[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: RFC 2279 (UTF-8) to Full Standard

To: Martin Duerst <[email protected]>, Kenneth Whistler <[email protected]>,[email protected]
Subject: Re: RFC 2279 (UTF-8) to Full Standard
From: Mark Davis <[email protected]>
Date: Fri, 12 Apr 2002 07:15:16 -0700
Cc: [email protected]
References: <4.2.0.58.J.20020412151604.034b1008@localhost>
Reply-to: Mark Davis <[email protected]>

Yes, they have; and it is quoted in the UTF-32 TR. Moreover, it is of
course safest if the RFC UTF-8 is restricted to 10FFFF, since any
higher values will not convert to UTF-16, and could even cause
security problems if converted incorrectly (e.g. overlaying legitimate
codes).

Mark
—————

Γνῶθι σαυτόν — Θαλῆς
[For transliteration, see http://oss.software.ibm.com/cgi-bin/icu/tr]

http://www.macchiato.com

----- Original Message -----
From: "Martin Duerst" <[email protected]>
To: "Kenneth Whistler" <[email protected]>; <[email protected]>
Cc: <[email protected]>
Sent: Thursday, April 11, 2002 23:29
Subject: RE: RFC 2279 (UTF-8) to Full Standard


> At 18:10 02/04/11 -0700, Kenneth Whistler wrote:
>
> >I agree, even though the Unicode Standard only describes UTF-8
> >out to U+10FFFF. 10646 still gives the full scheme to U-7FFFFFFF,
> >and it will be awhile (if ever) before we can change that to
> >deprecate all the 5- and 6-byte values.
>
> I thought ISO had adopted a standing policy on not allocating
> anything beyond U+10FFFF. Ken, do you know the exact status of
> this? Can you tell us?
>
> >So I see no good reason
> >right now to put RFC 2279 out of synch with 10646, particularly
> >if it would slow down a revision of RFC 2279 now.
>
> I think the new document should clearly state that codepoints above
> U+10FFFF cannot be encoded in UTF-16, that the Unicode consortium
> won't allocate any codepoints above that, that ISO has some relevant
> policy (if they do),... Also, pointing to UTF-32 might be a good
idea.
> (I just found out that it has been approved for registration, but
> is not yet listed in the relevant file.)
>
>
> Regards,   Martin.
>

Follow-Ups:
- Re: RFC 2279 (UTF-8) to Full Standard
  - From: Tony Hansen <[email protected]>

References:
- RE: RFC 2279 (UTF-8) to Full Standard
  - From: Martin Duerst <[email protected]>

Prev by Date: Re: RFC 2279 (UTF-8) to Full Standard
Next by Date: Re: RFC 2279 (UTF-8) to Full Standard
Prev by thread: RE: RFC 2279 (UTF-8) to Full Standard
Next by thread: Re: RFC 2279 (UTF-8) to Full Standard
Index(es):
- Date
- Thread