[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Registration of new charset CESU-8



At 1:48 AM -0800 1/2/02, toby_phipps@peoplesoft.com wrote:
>Published specification(s):
>    Unicode Technical Report #26
>    "Compatibility Encoding Scheme for UTF-16: 8-bit (CESU-8)"
>    http://www.unicode.org/unicode/reports/tr26

The summary in that TR says:

>This document specifies an 8-bit Compatibility Encoding Scheme for 
>UTF-16 (CESU) that is intended for internal use within systems 
>processing Unicode in order to provide an ASCII-compatible 8-bit 
>encoding that is similar to UTF-8 but preserves UTF-16 binary 
>collation. It is not intended nor recommended as an encoding used 
>for open information exchange. The Unicode Consortium, does not 
>encourage the use of CESU-8, but does recognize the existence of 
>data in this encoding and supplies this technical report to clearly 
>define the format and to distinguish it from UTF-8. This encoding 
>does not replace or amend the definition of UTF-8.

The statement "It is not intended nor recommended as an encoding used 
for open information exchange." is underlined in the TR for emphasis.

Charset labels are used for exchanging information. Thus, CESU-8 is 
*not* a candidate for having a charset label.

--Paul Hoffman, Director
--Internet Mail Consortium