[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Suggested character set policy for the IETF
Harald.T.Alvestrand@uninett.no writes:
> Protocols MUST be able to use the ISO 10646 coded character set,
> with the UTF-8 character encoding scheme, for all text. (This is
> called "UTF-8" in the rest of this document)
Well, UTF-8 is not a character encoding scheme, as it always implies
10646 characters, you could write:
"in the UTF-8 transformation format" instead of "with the UTF-8 ...,"
> They MAY specify how to use other character sets or other
> character encoding schemes, such as UTF-16, but lack of an ability
I propose to delete "or other character encoding schemes" as
"character encoding schemes" is already included in your above definition
of "character set", and UTF-16 is not a character encoding scheme anyway,
as it is not orthogonal with the 10646 coded character set (it always
implies 10646).
> to use UTF-8 needs clear and solid justification in the protocol
> specification document before being entered into or advanced upon
> the standards track.
I would like also to suggest the use of the term "charset" instead
of the term "character set" as the latter has a different meaning
in ISO standards than the one you use.
Keld Simonsen