[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Revised proposal for UTF-16



At 14:51 24.07.98 +0900, Martin J. Duerst wrote:
>What I think we should worry is whether and how UTF-16 should be used
>in traditional protocol headers, based on MIME encoded words. Several
>solutions are possible:
>
>- Discourage or disallow UTF-16 in such headers (there are other
>  cases, in particular Korean Email, where there are differences
>  between the encoding used in the header and in the body).
>

This is reasonable.

>- Use a different specification for these headers (headers would
>  probably be in big-endian without a BOM, and nothing else,
>  bodies could tolerate little-endian and/or recommend/mandate
>  the BOM). The difference is justified because headers need
>  additional encoding/decoding anyway, and the user expectations
>  for their legibility are somewhat lower.

This means that there are 2 almost-equal character sets.
Since they're not completely equal, they have to have different names.
That this seems attractive is an example of why I don't think mandating
the BOM is likely to be a Good Idea for all cases of UTF-16.

>- Use exactly the same specifications for both headers and bodies.

This is reasonable.

               Harald A

-- 
Harald Tveit Alvestrand, Maxware, Norway
Harald.Alvestrand@maxware.no