[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Revised proposal for UTF-16



At 18:26 25.05.98 -0700, Dan Kegel wrote:
>The underlying standard has the BOM.  
>The authors of that standard knew the issue was
>a hot potato, and decided to go both ways.

And they chose to be wishy-washy about it. Bad Move.
I haven't checked UNICODE, but 10646 is truly wishy-washy; all I could
find about byte order is this little paragraph from annex F:

>If an application which uses one of these signatures recognises its coded  
>representation in reverse sequence (e.g. hexadecimal FFFE), the application 
>can  identify that the coded representations of the following characters use 
>the  opposite octet sequence to the sequence expected, and may take the 
>necessary action to recognise the characters correctly.

Question: For what data element size do we expect the BOM to be used?
For long pieces of text, it's pretty obvious.
But what about databases? Structured values? ASN.1 SET OFs?
On all strings, the first string (whatever that means) or no string?

I'm not worried about wasting space, but about clarity on when to use it.

                                 Harald A




-- 
Harald Tveit Alvestrand, Maxware, Norway
Harald.Alvestrand@maxware.no