[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: windows-1252
Markus Scherer wrote:
> Unassigned code points are of course allowed. Without double-checking
> the tables which Mike pointed to in his requests, I think what Frank
> alluded to is how Windows treats unassigned codes in its SBCS
> charsets: It usually roundtrips unassigned bytes xx to/from Unicode
> U+00xx, rather than mapping unassigned codes to some SUBstitution
> character, and some but not all published tables reflect this.
The current registration of windows-1252 points to a document that lists
those codepoints as unassigned. It does not matter what other tables say
about them. If an implementation decides to round-trip these unassigned
codepoints, they do so for particular reasons. But that does not change
the fact that those codepoints are actually unassigned, and that people
should not use them.
Erik