[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Registration of new charset: UTF-32
Misha Wolf wrote:
> Has anyone looked to see how this ties in with:
> Extensible Markup Language (XML) 1.0 (Second Edition)
> Autodetection of Character Encodings (Non-Normative)
> http://www.w3.org/TR/REC-xml#sec-guessing
No problem. XML defines slightly more protocol: Since the first character is either a BOM or a '<', one can detect UTF-32 in either endianness even without a BOM.
markus