[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: draft-hoffman-utf16-01.txt available
Larry Masinter wrote:
> Appendix F of the XML specification
>
> (http://www.w3.org/TR/1998/REC-xml-19980210#sec-guessing)
>
> makes it clear that BOM isn't actually necessary for auto-guessing
> of charset between UTF-16LE and UTF-16BE, because of the differing
> encodings of "<" and "?" in the initial "<?".
Actually, external DTD subsets, external parsed entities, and external
parameter entites do not have to begin with "<". In particular,
external parsed entities can begin with any character. To allow
autodetection, XML 1.0 requires that external parsed entities must begin
either the BOM or text declarations, if not in UTF-8.
Makoto
Fuji Xerox Information Systems
Tel: +81-44-812-7230 Fax: +81-44-812-7231
E-mail: murata@apsdc.ksp.fujixerox.co.jp