[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: Registration of new charset "UTF-16"
> > What about end of line canonicalization? Do we stick with CRLF, or should
> > we use the ISO 10646 Line Separator and Paragraph Separator characters?
> > Or do we give up on a canonical form and just state that widetext/etext
> > probably isn't suitable for use with digital signatures.
I don't think any of these alternatives are correct. A canonical form
is a choice among equivalent forms, where transformation among the equivalents
is allowed. Here are two choices that might be acceptable:
a) treat widetext/etext as you treat application, for the purpose of
digital signatures: no transformations allowed.
b) specify that the 'canonical form' of "widetext/blah" is the transformation
into "text/blah;charset=utf-8" with CRLF end-of-line.
What do you think about a "utf-16" specific top level type, e.g.,
utf-16/html == text/html;charset="utf-16"
utf-16/plain == text/plain;charset="utf-16"
no charset parameter allowed. It would simplify things; there aren't so
many 16-bit charsets.