[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Volunteer needed to serve as IANA charset reviewer
Frank Ellermann schrieb:
> Claus Färber wrote:
>> "UNKNOWN-8BIT" (already used by some mail transport agents
> First defined in RFC 1428, used in RFC 1700 and RFC 2557, it's
> already registered.
Oops.
>> "UNKNOWN-UTF16"
> What's the difference from UTF-16 ?
UTF-16 "SHOULD be interpreted as being big-endian" if there's no BOM,
RFC 2781, 4.3. UNKNOWN-UTF16 would not have such a fall back.
>> with alias "UNICODE".
> Ugh, thanks, but no thanks.
The idea is to deprecate the label "UNICODE" by tying it to an
incompletly specified charset.
>> "UNKNOWN-ISO-8859" with alias "ANSI".
>> "UNKNOWM-IBMPC" with alias "OEM".
>
> One of those could do, "unknown-ascii-8bit", alias "oem".
We already have UNKNOWN-8BIT.
When you convert legacy data, you often DO know that something is in a
DOSish (IBMPC-based) or Windowsish (ANSI-based) charset. Having charset
labels to carry this information (instead of the unspecified
UNKNOWN-8BIT) is a good idea.
Claus