[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Suggested character set policy for the IETF
On Thu, 26 Jun 1997, Mark Crispin wrote:
> I am, however, sympathetic to Martin's position. I agree that "charset"
> should be the commonly used term, leading to wording such as:
> In this document, the term "character set" (commonly called a
> "charset") refers to the combination of coded character set and
> character encoding scheme. Non-IETF specifications use the term
> "character set" to refer to the "coded character set", so the
> term "charset" is preferred for the IETF definition.
> (both CCS and CES should be defined earlier).
I tend to agree with Mark on this issue, although I don't believe this
definition is correct.
A charset in the MIME sense is a mapping from octets to characters and
related presentation information. One way of constructing a MIME charset
is to combine a CCS with an invertible CES (note that a CCS and
non-invertible CES is *not* a MIME charset). While CCS and CES are useful
concepts for people who build character sets; the MIME charset concept is
the useful concept when presenting plain text.