[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: q about gb 2312/gbk
Markus,
I believe you are not missing GBK -- IANA is. In the work I did to enable
the Oracle XML Parser for a large list of charsets, I have also found the
following charsets are missing from IANA. A brief check today indicates that
these are still missing, but I could have missed something.
EUC-TW
ISO2022CN-CNS
ISO2022CN-GB
MS874
MS932
MS936
MS949
MS950
ISCII
GB18030
Additionally, missing from IANA registry were a large number of IBM code
pages (CPXXX, CPXXXX, CPXXXXX). I can send these to you or the IETF list
upon request, but I'm hesitant to litter just yet.
I apologize that I have not yet had time to file RFCs for the addition of
all of these charsets. Just the same, should someone else like to file these
as RFCs, please don't wait for me.
Thanks,
Craig R. Cummings
Java NLS Architect
Oracle Corporation
----- Original Message -----
From: Markus Scherer <markus.scherer@jtcsv.com>
To: charsets <ietf-charsets@iana.org>
Sent: Wednesday, August 22, 2001 10:30 AM
Subject: q about gb 2312/gbk
> Hello, I have two questions about GB* simplified-Chinese charsets:
>
> 1. There are two entries for GB 2312:
> 1.a)
> Name: GB_2312-80 [RFC1345,KXS2]
> MIBenum: 57
> Source: ECMA registry
> Alias: iso-ir-58
> Alias: chinese
> Alias: csISO58GB231280
>
> 1.b)
> Name: GB2312 (preferred MIME name)
> MIBenum: 2025
> Source: Chinese for People's Republic of China (PRC) mixed one byte,
> two byte set:
> 20-7E = one byte ASCII
> A1-FE = two byte PRC Kanji
> See GB 2312-80
> PCL Symbol Set Id: 18C
> Alias: csGB2312
>
> How are they different?
> The second one is clearly the commonly used MBCS charset.
> Is the first one the DBCS-only part for ISO 2022, or is it also MBCS?
Please clarify.
>
>
> 2. I cannot find a registration for GBK (Microsoft 936).
> Am I just missing it?
>
> markus
>