[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: q about gb 2312/gbk



Markus,

I believe you are not missing GBK -- IANA is. In the work I did to enable
the Oracle XML Parser for a large list of charsets, I have also found the
following charsets are missing from IANA. A brief check today indicates that
these are still missing, but I could have missed something.

   EUC-TW
   ISO2022CN-CNS
   ISO2022CN-GB
   MS874
   MS932
   MS936
   MS949
   MS950
   ISCII
   GB18030

Additionally, missing from IANA registry were a large number of IBM code
pages (CPXXX, CPXXXX, CPXXXXX). I can send these to you or the IETF list
upon request, but I'm hesitant to litter just yet.

I apologize that I have not yet had time to file RFCs for the addition of
all of these charsets. Just the same, should someone else like to file these
as RFCs, please don't wait for me.

Thanks,


Craig R. Cummings
Java NLS Architect
Oracle Corporation

----- Original Message -----
From: Markus Scherer <markus.scherer@jtcsv.com>
To: charsets <ietf-charsets@iana.org>
Sent: Wednesday, August 22, 2001 10:30 AM
Subject: q about gb 2312/gbk


> Hello, I have two questions about GB* simplified-Chinese charsets:
>
> 1. There are two entries for GB 2312:
> 1.a)
> Name: GB_2312-80                                        [RFC1345,KXS2]
> MIBenum: 57
> Source: ECMA registry
> Alias: iso-ir-58
> Alias: chinese
> Alias: csISO58GB231280
>
> 1.b)
> Name: GB2312  (preferred MIME name)
> MIBenum: 2025
> Source: Chinese for People's Republic of China (PRC) mixed one byte,
>         two byte set:
>           20-7E = one byte ASCII
>           A1-FE = two byte PRC Kanji
>         See GB 2312-80
>         PCL Symbol Set Id: 18C
> Alias: csGB2312
>
> How are they different?
> The second one is clearly the commonly used MBCS charset.
> Is the first one the DBCS-only part for ISO 2022, or is it also MBCS?
Please clarify.
>
>
> 2. I cannot find a registration for GBK (Microsoft 936).
> Am I just missing it?
>
> markus
>