[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Registration of some code pages



(2010/09/09 1:46), Shawn Steele wrote:
> Out of curiosity, is anyone aware of differences between 31J and windows' implementation?

The definition of Windows-31J is following:

Name: Windows-31J
MIBenum: 2024
Source: Windows Japanese.  A further extension of Shift_JIS
         to include NEC special characters (Row 13), NEC
         selection of IBM extensions (Rows 89 to 92), and IBM
         extensions (Rows 115 to 119).  The CCS's are
         JIS X0201:1997, JIS X0208:1997, and these extensions.
         This charset can be used for the top-level media type "text",
         but it is of limited or specialized use (see RFC2278).
         PCL Symbol Set id: 19K
Alias: csWindows31J

So
* it doesn't include User Defined Characters
* it's not clear about best fit chars
* Original CP932 has some odd mapping like U+0080 and U+00FF
http://icu-project.org/repos/icu/data/trunk/charset/data/ucm/windows-932-2000.ucm

-- 
NARUSE, Yui  <naruse@airemix.jp>