[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: shift_jis / windows-31J

Hello Shawn, Yui,

On 2010/11/11 18:57, NARUSE, Yui wrote:
> (2010/11/11 7:52), Shawn Steele wrote:
>> A dozen years ago windows-31J was created because people noticed that
>> there were lots of different flavors of shift_jis floating around.
>> Uniquely identifying them may have made sense, however the windows-31J
>> term has never really been widely adopted for the windows code page
>> 932 behavior.
>> So I’d like to propose the following updates, loosly based on
>> discussion about variants some time ago. I’d be happy to accept other
>> suggestions that help users discover that some test is tagged with the

Shouldn't that be 'text' instead of 'test'?

>> less-specific shift_jis name rather than the more specific vendor
>> charset name.
>> Name: Windows-31J
>> MIBenum: 2024
>> Source: Windows Japanese. A variant of Shift_JIS to include
>> NEC special characters (Row 13), NEC selection of IBM
>> extensions (Rows 89 to 92), and IBM extensions (Rows
>> 115 to 119). The CCS's are JIS X0201:1997,
>> JIS X0208:1997, and these extensions. This charset
>> can be used for the top-level media type "text", but
>> it is of limited or specialized use (see RFC2278).

I think when you say 'it can be used for the top-level media type 
"text", you also need to say something about that it is not 7-bit.

Anyway, it seems that you are using an old (or no) template, I think it 
would be best to use the newest template.

>> PCL Symbol Set id: 19K.

I had no clue what "PCL Symbol set" was. Is this important? It doesn't 
turn up in other charset registrations.

>> Windows-31J text is commonly
>> declared with the shift_jis name of the parent charset.

I'd suggest to change "commonly" to "often". To me, "commonly" has too 
much of a touch of "that's the right thing to do".

>> Alias: csWindows31J
>> Alias: shift_jis+cp932
>> Name: Shift_JIS (preferred MIME name)
>> MIBenum: 17
>> Source: This charset is an extension of csHalfWidthKatakana by
>> adding graphic characters in JIS X 0208. The CCS's are
>> JIS X0201:1997 and JIS X0208:1997. The
>> complete definition is shown in Appendix 1 of JIS
>> X0208:1997.
>> This charset can be used for the top-level media type "text".
>> Several vendor specific charsets that derive from shift_jis
>> often use the shift_jis name instead of a more specific
>> vendor charset name.
>> Alias: MS_Kanji
>> Alias: csShiftJIS

I'm not sure why the registration for Shift_JIS turns up here. Are you 
also updating that?

> I object to create new alias name.

Yui, can you say why you object?

> Moreover XML doesn't allow "+" for EncName.
> http://www.w3.org/TR/REC-xml/#NT-EncName

I agree that this is a serious show-stopper.

Regards,    Martin.

> If add aliases to Windows-31J, they should be CP932, MS932, or Windows-932.
> I agree with adding more description.

#-# Martin J. Dürst, Professor, Aoyama Gakuin University
#-# http://www.sw.it.aoyama.ac.jp   mailto:duerst@it.aoyama.ac.jp