[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Registration of new charset [ISO-2022-JP-2004]



Now that we have reached the letter O, we should probably use hex to
avoid confusion with the digit 0?

ESC $ ( O -> 1B 24 28 4F or \x1B\x24\x28\x4F (C/C++/etc)?

ISO 2022 uses decimal in nibbles (4-bit units == quartets) separated
by slash: O -> 4/15

However, most of the character encoding experts use hex, so we should too?

Anyway, I think iso-2022-jp-2004 may be using the latest registration:

http://www.itscj.ipsj.or.jp/ISO-IR/233.pdf

Which says that the final byte in the escape sequence is 5/1, i.e. \x51 == 'Q'

Also, I am quite concerned about the "full plane 1" and "subset of
plane 1" mentioned below, and the use of ESC $ ( B instead of the more
traditional ESC $ B.

Erik

On 10/2/06, Martin Duerst <duerst@it.aoyama.ac.jp> wrote:
> At 02:31 06/10/02, Erik van der Poel wrote:
> >Which escape sequences are permitted in iso-2022-jp-2004?
>
> - ESC ( B      for ISO/IEC 646 IRV
> - ESC $ ( O    for the full plane 1 of JIS X 0213
> - ESC $ ( P    for plane 2 of JIS X 0213
> - ESC $ ( B    for a subset of plane 1 of JIS X 0213
>                (also a subset of the plane/table from JIS X 0208)