[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Registration of new charset [IBM1047]



As requested, below is the revised registration form for the new charset
IBM1047.

=======================================================================
Charset name:     IBM1047

Charset aliases:  IBM-1047

Suitability for use in MIME text:   NO

Published specification(s):
IBM1047 (EBCDIC Latin 1/Open Systems) can be found at:
http://www-1.ibm.com/servers/eserver/iseries/software/globalization/pdf/cp01047z.pdf
.

ISO 10646 equivalency table:
   1) For the mapping table from IBM CP 1047 to ISO 10646, refer to the
table below:

**************************************************************************
* Copyright IBM Corporation 1995
* Name:            Mapping table from IBM CP 1047 to ISO 10646.
* Creation date:   Fri Nov 10 16:43:19 1995
* ISO 10646-1:     UCS-2 (level-3 tolerant)
* Table Version:   1.00
* Table Owner:     NLTC, IBM Canada Ltd.
* PUA used:        NO
* Round-Trip:      YES
* GCGIDs mapped:   ALL
* Table Format:
*                  Columns separated by spaces:
*                    Col1: (pos.  1-8  ) IBM CP 1047 code in hex
*                    Col2: (pos. 10-13 ) ISO 10646 code in hex
*                    Col3: (pos. 16-23 ) IBM GCGID
*                    Col4: (pos. 25-32 ) Mapping indicator to specify:
*                         1. Synonym GCGID used when reverse mapping for
this GCGID is not the same.
*                         2. Code page specific exceptions.
*                    Col5: (pos.035->  )  ISO 10646 character name
* Sorting:         The entries are in IBM CP 1047 code point order
* General Notes:
*                  1. Control codes are mapped as per CDRA specifications
for mapping between EBCDIC and ISO8 controls
*                  2. Characters with no UCS-2 equivalent identified with
????
*                  3. For PC-Data code pages, SM24 and SM25 are mapped as
explained in: CDRA Level-2, Registry, page 52, Exception.
**************************************************************************
*-------------------------------------------------------------------------
*CP      UCS   GCGID    SYNONYM   ISO 10646 NAME
*-------------------------------------------------------------------------
00       0000  ..NUL...           (CC) Null
01       0001  ..SOH...           (CC) Start of Heading
02       0002  ..STX...           (CC) Start of Text
03       0003  ..ETX...           (CC) End of Text
04       009C  ..SEL... ...ST...  (CC) String Terminator
05       0009  ...HT...           (CC) Horizontal Tabulation
06       0086  ..RNL... ..SSA...  (CC) Start of Selected Area
07       007F  ..DEL...           (CC) Delete
08       0097  ...GE... ..EPA...  (CC) End of Guarded Area
09       008D  ..SPS... ...RI...  (CC) Reverse Line Feed (or Reverse Index)
0A       008E  ..RPT... ..SS2...  (CC) Single Shift Two
0B       000B  ...VT...           (CC) Vertical Tabulation
0C       000C  ...FF...           (CC) Form Feed
0D       000D  ...CR...           (CC) Carriage Return
0E       000E  ...SO... .SO/LS1.  (CC) Locking Shift One (Shift Out)
0F       000F  ...SI... .SI/LS0.  (CC) Locking Shift Zero (Shift In)
10       0010  ..DLE...           (CC) Data Link Escape
11       0011  ..DC1...           (CC) Device Control One
12       0012  ..DC2...           (CC) Device Control Two
13       0013  ..DC3...           (CC) Device Control Three
14       009D  ........ ..OSC...  (CC) Operating System Command
15       0085  ...NL... ..NEL...  (CC) Next Line
16       0008  ...BS...           (CC) Backspace
17       0087  ..POC... ..ESA...  (CC) End of Selected Area
18       0018  ..CAN...           (CC) Cancel
19       0019  ...EM...           (CC) End of Medium
1A       0092  ..UBS... ..PU2...  (CC) Private Use Two
1B       008F  ..CU1... ..SS3...  (CC) Single Shift Three
1C       001C  ..IFS...           (CC) Information File Separator
1D       001D  ..IGS... ...GS...  (CC) Group Separator
1E       001E  ..IRS... ...RS...  (CC) Record Separator
1F       001F  ........ ...US...  (CC) Unit Separator
20       0080  ...DS... ........
21       0081  ..SOSI.. ........
22       0082  ...FS... ..BPH...  (CC) Break Permitted Here
23       0083  ..WUS... ..NBH...  (CC) No Break Here
24       0084  ........ ..IND...  (CC) Index
25       000A  ...LF...           (CC) Line Feed
26       0017  ..ETB...           (CC) End of Transmission Block
27       001B  ..ESC...           (CC) Escape
28       0088  ...SA... ..HTS...  (CC) Character Tabulation Set
29       0089  ..SFE... ..HTJ...  (CC) Character Tabulation with
Justification
2A       008A  .SM/SW.. ..VTS...  (CC) Line Tabultion Set
2B       008B  ..CSP... ..PLD...  (CC) Partial Line Down
2C       008C  ..MFA... ..PLU...  (CC) Partial Line Up
2D       0005  ..ENQ...           (CC) Enquiry
2E       0006  ..ACK...           (CC) Acknowledge
2F       0007  ..BEL...           (CC) Bell
30       0090  ........ ..DCS...  (CC) Device Control String
31       0091  ........ ..PU1...  (CC) Private Use One
32       0016  ..SYN...           (CC) Synchronous Idle
33       0093  ...IR... ..STS...  (CC) Set Transmit State
34       0094  ...PP... ..CCH...  (CC) Cancel Character
35       0095  ..TRN... ...MW...  (CC) Message Waiting
36       0096  ..NBS... ..SPA...  (CC) Start of Guarded Area
37       0004  ..EOT...           (CC) End of Transmission
38       0098  ..SBS... ..SOS...  (CC) Start of String
39       0099  ...IT... ........
3A       009A  ..RFF... ..SCI...  (CC) Single Character Introducer
3B       009B  ..CU3... ..CSI...  (CC) Control Sequence Introducer
3C       0014  ..DC4...           (CC) Device Control Four
3D       0015  ..NAK...           (CC) Negative Acknowledge
3E       009E  ........ ...PM...  (CC) Privacy Message
3F       001A  ..SUB...           (CC) Substitute
40       0020  SP010000           SPACE
41       00A0  SP300000           NO-BREAK SPACE
42       00E2  LA150000           LATIN SMALL LETTER A WITH CIRCUMFLEX
43       00E4  LA170000           LATIN SMALL LETTER A WITH DIAERESIS
44       00E0  LA130000           LATIN SMALL LETTER A WITH GRAVE
45       00E1  LA110000           LATIN SMALL LETTER A WITH ACUTE
46       00E3  LA190000           LATIN SMALL LETTER A WITH TILDE
47       00E5  LA270000           LATIN SMALL LETTER A WITH RING ABOVE
48       00E7  LC410000           LATIN SMALL LETTER C WITH CEDILLA
49       00F1  LN190000           LATIN SMALL LETTER N WITH TILDE
4A       00A2  SC040000           CENT SIGN
4B       002E  SP110000           FULL STOP
4C       003C  SA030000           LESS-THAN SIGN
4D       0028  SP060000           LEFT PARENTHESIS
4E       002B  SA010000           PLUS SIGN
4F       007C  SM130000           VERTICAL LINE
50       0026  SM030000           AMPERSAND
51       00E9  LE110000           LATIN SMALL LETTER E WITH ACUTE
52       00EA  LE150000           LATIN SMALL LETTER E WITH CIRCUMFLEX
53       00EB  LE170000           LATIN SMALL LETTER E WITH DIAERESIS
54       00E8  LE130000           LATIN SMALL LETTER E WITH GRAVE
55       00ED  LI110000           LATIN SMALL LETTER I WITH ACUTE
56       00EE  LI150000           LATIN SMALL LETTER I WITH CIRCUMFLEX
57       00EF  LI170000           LATIN SMALL LETTER I WITH DIAERESIS
58       00EC  LI130000           LATIN SMALL LETTER I WITH GRAVE
59       00DF  LS610000           LATIN SMALL LETTER SHARP S (German)
5A       0021  SP020000           EXCLAMATION MARK
5B       0024  SC030000           DOLLAR SIGN
5C       002A  SM040000           ASTERISK
5D       0029  SP070000           RIGHT PARENTHESIS
5E       003B  SP140000           SEMICOLON
5F       005E  SD150000           CIRCUMFLEX ACCENT
60       002D  SP100000           HYPHEN-MINUS
61       002F  SP120000           SOLIDUS
62       00C2  LA160000           LATIN CAPITAL LETTER A WITH CIRCUMFLEX
63       00C4  LA180000           LATIN CAPITAL LETTER A WITH DIAERESIS
64       00C0  LA140000           LATIN CAPITAL LETTER A WITH GRAVE
65       00C1  LA120000           LATIN CAPITAL LETTER A WITH ACUTE
66       00C3  LA200000           LATIN CAPITAL LETTER A WITH TILDE
67       00C5  LA280000           LATIN CAPITAL LETTER A WITH RING ABOVE
68       00C7  LC420000           LATIN CAPITAL LETTER C WITH CEDILLA
69       00D1  LN200000           LATIN CAPITAL LETTER N WITH TILDE
6A       00A6  SM650000           BROKEN BAR
6B       002C  SP080000           COMMA
6C       0025  SM020000           PERCENT SIGN
6D       005F  SP090000           LOW LINE
6E       003E  SA050000           GREATER-THAN SIGN
6F       003F  SP150000           QUESTION MARK
70       00F8  LO610000           LATIN SMALL LETTER O WITH STROKE
71       00C9  LE120000           LATIN CAPITAL LETTER E WITH ACUTE
72       00CA  LE160000           LATIN CAPITAL LETTER E WITH CIRCUMFLEX
73       00CB  LE180000           LATIN CAPITAL LETTER E WITH DIAERESIS
74       00C8  LE140000           LATIN CAPITAL LETTER E WITH GRAVE
75       00CD  LI120000           LATIN CAPITAL LETTER I WITH ACUTE
76       00CE  LI160000           LATIN CAPITAL LETTER I WITH CIRCUMFLEX
77       00CF  LI180000           LATIN CAPITAL LETTER I WITH DIAERESIS
78       00CC  LI140000           LATIN CAPITAL LETTER I WITH GRAVE
79       0060  SD130000           GRAVE ACCENT
7A       003A  SP130000           COLON
7B       0023  SM010000           NUMBER SIGN
7C       0040  SM050000           COMMERCIAL AT
7D       0027  SP050000           APOSTROPHE
7E       003D  SA040000           EQUALS SIGN
7F       0022  SP040000           QUOTATION MARK
80       00D8  LO620000           LATIN CAPITAL LETTER O WITH STROKE
81       0061  LA010000           LATIN SMALL LETTER A
82       0062  LB010000           LATIN SMALL LETTER B
83       0063  LC010000           LATIN SMALL LETTER C
84       0064  LD010000           LATIN SMALL LETTER D
85       0065  LE010000           LATIN SMALL LETTER E
86       0066  LF010000           LATIN SMALL LETTER F
87       0067  LG010000           LATIN SMALL LETTER G
88       0068  LH010000           LATIN SMALL LETTER H
89       0069  LI010000           LATIN SMALL LETTER I
8A       00AB  SP170000           LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
8B       00BB  SP180000           RIGHT-POINTING DOUBLE ANGLE QUOTATION
MARK
8C       00F0  LD630000           LATIN SMALL LETTER ETH (Icelandic)
8D       00FD  LY110000           LATIN SMALL LETTER Y WITH ACUTE
8E       00FE  LT630000           LATIN SMALL LETTER THORN (Icelandic)
8F       00B1  SA020000           PLUS-MINUS SIGN
90       00B0  SM190000           DEGREE SIGN
91       006A  LJ010000           LATIN SMALL LETTER J
92       006B  LK010000           LATIN SMALL LETTER K
93       006C  LL010000           LATIN SMALL LETTER L
94       006D  LM010000           LATIN SMALL LETTER M
95       006E  LN010000           LATIN SMALL LETTER N
96       006F  LO010000           LATIN SMALL LETTER O
97       0070  LP010000           LATIN SMALL LETTER P
98       0071  LQ010000           LATIN SMALL LETTER Q
99       0072  LR010000           LATIN SMALL LETTER R
9A       00AA  SM210000           FEMININE ORDINAL INDICATOR
9B       00BA  SM200000           MASCULINE ORDINAL INDICATOR
9C       00E6  LA510000           LATIN SMALL LIGATURE AE
9D       00B8  SD410000           CEDILLA
9E       00C6  LA520000           LATIN CAPITAL LIGATURE AE
9F       00A4  SC010000           CURRENCY SIGN
A0       00B5  SM170000           MICRO SIGN
A1       007E  SD190000           TILDE
A2       0073  LS010000           LATIN SMALL LETTER S
A3       0074  LT010000           LATIN SMALL LETTER T
A4       0075  LU010000           LATIN SMALL LETTER U
A5       0076  LV010000           LATIN SMALL LETTER V
A6       0077  LW010000           LATIN SMALL LETTER W
A7       0078  LX010000           LATIN SMALL LETTER X
A8       0079  LY010000           LATIN SMALL LETTER Y
A9       007A  LZ010000           LATIN SMALL LETTER Z
AA       00A1  SP030000           INVERTED EXCLAMATION MARK
AB       00BF  SP160000           INVERTED QUESTION MARK
AC       00D0  LD620000 LD640000  LATIN CAPITAL LETTER ETH (Icelandic)
AD       005B  SM060000           LEFT SQUARE BRACKET
AE       00DE  LT640000           LATIN CAPITAL LETTER THORN (Icelandic)
AF       00AE  SM530000           REGISTERED SIGN
B0       00AC  SM660000           NOT SIGN
B1       00A3  SC020000           POUND SIGN
B2       00A5  SC050000           YEN SIGN
B3       00B7  SD630000           MIDDLE DOT
B4       00A9  SM520000           COPYRIGHT SIGN
B5       00A7  SM240000           SECTION SIGN
B6       00B6  SM250000           PILCROW SIGN
B7       00BC  NF040000           VULGAR FRACTION ONE QUARTER
B8       00BD  NF010000           VULGAR FRACTION ONE HALF
B9       00BE  NF050000           VULGAR FRACTION THREE QUARTERS
BA       00DD  LY120000           LATIN CAPITAL LETTER Y WITH ACUTE
BB       00A8  SD170000           DIAERESIS
BC       00AF  SM150000 SD310000  MACRON
BD       005D  SM080000           RIGHT SQUARE BRACKET
BE       00B4  SD110000           ACUTE ACCENT
BF       00D7  SA070000           MULTIPLICATION SIGN
C0       007B  SM110000           LEFT CURLY BRACKET
C1       0041  LA020000           LATIN CAPITAL LETTER A
C2       0042  LB020000           LATIN CAPITAL LETTER B
C3       0043  LC020000           LATIN CAPITAL LETTER C
C4       0044  LD020000           LATIN CAPITAL LETTER D
C5       0045  LE020000           LATIN CAPITAL LETTER E
C6       0046  LF020000           LATIN CAPITAL LETTER F
C7       0047  LG020000           LATIN CAPITAL LETTER G
C8       0048  LH020000           LATIN CAPITAL LETTER H
C9       0049  LI020000           LATIN CAPITAL LETTER I
CA       00AD  SP320000           SOFT HYPHEN
CB       00F4  LO150000           LATIN SMALL LETTER O WITH CIRCUMFLEX
CC       00F6  LO170000           LATIN SMALL LETTER O WITH DIAERESIS
CD       00F2  LO130000           LATIN SMALL LETTER O WITH GRAVE
CE       00F3  LO110000           LATIN SMALL LETTER O WITH ACUTE
CF       00F5  LO190000           LATIN SMALL LETTER O WITH TILDE
D0       007D  SM140000           RIGHT CURLY BRACKET
D1       004A  LJ020000           LATIN CAPITAL LETTER J
D2       004B  LK020000           LATIN CAPITAL LETTER K
D3       004C  LL020000           LATIN CAPITAL LETTER L
D4       004D  LM020000           LATIN CAPITAL LETTER M
D5       004E  LN020000           LATIN CAPITAL LETTER N
D6       004F  LO020000           LATIN CAPITAL LETTER O
D7       0050  LP020000           LATIN CAPITAL LETTER P
D8       0051  LQ020000           LATIN CAPITAL LETTER Q
D9       0052  LR020000           LATIN CAPITAL LETTER R
DA       00B9  ND011000           SUPERSCRIPT ONE
DB       00FB  LU150000           LATIN SMALL LETTER U WITH CIRCUMFLEX
DC       00FC  LU170000           LATIN SMALL LETTER U WITH DIAERESIS
DD       00F9  LU130000           LATIN SMALL LETTER U WITH GRAVE
DE       00FA  LU110000           LATIN SMALL LETTER U WITH ACUTE
DF       00FF  LY170000           LATIN SMALL LETTER Y WITH DIAERESIS
E0       005C  SM070000           REVERSE SOLIDUS
E1       00F7  SA060000           DIVISION SIGN
E2       0053  LS020000           LATIN CAPITAL LETTER S
E3       0054  LT020000           LATIN CAPITAL LETTER T
E4       0055  LU020000           LATIN CAPITAL LETTER U
E5       0056  LV020000           LATIN CAPITAL LETTER V
E6       0057  LW020000           LATIN CAPITAL LETTER W
E7       0058  LX020000           LATIN CAPITAL LETTER X
E8       0059  LY020000           LATIN CAPITAL LETTER Y
E9       005A  LZ020000           LATIN CAPITAL LETTER Z
EA       00B2  ND021000           SUPERSCRIPT TWO
EB       00D4  LO160000           LATIN CAPITAL LETTER O WITH CIRCUMFLEX
EC       00D6  LO180000           LATIN CAPITAL LETTER O WITH DIAERESIS
ED       00D2  LO140000           LATIN CAPITAL LETTER O WITH GRAVE
EE       00D3  LO120000           LATIN CAPITAL LETTER O WITH ACUTE
EF       00D5  LO200000           LATIN CAPITAL LETTER O WITH TILDE
F0       0030  ND100000           DIGIT ZERO
F1       0031  ND010000           DIGIT ONE
F2       0032  ND020000           DIGIT TWO
F3       0033  ND030000           DIGIT THREE
F4       0034  ND040000           DIGIT FOUR
F5       0035  ND050000           DIGIT FIVE
F6       0036  ND060000           DIGIT SIX
F7       0037  ND070000           DIGIT SEVEN
F8       0038  ND080000           DIGIT EIGHT
F9       0039  ND090000           DIGIT NINE
FA       00B3  ND031000           SUPERSCRIPT THREE
FB       00DB  LU160000           LATIN CAPITAL LETTER U WITH CIRCUMFLEX
FC       00DC  LU180000           LATIN CAPITAL LETTER U WITH DIAERESIS
FD       00D9  LU140000           LATIN CAPITAL LETTER U WITH GRAVE
FE       00DA  LU120000           LATIN CAPITAL LETTER U WITH ACUTE
FF       009F  ...EO... ..APC...  (CC) Application Program Command
************************END OF TABLE**************************************

     2) An alternate mapping table including roundtrip/fallback information
is also available at:
http://oss.software.ibm.com/cvs/icu/charset/data/ucm/ibm-1047_P100-2000.ucm
.

Person and email address to contact for further information:
Reuel Robrigado
IBM Globalization Centre of Competency, Toronto
8200 Warden Avenue, Markham, ON. L6G 1C7 Canada
email: reuelr@ca.ibm.com

Intended usage:   LIMITED USE
=======================================================================


Thanks and best regards,
Reuel Robrigado
IBM Globalization Center of Competency, Toronto
8200 Warden Avenue, Markham, ON. L6G 1C7
Tel (905) 413-4975; Tie 969; Fax (905) 413-4903; reuelr@ca.ibm.com


                                                                                                                                       
                      Harald Tveit                                                                                                     
                      Alvestrand               To:       Reuel Robrigado/Toronto/IBM@IBMCA, ietf-charsets@iana.org                     
                      <harald@alvestran        cc:                                                                                     
                      d.no>                    Subject:  Re: Registration of new charset [IBM1047]                                     
                                                                                                                                       
                      09/19/2002 10:28                                                                                                 
                      PM                                                                                                               
                                                                                                                                       
                                                                                                                                       



thank you.
as a mere matter of form - could you:

- insert the equivalence table into the registration form (the IANA process

has so far not had to handle attachments)
- add the word NO to the "suitable for use in MIME text"
- change "intended usage" to the standard string LIMITED USE?

I'll forward this to IANA when that's done.

             Harald (apologies for being slow)

--On 10. september 2002 17:40 -0400 reuelr@ca.ibm.com wrote:

> It's ben a little over 6 weeks since this request was sent and so far,
> there was only one comment coming from Markus Scherer
> <markus.scherer@jtcsv.com>, re: mapping table showing roundtrip/fallback
> information (available at
> http://oss.software.ibm.com/cvs/icu/charset/data/ucm/ibm-1047_p100-2000.u
> cm ).
>
> Note too that the "Suitability for use in MIME text" for this charset
> should be "NO".
>
> The 2 week feedback period has long passed and I would like to request
> that this charset registration be finalized and considered complete.
>
> Thanks and best regards,
> Reuel Robrigado
> IBM Gloabalization Center of Competency, Toronto
> 8200 Warden Avenue, Markham, ON. L6G 1C7
> Tel (905) 413-4975; Tie 969; Fax (905) 413-4903; reuelr@ca.ibm.com
> ----- Forwarded by Reuel Robrigado/Toronto/IBM on 09/10/2002 04:24 PM
> -----
>                        Reuel Robrigado
>                                                 To:
> ietf-charsets@iana.org
>                        07/25/2002 04:34         cc:
>                        PM                       From:     Reuel
> Robrigado/Toronto/IBM@IBMCA
>                                                Subject:  Registration of
> new charset [IBM1047]
>
>
> This is to request for the registration of Charset IBM1047, details of
> which are noted below:
>
> To: ietf-charsets@iana.org
> Subject: Registration of new charset [IBM1047]
>
> Charset name:     IBM1047
>
> Charset aliases:  cp1047
>                   IBM-1047
>
> Suitability for use in MIME text:
>
> Published specification(s):
> http://www-1.ibm.com/servers/eserver/iseries/software/globalization/pdf/c
> p01047z.pdf
>
> ISO 10646 equivalency table:  see attached file
> (See attached file: IBM1047-ISO10646-equivalency-table.txt)
>
> Additional information:
> Reuel Robrigado
> reuelr@ca.ibm.com
>
> Intended usage:
> Meant for limited use to meet specific requirements
>
> Thanks and regards,
> Reuel Robrigado
> IBM Gloabalization Center of Competency, Toronto
> 8200 Warden Avenue, Markham, ON. L6G 1C7
> Tel (905) 413-4975; Tie 969; Fax (905) 413-4903; reuelr@ca.ibm.com
>