[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Update of charset windows-1252, draft 2

To: Kent Karlsson <kent.karlsson14@comhem.se>,Erik van der Poel <erikv@google.com>
Subject: RE: Update of charset windows-1252, draft 2
From: Shawn Steele <Shawn.Steele@microsoft.com>
Date: Mon, 30 Oct 2006 11:09:32 -0800
Cc: Martin Duerst <duerst@it.aoyama.ac.jp>, ietf-charsets@iana.org,Mike Ksar <mikeksar@microsoft.com>
List-Id: <ietf-charsets.mail.apps.ietf.org>
List-Owner: <mailto:ietf-charsets-owner@mail.apps.ietf.org>
List-Subscribe: <mailto:mailserv@mail.apps.ietf.org?subject=subscribe%20ietf-charsets>
List-Unsubscribe: <mailto:mailserv@mail.apps.ietf.org?subject=unsubscribe%20ietf-charsets>
Message-hash: 1D88C9E576E81DE689AD98F9C389C629
Original-recipient: rfc822;ned+ietf-charsets@mrochek.com
References: <000e01c6fae6$8e507f20$6500a8c0@chalmers95a69n>
Spam-test: False ; 0.0 / 4.5
Thread-index: Acb65uDc2hPZ6Jh2R5uo7lFAnc8rYwBbqbo1
Thread-topic: Update of charset windows-1252, draft 2

 
>> By the way, the windows-1255 charset has changed recently. See the
>>  mapping for 0xCA in:
http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1255.TXT
http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WindowsBestFit/bestfit1255.txt

Actually, the behavior of the windows code page has not changed, the previous mapping for 0xCA is identical to the current mapping.  I'm guessing that since it wasn't a real code point it was filtered out by whomever created CP1255.txt (I wasn't here, I'm not sure how it came to be :)).  

>> So we may want to update the out-of-date one (CP1255.TXT)

> I think that is for Microsoft to do. But it does make their stated
> policy of not updating any of their codepages less credible.
 
I'm not about to touch that data file, I'm not sure how it was created, obviously best-fit and unassigned unicode code points were filtered out.  (A few code pages also map to the PUA, but those mappings aren't in the older Unicode tables.)

> While doing that, I would suggest they fix the character name
> comments (both in the cp* files and in the bestfit* files) to
> align with Unicode 5.0. It is so much less confusing that way.

These are effectively our raw source files, and were provided without any manipulations in order to avoid the risk of introducing a technical error.  It'd be nice if the comments were pretty, but as it is we can easily prove that it's the same as the windows tables.
 
- Shawn
 
Shawn Steele
Windows International
Microsoft

References:
- RE: Update of charset windows-1252, draft 2
  - From: Kent Karlsson <kent.karlsson14@comhem.se>

Prev by Date: Re: (SPAM: 5.001) Re: Registration of new charset [ISO-2022-JP-2004]
Next by Date: RE: Update of charset windows-1252, draft 2
Prev by thread: RE: Update of charset windows-1252, draft 2
Next by thread: Re: Update of charset windows-1252, draft 2
Index(es):
- Date
- Thread