[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Registering a charset alias

To: Shawn Steele <Shawn.Steele@microsoft.com>,Erik van der Poel <erikv@google.com>
Subject: Re: Registering a charset alias
From: Anne van Kesteren <annevk@opera.com>
Date: Wed, 19 Aug 2009 23:04:05 +0200
Cc: Markus Scherer <markus.icu@gmail.com>,Ira McDonald <blueroofmusic@gmail.com>, ietf-charsets <ietf-charsets@iana.org>
In-reply-to: <CAD7705D4A93814F97D3EF00790AF0B31603537A@tk5ex14mbxc105.redmond.corp.microsoft.com>
List-Id: <ietf-charsets.mail.apps.ietf.org>
List-Owner: <mailto:ietf-charsets-owner@mail.apps.ietf.org>
List-Subscribe: <mailto:mailserv@mail.apps.ietf.org?subject=subscribe%20ietf-charsets>
List-Unsubscribe: <mailto:mailserv@mail.apps.ietf.org?subject=unsubscribe%20ietf-charsets>
Organization: Opera Software ASA
Original-recipient: rfc822;ned+ietf-charsets@mrochek.com
References: <op.uyl5bcjb64w2qv@annevk-t60><e395be80908131614p2e6ccb69u6bac9de57bc0f3d@mail.gmail.com><c07a32650908131856k44cbb0dcg129c64ffd57336e5@mail.gmail.com><CAD7705D4A93814F97D3EF00790AF0B316030FE6@tk5ex14mbxc105.redmond.corp.microsoft.com><c07a32650908141405lafcb236n98aec273dc45ff49@mail.gmail.com><CAD7705D4A93814F97D3EF00790AF0B31603105A@tk5ex14mbxc105.redmond.corp.microsoft.com><c07a32650908141549v103ae000qfd9e013ccb164ea8@mail.gmail.com><6bb028490908141603s5805ae6et6d486e7f3df5ca6@mail.gmail.com><c07a32650908141617x607895e3yaac4f86be795a1b9@mail.gmail.com><op.uyoz7ekm64w2qv@annevk-t60><c07a32650908150822k11618daase7468ba84660abc5@mail.gmail.com><op.uyxe100t64w2qv@annevk-t60><CAD7705D4A93814F97D3EF00790AF0B31603537A@tk5ex14mbxc105.redmond.corp.microsoft.com>
Spam-test: False ; 0.8 / 4.5 ; RDNS_NONE,SPF_SOFTFAIL
User-Agent: Opera Mail/10.00 (Linux)

On Wed, 19 Aug 2009 22:35:43 +0200, Shawn Steele <Shawn.Steele@microsoft.com> wrote:
> I'm not sure they're easy to find, I stuck a list of aliases that .Net  
> uses at  
> http://blogs.msdn.com/shawnste/archive/2009/08/18/alternate-encoding-names-recognized-by-net-ie.aspx
>
> http://msdn.microsoft.com/en-us/library/system.text.encoding.getencodings.aspx  
> has a list of the names that .Net calls the various encodings (webname)

Very cool, thanks!

So if I understand this data correctly IE does not treat ISO-8859-1 and Windows-1252 the same? That is not my experience, but maybe I do not understand the code pages concept good enough.

> Note that IE's code page detection is pretty fixed and we're suggesting  
> use of UTF-8 for new content, it's unlikely that any additional aliases  
> would be added or changed in many significant ways.

Understood.

> I think most of our encodings don't lend themselves to the superset  
> concept.  There're probably variations for individual code points even  
> in closely related code pages.  GB18030 might be an exception there.
>
> I'd much rather have the community push for UTF encodings rather than  
> trying to do perfect detection of imperfect code pages.

I agree that we should get everyone to use UTF-8.

This effort is not about new content however, it is about dealing with the vast amount of legacy data around and allowing new clients (and existing) to properly handle the content without having to reverse engineer the market leader.

> Even when names  
> are identical there are still unique quirks of different systems with  
> various code pages.  Sometimes it's just a code point difference, other  
> times it's a bigger problem.

I do think it would help a lot if this was publicly documented.

-- 
Anne van Kesteren
http://annevankesteren.nl/

Follow-Ups:
- RE: Registering a charset alias
  - From: Shawn Steele <Shawn.Steele@microsoft.com>
- Re: Registering a charset alias
  - From: Ned Freed <ned.freed@mrochek.com>

References:
- Registering a charset alias
  - From: Anne van Kesteren <annevk@opera.com>
- Re: Registering a charset alias
  - From: Ira McDonald <blueroofmusic@gmail.com>
- Re: Registering a charset alias
  - From: Erik van der Poel <erikv@google.com>
- RE: Registering a charset alias
  - From: Shawn Steele <Shawn.Steele@microsoft.com>
- Re: Registering a charset alias
  - From: Erik van der Poel <erikv@google.com>
- RE: Registering a charset alias
  - From: Shawn Steele <Shawn.Steele@microsoft.com>
- Re: Registering a charset alias
  - From: Erik van der Poel <erikv@google.com>
- Re: Registering a charset alias
  - From: Markus Scherer <markus.icu@gmail.com>
- Re: Registering a charset alias
  - From: Erik van der Poel <erikv@google.com>
- Re: Registering a charset alias
  - From: Anne van Kesteren <annevk@opera.com>
- Re: Registering a charset alias
  - From: Erik van der Poel <erikv@google.com>
- Re: Registering a charset alias
  - From: Anne van Kesteren <annevk@opera.com>
- RE: Registering a charset alias
  - From: Shawn Steele <Shawn.Steele@microsoft.com>

Prev by Date: doof
Next by Date: RE: Registering a charset alias
Prev by thread: RE: Registering a charset alias
Next by thread: RE: Registering a charset alias
Index(es):
- Date
- Thread