[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[OT] Re: Windows-1252 Best Fit tables.



At 03:54 06/10/27, Shawn Steele wrote:


>I致e blogged about best fit at <http://blogs.msdn.com/shawnste/archive/2006/01/19/515047.aspx>http://blogs.msdn.com/shawnste/archive/2006/01/19/515047.aspx 


Very interesting, thanks for the pointer. Just one correction.
You write:
   ... the best plan is to use Unicode when possible, either UTF-8 or
   UTF-16 is usually a good choice.
and then later in the same paragraph:
   In those cases finding extensions or newer protocols that handle Unicode
   are good, but some, like e-mail headers [;)], we're stuck with.

This is not true. First, with the current email standard (RFC 2822),
you can put UTF-8 into headers the same way you can put Latin-1
into headers: Using RFC 2047. The results look terrible
(e.g. =?UTF-8?Q?Martin D=CD=BCrst?= or =?iso-8859-1?Q?Martin D=FCrst?=),
but they usually work.

Second, there is now an effort underway, the IETF EAI WG, to move
to raw UTF-8 email headers (as well as UTF-8 in SMTP, POP, IMAP,...).
The basics are very easy (much easier of course than RFC 2047),
the main problem is fallbacks for those email servers that may
not support this new emerging standard for a while.
You can find more details at
   http://www.ietf.org/html.charters/eai-charter.html

Regards,     Martin.


#-#-#  Martin J. Du"rst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp       mailto:duerst@it.aoyama.ac.jp