FW: How-To handle i18n when you don't know charset?

From: Chris Wendt (christw@MICROSOFT.com)
Date: Tue Jul 11 2000 - 12:28:54 EDT

Speaking for the Windows versions:

All language versions of IE5 behave the same. The only difference in
behavior is the encoding of the base part of URLs which defaults to UTF-8
for all translations except the Traditional Chinese and Korean ones.

The initial default encoding for documents is the same as the Windows system
code page (GetACP()). As user chooses an encoding from the View.Encoding
menu, it becomes the new default. The default encoding for frames is the
encoding of the frameset document. Encountering a page with a charset tag
does not alter the default but it is indicated as the current encoding in
the View.Encoding menu. In other words the black dot in the Encoding menu
does NOT indicate the default encoding, it indicates the encoding currently
used by the browser - if framed, for the frameset page.

To restore the default charset setting to be like in a fresh install of
Explorer, delete the key HKEY_CURRENT_USER\Software\Microsoft\Internet
Explorer\International,Default_CodePage. The value of the key is the code
page number in hex.


-----Original Message-----
From: Alan Wood [mailto:alan.wood@context.co.uk]
Sent: Tuesday, July 11, 2000 8:36 AM
To: Unicode List
Subject: RE: How-To handle i18n when you don't know charset?

Mike Brown kindly supplied some JavaScript to determine the current and
default encoding for Internet Explorer 4+.

This gives some interesting results for default encoding:

Mac IE 4.5 - utf-8
Mac IE 5 - utf-8
Win IE 5.01 - x-user-defined
Win IE 5.01 SP1 - big5

Would anyone from Microsoft like to explain why a Chinese Traditional
encoding has been made the default for an English version of IE 5?

Alan Wood
(Documentation Writer / Web Master)
Context Limited
(Electronic publishers of UK and EU legal and official documents)
http://www.alanwood.net/ (Unicode, special characters, pesticide names)

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:05 EDT