RE: CP1252 under Unix

From: Robert A. Rosenberg (
Date: Thu Mar 30 2000 - 11:03:51 EST

At 05:16 AM 03/29/2000 -0800, Robert Brady wrote:
>On Tue, 28 Mar 2000, Robert A. Rosenberg wrote:
> > match MS's choice of mappings). For UNIX, just do this automatically even
> > if the HTML says ISO-8859-1 since there should never be any control
> > characters in that range and if the codepoints do occur then they are
>That would be acceptance of standards subversion. And it doesn't work.
>(Theres plenty of KOI8-R, ISO-8859-2, [insert other standard here]) tagged
>as ISO-8859-1, at least on usenet.

While I will accept your claim (of lots of misID'ed charsets) for the sake
of this discussion, I fail to see its relevance. If I create a
message/whatever in ISO-8859-2 but mark it as ISO-8859-1, the high-ASCII
xA0-xFF codepoint range will display incorrectly (You'll get the 8859-1 not
the expected 8859-2 glyphs). Treating any ISO-8859-1 claim as if marked as
Windows-1252 will CORRECTLY display all valid (x00-x7F+xA0-xFF codepoints
only) ISO-8859-1 content while still allowing for the use of the extra 32
glyphs (in particular the Typographic Quotes). For the other misID'ed cases
that you refer to, you will get the same incorrect display as using
ISO-8859-1 so you have no worse a display by treating ISO-8859-1 as


