Re: U+xxxx, U-xxxxxx, and the basics

From: Erik van der Poel (erik@netscape.com)
Date: Wed Mar 08 2000 - 15:45:26 EST


Mike Brown wrote:
>
> Keld wrote:
> >
> > iso/iec 8859-1 does not include all of us-ascii, as the
> > controls 0-31 and 127 are not in this standard. The IETF
> > charset iso-8859-1 includes both the us-ascii controls
> > and the C1 control characters of ISO/IEC 6429.
>
> Wow, thanks for that clarification. I didn't realize IETF's charset and the
> ISO standard were different. I'll note that change for the next revision of
> the materials.

By the way, if you are interested in the real world, some popular pieces
of software such as Netscape Navigator and Communicator treat incoming
text labelled as "iso-8859-1" as though it were "windows-1252", which
normally has the usual suspects in the C0 range (0x00 - 0x1F) and Delete
(0x7F), ASCII characters in the 0x20 - 0x7E range (0x20 is Space), and
ISO 8859-1 characters in the 0xA0 - 0xFF range, but it has graphic
(non-control) characters in the 0x80 - 0x9F range (including the euro,
"smart quotes", etc).

Erik



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:59 EDT