Strange characters in the Collation Charts

From: Jukka K. Korpela (
Date: Thu Nov 17 2005 - 13:51:48 CST

  • Next message: Neil Harris: "Re: Hebrew script in IDN (was Exemplar Characters)"

    Looking at the handly Collation Charts at
    I was surprised when I looked at the "Null" part there, which is
    addressable as

    Viewed on IE 6, it seems to contain characters like the euro sign,
    some punctuation marks, and some letters in positions U+0080 through
    U+009F. Apparently, these positions are reserved for control characters
    in Unicode, not assigned to some printable characters as in windows-1252.

    The source code contains character references like €, which are
    undefined according to HTML specifications. It seems that the chart has
    been programmatically generated without handling some special cases
    as they would need to be handled. I'm afraid people might get rather
    confused with this.

    Jukka "Yucca" Korpela,

    This archive was generated by hypermail 2.1.5 : Thu Nov 17 2005 - 13:54:15 CST