Re: Is there a term for strictly-just-this-encoding-and-not-really-that-encoding?

From: Tim Greenwood (timothy@greenwood.name)
Date: Wed Nov 10 2010 - 18:09:42 CST

  • Next message: Jim Monty: "Re: Is there a term for strictly-just-this-encoding-and-not-really-that-encoding?"

    Even more interesting is Windows 1252 and ISO8859-15 where the former is a
    repertoire superset of the latter for the graphic characters, but not an
    encoding superset.

    On Wed, Nov 10, 2010 at 5:53 PM, Kenneth Whistler <kenw@sybase.com> wrote:

    > Mark Davis wrote:
    >
    > > What are also tricky are the 'almost' supersets, where there are only a
    > few
    > > different characters. Those definitely cause problems because the
    > difference
    > > in data is almost undetectable.
    >
    > For example, Mark is referring to cases such as ISO 8859-1 and 8859-15.
    >
    > Those share all the same encoded characters except those at
    > the code points 0xA4, 0xA6, 0xA8, 0xB4, 0xB8, and 0xBC..0xBE.
    >
    > So neither of the repertoires is a proper subset of the other,
    > but the two coded character sets share the vast majority
    > of their characters, including almost all of the common ones.
    >
    > --Ken
    >
    >
    >



    This archive was generated by hypermail 2.1.5 : Wed Nov 10 2010 - 18:13:09 CST