Re: Surrogate pairs and UTF-8

From: Otto Stolz (Otto.Stolz@uni-konstanz.de)
Date: Thu Jun 22 2006 - 03:25:35 CDT

  • Next message: Raymond Mercier: "Baskerville type"

    Labdien, Pavils Jurjans,

    you have asked:
    > Can you give an example of some
    > surrogate pairs, and how do their respective character look like?

    See the table under
    <http://www.systems.uni-konstanz.de/Otto/Vortrag/Charset/Unicode-Grundlagen.html#U-UTF>
    which is from an introductory lecture, in German. Note that I have replaced the
    Quarter Rest character with an in-line picture, because fonts containing that character
    are not installed, on most computers -- but with suitable fonts installed,
    the encoded strings presented in this example would be rendered as depicted
    in the 1st row of the table.

    A few screens down, there is a similar table for the encoding schemes
    (i. e. taking the byte order of longer encoding units into account):
    <http://www.systems.uni-konstanz.de/Otto/Vortrag/Charset/Unicode-Grundlagen.html#UB-endian>

    Still further down, there is the same example with a BOM:
    <http://www.systems.uni-konstanz.de/Otto/Vortrag/Charset/Unicode-Grundlagen.html#UB-BOM>.

    Best wishes,
       Otto Stolz



    This archive was generated by hypermail 2.1.5 : Thu Jun 22 2006 - 03:57:52 CDT