RE: Newbie questions: 1) Surrogates in WinXP? 2) Unicode in PostS cript?

From: Rick Cameron (Rick.Cameron@businessobjects.com)
Date: Wed Apr 07 2004 - 15:11:28 EDT

  • Next message: Peter Constable: "RE: Doulos SIL (was: French typographic thin space)"

    > -----Original Message-----
    > From: unicode-bounce@unicode.org
    > [mailto:unicode-bounce@unicode.org] On Behalf Of Peter Constable
    > Sent: April 7, 2004 0:52
    > To: Unicode Mailing List
    > Subject: RE: Newbie questions: 1) Surrogates in WinXP? 2)
    > Unicode in PostScript?
    >
    >
    > In the .Net Framework, the string class (System
    > namespace) and the System.Globalization and System.Text
    > namespaces *are* designed to be aware of supplementary plane
    > characters.
    >
    >
    > Peter Constable
    >

    Hi, Peter

    IMHO, that's a bit misleading. The String class itself does not appear to be
    aware of SMP characters. It clearly uses UTF-16, and the length it reports
    is the number of code units, not the number of characters or graphemes in
    the string.

    The StringInfo class, however, does provide support for working with
    complete graphemes.

    Does anyone know of a String class, in C++, Java or .NET, that hides the
    actual encoding used, and provides a public API based on code points?

    Cheers

    - rick



    This archive was generated by hypermail 2.1.5 : Wed Apr 07 2004 - 16:07:39 EDT