RE: Newbie questions: 1) Surrogates in WinXP? 2) Unicode in PostScript?

From: Asmus Freytag (
Date: Thu Apr 08 2004 - 02:32:25 EDT

  • Next message: "CJK U+3ADA and U+66F6"

    At 10:49 PM 4/7/2004, Peter Constable wrote:
    > >, and the length it reports
    > > is the number of code units, not the number of characters or graphemes
    > > the string.
    >True; that is documented.

    However, that's very common; many APIs relating to UTF-8 would report
    the number of bytes, not the number of characters.

    While it's interesting to have a method that can derive grapheme
    boundaries, e.g. for UI support, it's far less useful to get a grapheme count.


    This archive was generated by hypermail 2.1.5 : Thu Apr 08 2004 - 03:25:07 EDT