Re: Looking for a C library that converts UTF-8 strings from their decomposed to pre-composed form

From: Eric Muller (emuller@adobe.com)
Date: Mon Nov 08 2004 - 23:28:48 CST

  • Next message: Peter Constable: "RE: official languages of ISO / IEC (CIE)"

    Deborah Goldsmith wrote:

    > It's worth pointing out that there is no such thing as "precomposed
    > Unicode". Normalization form C (NFC) could be called "as precomposed
    > as possible." There are some sequences of Unicode that can only be
    > expressed using combining marks.
    >
    As well as single (precomposed) characters which have a sequence of more
    than one character as their NFC form. So NFC is not even "as precomposed
    as possible".

    Eric.



    This archive was generated by hypermail 2.1.5 : Mon Nov 08 2004 - 23:30:01 CST