Looking for a C library that converts UTF-8 strings from their decomposed to pre-composed form

From: Tay, William (William.Tay@xerox.com)
Date: Mon Nov 08 2004 - 11:21:51 CST

  • Next message: Edward H. Trager: "Re: not font designers?"

    Hi,

    It seems that accented characters generated in MacOS X are represented in UTF-8 decomposed form, e.g. the character ¨¦ is represented as 65 cc 81, instead of c3 a9 (the pre-composed form), and the character ¥º is represented as e3 82 b9 e3 82 99 instead of e3 82 ba. My Solaris application needs to process these characters that are generated from MacOS X.

    Is there any C library available that converts the decomposed UTF-8 byte streams into the pre-composed equivalent?

    Thanks

    Will
        



    This archive was generated by hypermail 2.1.5 : Mon Nov 08 2004 - 11:24:33 CST