Looking for a C library that converts UTF-8 strings from their decomposed to pre-composed form

From: Tay, William (William.Tay@xerox.com)
Date: Mon Nov 08 2004 - 11:21:51 CST

  • Next message: Edward H. Trager: "Re: not font designers?"


    It seems that accented characters generated in MacOS X are represented in UTF-8 decomposed form, e.g. the character is represented as 65 cc 81, instead of c3 a9 (the pre-composed form), and the character is represented as e3 82 b9 e3 82 99 instead of e3 82 ba. My Solaris application needs to process these characters that are generated from MacOS X.

    Is there any C library available that converts the decomposed UTF-8 byte streams into the pre-composed equivalent?



    This archive was generated by hypermail 2.1.5 : Mon Nov 08 2004 - 11:24:33 CST