From: Tay, William (William.Tay@xerox.com)
Date: Mon Nov 08 2004 - 11:21:51 CST
It seems that accented characters generated in MacOS X are represented in UTF-8 decomposed form, e.g. the character ¨¦ is represented as 65 cc 81, instead of c3 a9 (the pre-composed form), and the character ¥º is represented as e3 82 b9 e3 82 99 instead of e3 82 ba. My Solaris application needs to process these characters that are generated from MacOS X.
Is there any C library available that converts the decomposed UTF-8 byte streams into the pre-composed equivalent?
This archive was generated by hypermail 2.1.5 : Mon Nov 08 2004 - 11:24:33 CST