Our company produces a product that addresses your problem, including all
the issues mentioned by John Jenkins below. We call it our
Chinese-to-Chinese Script Converter, or C2C for short.
In particular it does not only code-point conversion but also orthographic
and lexemic conversions, based on a set of cross-idiom dictionaries and word
identification in streams of Chinese text. It is fully Unicode based
internally, although conversion to and from other character sets is also
You can read more about it at
http://www.basistech.com/products/Chinese-Converter.html, or contact me
directly for more information.
Director of Product Management
Basis Technology Corp.
One Kendall Square
Cambridge, MA 02139
From: John H. Jenkins [mailto:firstname.lastname@example.org]
Sent: Tuesday, May 01, 2001 2:54 PM
To: Magda Danish (Unicode); email@example.com
Subject: Re: FW: chinese conversion tables
At 11:21 AM -0700 5/1/01, Magda Danish (Unicode) wrote:
>From: Michal Gerling [mailto:firstname.lastname@example.org]
>Sent: Tuesday, May 01, 2001 7:24 AM
>Subject: chinese conversion tables
>I am working with UNICODE and the CJK market and need to know: Is there
>any one table or formula for moving from simplified to traditional
>characters and back in UNICODE? thank you very much for your help!
Partial data to interconvert between simplified and traditional
characters is available through the Unihan database. However, the
problem is not a simple one, as there are frequently multiple
traditional forms that correspond to a single simplified form.
Moreover, the vocabulary used in the PRC with simplified characters
differs on occasion from the vocabulary used in Taiwan and elsewhere
for traditional ones (e.g., the names of the chemical elements, until
recently the word for "computer"). It really isn't possible to
convert between simplified and traditional characters without doing a
-- ===== John H. Jenkins email@example.com firstname.lastname@example.org http://homepage.mac.com/jenkins/
This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:18:16 EDT