If there were a simple mapping between simplified and traditional Chinese characters, this would be treated somewhat like case-folding (see http://www.unicode.org/unicode/reports/tr21/). My understanding, however, is that the mapping is not algorithmic -- that it essentially takes parsing plus a dictionary look-up to do a good job. I believe that Basis Technology offers a package for doing that. Can anyone else comment?
Michael Friedman wrote:
> Q: How can I normalize Unicode text so that equivalent Simplified and Traditional Chinese characters compare equal? How can I normalize the text so that equivalent Simplified and Traditional Chinese characters do not compare equal? --- I actually don't know the answer to this. UTR #15 doesn't seem to cover this. Mike
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:02 EDT