Ideograph Variants-What They Are and How To Handle Them
Qin Lu - The Hong Kong Polytechnic University
Ideographic variants have been created and used since the start of the ideographic characters for many different reasons, and different people have different understanding of variants. Some variants are created purely for different calligraphy style, some are due to extensions in meanings. Variants are very confusing and ideograph variants have been causing problems in computerization of Chinese information. Unicode in its early stage has a lot of problem with variants and a set of unification rules were developed to avoid giving unnecessary code points for variants. However, there are still a lack of systematic study and handling on CJK variant handling.
This paper will first give the definition of Chinese variants, they discusses the issues in variant representation. It further discusses the feasibility of using the variant symbols introduced in Unicode fort the representation of variants. It then goes through the character decomposition scheme for an algorithmic way to extract variant information of different characters.
Keywords: Character decomposition, ideograph variants, new applications
|When the world wants to talk, it speaks Unicode|
International Unicode Conferences are organized by Global Meeting Services, Inc., (GMS).
GMS is pleased to be able to offer the International Unicode Conferences under an exclusive
license granted by the Unicode Consortium. All responsibility for conference finances and
operations is borne by GMS. The independent conference board serves solely at the pleasure
of GMS and is composed of volunteers active in Unicode and in international software
development. All inquiries regarding International Unicode Conferences should be addressed
Unicode and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission.
14 January 2002, Webmaster