UnicodeIUC21
Program Showcase Registration Accommodation Travel Sponsors
Unicode Standard Conference Board Conference CD Last Conference Past Conferences Next Conference
Abstract

Ideograph Variants-What They Are and How To Handle Them

Qin Lu - The Hong Kong Polytechnic University

Intended Audience: Managers, Software Engineers, Systems Analysts
Session Level: Beginner, Intermediate

Ideographic variants have been created and used since the start of the ideographic characters for many different reasons, and different people have different understanding of variants. Some variants are created purely for different calligraphy style, some are due to extensions in meanings. Variants are very confusing and ideograph variants have been causing problems in computerization of Chinese information. Unicode in its early stage has a lot of problem with variants and a set of unification rules were developed to avoid giving unnecessary code points for variants. However, there are still a lack of systematic study and handling on CJK variant handling.

This paper will first give the definition of Chinese variants, they discusses the issues in variant representation. It further discusses the feasibility of using the variant symbols introduced in Unicode fort the representation of variants. It then goes through the character decomposition scheme for an algorithmic way to extract variant information of different characters.

Keywords: Character decomposition, ideograph variants, new applications


Unicode
When the world wants to talk, it speaks Unicode

UnicodeIUC21
Program Showcase Registration Accommodation Travel Sponsors
Unicode Standard Conference Board Conference CD Last Conference Past Conferences Next Conference
International Unicode Conferences are organized by Global Meeting Services, Inc., (GMS). GMS is pleased to be able to offer the International Unicode Conferences under an exclusive license granted by the Unicode Consortium. All responsibility for conference finances and operations is borne by GMS. The independent conference board serves solely at the pleasure of GMS and is composed of volunteers active in Unicode and in international software development. All inquiries regarding International Unicode Conferences should be addressed to info@global-conference.com.

Unicode and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission.

14 January 2002, Webmaster