Unihan : Traditional characters having two simplified equivalents

From: koxinga (koxinga@wanadoo.fr)
Date: Mon Dec 29 2008 - 08:57:36 CST

Next message: John H. Jenkins: "Re: Zhuang Han (was: Re: Emoji survey)"
Previous message: vunzndi@vfemail.net: "Re: Old Hungarian (was: Zhuang Han was: Re: Emoji survey)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Hi !

I am wondering how the Unicode Consortium selected the traditional-simplified pairs. Is there somewhere a detailed explanation of how it was done ?

More precisely, I don't understand the rationales behind the seven "one traditional character-two simplified characters" links.

* 瀋: 沈, 渖
* 畫: 划, 画
* 鍾: 钟, 锺
* 靦: 腼, 䩄 (U+4A44)
* 餘: 余, 馀
* 鯰: 鲇, 鲶
* 鹼: 硷, 碱

According to what I found :
* "餘: 余, 馀" exists because of possible ambiguities between 餘 and 余, which already existed in traditional chinese.
* "瀋: 沈, 渖" and "鍾: 钟, 锺" exist because of conflicting general rules (審 -> 审 and 釒 -> 钅) and specific simplifications.
* the other four are links to synonymous characters or old variants. For example "鯰: 鲇, 鲶". 鯰 -> 鲶 and 鮎 -> 鲇 are ok, it is the general rule. 鯰 and 鮎 were synonym, but is it a reason to add 鯰 -> 鲇 (and why 鮎 -> 鲶 doesn't exist ?).

My 新华字典 gives me :
* 瀋: 沈
* 畫: 画
* 鍾: 钟
* 餘: 余, 馀 (with a note explaining why)
* 鹼: 硷
It does not write the simplification following general rules so these two are implied :
* 靦: 䩄 (U+4A44)
* 鯰: 鲶

Are these exceptions in Unicode because of older character set compatibilities ? Are there some specific reasons ? Are these reasons explained somewhere ?

thanks,

Koxinga

Next message: John H. Jenkins: "Re: Zhuang Han (was: Re: Emoji survey)"
Previous message: vunzndi@vfemail.net: "Re: Old Hungarian (was: Zhuang Han was: Re: Emoji survey)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Fri Jan 02 2009 - 15:33:07 CST