Japanese shinjitai/kyuujitai mappings in Unihan

From: Leonardo Boiko <leoboiko_at_gmail.com>
Date: Wed, 17 Aug 2011 23:05:22 -0300


There are lots of variant fields mapping characters in the Unihan
database. Do any of the fields (or a combination) reproduce the
Japanese kyūjitai↔shinjitai (“old character forms”/“new character
forms”) mappings, as present in the standard Jōyō Kanji-hyō table[1]?

I looked around but failed to find documentation about this. I’ve
tried extracting data from the Jōyō PDF[2], and from a quick look, it
seems these pairs might be present in Unihan in either direction as
one or more of kTraditionalVariant, kZVariant, kSemanticVariant,
kSpecializedSemanticVariant, or kCompatibilityVariant. 22 pairs
apparently aren’t present at all (namely the shinjitai characters 壱 艶
穏 巻 挙 郷 勲 鶏 研 県 鉱 渋 嬢 挿 対 滝 闘 弐 悩 錬 and two of the three 弁 mappings);
but I double-checked my list and they’re all in the Jōyō list.

If the shinjitai/kyūjitai information isn’t available in Unicode,
would it considered relevant enough to add? I don’t know what I’m
doing, so please forgive me if I’m missing something obvious.

[1] http://www.bunka.go.jp/kokugo_nihongo/jyoyokanji_kokuji.html
[2] http://namakajiri.net/kanjigen/shinjitai.txt
Received on Wed Aug 17 2011 - 21:07:32 CDT

This archive was generated by hypermail 2.2.0 : Wed Aug 17 2011 - 21:07:32 CDT