L2/08-281 Date/Time: Sat Aug 2 00:01:40 CDT 2008 Contact: eyedunno11@hotmail.com Name: Joshua J. Smith Report Type: Error Report Opt Subject: Errors/Inconsistencies in Unihan Database I downloaded an Excel spreadsheet of the Unihan Database, and in an attempt to convert the "kJapaneseOn" section to kana, I encountered the following errors: ------------------------------------------ U+5AD9 SEB should probably be SEN, based on similar characters, but I don't have Morohashi's Dai Kanwa Jiten to consult on this U+5A83 JYUU should be JUU (JYUU is not correct in either Hepburn or Kunrei romanization) U+7E98 SAB should be SAN U+913C SAB should be SAN U+59A0 NANDNON should be checked in Morohashi, if available U+6F6C DDAN should be DAN U+7AE8 DYOU should be JOU (DYOU might be correct as ヂョウ, but modern dictionaries list it as ジョウ) U+50E0 FAN - Maybe HAN or BAN, but this should be checked out in Morohashi, if available U+8A22 FGIN should be GIN, and KI should probably not be there (though again, I don't have a Morohashi, so it might be worth doublechecking) U+6945 HYKU should be HYOKU U+6E62 HYKI should be HIKI U+809C CHN should probably not even be there (I'd guess it's a mistaken input from the Mandarin section, but I'm not sure) U+95DE KN should be KAN U+562B DAM should be checked in Morohashi, if available U+829B SHUTS - my dictionary lists only I for this one, and it should probably be doublechecked in Morohashi U+901A TS UTSU should be TSUU TSU U+6733 HATCHI should be HACHI, and HECHI is also apparently possible ------------------------------------------ In addition, Hepburn romanization seems to dominate in these entries, but there are some exceptions where Kunrei romanization is found. These include: ------------------------------------------ U+528B SYOU should maybe be SHOU U+5435 SYOU should maybe be SHOU U+9443 TYOU should maybe be CHOU, and JOU is also apparently possible U+5281 SYOU should maybe be SHOU, but I couldn't find this particular kanji [MANY ENTRIES] HU is scattered throughout the kJapaneseOn and kJapaneseKun sections, but proper Hepburn romanization is FU ------------------------------------------ Thanks for providing a great resource (though an offline searchable version of the Unihan Database would be even better), and hopefully these corrections are of some assistance. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- Date/Time: Sat Aug 2 16:29:19 CDT 2008 Contact: eyedunno11@hotmail.com Name: Joshua J. Smith Report Type: Error Report Opt Subject: Further Unihan Database corrections (mostly kJapaneseKun) I did a semi-automated kana conversion on the kJapaneseKun section of my Unihan Database Excel file today (it was more difficult than kJapaneseOn because ZU is used for both ず and づ and JI is used for both じ and ぢ) and found the following errors: ------------------------------------------ U+55E2 MUSEBBU should be MUSEBU U+544E FIUTO should be FIITO (English "feet", as in the measurement unit) U+5B68 MINASHGO should be MINASHIGO U+8E89 HASHKE should be checked in Morohashi, if available U+5924 NOBIRY should be NOBIRU U+5E2E TASKERU should be TASUKERU U+572E KUTSUGAYERU should be KUTSUGAERU U+6AF0 INUENJIYU should be INUENJU U+8961 NAGAJIYUBAN should be NAGAJUBAN U+5A16 TSUTUSMU should be TSUTSUMU U+9F16 TSUTUMIUTSU should be TSUZUMIUTSU (鼓打つ・つづみうつ) U+7FEC TIBU should be TOBU U+8347 HANAJIYUNSAI should be HANAJUNSAI U+8395 HANAJIYUNSAI should be HANAJUNSAI ------------------------------------------ I also found the following inconsistencies with Hepburn romanization: ------------------------------------------ U+54F0 HAKKIRISINAI should maybe be HAKKIRISHINAI U+8E31 HADASI should maybe be HADASHI [MANY ENTRIES] As with kJapaneseOn, HU seems to be used at random where only FU should be used ------------------------------------------ Finally, I checked several more sources on yesterday's corrections for kJapaneseOn and found a few more results on one character: ------------------------------------------ U+829B (芛) /Kanjigen/ lists I as the only reading. However, /Shin-Kangorin/ lists ITSU and ICHI, and Microsoft's IME-Pad interface lists only YUI. Only /Kanjigen/ accounts for one of the listed readings (I SHUN SHUTS JUCHI). Hopefully this information is of some value, and in any case, SHUTS should be changed (if only to SHUTSU). ------------------------------------------ (End of Report)