From: Mark Davis (firstname.lastname@example.org)
Date: Fri Jan 20 2006 - 10:05:10 CST
A couple of things. Look at the extracted files for the numeric info,
since they also contain the CJK.
Secondly, where letters are given numeric value as part of a traditional
numbering system (such as Greek or Hebrew systems), those values are not
marked. If you are interested in non-decimal systems, I'd suggest
consulting Georges Ifrah's book for background information.
Tom Emerson wrote:
>Kit Peters writes:
>>As I mentioned in an earlier post, I am investigating the parsing of
>>non-Western numerals. An example of non-Western numerals would certainly be
>>kanji, but in looking through the 4.0 UnicodeData.txt, I see no entries for
>>the kanji (Juu, Roku, Hachi, Hyaku). Why is this?
>The Unified Ideographs are documented in Unihan.txt, not UnicodeData.txt.
>Ideographs with numeric uses will have kPrimaryNumeric,
>kAccountingNumeric, or kOtherNumeric values.
This archive was generated by hypermail 2.1.5 : Fri Jan 20 2006 - 10:09:02 CST