No, there are no semantic markers associated with the Han numbers: I was
told that the committee decided against assigning any such marker to the
ideographs. However, the following table may be useful to you:
http://cymru.basistech.com/~tree/numbers.html
HTH,
-tree
-- Tom Emerson Basis Technology Corp. Language Hacker http://www.basistech.com "Beware the lollipop of mediocrity: lick it once and you suck forever"-----Original Message----- From: John D. Burger [mailto:john@mitre.org] Sent: Thursday, January 06, 2000 14:46 To: Unicode List Subject: General Category for CJK ranges?
Hello -
I am a computational linguist currently working with some Chinese text. Is there anything in the Unicode Database that indicates the semantic category of CJK characters, at a minimum numeric versus non-numeric? The version I examined [1] seems to indicate that all characters in the ranges U+3400 - U+4DB5 and U+4E00 - U+9FA5 are of category Lo (letter other).
[1] ftp://ftp.unicode.org/Public/UNIDATA/UnicodeData.html
Thanks for any information you can provide.
- John Burger john@mitre.org
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:58 EDT