RE: General Category for CJK ranges?

From: Tom Emerson (
Date: Thu Jan 06 2000 - 16:39:32 EST

No, there are no semantic markers associated with the Han numbers: I was
told that the committee decided against assigning any such marker to the
ideographs. However, the following table may be useful to you:



Tom Emerson                                          Basis Technology Corp.
Language Hacker                          
  "Beware the lollipop of mediocrity: lick it once and you suck forever"

-----Original Message----- From: John D. Burger [] Sent: Thursday, January 06, 2000 14:46 To: Unicode List Subject: General Category for CJK ranges?

Hello -

I am a computational linguist currently working with some Chinese text. Is there anything in the Unicode Database that indicates the semantic category of CJK characters, at a minimum numeric versus non-numeric? The version I examined [1] seems to indicate that all characters in the ranges U+3400 - U+4DB5 and U+4E00 - U+9FA5 are of category Lo (letter other).


Thanks for any information you can provide.

- John Burger

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:58 EDT