U+3007 Ideographic 0

From: Tom Emerson (Tree@basistech.com)
Date: Sun Jul 11 1999 - 12:54:59 EDT


Greetings,

Recently while working on some code that does lexical analysis on Japanese
text I came across the following sequence in some of my test data (culled
from various sources on the WWW):

U+4E5D U+3007 U+5E74

CJK Ideograph Nine, Ideographic Number Zero, On reading 'nen', "year"

I was interested to see that U+3007 is not considered a Decimal Digit, but
simply as a Numeric (while the ideographic numbers, such as U+4E5D, are
not).

Thanks in advance,

        -tre

--
Tom Emerson                                          Basis Technology Corp.
Language Hacker                                    http://www.basistech.com
  "Beware the lollipop of mediocrity: lick it once and you suck forever"



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:48 EDT