Re: U+3007 Ideographic 0

From: Linus Toshihiro Tanaka (ttanaka@us.oracle.com)
Date: Mon Jul 12 1999 - 15:36:33 EDT


As far as I know, the year "90" can be one of the below in Japanese
language. The 5th one is too formal in most cases (used on checks or legal
documents ?)

1. U+0039 U+0030 U+5E74
2. U+FF19 U+FF10 U+5E74
3. U+4E5D U+3007 U+5E74
4. U+4E5D U+5341 U+5E74
5. U+4E5D U+62FE U+5E74

I have never seen the below, so it may not be used (not sure, though).

6. U+4E5D U+96F6 U+5E74

P.S. Japan uses Gregorian and Imperial calendars (and Lunar calendar
     in some cases), but that's a separate issue

+-----------------------------------------------------------------------+
| Linus Toshihiro Tanaka 500 Oracle Parkway M/S 4op7 |
| Server Globalization Technology Redwood Shores, CA 94065 U.S.A. |
| Oracle Corporation email: ttanaka@us.oracle.com |
+-----------------------------------------------------------------------+

Tom Emerson wrote:
>
> Greetings,
>
> Recently while working on some code that does lexical analysis on Japanese
> text I came across the following sequence in some of my test data (culled
> from various sources on the WWW):
>
> U+4E5D U+3007 U+5E74
>
> CJK Ideograph Nine, Ideographic Number Zero, On reading 'nen', "year"
>
> I was interested to see that U+3007 is not considered a Decimal Digit, but
> simply as a Numeric (while the ideographic numbers, such as U+4E5D, are
> not).
>
> Thanks in advance,
>
> -tre
>
> --
> Tom Emerson Basis Technology Corp.
> Language Hacker http://www.basistech.com
> "Beware the lollipop of mediocrity: lick it once and you suck forever"



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:48 EDT