Re: Unihan number types and values

From: Kent Karlsson ([email protected])
Date: Tue Nov 30 2010 - 07:36:02 CST

Next message: Mahesh T. Pai: "Re: Phishing and enforcing Confusables.txt"

Previous message: Martin J. Dürst: "Re: Phishing and enforcing Confusables.txt"
In reply to: Kenneth Whistler: "Re: Unihan number types and values"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Den 2010-11-29 23:24, skrev "Kenneth Whistler" <[email protected]>:

...
> they are quite often used in traditional numbering in
> East Asia, which does not use decimal radix forms. Handling
> Han numeric ideographs requires special processing to
> parse numeric values correctly.

CLDR, and ICU, has (some) support for that. See
http://www.unicode.org/cldr/trac/browser/trunk/common/rbnf/zh_Hant.xml
http://www.unicode.org/cldr/trac/browser/trunk/common/rbnf/zh.xml
http://www.unicode.org/cldr/trac/browser/trunk/common/rbnf/ja.xml

The data in these datafiles are used by the RBNF number formatter
and reader APIs in ICU:
http://icu-project.org/apiref/icu4c/classRuleBasedNumberFormat.html.
(None of them permit substituting "numerically equivalent" Han characters
for reading.)

More on numbering systems in CLDR: see
http://www.unicode.org/cldr/trac/browser/trunk/common/supplemental/numbering
Systems.xml. One, just one (for now at least), decimal-base position system
using Han characters is supported, called "hanidec". The names listed in
numberingSystems.xml can be used in the ICU API to ask for the numbering
system in question. (Some of the number spellout systems, including the
Han character ones, can be asked for that way; but most cannot, and one must
then use the RBNF API directly.)

/Kent K

Next message: Mahesh T. Pai: "Re: Phishing and enforcing Confusables.txt"
Previous message: Martin J. Dürst: "Re: Phishing and enforcing Confusables.txt"
In reply to: Kenneth Whistler: "Re: Unihan number types and values"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Tue Nov 30 2010 - 07:41:20 CST