Re: Unihan number types and values

From: John H. Jenkins (
Date: Mon Nov 29 2010 - 15:51:30 CST

  • Next message: Mark Davis β˜•: "Re: Phishing and enforcing Confusables.txt"

    On 2010/11/29, at δΈ‹εˆ2:17, M.-A. Lemburg wrote:

    > Question: Why don't these code points have the "Nd" category ?

    Because they're not numerals, they're words/parts of words. East Asian writing systems don't traditionally distinguish the numeral "1" from the number-word "one." Yes, you *can* use them as straight decimal digits, but that's an extended use, and it's more akin to "two zero one zero" than "2010."

    > Related to this, it is also unclear what to use as official zero
    > for these number systems (U+3007 is often recommended).

    There isn't an official zero. U+3007 and U+96F6 are the two that are generally used. Which you want to use is somewhat context-dependent; if possible, the choice should be left to the user.

    John H. Jenkins

    This archive was generated by hypermail 2.1.5 : Mon Nov 29 2010 - 15:54:12 CST