Re: Unihan number types and values

From: karl williamson (public@khwilliamson.com)
Date: Mon Nov 29 2010 - 16:42:10 CST

  • Next message: CE Whitehead: "RE: Phishing and enforcing Confusables.txt"

    John H. Jenkins wrote:
    > On 2010/11/29, at δΈ‹εˆ2:17, M.-A. Lemburg wrote:
    >
    >> Question: Why don't these code points have the "Nd" category ?
    >>
    >
    > Because they're not numerals, they're words/parts of words. East Asian writing systems don't traditionally distinguish the numeral "1" from the number-word "one." Yes, you *can* use them as straight decimal digits, but that's an extended use, and it's more akin to "two zero one zero" than "2010."
    >

    Also, in Unicode 6.0, only code points that form a contiguous block of
    10, with the first (0th) in the block having the numeric value of 0, the
    next 1; etc. are classified Nd.

    >> Related to this, it is also unclear what to use as official zero
    >> for these number systems (U+3007 is often recommended).
    >>
    >
    > There isn't an official zero. U+3007 and U+96F6 are the two that are generally used. Which you want to use is somewhat context-dependent; if possible, the choice should be left to the user.
    >
    > =====
    > John H. Jenkins
    > jenkins@apple.com
    >
    >
    >
    >
    >



    This archive was generated by hypermail 2.1.5 : Mon Nov 29 2010 - 16:43:39 CST