Re: Proposed Update of UTS #10: Unicode Collation Algorithm

From: Jungshik Shin (jshin@mailaps.org)
Date: Sat May 10 2003 - 22:22:27 EDT

  • Next message: Don Osborn: "Hooked-y's in Unicode 4.0"

    On Fri, 9 May 2003, Mark Davis wrote:

    > There is a new Proposed Update of UTS #10: Unicode Collation
    > Algorithm, on:
    >
    > http://www.unicode.org/reports/tr10/tr10-10.html

      Just a quck question before reading it through and comment on it. Will
    allkeys.txt for 4.0 keep weights given to Hangul Jamos? The following
    is written under the assumption that it will.

    Specifically, U+1102 (Nieun), U+1103 (Tikeut) and U+1113(Nieun-Kiyeok) are
    given the primary weight of 1832, 1833 and 1844, respectively. With these,
    U+1113 will be sorted after U+1103, right? Or am I missing something
    (I haven't read UTS #10 through, yet)? The order is different from the
    way (South) Koreans (at least, most Korean dictionary editors) expect
    them to be sorted. We expect U+1113 (and other cluster consonants whose
    first component is U+1102. They're U+1114, U+1115, U+1116) to be put
    after U+1102 but before U+1103. The same is true of any cluster Jamos.
    Is it UTC's intention to leave the task of making Hangul Jamos collate in
    accordance with (South) Koreans' expectation to (South) Korean specific
    tailoring?

    Thanks,

    Jungshik



    This archive was generated by hypermail 2.1.5 : Sat May 10 2003 - 23:08:31 EDT