Unicode Collation Algorithm

From: Mike (mike-list@pobox.com)
Date: Wed Apr 26 2006 - 13:48:36 CST

  • Next message: Magda Danish \(Unicode\): "Save the Date: 30th Internationalization & Unicode Conference Washington, D.C., USA; November 15-17, 2006"


    I am implementing the UCA and am having trouble
    passing the conformance test. The problem is
    that I believe my code is correct and the test
    is wrong. For example the sequence:

         09C7 1D165 09BE 0061

    is supposed to come before

         09C7 0001 09D7 0061

    according to the test. What I am observing is
    that 09C7 combines with 09BE according to steps
    S2.1.1 thru S2.1.3. The intervening 1D165 is
    ignored since it is not of combining class 0.
    The combination 09C7 09BE becomes 09CB, which
    sorts after 09C7.

    Note that this is the NON_IGNORABLE test. I
    have the same problem with the SHIFTED test.
    And also this is for version 4.1.0.

    If I comment out the code that implements
    steps S2.1.1 through S2.1.3, then things break
    that were working correctly. Has anyone been
    able to resolve this problem?

    Thanks for any insight.


    This archive was generated by hypermail 2.1.5 : Wed Apr 26 2006 - 14:13:53 CST