Re: More Permanent Faults? - Unicode 5.0 Casefolding - Correction on Lithuanian

From: Richard Wordingham (richard.wordingham@ntlworld.com)
Date: Fri Jun 09 2006 - 03:18:01 CDT

  • Next message: Werner LEMBERG: "Re: CNS 11643-1992 plane 15"

    Richard Wordingham Friday, June 09, 2006 3:40 AM
    > However, but I have not double checked, there is an equivalent Lithuanian
    > case-folding that works by adding the following rules:
    >
    > 0307; L; After_Soft_Dotted; # COMBINING DOT ABOVE
    <transforms snipped>

    It's not a case-folding. It isn't idempotent. :-(

    > It preserves canonical equivalence if you fix the issue of U+0131.

    That was not a canonical equivalence problem. However, <U+0130 LATIN
    CAPITAL LETTER I WITH DOT ABOVE> still presents a problem. Without further
    accents, it folds with <U+0049>, and does not fold with its decomposition
    <U+0049><U+0307>. Still GIGO.

    Richard.



    This archive was generated by hypermail 2.1.5 : Fri Jun 09 2006 - 05:17:39 CDT