RE: Case mapping of dotless lowercase letters

From: Kent Karlsson (kentk@cs.chalmers.se)
Date: Wed Dec 17 2003 - 08:34:59 EST

  • Next message: Kent Karlsson: "RE: Case mapping of dotless lowercase letters"

    Peter Kirk wrote:
    ...
    > used on Turkic text, violates the very sensible rule DO NOT USE
    > COMBINING DOTS WITH I's, and leads to all sorts of potential
    > confusion
    > e.g. that both simple and full case folding and lowercasing
    > applied to
    > NFD Turkic text generate the nonsensical <i, dot above>. This
    > could be a
    > serious problem - although one that may not be worth fixing.

    <i, dot above> is not non-sensical. It is used in Lithuanian for
    such things as <i, dot above, tilde above>, as well as other
    additonal accents above an i or a j that keeps its dot.

                    /kent k

    Lithuanian alphabet (not listing all the uppercase
    accented letters)

     Aa (Àà, Áá Ãã Aa {A´}{a´}), Bb, Cc (CHch), Cc, Dd,
     Ee (Ee, Ee è é ? e {e´} {e~} e {e´} {e~}), Ff, Gg, Hh,
     Ii (Ì{i?`} Í{i?´} I{i?~} Ii {I´}{i?´} {I~}{i?~}, Yy, Ýý, ??),
     Jj ({J~}{j?~}), Kk, Ll ({l~}), Mm ({m~}), Nn (Ññ),
     Oo (ò, ó, õ), Pp, [Qq], Rr (r~), Ss, Šš, Tt,
     Uu (ù ú u Uu {u´} {u~} Uu {u´}), Vv, [Ww], [Xx], Zz, Žž



    This archive was generated by hypermail 2.1.5 : Wed Dec 17 2003 - 09:31:48 EST