RE: Case mapping of dotless lowercase letters

From: Philippe Verdy (verdy_p@wanadoo.fr)
Date: Tue Dec 16 2003 - 18:35:04 EST

  • Next message: Peter Kirk: "Re: Stability of WG2"

    Chris Jacobs [mailto:chris.jacobs@freeler.nl]
    > From: "Philippe Verdy" <verdy_p@wanadoo.fr>
    > > Stefan Persson writes:
    > > > Isn't the sequence "dotless i + combining acute" canonically
    > equivalent
    > > > to "dotted i + combining acute"?
    > >
    > > NO. There's no canonical equivalence between distinct pairs of
    > characters,
    > > if the first letter of each pair are not also canonically equivalent.
    >
    > compare ë? with e¨
    >
    > The first pair has e trema as its first letter, the second pair e ogonek.
    > Yet these pairs are canonical equivalent.

    True in the way you interpret my sentence, but when I say the "first letter"
    of each pair, I mean the first non decomposable character of each pair. In
    your example, both letters are simple "e" vowels.

    Both "dotted lowercase i" and "dotless lowercase i" are not decomposable...
    unlike "dotter uppercase I"...

    Well Outlook 2000 is unable to represent any e with ogonek and trema of your
    example. So, despite they are canonically equivalent, they are rendered
    differently:

    - "ë?" <SMALL LETTER E WITH DIAERERESIS, COMBINING OGONEK>
      displays <SMALL LETTER E WITH DIAERESIS, MISSING SPACING GLYPH FOR
    COMBINING OGONEK>
      in an unbreakable sequence of glyphs or editable grapheme clusters (the
    keyboard edit cannot move in the middle, but the mouse selection can break
    before the ogonek.)

    - "e¨" <SMALL LETTER E WITH OGONEK, COMBINING DIAERERESIS>
      and "e?¨" <SMALL LETTER E, COMBINING OGONEK, COMBINING DIAERERESIS>
      both display <E WITH OGONEK, SPACING DIAERESIS>
      with a break between glyphs, as if it were two distinct editable grapheme
    clusters.

    All these should better display <E WITH OGONEK, MISSING NON-SPACING GLYPH
    FOR COMBINING DIAERESIS>
    Isn't there a distinct glyph for missing glyphs representing spacing
    diacritics, or not even a spacing glyph with a dotted circle? And grapheme
    clusters are incorrectly mapped for editing in Outlook.

    __________________________________________________________________
    << ella for Spam Control >> has removed Spam messages and set aside
    Newsletters for me
    You can use it too - and it's FREE! http://www.ellaforspam.com





    This archive was generated by hypermail 2.1.5 : Tue Dec 16 2003 - 19:09:22 EST