Re: "markers" codepoints for some combining letter sets in Dravidian scripts

From: Antoine Leca (
Date: Wed Apr 12 2006 - 11:15:25 CST

  • Next message: Kenneth Whistler: "Re: Unicode 5.0 Character Count?"

    On Wednesday, April 12th, 2006 12:27Z, N. Ganesan wrote:

    >>> Cillus can come at the end of words
    >>> and word-medially. ka, na, nna, la, lla, llla, r, t and m
    >> On the other hand, I do not see clearly what do you mean by cillu-t
    >> (there are two candidates, <0D31, 0D4D, 200D> and <0D24, 0D4D,
    >> 200D>).
    > cillu_t and cillu_l are of the same glyphs,
    > but in code-points, if some want to differentiate
    > it's possible to differentiate between cillu_t and cillu_l.

    Sorry, you have to be a little more precise about the real use.

    I too would be pleased sometimes to be able to differentiate between J in
    Latin, English, Castilian Spanish, German, Sankrit, French, etc...
    But I cannot. I have to use U+004A, everywhere.

    Until now, it is not known if "cillu-l" (and, as far as I can see, your
    putative cillu-t as well) should be encoded as <0D31, 0D4D, 200D> or U+0D7B.
    But nothing more. In order to recognize other combinations, you have to put
    forward quite strong arguments (and to hurry up, too.)

    >>> (4) Malayalam short u marker
    >> I read this has already been decided to be encoded as
    >> <U+0D41, U+0D4D>.
    > Any ref.s? From Unicode?

    There was a thread on this in final August-mid September 2005, on this
    mailing list. Someone with the same From: field as you posted to this very
    thread. I did not see any denegation; and even an acknowledgement from a
    major implementer of this being in place since mid-2004.


    This archive was generated by hypermail 2.1.5 : Wed Apr 12 2006 - 11:26:25 CST