Re: Telugu vs Kannada confusables

From: Christopher Fynn (chris.fynn@gmail.com)
Date: Sat Nov 27 2010 - 05:59:11 CST

  • Next message: Shriramana Sharma: "Re: Telugu vs Kannada confusables"

    This is going to cause some fun.

    I wonder, in a case like this, which of the two scripts takes precedence?

    - C

    On 25/11/2010, Shriramana Sharma <samjnaa@gmail.com> wrote:

    > Hello. Here's a Telugu vs Kannada confusables list I cooked up right
    > now. As this is an important security issue, I post to all the lists
    > so that people may contribute. Also, some of this is probably already
    > there but I'm going for completeness:

    > ANUSVARA
    > VISARGA
    > LETTER A
    ...
    <SNIP>
    ...
    > DIGIT NINE

    > That makes sixty two characters in all including the ones marked ?.
    > Even *without* the ones marked ? (which I did because I suspected
    > others may contest these cases) it comes to fifty two.

    > Now to count the characters NOT common or confusable (obviously much
    > lesser):
    >
    > LETTER U
    > LETTER UU
    ...
    <SNIP>
    ...
    > DIGIT SEVEN

    > That comes to sixteen.

    > I left out the LLLA of Kannada and the fractions of Telugu which are
    > *not* present (as of Unicode 6.0) in the other script because
    > obviously there can be no comparison on those.

    > So there are at least *thrice* (or at most *four times*) as many
    > confusable characters between Kannada and Telugu than there are
    > NON-confusables.

    > Now can you beat that! Speaking of scripts with a common origin and
    > causing potential confusion in IDNs, *I* say Kannada and Telugu takes
    > the cake!

    > Shriramana Sharma.



    This archive was generated by hypermail 2.1.5 : Sat Nov 27 2010 - 06:04:33 CST