From: Christopher Fynn (chris.fynn@gmail.com)
Date: Sat Nov 27 2010 - 05:59:11 CST
This is going to cause some fun.
I wonder, in a case like this, which of the two scripts takes precedence?
- C
On 25/11/2010, Shriramana Sharma <samjnaa@gmail.com> wrote:
> Hello. Here's a Telugu vs Kannada confusables list I cooked up right
> now. As this is an important security issue, I post to all the lists
> so that people may contribute. Also, some of this is probably already
> there but I'm going for completeness:
> ANUSVARA
> VISARGA
> LETTER A
...
<SNIP>
...
> DIGIT NINE
> That makes sixty two characters in all including the ones marked ?.
> Even *without* the ones marked ? (which I did because I suspected
> others may contest these cases) it comes to fifty two.
> Now to count the characters NOT common or confusable (obviously much
> lesser):
>
> LETTER U
> LETTER UU
...
<SNIP>
...
> DIGIT SEVEN
> That comes to sixteen.
> I left out the LLLA of Kannada and the fractions of Telugu which are
> *not* present (as of Unicode 6.0) in the other script because
> obviously there can be no comparison on those.
> So there are at least *thrice* (or at most *four times*) as many
> confusable characters between Kannada and Telugu than there are
> NON-confusables.
> Now can you beat that! Speaking of scripts with a common origin and
> causing potential confusion in IDNs, *I* say Kannada and Telugu takes
> the cake!
> Shriramana Sharma.
This archive was generated by hypermail 2.1.5 : Sat Nov 27 2010 - 06:04:33 CST