From: srivas sinnathurai (sisrivas@blueyonder.co.uk)
Date: Sat Nov 27 2010 - 16:55:16 CST
While on this topic, what about Tamil-Tamil confusables?
For example, 0bb6 is a duplicate of ob9A.
If one goes to courts, say over a domain name, according to grammar. the UC
added 0bb6 is a duplicate of oB9A.
According to UC these appear different. In usage, within Tamil these are
confusables. Is their such a list for future mediations?
Should there be a confusable list forconceptually duplicate characters,
similar to appearing confusables between languages?
Sinnathurai
On 27 November 2010 13:50, Shriramana Sharma <samjnaa@gmail.com> wrote:
> On Sat, Nov 27, 2010 at 5:29 PM, Christopher Fynn <chris.fynn@gmail.com>
> wrote:
> > I wonder, in a case like this, which of the two scripts takes precedence?
>
> Where's the question of precedence? As I understand it, confusable
> mappings go from higher codepoint to lower codepoint, so it's just a
> question of folding -- something like case folding.
>
> అరగ ಅರಗ అರగ ಅరಗ (you'll have to look at that via UniView to get the
> difference) will all fold to అరగ (all Telugu) if I am not mistaken
> (provided the appropriate Confusables.txt entries are present) and
> given that mixed script domain names are (almost) prohibited, whoever
> registers whichever first -- whether all Kannada or all Telugu -- will
> get precedence, even though in internal processing the Kannada
> codepoints will fold to Telugu.
>
> @ the techies here: I hope I got that right...
>
> Shriramana Sharma.
>
>
>
This archive was generated by hypermail 2.1.5 : Sat Nov 27 2010 - 17:00:05 CST