Re: Mark-Driven Script Categorisation (was: Compliant Tailoring of Normalisation for the Unicode Collation Algorithm)

From: Philippe Verdy <verdy_p_at_wanadoo.fr>
Date: Thu, 17 May 2012 22:56:51 +0200

2012/5/17 Richard Wordingham <richard.wordingham_at_ntlworld.com>:
> On Wed, 16 May 2012 21:46:17 -0700
> Mark Davis ☕ <mark_at_macchiato.com> wrote:
>
>> No, it's not.
>>
>> Including x in Lao for some pedagogical (I'm guessing) purpose is
>> completely out of scope. That'd be like including π in Latin because
>> it sometimes occurs in the middle of English text.
>
> No, it's more like including Devanagari candrabindu in the Latin
> script because it sometimes occurs on Latin letters in discussions of
> Sanskrit. (Actually, I can only recall it on lower-case 'l'.) We
> already have U+0310 COMBINING CANDRABINDU.
>
> The problem is that 'x' then takes the full set of Lao vowel symbols,
> forming a default grapheme cluster.

Oh well... then the next time we'll discuss about including the Han
sinograms in the Latin script because we find discussions in English
about these sinograms. Then we'll start mixing all scripts together as
if they were all in the same system.
Received on Thu May 17 2012 - 15:58:52 CDT

This archive was generated by hypermail 2.2.0 : Thu May 17 2012 - 15:58:53 CDT