Re: Compliant Tailoring of Normalisation for the Unicode Collation Algorithm

From: Ken Whistler <kenw_at_sybase.com>
Date: Wed, 16 May 2012 15:32:31 -0700

On 5/16/2012 2:54 PM, Richard Wordingham wrote:
> Similar remarks apply to 'reorder'. What if I move 'Q' and 'q' into
> the Cyrillic sequence? (I've a recollection that this letter is used
> in Kurdish written in Cyrillic.)

Obsolete recollection. See:

051A;CYRILLIC CAPITAL LETTER QA;Lu;0;L;;;;;N;;;;051B;
051B;CYRILLIC SMALL LETTER QA;Ll;0;L;;;;;N;;;051A;;051A

which are already in DUCET as Cyrillic.

> I have been wondering if U+0078 LATIN
> SMALL LETTER X should be made common script because of its use for
> displaying Lao vowels, but perhaps the principle of separation of
> scripts should lead to LAO LETTER SMALL X.

Please, no! ;-)

Orthographies which mix in random characters from other scripts do not
(or should not) drive the identity of characters for *scripts* per se.
And edge cases for making mixed script collation work should not drive
such decisions, either.

--Ken
Received on Wed May 16 2012 - 17:39:25 CDT

This archive was generated by hypermail 2.2.0 : Wed May 16 2012 - 17:39:42 CDT