Re: Compliant Tailoring of Normalisation for the Unicode Collation Algorithm

From: Richard Wordingham <>
Date: Sun, 20 May 2012 17:05:00 +0100

On Sun, 20 May 2012 16:15:24 +0100
Richard Wordingham <> wrote:


> For the general case, we ought to be able to express a rule such as
> 'ignore the countering of sof-dottedness', as in Lithuanian casing,
> but I don't see any finite method of expressing it under the UCA,

As we have discontiguous contraction, the three rules for
0049+0307+0300 etc. will cover Lithuanian quite nicely. In general,
we need the rules for <soft-dotted indecomposable>+0307+<ccc=203>, a
finite set. These then remove the Lithuanian FCD booby traps.

Received on Sun May 20 2012 - 11:06:08 CDT

This archive was generated by hypermail 2.2.0 : Sun May 20 2012 - 11:06:08 CDT