Special casing clarification

From: Carl W. Brown (cbrown@xnetinc.com)
Date: Thu Feb 08 2001 - 21:22:31 EST

Some of the special casing rules are not clear.

# FINAL: The letter is not followed by a letter of category L* (e.g. Ll,
Lt, Lu, Lm, or Lo).

What happens if the word with the final sigma is followed by a period or
comma etc. It should be final. But what about a hyphenated word?
Technically it is still followed by a letter. The text needs clarification.

It seems that final should be when it is followed by a space before a letter
or followed by no more letters.

Another issue - case folding. Case folding appears to be the same as a
shift to upper followed by a shift to lower. The sigma adjustment is not
necessary because the two forms are adjacent and will not affect sort
sequences. The consolidation if dotted and dotless i should not impact
compares in Turkish locations but the Lithuanian removal of u0307 (combining
dot above) after i will affect the Lithuanian locale. However, this should
not affect other locales. It is probably a good idea to do this for all


This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:18 EDT