Please note that the language tags in plane 14 are pure ASCII in nature.
The Turkish I problem doesn't enter, nor do ß and accented latin characters.
> -----Original Message-----
> From: Antoine Leca [SMTP:Antoine.Leca@renault.fr]
> Sent: Thursday, June 29, 2000 7:56 AM
> To: Unicode List
> Subject: Re: Plane 14 language tags
> Brendan Murray wrote:
> > Murray Sargent <email@example.com> wrote:
> > > Note that in C, it's essentially just as fast to make character
> > > comparisons with (ch | 0x20) as with ch alone, i.e., if you know
> > > ch is in an ASCII range (0 - 0x7F or 0xE0000 - 0xE007F), you can
> > > do a case insensitive compare as quickly as a case sensitive one.
> > Except, of course, in Turkey where the lowercase of 'I' is not 'i' and
> > uppercase of 'i' is not 'I'.
> Unless I missed a very recent draft (that ought to be refused, IMHO),
> Turkey (or Azerbaijani) was not used for the plane 14 language tags,
> was it?
> And of course, the lowercase of "SS" in German is sometimes ß, the lower
> case of an initial "E" in French followed by a consonnant is more often
> "é" than "e", except if followed by "x"/"X" or a doubled one (like
> "ff") or two consonant, first a nasal (like "mb", "nc", "MP", ...),
> the lowercase of Italian (or Corsican) "A'", "E'", ... at the end of a
> word is likely to be "à", "é/è", ... (Marco, is it really true? and how
> é and è are handled?)
> Et cætera.
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:05 EDT