RE: Plane 14 language tags

From: Murray Sargent (murrays@microsoft.com)
Date: Thu Jun 29 2000 - 15:46:10 EDT


Please note that the language tags in plane 14 are pure ASCII in nature.
The Turkish I problem doesn't enter, nor do ß and accented latin characters.

Thanks
Murray

> -----Original Message-----
> From: Antoine Leca [SMTP:Antoine.Leca@renault.fr]
> Sent: Thursday, June 29, 2000 7:56 AM
> To: Unicode List
> Subject: Re: Plane 14 language tags
>
> Brendan Murray wrote:
> >
> > Murray Sargent <murrays@microsoft.com> wrote:
> > > Note that in C, it's essentially just as fast to make character
> > > comparisons with (ch | 0x20) as with ch alone, i.e., if you know
> > > ch is in an ASCII range (0 - 0x7F or 0xE0000 - 0xE007F), you can
> > > do a case insensitive compare as quickly as a case sensitive one.
> >
> > Except, of course, in Turkey where the lowercase of 'I' is not 'i' and
> the
> > uppercase of 'i' is not 'I'.
>
> Unless I missed a very recent draft (that ought to be refused, IMHO),
> Turkey (or Azerbaijani) was not used for the plane 14 language tags,
> was it?
>
> And of course, the lowercase of "SS" in German is sometimes ß, the lower
> case of an initial "E" in French followed by a consonnant is more often
> "é" than "e", except if followed by "x"/"X" or a doubled one (like
> "ff") or two consonant, first a nasal (like "mb", "nc", "MP", ...),
> the lowercase of Italian (or Corsican) "A'", "E'", ... at the end of a
> word is likely to be "à", "é/è", ... (Marco, is it really true? and how
> é and è are handled?)
> Et cætera.
>
>
> Antoine



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:05 EDT