Re: TATAP => TATAR

From: Mark Davis (markdavis@ispchannel.com)
Date: Tue Sep 19 2000 - 13:48:16 EDT


If those can be confirmed, then the SpecialCasing file should be modified to add
them. Could you verify this in time for the next UTC?

Mark

Cathy Wissink wrote:

> I believe Azeri also uses the dotless i/dotted i Turkish-style casing.
>
> Cathy
>
> -----Original Message-----
> From: Carl W. Brown [mailto:cbrown@xnetinc.com]
> Sent: Tuesday, September 19, 2000 9:03 AM
> To: Unicode List
> Subject: RE: TATAP => TATAR
>
> >-----Original Message-----
> >From: Herman Ranes [mailto:herman@iet.hist.no]
> >Sent: Tuesday, September 19, 2000 6:30 AM
> >To: Unicode List
> >Cc: unicode@unicode.org
> >Subject: Re: TATAP => TATAR
>
> >Several Tatar language links here:
> >http://members.tripod.com/~anttikoski/eng_tatar.html
>
> >In particular, the Tatar-Bashkir latin alphabet is presented in RFE/RL's
> >site at
> >http://rferl.org/bd/tb/tatar/TATAR/abs.html
>
> >Are all these characters supported in UNICODE?
>
> I was unaware that they were moving back to the Latin alphabet.
> What jumps out at me is that case conversion code like the code that I just
> submitted for inclusion into ICU is wrong. Turkish is not the only language
> with dotted and dot less i. I assume that Tatar and Bashkir should follow
> the same rules as Turkish. Are there other languages?
>
> So I guess that I should check for "ba", "tt" & "tr" for special case
> shifting. I presume that the alphabet is listed in proper sort order?
>
> Carl



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:13 EDT