RE: TATAP => TATAR

From: Carl W. Brown (cbrown@xnetinc.com)
Date: Tue Sep 19 2000 - 12:17:12 EDT


>-----Original Message-----
>From: Herman Ranes [mailto:herman@iet.hist.no]
>Sent: Tuesday, September 19, 2000 6:30 AM
>To: Unicode List
>Cc: unicode@unicode.org
>Subject: Re: TATAP => TATAR

>Several Tatar language links here:
>http://members.tripod.com/~anttikoski/eng_tatar.html

>In particular, the Tatar-Bashkir latin alphabet is presented in RFE/RL's
>site at
>http://rferl.org/bd/tb/tatar/TATAR/abs.html

>Are all these characters supported in UNICODE?

I was unaware that they were moving back to the Latin alphabet.
What jumps out at me is that case conversion code like the code that I just
submitted for inclusion into ICU is wrong. Turkish is not the only language
with dotted and dot less i. I assume that Tatar and Bashkir should follow
the same rules as Turkish. Are there other languages?

So I guess that I should check for "ba", "tt" & "tr" for special case
shifting. I presume that the alphabet is listed in proper sort order?

Carl



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:13 EDT