Re: Unicode-based Cyrillic-Latin transliteration table

From: DougEwell2@cs.com
Date: Wed May 30 2001 - 02:31:19 EDT


In a message dated 2001-05-29 4:28:09 Pacific Daylight Time,
Peter_Constable@sil.org writes:

>> The goal is to improve an existing program I wrote which automatically
>> detects the encoding form of Cyrillic text (8-bit character sets such as
DOS
>> CP 866, Windows CP 1251, or KOI-8, as well as UTF-8) and optionally
>> transliterates the text to a 7-bit ASCII representation that an English
>> speaker can reasonably sound out.
>
> Do you want transliteration or transcription? TC46/SC2's definitions for
> these can be seen at http://www.elot.gr/tc46sc2/purpose.html

After reading these definitions, I see that I was mistaken in my terminology.
 Since I am looking for conversions that are geared toward English speakers,
and explicitly favor solutions like "ch" for U+0427 instead of "tj" or
"tsch," the proper name for what I am trying to do is "transcription."

Thanks to Peter for pointing me toward the definitions, and apologies to John
Cowan for blowing him off when he tried to do the same.

-Doug Ewell
 Fullerton, California



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:18:17 EDT