UnicodeIUC22
Program Showcase Registration Accommodation Travel Sponsors
Unicode Standard Conference Board Conference CD Last Conference Past Conferences Next Conference
Abstract

Transliteration of Indic Scripts Implementation in ICU and Lessons Learned

Ram Viswanadha - IBM Corporation

Intended Audience: Software Engineers
Session Level: Beginner, Intermediate

This talk gives an overview and discusses the issues with transliteration of Indic scripts to Latin and between different Indic scripts, e.g: Devanagari-Telugu, Gujarati-Tamil, etc.

Different standards for transliterating between Indic Scripts and Latin are discussed and respective merits and disadvantages are highlighted.

It is often perceived that transliteration between different Indic scripts is straightforward because all Indic scripts have a common origin in Brahmi script.

The ISCII standard is based on this similarity between scripts, and the placement of Unicode code points for each Indic script is based on an early version of ISCII.

Correct transliteration between Indic scripts takes advantage of this allocation but handles special cases. The ICU implementation uses a script-neutral pivot range in Unicode. An online demonstration is presented.


Unicode
When the world wants to talk, it speaks Unicode

UnicodeIUC22
Program Showcase Registration Accommodation Travel Sponsors
Unicode Standard Conference Board Conference CD Last Conference Past Conferences Next Conference
International Unicode Conferences are organized by Global Meeting Services, Inc., (GMS). GMS is pleased to be able to offer the International Unicode Conferences under an exclusive license granted by the Unicode Consortium. All responsibility for conference finances and operations is borne by GMS. The independent conference board serves solely at the pleasure of GMS and is composed of volunteers active in Unicode and in international software development. All inquiries regarding International Unicode Conferences should be addressed to info@global-conference.com.

Unicode and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission.

23 May 2002, Webmaster