XML Content Localization and Unicode
Ultan Ó Broin - Oracle Corporation
In this paper, we will examine the eXtensible Markup Language (XML) techniques for the internationalization and localization of content.
Although XML can support any defined character set, the basis for XML globalization comes from its support of the multilingual Unicode standard.
To provide a basis for localization, we will first consider typical internationalization requirements, the role of the Universal Character Set (which we refer to as Unicode), and outline language related XML features for ruby text, vertical text, combined text, bi-directional text, the preservation of white space and more. Then we will examine how localization processes are best served by XML and the emerging XML Localisation Interchange File Format (XLIFF).
Our conclusion is that the use of Unicode and XML/XLIFF is the best way for an open, scalable approach to developing multilingual content for global deployment.
|When the world wants to talk, it speaks Unicode|
International Unicode Conferences are organized by Global Meeting Services, Inc., (GMS).
GMS is pleased to be able to offer the International Unicode Conferences under an exclusive
license granted by the Unicode Consortium. All responsibility for conference finances and
operations is borne by GMS. The independent conference board serves solely at the pleasure
of GMS and is composed of volunteers active in Unicode and in international software
development. All inquiries regarding International Unicode Conferences should be addressed
Unicode and the Unicode logo are registered trademarks of Unicode, Inc. Used with permission.
4 March 2002, Webmaster