Version 4.1 of UCA Released

From: Rick McGowan (
Date: Fri May 06 2005 - 16:26:15 CDT

  • Next message: Peter Kirk: "Re: Cyrillic - accented/acuted vowels"

    Version 4.1 of UCA Released:
    Searching, Sorting, and Matching Unicode Characters

    Mountain View, CA, May 6, 2005 - The Unicode Consortium announced today
    the release of Version 4.1.0 of UTS #10, The Unicode Collation Algorithm
    (UCA), used for searching, sorting, and matching Unicode characters.

    The major changes in this release are:

    - increase in repertoire to cover the recently-released Unicode 4.1.0
    - a revised handling of Thai/Lao via contractions
    - enhancements to sorting and matching, with new conformance requirements
    - changes to the handling of ignorable characters
    - guidelines on the use of grapheme joiner
    - additional introductory text
    - changes in weights for a small number of characters.

    The UCA also serves as the basis for collation in the Common Locale Data
    Repository (CLDR); the language-specific collation sequences in the CLDR
    are expressed as tailorings of the default UCA table. For more information,
    see .

    About the Unicode Standard

    The Unicode Standard is a fundamental component of all modern software and
    information technology protocols. It provides a uniform, universal
    architecture and encoding for all languages of the world -- with over
    96,000 characters currently encoded -- and is the basis for processing,
    storage, and seamless data interchange of text data worldwide. Unicode is
    required by modern standards such as XML, Java, C#, ECMAScript
    (JavaScript), LDAP, CORBA 3.0, WML, IDN, etc., and is the official way to
    implement ISO/IEC 10646.

    About the Unicode Consortium

    The Unicode Consortium is a non-profit organization founded to develop,
    extend and promote software globalization. The membership of the consortium
    represents a broad spectrum of corporations and organizations in the
    computer and information processing industry. Full members (the highest
    level) are: Adobe Systems, L'Agence intergouvernementale de la
    Francophonie, Apple Computer, Government of India - Ministry of Information
    Technology, Government of Pakistan - National Language Authority, HP, IBM,
    Justsystem, Microsoft, Monotype Imaging, Oracle, RLG, SAP, Sun
    Microsystems, and Sybase. In addition, there are about 100 Supporting,
    Associate, Liaison, and Individual members.

    For more information, please contact the Unicode Consortium

    This archive was generated by hypermail 2.1.5 : Fri May 06 2005 - 16:27:10 CDT