[Unicode]  The Unicode Standard Home | Site Map | Search
 

Components of The Unicode Standard
Version 5.1.0

The following lists the components of version 5.1.0 of the Unicode Standard. The version numbering, symbols, and the role of each component are explained in Versions of The Unicode Standard.

Note: All files available via HTTP are mirrored and available via FTP. Thus either http://www.unicode.org/Public/ or ftp://www.unicode.org/Public/ can be used.

 


Released: April 4, 2008

The Unicode Consortium. The Unicode Standard, Version 5.1.0, defined by: The Unicode Standard, Version 5.0 (Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0) (http://www.unicode.org/versions/Unicode5.0.0/), as amended by Unicode 5.1.0 (http://www.unicode.org/versions/Unicode5.1.0/)

The following is a sample reference format for a UAX. For unversioned reference formats, see the References section of the Versions page.

Unicode Standard Annex #15, "Unicode Normalization Forms," by Mark Davis and Martin Dürst, an integral part of The Unicode Standard. Version 5.1.0. 2008-03-28. (http://www.unicode.org/reports/tr15/tr15-29.html)
Latest Version: http://www.unicode.org/reports/tr15/

The Unicode Standard, Version 5.1.0 is defined by the following list. The version numbering and the role of each component are explained in Versions of The Unicode Standard. For a summary of the contents of this version, see Unicode 5.1.0.

Major Reference
The Unicode Consortium. The Unicode Standard, Version 5.0
Boston, MA, Addison-Wesley Developers Press, 2007. ISBN 0-321-48091-0.
Unicode Standard Annexes
UAX #9: Unicode Bidirectional Algorithm
UAX #11: East Asian Width
UAX #14: Unicode Line Breaking Algorithm
UAX #15: Unicode Normalization Forms
UAX #24: Unicode Script Property
UAX #29: Unicode Text Segmentation
UAX #31: Unicode Identifier and Pattern Syntax
UAX #34: Unicode Named Character Sequences
UAX #38: Unicode Han Database (Unihan)
UAX #41: Common References for Unicode Standard Annexes
UAX #42: Unicode Character Database in XML
UAX #44: Unicode Character Database
Unicode Character Database
http://www.unicode.org/Public/5.1.0, or
ftp://www.unicode.org/Public/5.1.0
Documentation
D Index.txt
F NamesList.html
T ReadMe.txt
-     StandardizedVariants.html
T UCD.html
T Unihan.html
Core Data
D ArabicShaping.txt
D BidiMirroring.txt
D Blocks.txt
T CompositionExclusions.txt
D EastAsianWidth.txt
- HangulSyllableType.txt
- Jamo.txt
D LineBreak.txt
- NameAliases.txt
D NamedSequences.txt
D NamedSequencesProv.txt
D NamesList.txt
T     NormalizationCorrections.txt
D PropertyAliases.txt
D PropertyValueAliases.txt
D PropList.txt
D Scripts.txt
T SpecialCasing.txt
F     StandardizedVariants.txt
D UnicodeData.txt
D Unihan.txt (very large file, see Unihan.zip)
Derived Data
D CaseFolding.txt
D DerivedAge.txt
D DerivedCoreProperties.txt
D DerivedNormalizationProps.txt
Extracted Data
D DerivedBidiClass.txt
D DerivedBinaryProperties.txt
D DerivedCombiningClass.txt
D DerivedDecompositionType.txt
D DerivedEastAsianWidth.txt
D DerivedGeneralCategory.txt
D DerivedJoiningGroup.txt
D DerivedJoiningType.txt
D DerivedLineBreak.txt
D DerivedNumericType.txt
D DerivedNumericValues.txt
Conformance Test Data
D    NormalizationTest.txt
Auxiliary Data for UAX #14 and UAX #29
D    GraphemeBreakProperty.txt
D    SentenceBreakTest.txt
D    GraphemeBreakTest.txt
N    LineBreakTest.txt
D    SentenceBreakProperty.txt
D    WordBreakProperty.txt
D    WordBreakTest.txt
Documentation for Auxiliary Data
D    GraphemeBreakTest.html
N    LineBreakTest.html
D    SentenceBreakTest.html
D    WordBreakTest.html