[Unicode]  The Unicode Standard Home | Site Map | Search

Components of The Unicode® Standard
Version 12.1.0

This page lists the components of Version 12.1.0 of the Unicode Standard. The version numbering and the role of each component are explained in Versions of The Unicode Standard. See Unicode 12.1.0 for a summary of the contents of this version. That page also provides chapter-by-chapter links to the core specification and an index for block-by-block access to the code charts, for easier browsing of the content of the standard.


Unicode 12.1.0 (May 7, 2019)

Major Reference

The Unicode Consortium. The Unicode Standard, Version 12.1.0, (Mountain View, CA: The Unicode Consortium, 2019. ISBN 978-1-936213-25-2)

In Version 12.1.0, the core specification, as well as the Unicode Standard Annexes with the exception of UAX #42, remain published as they were for Version 12.0.0. They are incorporated by reference into Version 12.1.0, with no changes to their text or dates of publication. The single exception is Unicode Standard Annex #42, "Unicode Character Database in XML," whose schema has been updated to reflect the changes in the UCD for Version 12.1.0.

The following is a sample reference format for a UAX. For unversioned reference formats, see the Reference Examples section of the Versions page.

Unicode Standard Annex #15, "Unicode Normalization Forms," edited by Ken Whistler, an integral part of The Unicode Standard. Version 12.0.0. 2019-02-04. (http://www.unicode.org/reports/tr15/tr15-48.html)
Latest Version: http://www.unicode.org/reports/tr15/


Core Specification
UnicodeStandard-12.0.pdf (size: 14 MB)
As noted above, Versions 12.0.0 and 12.1.0 of the Unicode Standard share the same core specification.
Code Charts and Radical-Stroke Index
Code Charts (size: 108 MB)
Radical-Stroke Index (size: 35 MB)
Note that Versions 11.0.0, 12.0.0, and 12.1.0 of the Unicode Standard share the same Unihan radical-stroke index.
Unicode Standard Annexes
UAX #9: Unicode Bidirectional Algorithm
UAX #11: East Asian Width
UAX #14: Unicode Line Breaking Algorithm
UAX #15: Unicode Normalization Forms
UAX #24: Unicode Script Property
UAX #29: Unicode Text Segmentation
UAX #31: Unicode Identifier and Pattern Syntax
UAX #34: Unicode Named Character Sequences
UAX #38: Unicode Han Database (Unihan)
UAX #41: Common References for Unicode Standard Annexes
UAX #42: Unicode Character Database in XML (Version 12.1.0)
UAX #44: Unicode Character Database
UAX #45: U-Source Ideographs
UAX #50: Unicode Vertical Text Layout
Unicode Character Database
The change status labels that accompany the data files listed below are defined in the Key table of the Versions page.
D Index.txt
- NamesList.html
- ReadMe.txt
Core Data
- ArabicShaping.txt
- BidiBrackets.txt
- BidiMirroring.txt
- Blocks.txt
- CJKRadicals.txt
- CompositionExclusions.txt
D EastAsianWidth.txt
- EmojiSources.txt
- EquivalentUnifiedIdeograph.txt
- HangulSyllableType.txt
- IndicPositionalCategory.txt
- IndicSyllabicCategory.txt
- Jamo.txt
D LineBreak.txt
- NameAliases.txt
- NamedSequences.txt
- NamedSequencesProv.txt
D NamesList.txt
- NormalizationCorrections.txt
- NushuSources.txt
- PropertyAliases.txt
D PropertyValueAliases.txt
- PropList.txt
D Scripts.txt
D ScriptExtensions.txt
- SpecialCasing.txt
- StandardizedVariants.txt
- TangutSources.txt
D UnicodeData.txt
D VerticalOrientation.txt
Unihan Database (Unihan.zip)
- Unihan_DictionaryIndices.txt
- Unihan_DictionaryLikeData.txt
- Unihan_IRGSources.txt
- Unihan_NumericValues.txt
- Unihan_OtherMappings.txt
- Unihan_RadicalStrokeCounts.txt
- Unihan_Readings.txt
- Unihan_Variants.txt
Note that there are no changes in the Unihan data between Versions 12.0.0 and 12.1.0 of the Unicode Standard.
Furthermore, in Version 12.1.0, the constituent files of Unihan.zip show Version 12.0.0 in their header comments.
Data for UAX #45
D   USourceData.txt
-   USourceGlyphs.pdf
-   USourceRSChart.pdf
Note that there are no changes in USourceGlyphs.pdf and USourceRSChart.pdf between Versions 12.0.0 and 12.1.0
of the Unicode Standard. Furthermore, in Version 12.1.0, the two PDF data files show Version 12.0.0 on their cover pages.
Derived Data
- CaseFolding.txt
D DerivedAge.txt
D DerivedCoreProperties.txt
D DerivedNormalizationProps.txt
Extracted Data
D DerivedBidiClass.txt
- DerivedBinaryProperties.txt
D DerivedCombiningClass.txt
D DerivedDecompositionType.txt
D DerivedEastAsianWidth.txt
D DerivedGeneralCategory.txt
- DerivedJoiningGroup.txt
- DerivedJoiningType.txt
D DerivedLineBreak.txt
D DerivedName.txt
- DerivedNumericType.txt
- DerivedNumericValues.txt
Conformance Test Data
-   BidiCharacterTest.txt
-   BidiTest.txt
D   NormalizationTest.txt
Auxiliary Data for UAX #14 and UAX #29
-   GraphemeBreakProperty.txt
-   GraphemeBreakTest.txt
-   LineBreakTest.txt
-   SentenceBreakProperty.txt
-   SentenceBreakTest.txt
-   WordBreakProperty.txt
-   WordBreakTest.txt
Documentation for Auxiliary Data
-   GraphemeBreakTest.html
-   LineBreakTest.html
-   SentenceBreakTest.html
-   WordBreakTest.html