[Unicode]  Technical Reports
 

Unicode Standard Annex #41

Common References for Unicode Standard Annexes

Version 5.0.0
Authors Magda Danish
Date 2006-09-15
This Version http://www.unicode.org/reports/tr41/tr41-1.html
Previous Version none
Latest Version http://www.unicode.org/reports/tr41/
Revision 1

Summary

This annex presents a common set of references for the Unicode Standard Annexes.

Status

This document has been reviewed by Unicode members and other interested parties, and has been approved for publication by the Unicode Consortium. This is a stable document and may be used as reference material or cited as a normative reference by other specifications.

A Unicode Standard Annex (UAX) forms an integral part of the Unicode Standard, but is published online as a separate document. The Unicode Standard may require conformance to normative content in a Unicode Standard Annex, if so specified in the Conformance chapter of that version of the Unicode Standard. The version number of a UAX document corresponds to the version of the Unicode Standard of which it forms a part.

Please submit corrigenda and other comments with the online reporting form [Feedback]. For the latest version of the Unicode Standard see [Unicode]. For a list of current Unicode Technical Reports, see [Reports]. For more information about versions of the Unicode Standard, see [Versions].


Contents


1 References to Publications by the Unicode Consortium

Publications may be listed more than once under different headings.

[Bidi] UAX #9: Unicode Bidirectional Algorithm
http://www.unicode.org/reports/tr9/
[Blocks] Blocks data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/Blocks.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/Blocks.txt
[Boundaries] UAX #29: Text Boundaries
http://www.unicode.org/reports/tr29/
[Charts] Online Code Charts
http://www.unicode.org/charts/
An index to character names with links to the corresponding chart is found at
http://www.unicode.org/charts/charindex.html
[Charts15] Normalization Charts
http://www.unicode.org/reports/tr15/charts
[Charts29] Charts for the test files
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/UNIDATA/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/UNIDATA/auxiliary/SentenceBreakTest.html
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/GraphemeBreakTest.html
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/WordBreakTest.html
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/SentenceBreakTest.html
[CLDR] Common Locale Data Repository
http://www.unicode.org/cldr/
[Code9] Reference code implementing the Bidirectional Algorithm
For the original verified C/C++ reference implementation, see: http://www.unicode.org/reports/tr9/BidiReferenceCpp/
For the original verified Java reference implementation, see: http://www.unicode.org/reports/tr9/BidiReferenceJava/
For updates to the C/C++ sample code, see: http://www.unicode.org/Public/PROGRAMS/BidiReferenceCpp/
[Code14] Sample code implementing the Line Break Algorithm using a pair table
http://www.unicode.org/Public/PROGRAMS/LineBreakSampleCpp/
Contains the code samples shown in UAX #14 together with driver code.
[Collation] UTS #10: Unicode Collation Algorithm (UCA)
http://www.unicode.org/reports/tr10/
[Corrections] Normalization Corrections
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NormalizationCorrections.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/NormalizationCorrections.txt
[Corrigendum1 Corrigendum #1: UTF-8 Shortest Form
http://www.unicode.org/versions/corrigendum1.html
[Corrigendum2] Corrigendum #2: Yod with Hiriq Normalization
http://www.unicode.org/versions/corrigendum2.html
[Corrigendum3] Corrigendum #3: U+F951 Normalization
http://www.unicode.org/versions/corrigendum3.html
[Corrigendum4] Corrigendum #4: Five CJK Canonical Mapping Errors
http://www.unicode.org/versions/corrigendum4.html
[Corrigendum5 Corrigendum #5: Normalization Idempotency
http://www.unicode.org/versions/corrigendum5.html
[Data9] Bidi Mirroring
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/BidiMirroring.txt

For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/BidiMirroring.txt
[Data11] East Asian Width property data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/EastAsianWidth.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/EastAsianWidth.txt
[Data14] Line Break property data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/LineBreak.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/LineBreak.txt
[Data24] Scripts data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/Scripts.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/Scripts.txt
[Data34] Named Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NamedSequences.txt
For the 5.0.0 version see:
http://www.unicode.org/Public/5.0.0/ucd/NamedSequences.txt
[DataProv] Provisional Named Sequences data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NamedSequencesProv.txt
For the 5.0.0 version see:
http://www.unicode.org/Public/5.0.0/ucd/NamedSequencesProv.txt 
[DerivedBIDI] Derived Bidi Properties
For the latest version see:
http://www.unicode.org/Public/UNIDATA/extracted/DerivedBidiClass.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/extracted/DerivedBidiClass.txt
[EAW] UAX #11: East Asian Width
http://www.unicode.org/reports/tr11/
[Errata] Updates and Errata
http://www.unicode.org/errata
[Exclusions] Composition Exclusion Table
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/CompositionExclusions.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/CompositionExclusions.txt
[FAQ] Unicode Frequently Asked Questions
http://www.unicode.org/faq/
For answers to common questions on technical issues.
[Feedback] Reporting Form
http://www.unicode.org/reporting.html
For reporting errors and requesting information online.
[Glossary] Unicode Glossary
http://www.unicode.org/glossary/

For explanations of terminology used in this and other documents.
[HangulST] Hangul Syllable Types
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/HangulSyllableType.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/HangulSyllableType.txt
[LineBreak] UAX #14: Line Breaking Properties
http://www.unicode.org/reports/tr14/
[NormProps Derived Normalization Properties
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/DerivedNormalizationProps.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/DerivedNormalizationProps.txt
[Policies] Unicode Policies
http://www.unicode.org/policies/
[Props] Property Data:
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/SentenceBreakProperty.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/GraphemeBreakProperty.txt
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/WordBreakProperty.txt
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/SentenceBreakProperty.txt
[PropValue] Property Value Aliases data file
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/PropertyValueAliases.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/PropertyValueAliases.txt
[RegEx] UTS #18: Unicode Regular Expressions
http://www.unicode.org/reports/tr18/
[Reports] Unicode Technical Reports
http://www.unicode.org/reports/
For information on the status and development process for technical reports, and for a list of technical reports.
[Sample] Sample Normalizer code
http://www.unicode.org/reports/tr15/Normalizer.html
[Security] UTR #36: Security Considerations for the Implementation of Unicode and Related Technology
http://www.unicode.org/reports/tr36/
[Stability] Unicode Consortium Stability Policies
http://www.unicode.org/standard/stability_policy.html 
[Tests15] Normalization Conformance Test
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/NormalizationTest.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/NormalizationTest.txt
[Tests29] Test data:
For the latest version, see:
http://www.unicode.org/Public/UNIDATA/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/UNIDATA/auxiliary/SentenceBreakTest.txt
For the 5.0.0 version, see:
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/GraphemeBreakTest.txt
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/WordBreakTest.txt
http://www.unicode.org/Public/5.0.0/ucd/auxiliary/SentenceBreakTest.txt
[UAX14] UAX #14: Line Breaking Properties
http://www.unicode.org/reports/tr14/
[UAX15]

UAX #15: Unicode Normalization Forms
http://www.unicode.org/reports/tr15/

[UAX29] UAX #29: Text Boundaries
http://www.unicode.org/reports/tr29/
[UAX31] UAX #31: Identifier and Pattern Syntax
http://www.unicode.org/reports/tr31/
[UCA] UTS #10: Unicode Collation Algorithm
http://www.unicode.org/reports/tr10/
[UCD] Unicode Character Database
http://www.unicode.org/ucd/
For an overview of the Unicode Character Database and a list of its associated files, see:
http://www.unicode.org/Public/UNIDATA/UCD.html
[UCDDoc Unicode Character Database Documentation
http://www.unicode.org/Public/UNIDATA/UCD.html
[Unicode] The Unicode Standard
For the latest version, see:
http://www.unicode.org/versions/latest/
For the 5.0.0 version, see:
http://www.unicode.org/versions/Unicode5.0.0/
[Unicode3.0] The Unicode Consortium. The Unicode Standard, Version 3.0 (Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5).
[Unicode3.1] The Unicode Consortium. The Unicode Standard, Version 3.1.0, defined by: The Unicode Standard, Version 3.0 (Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as amended by the Unicode Standard Annex #27: Unicode 3.1
http://www.unicode.org/reports/tr27/
[Unicode3.2] The Unicode Consortium. The Unicode Standard, Version 3.2.0, defined by: The Unicode Standard, Version 3.0 (Reading, MA, Addison-Wesley, 2000. ISBN 0-201-61633-5), as amended by the Unicode Standard Annex #27: Unicode 3.1 and the Unicode Standard Annex #28: Unicode 3.2
http://www.unicode.org/reports/tr28/
[Unicode4.0] The Unicode Consortium. The Unicode Standard, Version 4.0 (Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1).
[Unicode4.0.1]

The Unicode Consortium. The Unicode Standard, Version 4.0.1, defined by: The Unicode Standard, Version 4.0 (Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1), as amended by Unicode 4.0.1
http://www.unicode.org/versions/Unicode4.0.1/

[Unicode4.1] The Unicode Consortium. The Unicode Standard, Version 4.1.0, defined by: The Unicode Standard, Version 4.0 (Boston, MA, Addison-Wesley, 2003. ISBN 0-321-18578-1), as amended by Unicode 4.0.1 and by Unicode 4.1.0
http://www.unicode.org/versions/Unicode4.1.0/
[Unicode5.0] The Unicode Consortium. The Unicode Standard, Version 5.0 (Boston, MA, Addison-Wesley, 2007. ISBN 0-321-48091-0).
[UTC] Unicode Technical Committee
http://www.unicode.org/consortium/utc.html
[UTN5] UTN #5: Canonical Equivalences in Applications
http://www.unicode.org/notes/tn5
[UTR36] UTR #36: Unicode Security Considerations
http://www.unicode.org/reports/tr36/
[UTS35] UTS #35: Locale Data Markup Language (LDML)
http://www.unicode.org/reports/tr35/
[UTS10] UTS #10: Unicode Collation Algorithm (UCA)
http://www.unicode.org/reports/tr10/
[UTS18] UTS #18: Unicode Regular Expressions
http://www.unicode.org/reports/tr18/
[UTS39] UTS #39: Unicode Security Mechanisms
http://www.unicode.org/reports/tr39/
[Versions] Versions of the Unicode Standard
http://www.unicode.org/versions/
For information on version numbering, and citing and referencing the Unicode Standard, the Unicode Character Database, and Unicode Technical Reports.

2 References to Other Standards

[10646] International Organization for Standardization. Information Technology—Universal Multiple-Octet Coded Character Set (UCS).  (ISO/IEC 10646:2003).
For availability, see:
http://www.iso.org
[ISO15924 ISO 15924: Code for the Representation of Names of Scripts
http://www.unicode.org/iso15924/
[JIS] JIS X 4051-1995. Line Composition Rules for Japanese Documents.
(『日本語文晝の行組版方法』) Japanese Standards Association. 1995.
[XML1.1] Extensible Markup Language (XML) 1.1
http://www.w3.org/TR/xml11/

3 Other References

[Cedar97] Cy Cedar, David Veintimilla, Michel Suignard, and Asmus Freytag, Report from the Trenches: Microsoft Publisher goes Unicode. Proceedings of the Eleventh International Unicode Conference, San Jose, CA, 1997.
[CharLint] Charlint—A Character Normalization Tool
http://www.w3.org/International/charlint/
[CharMod W3C Character Model for the World Wide Web
http://www.w3.org/TR/charmod/
[CharReq] W3C Requirements for String Identity Matching and String Indexing
http://www.w3.org/TR/WD-charreq
[Knuth78] Donald E. Knuth and Michael F. Plass, Breaking Lines into Paragraphs, republished in Digital Typography, CSLI 78 (Stanford, California: CLSI Publications 1997).
[Suign98] Michel Suignard, Worldwide Typography and How to Apply JIS X 4051-1995 to Unicode. Proceedings of the Twelfth International Unicode/ISO 10646 Conference, Tokyo, Japan, 1998.
[TEX] Donald E. Knuth, TEX, the Program, Volume B of Computers & Typesetting (Reading, MA, Addison-Wesley, 1986).