Code Charts - Help and Links

Unicode® Code Charts Help and Links

SCRIPTS | SYMBOLS & PUNCTUATION | NAME INDEX

About the Code Charts

The code charts are provided as a convenient reference to the character contents of the latest version of the Unicode Standard. For the normative code charts for a specific version, see Access to Specific Versions. There are several other types of informative code charts—see the links to the left.

Code charts are an essential resource, but do not provide all the information needed to fully support individual scripts or symbol collections using the Unicode Standard. Proper Unicode support requires considerably more than providing glyphs for characters, and requires consulting the Unicode Standard, including the Unicode Character Database and the Unicode Standard Annexes.

Blocks and Scripts

The list of code charts is divided into two separate sections, one covering scripts and the other covering punctuation, symbols, and notational systems. Each section of the map is organized so related scripts and symbols are grouped together, as described below.

The Unicode character code charts are divided into character blocks, such that many scripts require the use of several blocks. In such cases, all blocks related to a script are listed together. The links from the block names are to the most recent version of the corresponding character code chart.

Conventions in the Code Charts Index

Script Group Multi-block Script Single Script Block Partial Block

Groups scripts by region; groups symbols, punctuation and other by major classes. Represents a single script (or symbol type), usually with a link to the main block on the subhead. Represents a single block closely associated with the block or subhead shown on a light blue background above it. Designates a range of characters within a block, with a link to the block, and not specifically to the range.

Script and symbol groups are arranged in an order that provide the best fit for the table. The order of presentation may change as additional code charts are added. The names of blocks may be an abbreviated or otherwise modified form of the official block name. In some cases, a subhead may simply be a name which covers the content of several closely related blocks. The names of ranges are freely chosen and may change.

Key to the Unicode Code Charts

Symbol Meaning Examples

※ Character name alias ※ LATIN SMALL LETTER GHA

= Informative alias(es) = barred o, o bar

• Informative note • lowercase is 0275 ɵ

• Portuguese, Estonian

• this is a spacing character

→ Cross-reference → 0283 ʃ latin small letter esh

≡ Canonical decomposition mapping ≡ 0075 u 031B ◌̛

≈ Compatibility decomposition mapping ≈ 006E n 006A j

~ Standardized variation sequence ~ 2205 FE00 zero with long diagonal stroke overlay form

For a full explanation of all the conventions and symbols used in the Unicode code charts, including the special layout used for the CJK unified ideographs, see About the Code Charts in the core specification. See also the discussion of Character Decomposition Mapping in UAX #44, Unicode Character Database. For the specification of syntax used in the Unicode names list, see NamesList.html. Documentation and a complete list of the normative character name aliases can be found in NameAliases.txt.

Search and Index

To look up a character by character code, use the lookup field at the top of the page. For an alphabetical index of characters and blocks, use the Unicode Character Name Index. To look up information on a Han ideograph, use the searchable Unihan database.

Access to Specific Versions

Version-specific code charts are only created for new blocks or blocks changed by the addition of new characters; these charts show the new characters with yellow highlighting, For each version, for example Unicode 15.0, there is a list showing all of the affected blocks, with the number of new characters added, and with links to the corresponding version-specific charts for each block.

Use the table of links to Version-Specific Charts in the navigation bar to the left to look up the code charts which changed in a particular version of the Unicode Standard.

Starting from Unicode 4.1.0, archived versions of the code charts provide the complete character code charts for a given version of the Unicode Standard. For example the directory https://www.unicode.org/Public/6.0.0/charts/ contains the archived code charts for Unicode 6.0.0. These charts do not indicate which characters were added in a particular version.

For versions of the standard prior to Unicode 3.0, scanned versions of code charts from the hardcopy book editions can be found via the links in the Archive of Unicode Versions.

Version-specific or archived code charts are not updated with errata fixes.

Translated Code Charts

From time to time, translations of particular versions of the Unicode Code Charts may be posted. For example, see the French Code Charts. The goal of such translations is to help the readers of the target language to better understand the identity of those characters in the standard by translating the names for characters or other annotational material in the code chart, or by providing additional annotations in the target language, such as additional local aliases for character names.

Any translations of the Unicode Code Charts have informative status only. Their content is not formally reviewed and approved by the Unicode Technical Committee, and they do not constititute part of any particular release of the Unicode Standard. However, such charts must meet certain criteria. Any character shown in such a translated chart must use the correct code point for that version. Glyphs shown must not deviate from the normative chart, other than insubstantial changes resulting from the production process that do not affect the identity of the character. Translated names for characters should be unique, but have no normative status and are not subject to the same stability guarantees applicable to the normative character names.

Charts and Ancillary Data for ISO/IEC 10646

The Unicode web site also hosts charts and ancillary data for ISO/IEC 10646 to assist in the publication of that standard. Currently, the code charts and ancillary data files are available for ISO/IEC 10646:2017, 5th Edition.

In the future, content corresponding to earlier editions or amendments may be added, as well as content corresponding to future editions of ISO/IEC 10646.

The code charts for ISO/IEC 10646 are produced with the same tools and names list files used to produce the code charts for the Unicode Standard, so the conventions for symbols used in the code charts are identical. The ancillary data contains source information for large repertoires such as CJK unified ideographs, Tangut ideographs, and Nushu characters. The data files also contain descriptions of 10646 character collections, an ordered list of character names, and various tabular information better represented in plain text data files. The code charts and ancillary data files are directly referenced by the ISO/IEC 10646 standard.

See the code charts and data pages for ISO/IEC 10646 for any details regarding exactly which parts of the code charts and data have normative or informative status specific to that standard.

See the WG2 Home Page for further information about ISO/IEC 10646 and for links to the full text of 10646, accessible as a publicly available specification from the ISO site.

Other Charts

In addition to the character code charts formatted in PDF, there is additional chart material available about characters in the latest Unicode version, formatted as HTML tables. These additional charts make use of Unicode characters directly, rather than embedded images, so require good font support on your viewing platform for best display.

These additional charts list Unicode characters grouped by script (Script Charts), list Unicode characters in block order, with links into the annotated names list (Names List Charts), and show casing relationships (Case Mapping Charts) and normalization relationships (Normalization Charts). Also, there are charts which show the default ordering of Unicode characters according to the Unicode Collation Algorithm (Collation Charts). See the individual charts for notes about their use and conventions.

Errata

For an up-to-date list of errata to the Standard, including errata to the code charts, see Errata and Updates. If you find what you suspect is a glyph error in the charts, or an incorrect annotation in the names list, you may use the online contact form to report it.

Known Issues

Note that the code charts are produced with hundreds of fonts. Many of those fonts are Unicode-encoded, but many are not. There is no guarantee that any particular glyph shown in the code charts is using a Unicode value as its glyph code. One should not attempt to copy/paste characters out of the PDF code charts—the code charts are not intended for or valid as an input method via copy/paste. Please do not report as an error any mismatch of glyph code for a character copy/pasted out of the code charts.

Proposed Additions to Unicode

Information on characters currently being considered for addition to the Unicode Standard can be found at Proposed Scripts and Proposed Characters. A placeholder for additions of entire scripts or blocks for may be tentatively entered into the list of code charts and marked with the version number of the target release. Any such information is tentative and subject to change.

Terms of Use

Unicode Code Charts are protected by copyright, and permission must be obtained prior to any reproduction, modification, or other use not permitted by the Terms of Use.

The Unicode Consortium specifically grants ISO a license to produce such code charts with their associated character names list to show the repertoire of characters for that standard, as a normatively referenced, integral part of that standard.

Unicode uses most fonts under restricted license from the original font designer who contributed the font. You may not extract, copy, modify, or distribute fonts or font data from any Unicode Products, including the Unicode Code Charts, without license from the font owner.

The authors, contributors, and publishers have taken care in the preparation of the code charts provided here, but make no express or implied representation or warranty of any kind and assume no responsibility or liability for errors or omissions or for consequential or incidental damages that may arise therefrom. All code charts are provided “AS-IS” without charge as a convenience to users.