[Unicode] Code Charts Look up by character code: Tech Site | Site Map | Search
 

 

UnicodeĀ® Code Charts Help and Links

SCRIPTS   |   SYMBOLS & PUNCTUATION   |   NAME INDEX

About the Online Code Charts

These charts are provided as a convenient online reference to the character contents of the Unicode Standard but do not provide all the information needed to fully support individual scripts using the Unicode Standard. Proper Unicode support requires considerably more than providing glyphs for characters, and requires consulting the Unicode Standard, including the Unicode Character Database and the Unicode Standard Annexes. There are several other types of charts, see the links on the left.

Blocks and Scripts

The list of code charts is divided into two separate sections, one covering scripts and the other covering punctuation, symbols, and notational systems. Each section of the map is organized so related scripts and symbols are grouped together, as described below.

The Unicode character code charts are divided into character blocks, such that many scripts require the use of several blocks. In such cases, all blocks related to a script are listed together. The links from the block names are to the most recent version of the corresponding character code chart.

Conventions in the Code Charts Index

Script Group Multi-block Script Single Script Block Partial Block
Groups scripts by region; groups symbols, punctuation and other by major classes. Represents a single script (or symbol type), usually with a link to the main block on the subhead. Represents a single block closely associated with the block or subhead shown on a light blue background above it. Designates a range of characters within a block, with a link to the block, and not specifically to the range.

Script and symbol groups are arranged in an order that provide the best fit for the table. The order of presentation may change as additional code charts are added. The names of blocks may be an abbreviated or otherwise modified form of the official block name. In some cases, a subhead may simply be a name which covers the content of several closely related blocks. The names of ranges are freely chosen and may change.

Key to the Unicode Code Charts

Symbol Meaning Examples
Character name alias ※ LATIN SMALL LETTER GHA
= Informative alias(es) = barred o, o bar
Informative note • lowercase is 0275 ɵ
• Portuguese, Estonian
• this is a spacing character
Cross-reference → 0283 ʃ  latin small letter esh
Canonical decomposition mapping ≡ 0075 u 031B ◌̛
Compatibility decomposition mapping ≈ 006E n 006A j
~ Standardized variation sequence ~ 2205 FE00 zero with long diagonal stroke overlay form

For a full explanation of all the conventions and symbols used in the Unicode code charts, including the special layout used for the CJK unified ideographs, see About the Code Charts in the core specification. See also the discussion of Character Decomposition Mapping in UAX #44, Unicode Character Database. For the specification of syntax used in the Unicode names list, see NamesList.html. Documentation and a complete list of the normative character name aliases can be found in NameAliases.txt.

Search and Index

To look up a character by character code, use the lookup field at the top of the page. For an alphabetical index of characters and blocks, use the Unicode Character Name Index. To look up information on a Han ideograph, use the searchable Unihan database.

Access to Specific Versions

Version-specific code charts are only created for new blocks or blocks changed by the addition of new characters; these charts show the new characters with yellow highlighting, For each version, for example Unicode 14.0, there is a list showing all of the affected blocks, with the number of new characters added, and with links to the corresponding version-specific charts for each block.

Use the table of links to Version-Specific Charts in the navigation bar to the left to look up the code charts which changed in a particular version of the Unicode Standard.

Starting from Unicode 4.1.0, archived versions of the code charts provide the complete character code charts for a given version of the Unicode Standard. For example the directory https://www.unicode.org/Public/6.0.0/charts/ contains the archived code charts for Unicode 6.0.0. These charts do not indicate which characters were added in a particular version.

For versions of the standard prior to Unicode 3.0, scanned versions of code charts from the hardcopy book editions can be found via the links in the Archive of Unicode Versions.

Version-specific or archived code charts are not updated with errata fixes.

Translated Code Charts

From time to time, translations of particular versions of the Unicode Code Charts may be posted. For example, see the French Code Charts. The goal of such translations is to help the readers of the target language to better understand the identity of those characters in the standard by translating the names for characters or other annotational material in the code chart, or by providing additional annotations in the target language, such as additional local aliases for character names.

Any translations of the Unicode Code Charts have informative status only. Their content is not formally reviewed and approved by the Unicode Technical Committee, and they do not constititute part of any particular release of the Unicode Standard. However, such charts must meet certain criteria. Any character shown in such a translated chart must use the correct code point for that version. Glyphs shown must not deviate from the normative chart, other than insubstantial changes resulting from the production process that do not affect the identity of the character. Translated names for characters should be unique, but have no normative status and are not subject to the same stability guarantees applicable to the normative character names.

Charts and Ancillary Data for ISO/IEC 10646

The Unicode web site also hosts charts and ancillary data for ISO/IEC 10646 to assist in the publication of that standard. Currently, the code charts and ancillary data files are available for ISO/IEC 10646:2017, 5th Edition.

In the future, content corresponding to earlier editions or amendments may be added, as well as content corresponding to future editions of ISO/IEC 10646.

The code charts for ISO/IEC 10646 are produced with the same tools and names list files used to produce the code charts for the Unicode Standard, so the conventions for symbols used in the code charts are identical. The ancillary data contains source information for large repertoires such as CJK unified ideographs, Tangut ideographs, and Nushu characters. The data files also contain descriptions of 10646 character collections, an ordered list of character names, and various tabular information better represented in plain text data files. The code charts and ancillary data files are directly referenced by the ISO/IEC 10646 standard.

See the code charts and data pages for ISO/IEC 10646 for any details regarding exactly which parts of the code charts and data have normative or informative status specific to that standard.

See the WG2 Home Page for further information about ISO/IEC 10646 and for links to the full text of 10646, accessible as a publicly available specification from the ISO site.

Other Charts

In addition to the character code charts formatted in PDF, there is additional chart material available about characters in the latest Unicode version, formatted as HTML tables. These additional charts make use of Unicode characters directly, rather than embedded images, so require good font support on your viewing platform for best display.

These additional charts list Unicode characters grouped by script (Script Charts), list Unicode characters in block order, with links into the annotated names list (Names List Charts), and show casing relationships (Case Mapping Charts) and normalization relationships (Normalization Charts). Also, there are charts which show the default ordering of Unicode characters according to the Unicode Collation Algorithm (Collation Charts). See the individual charts for notes about their use and conventions.

Errata

For an up-to-date list of errata to the Standard, including errata to the code charts, see Errata and Updates. If you find what you suspect is a glyph error in the charts, or an incorrect annotation in the names list, you may use the online contact form to report it.

Proposed Additions to Unicode

Information on characters currently being considered for addition to the Unicode Standard can be found at Proposed Scripts and Proposed Characters. A placeholder for additions of entire scripts or blocks for may be tentatively entered into the list of code charts and marked with the version number of the target release. Any such information is tentative and subject to change.

Disclaimers

The code charts are provided as a public service by Unicode, Inc. They present information copyrighted by Unicode, Inc., but not all files are guaranteed to accurately reflect the most current version of The Unicode Standard. For major and minor versions of the standard, except as otherwise noted, the authoritative version-specific reference is the archived version of the code charts. For major versions of the standard prior to Unicode 5.2, the authoritative reference is the printed version of the code charts.

The information on these pages may be updated from time to time. The Unicode Consortium is not liable for errors or omissions in these charts or the standard itself.

Fonts

The fonts used in these charts were provided to the Unicode Consortium by a number of different font designers who own the rights to the fonts. Note that the glyphs in these charts are only representative; there can be wide variation in the glyphs used to represent any particular character, as discussed in the standard.

Terms of Use

You may freely use these code charts for personal or internal business uses only. You may not incorporate them wholly or in part into any product or publication, or otherwise distribute them without express written permission from the Unicode Consortium. However, you may provide links to these charts.

The ISO/IEC 10646:2017 code charts present information copyrighted by Unicode, Inc. The Unicode Consortium specifically grants ISO the right to use those code charts with their associated character names list to show the repertoire of characters for that standard, as a normatively referenced, integral part of that standard.