Unicode® Code Charts Help and Links
SYMBOLS & PUNCTUATION
| NAME INDEX
About the Online Code Charts
These charts are provided as a convenient
online reference to the character contents of the
but do not provide all the information needed to fully support
individual scripts using the Unicode Standard. Proper Unicode
support requires considerably more than providing glyphs
for characters, and requires consulting the Unicode Standard,
including the Unicode
Character Database and the
Standard Annexes. There are several other types of charts, see
the links on the left.
Blocks and Scripts
The list of code charts is divided into two separate
sections, one covering
and the other covering
symbols, and notational systems. Each section of the
map is organized so related scripts and symbols are
grouped together, as described
The Unicode character code charts are divided into character
blocks, such that many scripts require the use of several
blocks. In such cases, all blocks related to a script are
listed together. The links from the block names are to the
most recent version of the corresponding character code
Conventions in the Code Charts Index
||Single Script Block
|Groups scripts by region; groups symbols, punctuation and other by major classes.
||Represents a single script (or symbol type), usually with a link to the
main block on the subhead.
||Represents a single block closely associated with the block or subhead
shown on a light blue background above it.
||Designates a range of characters
within a block, with a link to the block, and not specifically to the range.
Script and symbol groups are arranged in an order that provide
the best fit for the table. The order of presentation may change as
additional code charts are added. The names of blocks may be an abbreviated
or otherwise modified form of the official block name. In some cases, a subhead
may simply be a name which covers the content of several closely related blocks.
The names of ranges are freely chosen and may change.
Key to the Unicode Code Charts
||Character name alias
||※ LATIN SMALL LETTER GHA
||= barred o, o bar
||• lowercase is 0275 ɵ
|• Portuguese, Estonian
|• this is a spacing character
||→ 0283 ʃ latin small letter esh
||Canonical decomposition mapping
||≡ 0075 u 031B ◌̛
||Compatibility decomposition mapping
||≈ 006E n 006A j
||Standardized variation sequence
||~ 2205 FE00 zero with long diagonal stroke overlay form
For a full explanation of all the conventions and symbols used in
the Unicode code charts, including the special layout used for
the CJK unified ideographs, see
About the Code Charts
in the core specification. See also the discussion of
Character Decomposition Mapping
in UAX #44, Unicode Character Database.
For the specification of syntax used in the
Unicode names list, see
Documentation and a complete list of the normative character name aliases
can be found in
Search and Index
To look up a character by character code, use the lookup field at the
top of the page. For an alphabetical index of characters and blocks,
Character Name Index. To look up information on a Han ideograph,
Access to Specific Versions
Version-specific code charts are only created for new blocks or blocks changed by the addition of new characters; these charts show the new characters with yellow highlighting, For each version, for example
Unicode 5.0, there is a list showing all of the affected blocks, with the number of new characters added, and with links to the corresponding version-specific charts for each block.
Use the table of links to Version-Specific Charts in the navigation bar to the left to look up the code charts which changed in a particular version of the Unicode Standard.
Starting from Unicode 4.1.0, archived versions of the code charts provide the complete character code charts for a given version of the Unicode Standard. For example the directory
https://www.unicode.org/Public/6.0.0/charts/ contains the archived code charts for Unicode 6.0.0. These charts do not indicate which characters were added in a particular version.
For versions of the standard prior to Unicode 3.0, scanned versions of code charts
from the hardcopy book editions can be found via the links in the
Archive of Unicode Versions.
Version-specific or archived code charts are not updated with errata fixes.
Translated Code Charts
From time to time, translations of particular versions of the Unicode Code Charts
may be posted. For example, see the French Code Charts.
The goal of such translations is to help the readers of the target language
to better understand the identity of those characters in the standard by translating
the names for characters or other annotational material in the code chart, or
by providing additional annotations in the target language, such as additional
local aliases for character names.
Any translations of the Unicode Code Charts have informative status only. Their content
is not formally reviewed and approved by the Unicode Technical Committee, and
they do not constititute part of any particular release of the Unicode Standard.
However, such charts must meet certain criteria. Any character shown in such a
translated chart must use the correct code point for that version.
Glyphs shown must not deviate from the normative chart, other than
insubstantial changes resulting from the production process that
do not affect the identity of the character.
Translated names for characters should be unique, but have no normative status
and are not subject to the same stability guarantees applicable to the
normative character names.
Charts and Ancillary Data for ISO/IEC 10646
The Unicode web site also hosts charts and ancillary data for ISO/IEC 10646 to
assist in the publication of that standard. Currently, the
code charts and
ancillary data files
are available for ISO/IEC 10646:2017, 5th Edition.
In the future, content corresponding to earlier editions or amendments may be
added, as well as content corresponding to future editions of ISO/IEC 10646.
The code charts for ISO/IEC 10646 are produced with the same tools and names list
files used to produce the code charts for the Unicode Standard, so the conventions
for symbols used in the code charts are identical. The ancillary data contains
source information for large repertoires such as CJK unified ideographs, Tangut
ideographs, and Nushu characters. The data files also contain descriptions of
10646 character collections, an ordered list of character names, and various
tabular information better represented in plain text data files. The
code charts and ancillary data files are directly referenced by the ISO/IEC 10646
See the code charts and data
pages for ISO/IEC 10646 for any details regarding exactly which parts of the
code charts and data have normative or informative status specific to that standard.
See the WG2 Home Page for further information
about ISO/IEC 10646 and for links to the full text of 10646, accessible as
a publicly available specification from the ISO site.
In addition to the character code charts formatted in PDF, there is additional
chart material available about characters in the latest Unicode version, formatted as HTML
tables. These additional charts make use of Unicode characters directly,
rather than embedded images, so require good font support on your viewing
platform for best display.
These additional charts list Unicode characters grouped
by script (Script Charts), list Unicode characters in block order,
with links into the annotated names list (Names List
and show casing relationships (Case Mapping
normalization relationships (Normalization
there are charts which show the default ordering of Unicode characters
according to the Unicode Collation Algorithm (Collation
See the individual charts for notes about their use and conventions.
For an up-to-date list of errata to the Standard, including
errata to the code charts, see
Errata and Updates. If you find what you suspect is
a glyph error in the charts, or an incorrect annotation
in the names list, you may use the
form to report it.
Proposed Additions to Unicode
Information on characters currently being considered
for addition to the Unicode Standard can be found at
Characters. A placeholder for additions of entire scripts
or blocks for may be tentatively entered into the list of
code charts and marked with the version number of the target
release. Any such information is tentative and subject to
The code charts are provided as a public service by Unicode,
Inc. They present information copyrighted by Unicode, Inc.,
but not all files are guaranteed to accurately reflect the
most current version of The Unicode Standard.
For major and minor versions of the standard, except
as otherwise noted, the
authoritative version-specific reference is the archived
version of the code charts. For major versions of the standard prior
to Unicode 5.2, the authoritative reference is the
printed version of the code charts.
The information on these pages may be updated from time
to time. The Unicode Consortium is not liable for errors
or omissions in these charts or the standard itself.
The fonts used in these charts were provided to the Unicode
Consortium by a number of different
font designers who own the rights to the fonts. Note
that the glyphs in these charts are only representative;
there can be wide variation in the glyphs used to represent
any particular character, as discussed in the standard.
You may freely use these code charts for personal or
internal business uses only. You may not incorporate them
wholly or in part into any product or publication, or otherwise
distribute them without express written permission from
the Unicode Consortium. However, you may provide links to
The ISO/IEC 10646:2017 code charts present information copyrighted
by Unicode, Inc. The Unicode Consortium specifically grants
ISO the right to use those code charts with their associated
character names list to show the repertoire of characters
for that standard, as a normatively referenced, integral part
of that standard.