[Unicode] Character Proposals Home | Site Map | Search
 

Current Allocation

The following tables give the statistics for currently unassigned (reserved) code points in the Basic Multilingual Plane (BMP) and in the Supplementary Multilingual Plane (SMP) for Unicode 5.1.0.

First, a few definitions.

column
A set of 16 Unicode code points that all have the same "div 16" value, e.g. U+2040..U+204F or U+1D120..U+1D12F. (The name comes from the fact that they occupy a vertical column in the character charts.)
empty column
A column where none of the code points are designated.
partial column
A column where some code points are designated and some are reserved.

For practical reasons, the Unicode Technical Committee avoids splitting character blocks across columns. For that reason, it is important in new allocation to distinguish code points from these sources:

  • reserved code points not in blocks
  • reserved code points in empty columns (within assigned blocks)
  • reserved code points in partially allocated columns (within assigned blocks)
  • designated code points (includes assigned characters, private use, surrogate code points, and noncharacters—all of which are unavailable for assigning new characters) 

Here is the breakdown in Unicode 5.1.0.

SUMMARY

  Reserved Designated
  Not in Blocks in Empty Columns in Partial Columns

Code Points

1,664

704

1,288

61,880

Columns

104

44

n/a

3,948

 

SMP SUMMARY

  Reserved Designated
  Not in Blocks in Empty Columns in Partial Columns

Code Points

61,168

256

256

3,856

Columns

3,823

16

n/a

257

The following lists the code points in empty columns in more detail. It is separated into two parts: empty columns in unassigned blocks (or areas), and empty columns in assigned blocks. The (xx) are the number of code points in empty columns in that block or area.

Reserved Ranges Not in Blocks

0800..08FF256 General Scripts Area - Right to Left
18B0..18FF80 General Scripts Area
1A20..1AFF224 General Scripts Area
1BC0..1BFF64 General Scripts Area
1C80..1CFF128 General Scripts Area
2FE0..2FEF16 Symbols Area
A4D0..A4FF48 General Scripts Area
A6A0..A6FF96 General Scripts Area
A830..A83F16 General Scripts Area
A8E0..A8FF32 General Scripts Area
A960..A9FF160 General Scripts Area
AA60..ABFF416 General Scripts Area
D7B0..D7FF80 General Scripts Area
10200..1027F128 General Scripts Area
102E0..102FF16 General Scripts Area
10350..1037F48 General Scripts Area
103E0..103FF16 General Scripts Area
104B0..107FF848 General Scripts Area
10840..108FF192 General Scripts Area - Right to Left
10940..109FF192 General Scripts Area - Right to Left
10A60..10FFF1,440 General Scripts Area - Right to Left
11000..11FFF4,096 General Scripts Area
12480..1CFFF43,904 General Scripts Area
1D250..1D2FF176 Symbols Area
1D800..1EFFF6,144 Symbols Area
1F0A0..1FFFD3,934 Symbols Area

Empty Columns in Assigned Blocks

0DE0..0DEF16 [Sinhala]
0E60..0E7F32 [Thai]
0EE0..0EFF32 [Lao]
0FD0..0FFF48 [Tibetan]
20C0..20CF16 [Currency Symbols]
23F0..23FF16 [Miscellaneous Technical]
2430..243F16 [Control Pictures]
2450..245F16 [Optical Character Recognition]
26D0..26FF48 [Miscellaneous Symbols]
2B60..2BFF160 [Miscellaneous Symbols and Arrows]
2D70..2D7F16 [Tifinagh]
2E40..2E7F64 [Supplemental Punctuation]
9FD0..9FFF48 [CJK Unified Ideographs]
A630..A63F16 [Vai]
A790..A7EF96 [Latin Extended-D]
FAE0..FAFF32 [CJK Compatibility Ideographs]
FBC0..FBCF16 [Arabic Presentation Forms-A]
FD40..FD4F16 [Arabic Presentation Forms-A]
10060..1007F32 [Linear B Syllabary]
101A0..101CF48 [Ancient Symbols]
12370..123FF144 [Cuneiform]
1D1E0..1D1FF32 [Musical Symbols]