Roadmap to the SIP
The following table shows a map of the actual and proposed allocation on Plane 2, the SIP (Supplementary Ideographic Plane). A description of the presentation conventions used in the table is at the bottom of this page. The SIP is intended for encoding of additional repertoire of Unified CJK ideographs or Compatibility CJK ideographs. Other CJK related characters, including strokes, radicals, and punctuation are encoded on the BMP.
This document is informative and those parts reflecting proposals are of a provisional nature. Please send corrigenda and other comments to the authors using the online contact form.
Plane 2 is tentatively mapped out to the following zones:
00020000-0002A6DF CJK Unified Ideographs Extension B
0002A700-0002B73F CJK Unified Ideographs Extension C
0002B740-0002B81F CJK Unified Ideographs Extension D
0002B820-0002CEAF CJK Unified Ideographs Extension E
0002CEB0-0002EBEF CJK Unified Ideographs Extension F
0002F800-0002FA1F CJK Compatibility Ideographs Supplement
NOTE: Should Plane 2, the SIP (Supplementary Ideographic Plane) prove insufficient for future Han character encoding, it is anticipated that further allocations may be provided on Plane 3.
- Bold text indicates an allocated (that is, published) character collection.
For blocks containing assigned graphic or format characters, there is a link to the charts on the Unicode web site.
- (Bold text between parentheses) indicates scripts which have been formally accepted by UTC or WG2 for processing toward inclusion in the standard. There is generally a link to a mature proposal for the script.
- (Text between parentheses) indicates scripts for which proposals have been formally submitted to the UTC or to WG2. There is generally a link to the formal proposal.
- ¿Text between question marks? indicates scripts for which detailed proposals have not yet been written. There may be a link to an exploratory code table.
- ??? in a block indicates that no suggestion has been made regarding the block allocation.
- Color highlighting is used to indicate blocks and unassigned ranges which default to right-to-left character behavior.
- Grey shading is used to indicate ranges of control characters and noncharacters.
NOTE: When scripts are actually proposed to the UTC or to WG2, the practice is to "front" them in the zones to which they are tentatively allocated, and to adjust the block size with regard to the allocation proposed.
The size and location of the unallocated script blocks are merely proposals based on the current state of planning. The size and location of a script may change during final allocation of the script.
The Roadmap Committee maintains and updates this document as a service to the Unicode Technical Committee and to ISO/IEC JTC1/SC2/WG2.