[Unicode]  Ideographic Variation Database Home | Site Map | Search

PRI 326: Combined registration of the MSARG collection and of sequences in that collection


A submission for the "Combined registration of the MSARG collection and of sequences in that collection" has been received by the IVD Registrar. This submission is currently under review according to the procedures of UTS #37, Unicode Ideographic Variation Database, with an expected close date of 2016-08-12.

At the end of the review period, the submission has been incorporated into the 2016-08-15 version of the IVD as 21 registered IVSes for the MSARG collection. No substantive comments were received during the review period.

This page remains available for archival purposes.

Review instructions

Reviewers are encouraged to comment on any aspect of the submissions, but more particularly on:

  • whether the glyphic subset corresponding to a proposed sequence is indeed a glyphic subset of the base character for the sequence
  • whether the proposed sequences are congruent with the scope of their collection, or whether a new collection may be more appropriate

All comments should be sent via the reporting form and will be forwarded to the submitter. The content of the submission may be adjusted during the review period to account for the comments received.

Submission details

The content of this section has been provided by the submitter, but was edited by the IVD Registrar.


  • Name and address of registrant: Public Administration and Civil Service Bureau (SAFP), Macao Special Administrative Region, China, Rua do Campo, no. 162, Edificio Administracao Publica, 21-27 Andares, Macau
  • Names and email addresses of representatives: Chau Cheuk Kwan (Clement) cchau@safp.gov.mo, Lam Sok Chi sokchil@safp.gov.mo & Professor Lu Qin csluqin@comp.polyu.edu.hk
  • URL of the website describing the collection: https://www.iso10646hk.net/ivd/MSARG/ (NOTE: This is a temporary web site and it will be changed to another web site in the future.)
  • Suggested identifier for the collection: MSARG
  • Pattern for the sequence identifiers: M([AB]_[0-9A-F]{4}|C_[0-9]{5}|D_[0-9A-F]{4,5}|E_[0-9A-F]{4,5}_[0-9]{3})


To facilitate more effective electronic communication among the government units, Macao Special Administrative Region Government (MSARG) is planning to establish and implement Macao SAR Information Systems Character Set (MISCS), which is tentatively named MISCS-2016. This character set will include all the Chinese characters and symbols used in the computer systems of MSARG. In this character set, 11 Chinese character variants are unified with another character which has been encoded in ISO/IEC 10646, but the glyphs used in Macao are substantially different from the encoded characters. To make these glyphs available in Macao's computer systems, it is necessary to register Ideographic Variation Sequences (IVSes) for 21 variants, 10 of which correspond to base characters whose representative glyphs are the same as in the code charts.

MISCS-2016, as a complete named character set under the ISO/IEC 10646 international encoding standard, covers all approved Chinese characters and symbols used in Macao's computer systems. MISCS-2016 will include: 1) the Big-5 character set; 2) HKSCS-2008; 3) Macao's Vertical Extension to ISO/IEC 10646; 4) Macao's Horizontal Extension to ISO/IEC 10646; and 5) Macao's variants (excluding base characters) with registered IVSes. Under the ISO/IEC 10646 international encoding standard, the coding scheme of MISCS-2016 as source references is as follows:

  • MB-hhhh is used to refer to all characters in the Big-5 character set, in which "hhhh" is the hexadecimal Big-5 code.
  • MA-hhhh is used to refer to all characters already encoded in HKSCS-2008, in which "hhhh" is the corresponding hexadecimal Big-5 code in HKSCS-2008.
  • MC-nnnnn is used for characters vertically extended to ISO/IEC 10646, in which "nnnnn" is an MISCS-assigned source reference code between 00001 and 99999, and assigned in sequence.
  • MD-hhhh[h] is used for characters horizontally extended to ISO/IEC 10646, in which "hhhh[h]" is the four- or five-digits hexadecimal code of the character in the ISO/IEC 10646 international standard. For characters in the Basic Multilingual Plane (BMP or Plane 0), the code points contain four hexadecimal digits. For characters in other planes, the code points contain five hexadecimal digits.
  • ME-hhhh[h]-nnn is used for character variants with registered IVSes, in which "hhhh[h]" is the four- or five-digit hexadecimal code of the base character in the ISO/IEC 10646 international standard, and "nnn" is an MISCS-assigned number between 001 and 999. For character variants corresponding to the same base character, "nnn" is assigned in sequence.

Note that the sequence identifiers use underscores in lieu of hyphens per Section 3 of UTS #37.

List of proposed sequences

A data file listing the proposed sequences is available at https://www.unicode.org/ivd/pri/pri326/IVD_Sequences_MSARG.txt.

A one-line description of the collection to be added to the IVD_Collections.txt file is available at https://www.unicode.org/ivd/pri/pri326/IVD_Collections_MSARG.txt.

Representative Glyph Charts

Representative glyphs for the submitted sequences are available in PDF format at https://www.unicode.org/ivd/pri/pri326/Glyphs_List_MSARG.pdf.

NOTE: The IVD Registrar is aware of the minor consistency issue between the left-side component of the base characters for U+36C7 㛇 and U+5557 啗 and that of their variants, which has been reported to the registrant, and which does not affect the submission.

Updates and Comments received

This page may be updated from time to time to inform reviewers of some of the comments received.