L2/06-187 Title: WG2 Consent Docket Source: Ken Whistler Date: May 10, 2006 Following my usual procedure, I have rolled up all items from the latest WG2 meeting (WG2 #48, Mountain View, CA, April 24 - 27, 2006) for which there is a synchronization issue that the UTC needs to address. The main outcome of the WG2 meeting was the resolution of ballot comments on PDAM 3 and the decision to reissue Amendment 3 for another PDAM ballot, in part because of the agreement to add significant new content to Amendment 3. This consent docket is largely aimed at precise specification of how the content agreed for Amendment 3 differs from what the UTC has approved to date. ================================================================ A. Name Changes for Characters Accepted by the UTC A.1 Malayalam 0D79 MALAYALAM ORDINAL INDICATOR WG2 accepted a name change to: 0D79 MALAYALAM DATE MARK This has been discussed on the unicore list, and I believe there is consensus that the revised name is more accurate. A.2 Latin 2C7A LATIN SMALL LETTER O WITH RING INSIDE DOWN WG2 accepted a name change to: 2C7A LATIN SMALL LETTER O WITH LOW RING INSIDE The UTC should accept this name change. A.3 Saurashtra A8B4 SAURASHTRA LETTER UPAKSHARA WG2 accepted a name change to: A8B4 SAURASHTRA CONSONANT SIGN HAARU The UTC should accept this name change. ================================================================= B. Myanmar Additions B.1 Core additions for Burmese WG2 accepted 7 additional Myanmar characters, on the basis of WG2 N3043 (= L2/06-077). See also the report of the WG2 ad hoc regarding Myanmar, WG2 N3099 (= L2/06-140). 102B MYANMAR VOWEL SIGN TALL AA 103A MYANMAR SIGN ASAT 103B MYANMAR CONSONANT SIGN MEDIAL YA 103C MYANMAR CONSONANT SIGN MEDIAL RA 103D MYANMAR CONSONANT SIGN MEDIAL WA 103E MYANMAR CONSONANT SIGN MEDIAL HA 103F MYANMAR LETTER GREAT SA These have engendered much controversy and a long document trail, but in my opinion should now be accepted by the UTC. B.2 Glyph changes for existing core Burmese characters WG2 also approved glyph changes for two Myanmar characters, based on WG2 N3043 (= L2/06-077): 1039 MYANMAR SIGN VIRAMA 104E MYANMAR SYMBOL AFOREMENTIONED The glyph change for 1039 is linked to item B.1, because of the separation in function between the virama and the killer (U+103A MYANMAR SIGN ASAT). Both glyph changes have consensus among all relevant parties at this point, and should be approved by the UTC. B.3 Additions for Mon and S'gaw Karen WG2 accepted 14 additional Myanmar characters for the minority languages Mon and S'gaw Karen, on the basis of WG2 N3044 (= L2/06-078). See also the report of the WG2 ad hoc regarding Myanmar, WG2 N3099 (= L2/06-140). 1028 MYANMAR LETTER MON E 1033 MYANMAR VOWEL SIGN MON II 1034 MYANMAR VOWEL SIGN MON O 105A MYANMAR LETTER MON NGA 105B MYANMAR LETTER MON JHA 105C MYANMAR LETTER MON BBA 105D MYANMAR LETTER MON BBE 105E MYANMAR CONSONANT SIGN MON MEDIAL NA 105F MYANMAR CONSONANT SIGN MON MEDIAL MA 1060 MYANMAR CONSONANT SIGN MON MEDIAL LA 1061 MYANMAR LETTER SGAW KAREN SHA 1062 MYANMAR VOWEL SIGN SGAW KAREN EU 1063 MYANMAR TONE MARK SGAW KAREN HATHI 1064 MYANMAR TONE MARK SGAW KAREN KE PHO Assuming that the UTC accepts the repertoire in B.1, I think that there is then consensus regarding these additional 14 characters, and the UTC should accept them. However, as for all the Myanmar additions, other feedback documents should be considered. (Cf. L2/06-161, etc.) ================================================================= C. Sundanese Script WG2 approved the Sundanese script for encoding, 55 characters in the range 1B80..1BB9, in a new block, Sundanese (1B80..1BBF), on the basis of WG2 N3022. The UTC should also approve this script for encoding. ================================================================= D. Lepcha Addition WG2's approval of the Lepcha script includes one character not yet approved by the UTC: U+1C35 LEPCHA CONSONANT SIGN KANG Further clarification about this character was provided by the Irish NB in response to the query from the U.S. NB, and at this point, I think the correct course is to approve this addition, to bring the UTC and WG2 back in synch for Lepcha. ================================================================= E. Combining Diacritical Marks Additions E.1 Lithuanian dialectology WG2 approved: 1DCB COMBINING BREVE-MACRON 1DCC COMBINING MACRON-BREVE on the basis of WG2 N3048. I think the characters are justified, and the UTC should go on record as approving them. E.2 Medievalist combining marks WG2 approved a series of 26 combining marks, U+1DCD..U+1DE6, of various types, on the basis of WG2 N3027 (= L2/06-074). For details, see WG2 N3059 (= L2/06-147). The UTC should go on record as approving them. ================================================================= F. Latin Extended Additions F.1 Medievalist Latin characters WG2 approved the following 9 characters in the Latin Extended Additional block, on the basis of WG2 N3027 (= L2/06-074): 1E9C LATIN SMALL LETTER LONG S WITH STROKE 1E9D LATIN SMALL LETTER LONG S WITH HIGH STROKE 1E9F LATIN SMALL LETTER DELTA 1EFA LATIN CAPITAL LETTER MIDDLE-WELSH LL 1EFB LATIN SMALL LETTER MIDDLE-WELSH LL 1EFC LATIN CAPITAL LETTER MIDDLE-WELSH V 1EFD LATIN SMALL LETTER MIDDLE-WELSH V 1EFE LATIN CAPITAL LETTER Y WITH LOOP 1EFF LATIN SMALL LETTER Y WITH LOOP The UTC should approve them. F.2 Mayanist additions The UTC had approved the following 4 characters in the Latin Extended-C block: 2C6F LATIN LETTER TRESILLO 2C70 LATIN LETTER CUATRILLO 2C7B LATIN CAPITAL LETTER TZ 2C7C LATIN SMALL LETTER TZ WG2 accepted, instead, a revised an extended repertoire of 10 Mayanist Latin additions, based on WG2 N3082 (= L2/06-121), in the Latin Extended-D block: A726 LATIN CAPITAL LETTER HENG A727 LATIN SMALL LETTER HENG A728 LATIN CAPITAL LETTER TZ A729 LATIN SMALL LETTER TZ A72A LATIN CAPITAL LETTER TRESILLO A72B LATIN SMALL LETTER TRESILLO A72C LATIN CAPITAL LETTER CUATRILLO A72D LATIN SMALL LETTER CUATRILLO A72E LATIN CAPITAL LETTER CUATRILLO WITH COMMA A72F LATIN SMALL LETTER CUATRILLO WITH COMMA This constitutes a move of two characters already approved, a move, name change, and case cloning for two more (tresillo and cuatrillo), and the addition of four more characters. This change has been quite controversial, but the emerging consensus is that the case pairs, while marginal, are justified. The UTC should discuss, but I recommend the approval of the revised repertoire. F.3 UPA additions WG2 approved the following 3 character in the Latin Extended-C block, on the basis of WG2 N3070: 2C7B LATIN LETTER SMALL CAPITAL TURNED E 2C7C LATIN SUBSCRIPT SMALL LETTER J 2C7D MODIFIER LETTER CAPITAL V The UTC should approve them. F.4 Medievalist Latin characters WG2 approved 73 characters in the Latin Extended-D block, on the basis of WG2 N3027 (= L2/06-074). These are documented in WG2 N3059 (= L2/06-147). The UTC should go on record as approving them. ================================================================= G. CJK Strokes WG2 accepted an additional set of 20 CJK strokes, in the range 31D0..31E3 in the CJK Strokes block, to complete the set of basic stroke type symbols. The UTC should approve them. ================================================================= H. Vai additions H.1 Vai nasal vowel syllables for foreign sounds WG2 accepted 4 additional Vai characters, based on WG2 N3081R (= L2/06-120R): A501 VAI SYLLABLE EEN A525 VAI SYLLABLE IN A572 VAI SYLLABLE OON A596 VAI SYLLABLE UN And as a result, the entire Vai block was rearranged slightly to interpolate these values into the block. The UTC should accept these four characters and the rearranged code points for the rest of the Vai block, based on WG2 N3059 (= L2/06-147). H.2 Vai digits WG2 accepted 10 Vai digits, based on WG2 N3081R (= L2/06-120R): A620 VAI DIGIT ZERO ... A629 VAI DIGIT NINE The UTC should accept these characters and also specify that the Vai block extends from A500..A62F. ================================================================= I. Kayah Li Script WG2 approved the Kayah Li script for encoding, 64 characters in the range A900..A92F, in a new block, Kayah Li (A900..A92F), on the basis of WG2 N3038 (= L2/06-073). The UTC should also approve this script for encoding. ================================================================= J. Rejang Script WG2 approved the Rejang script for encoding, 37 characters in the range A930..A95F, in a new block, Rejang (A930..A95F), on the basis of WG2 N3096 (= L2/06-139). The UTC should also approve this script for encoding. ================================================================= K. Phaistos Disc Symbols WG2 approved the Phaistos Disc symbols for encoding, 46 characters in the range 101D0..101FD, in a new block, Phaistos (101D0..101FF), on the basis of WG2 N3066R (= L2/06-095). The UTC should also approve this script for encoding. ================================================================= L. Combining Marks for Old Cyrillic The UTC approved 22 Old Cyrillic combining marks in the range 2DE0..2DF5. The WG2 did not take these up, because the proposal was considered premature. At some point, the UTC will be getting a revised proposal for consideration, and at that point may need to reconsider the already approved repertoire. My current recommendation on this is to take no action, pending the submission of the revised proposal. ================================================================= M. Named Character Sequences Named Sequences added by WG2 (Lithuanian) These are not uniquified yet, neither by name nor by sequence LATIN CAPITAL LETTER A WITH OGONEK AND ACUTE; 0104 0301 LATIN SMALL LETTER A WITH OGONEK AND ACUTE; 0105 0301 LATIN CAPITAL LETTER A WITH OGONEK AND TILDE; 0104 0303 LATIN SMALL LETTER A WITH OGONEK AND TILDE; 0105 0303 LATIN CAPITAL LETTER E WITH OGONEK AND ACUTE; 0118 0301 LATIN SMALL LETTER E WITH OGONEK AND ACUTE; 0119 0301 LATIN CAPITAL LETTER E WITH OGONEK AND TILDE; 0118 0303 LATIN SMALL LETTER E WITH OGONEK AND TILDE; 0119 0303 LATIN CAPITAL LETTER E WITH DOT ABOVE AND ACUTE; 0116 0301 LATIN SMALL LETTER E WITH DOT ABOVE AND ACUTE; 0117 0301 LATIN CAPITAL LETTER E WITH DOT ABOVE AND TILDE; 0116 0303 LATIN SMALL LETTER E WITH DOT ABOVE AND TILDE; 0117 0303 LATIN SMALL LETTER I WITH DOT ABOVE AND GRAVE; 0069 0307 0300 LATIN SMALL LETTER I WITH DOT ABOVE AND ACUTE; 0069 0307 0301 LATIN SMALL LETTER I WITH DOT ABOVE AND TILDE; 0069 0307 0303 LATIN CAPITAL LETTER I WITH OGONEK AND ACUTE; 012E 0301 LATIN SMALL LETTER I WITH OGONEK AND DOT ABOVE AND ACUTE; 012F 0307 0301 LATIN CAPITAL LETTER I WITH OGONEK AND TILDE; 012E 0303 LATIN SMALL LETTER I WITH OGONEK AND DOT ABOVE AND TILDE; 012F 0307 0303 LATIN CAPITAL LETTER J WITH TILDE; 004A 0303; LATIN SMALL LETTER J WITH DOT ABOVE AND TILDE; 06A 0307 0303 LATIN CAPITAL LETTER L WITH TILDE; 004C 0303 LATIN SMALL LETTER L WITH TILDE; 006C 0303 LATIN CAPITAL LETTER M WITH TILDE; 004D 0303 LATIN SMALL LETTER M WITH TILDE; 006D 0303 LATIN CAPITAL LETTER R WITH TILDE; 0052 0303 LATIN SMALL LETTER R WITH TILDE; 0072 0303 LATIN CAPITAL LETTER U WITH OGONEK AND ACUTE; 0172 0301 LATIN SMALL LETTER U WITH OGONEK AND ACUTE; 0173 0301 LATIN CAPITAL LETTER U WITH OGONEK AND TILDE; 0172 0303 LATIN SMALL LETTER U WITH OGONEK AND TILDE; 0173 0303 LATIN CAPITAL LETTER U WITH MACRON AND ACUTE; 016A 0301 LATIN SMALL LETTER U WITH MACRON AND ACUTE; 016B 0301 LATIN CAPITAL LETTER U WITH MACRON AND TILDE; 016A 0303 LATIN SMALL LETTER U WITH MACRON AND TILDE;016B 0303