L2/14-128 Title: WG2 Consent Docket Author: Ken Whistler Date: May 7, 2014 Action: For consideration by UTC WG2 #62 met in San Jose, the week of February 24-28, 2014. During that meeting a number of resolutions were taken which progressed the DIS for 10646 4th Edition and the PDAM 1 to the 4th Edition. See L2/14-073 (= WG2 N4554) for the full details of all the resolutions. As usual, in this consent docket, I summarize just the parts of the actions taken by WG2 which result in a different status between WG2 and the UTC regarding various character approvals. These are the differences where the UTC needs to make some decision regarding how to synchronize approvals (or to oppose a proposed change). For convenience, the changes are grouped here by FDIS, DAM, or PDAM. Note that the pipeline page: http://www.unicode.org/alloc/Pipeline.html has already been updated to reflect changes in approvals by WG2, and to highlight differences from the current approvals by the UTC, so that page can be useful in following the discussion below on the individual issues. ======================================================================= Corrections Related to the Repertoire Published for Unicode 7.0 A. U+2B81 Name Fix WG2 agreed to correct the misspelling in the name of U+2B81, to fix the "...LEFTWARDS DOWNWARDS OF..." to "...LEFTWARDS OF DOWNWARDS..." Formally, the UTC had approved the incorrect name, so the record of approvals is out of synch now. To take account of this, the UTC also approved a formal name alias with the fix, and gave me an action item to cover updating the relevant data files. However, anticipating that WG2 would proceed with formally fixing the name for the publication of the 4th Edition, I held off on the formal name alias. Instead, the corrected name was propagated into the 7.0 UnicodeData.txt and other UCD files immediately following the WG2 meeting which approved the name correction. Recommendation: The UTC may wish to formally note the correction to the name and rescind the action to add the formal name alias. However, because the UCD for 7.0 already reflects the fix, it may be simplest to just allow this fix to be swept up with the overall approval of the release of Unicode 7.0, based on the updated beta files for the UCD. This latter approach may be less confusing, actually, than trying to document any formal name correction at this stage in the process. ======================================================================= Changes Related to the DIS for the 4th Edition The 4th Edition is now being progressed to FDIS status. The FDIS ballot is a non-technical ballot, so at this point it is too late to make further technical changes on its content. I recommend that the UTC simply approve the few points where it is out of synch with former UTC approvals. Note that this is all content aimed at inclusion in Unicode 8.0. The full listing of the revised DIS content for FDIS balloting, including a number of glyph changes, can be seen in WG2 N4571 (= L2/14-080). B. Siddham Variant Letters The Siddham variant letters were approved by WG2. They were split into two parts for progression in ballots. The non-controversial additions were included in the FDIS. Two others were included in Amendment 1, instead (see below). Recommendation: The UTC should approve the following characters: 115D8 SIDDHAM LETTER THREE-CIRCLE ALTERNATE I 115D9 SIDDHAM LETTER TWO-CIRCLE ALTERNATE I 115DA SIDDHAM LETTER TWO-CIRCLE ALTERNATE II 115DB SIDDHAM LETTER ALTERNATE U =========================================================================== Changes Related to DAM 1 to the 4th Edition Amendment 1 to the 4th Edition is now being progressed to a DAM ballot. This content is also aimed at inclusion in Unicode 8.0, and contains the less-controversial content from PDAM 1 plus high-priority additions. Less urgent additions from PDAM 1 were moved to the repertoire for progression in Amendment 2 (see below). The full listing of the additional repertoire for DAM 1 balloting can be seen in WG2 N4568 (= L2/14-078). C. Siddham Variant Letters The two alternate forms for combining vowel signs in Siddham were added to the DAM 1 repertoire, in order to give national bodies one more chance for technical review. Recommendation: The UTC should review the technical issues regarding the the following characters. After review, I recommend that the characters be approved. 115DC SIDDHAM VOWEL SIGN ALTERNATE U 115DD SIDDHAM VOWEL SIGN ALTERNATE UU D. Cherokee The lowercase Cherokee letters were approved by WG2, along with one other character. The allocation areas agreed with the areas that the UTC had already approved, with the new block U+AB70..U+ABBF Cherokee Supplement. So at this point, to get back into synch, the UTC needs to consider and approve the following repertoire of 87 characters: 13F5 CHEROKEE LETTER MV 13F8 CHEROKEE SMALL LETTER YE ... 13FD CHEROKEE SMALL LETTER MV AB70 CHEROKEE SMALL LETTER A ... ABBF CHEROKEE SMALL LETTER YA =========================================================================== Changes Related to PDAM 2 to the 4th Edition The PDAM 2 to the 4th Edition is now being progressed to a first PDAM ballot. This content is generally aimed at inclusion in Unicode 9.0 (in 2016). Note that some parts of the PDAM 2 repertoire might be accelerated for publication into Unicode 8.0 (in 2015), depending on what happens during future ballot resolutions. The full listing of the additional repertoire for PDAM2 balloting can be seen in WG2 N4569 (= L2/14-079). E. Tangut Ideographs Tangut was approved for ballot in PDAM 2. One character, the TANGUT REPETITION MARK, which isn't actually an ideograph, was moved from U+17000 to U+16FE0, in a new Ideographic Symbols and Punctuation block (U+16FE0..U+16FFF). WG2 agreed to ballot Tangut with the algorithmic names already approved by the UTC, e.g., TANGUT IDEOGRAPH-17001, etc. There is a complication here, because in turns out that holes at the beginning or middle of ideographic blocks causes problems for the unibook chart tool. Michel has proposed that one solution is to move the remaining range of Tangut ideographs U+17001..U+187ED be moved up by one code point. That solution is also apparently favored by the authors of the proposal, Andrew West and Michael Everson. Recommendation: The UTC should approve the code point change for the Tangut repetition mark: U+16FE0 TANGUT REPETITION MARK and the new block: U+16FE0..U+16FFF Ideographic Symbols and Punctuation Discuss what to do about the code point shift for the Tangut block itself. F. Nushu The Nushu script was also approved for ballot in PDAM 2. This block range is U+1B100..U+1B28F, with 397 characters approved in the range U+1B100.. U+1B28C. The UTC has not yet approved this repertoire or block definition. Note that although there is considerable desire to get Nushu encoded soon, the proposal documents and data files still have a number of problems, so I anticipate that there will be a number of problems still found in the data. Recommendation: Discuss options going forward. G. Latin Capital Letter Small I One Latin character was added. This character was discussed in the context of Unifon additions, but evidence was adduced that this particular character was attested as part of a case pair for other orthographic use. Recommendation: Approve U+A7AE LATIN CAPITAL LETTER SMALL CAPITAL I