L2/21-009

Approved Minutes of UTC Meeting 166
Mountain View, CA — January 19 and 21, 2021
Hosted virtually on Zoom

UTC #166 Agenda
Revision date: August 3, 2021


Tuesday, January 19, 2021

Meeting opened at 9:30

10 Full members, 3 Institutional, 2 Supporting.

Full Members in regular attendance: 7
Institutional Members in regular attendance: 1
Supporting Members in regular attendance: 1
Quorum: 5

9.5 members in regular attendance. Quorum is 5.

5 members represented: Adobe, Apple, Google, Microsoft, UCB.

Recognition of some members.

L2 meeting opened.

A.5 Action item review [L2/SD2]
A.5.1 Recently closed action items [L2/21-010]

Review minutes of previous meeting.

[166-C1] Consensus: Approve minutes of UTC #165 as documented in L2/20-237.

A.6 Calendar review [Calendar]

Added UTC #170, January 24-27, 2022. Host TBD, Google checking.

A.7 Liaison Reports [ISO, IRG, IETF/ICANN, INFITT, SEI, Mongolian, ICU, CLDR, TC37/SC2]

SEI liaison. Oral report by Deborah Anderson.

ICU liaison. Oral report by Markus Scherer.

CLDR liaison. Oral report by Mark Davis.

FYI - Milestone - Unicode was incorporated 30 years ago, January 4.

IRG liaison. Oral report by Ken Lunde.

SC2 liaison. Oral report by Michel Suignard.

Mongolian ad-hoc. Oral report by Liang Hai.

TC37 liaison. Oral report by Peter Constable.

Ken Whistler also reported on TC37/SC2.

Short break until 10:50.

C.1 Unihan Ad Hoc Recommendations for UTC #165 Meeting [Lunde, Jenkins, et al, L2/21-015]

C.1 — L2/20-277: Proposal to Add One UTC-Source Ideograph to UAX #45

[166-C2] Consensus: Accept one new U-Source ideograph as UTC-03234 with a UAX #45 status value of V, based on document L2/20-277 and Unihan-UTC166-R01 in document L2/21-015, for Unicode Version 14.0.

[166-A1] Action Item for John Jenkins: Add one new record to USourceData.txt and its representative glyph to USourceGlyphs.pdf, based on document L2/20-277 and Unihan-UTC166-R01 in document L2/21-015, for Unicode Version 14.0.

C.1 — L2/21-025: Proposal to Add to UAX #45 Four Ideographs from Japan

[166-C3] Consensus: Accept four new U-Source ideographs as UTC-03235 through UTC-03238 with a UAX #45 status value of N, based on document L2/21-025 and Unihan-UTC166-R02 in document L2/21-015, for Unicode Version 14.0.

[166-A2] Action Item for John Jenkins: Add four new records to USourceData.txt and their representative glyphs to USourceGlyphs.pdf, based on document L2/21-025 and Unihan-UTC166-R02 in document L2/21-015, for Unicode Version 14.0.

C.1 — 2) UAX #38 / Unihan Database Public Feedback

[166-A3] Action Item for Peter Edberg: Ask the CLDR-TC to check the proposed kMandarin property value changes for U+7B7D 筽 and U+9730 霰 then report back to the UTC. See document L2/21-011 and Unihan-UTC166-R03 in document L2/21-015.

C.1 — 3) UAX #38 / Unihan Database Documents C.1 — L2/21-022: Status on the Update to the Unihan kCantonese Field

[166-C4] Consensus: Accept 26,698 kCantonese property values. (Note that the kCantonese property is still provisional.)

[166-A4] Action Item for John Jenkins: Update the Unihan database by replacing the existing kCantonese property values with the 26,698 provisional kCantonese property values, based on document L2/21-022 and Unihan-UTC166-R04 in document L2/21-015, for Unicode Version 14.0.

C.1 — L2/21-032: Proposal to remove and improve provisional Unihan database properties (draft)

[166-A5] Action Item for Ken Lunde: Follow up with CITPC’s liaison contact, Taro YAMAMOTO, about the status of the request for permission to use the Moji Jōhō Kiban Database to 1) establish the provisional kMojiJoho property as proposed in L2/20-146; and 2) improve and expand the existing the kMorohashi property as proposed in L2/21-032 (draft).

C.1 — 4) Other Unihan docs C.1 — L2/20-276: Suggestion concerning tentative roadmap placement of Kanbun Extended block

[166-A6] Action Item for Ken Lunde: Convey to the proposal author the CJK & Unihan Group feedback in document L2/21-015 and Script Ad Hoc Recommendations in document L2/21-016 (page 33).

B.1 — 2 Cypro-Minoan

[166-C36] Consensus: The UTC accepts one new Cypro-Minoan character, U+12FCE CYPRO-MINOAN SIGN CM075B, and moves U+12FCE..U+12FF1 to U+12FCF..U+12FF2, listed in SAH-UTC166-R1. With the additional character, the total number of new characters in the Cypro-Minoan block will be 99. Properties for the Cypro-Minoan characters are as documented in L2/20-154 (though code points need adjusting) and glyphs are as shown in Figure 4 of L2/20-156R. (Reference: L2/20-156R and L2/20-154).

[166-A100] Action Item for Ken Whistler: Update the Pipeline according to Section 2 of L2/21-016 Script Ad Hoc Recommendations.

[166-A101]Action Item for Deborah Anderson, Michael Everson: Provide a font for the characters in Section 2 of L2/21-016 Script Ad Hoc Recommendations.

C.1 — L2/20-291: Request for consideration to disunify U+3B3F (IRG N2443)

[166-C5] Consensus: Accept the disunification proposed in L2/20-291, and encode the disunified ideograph at code point U+2B736 with Unihan database changes, based on Unihan-UTC166-R07 in document L2/21-015, for Unicode Version 14.0.

[166-A7] Action Item for Ken Whistler: Update the pipeline to add the disunified ideograph with code point U+2B736. See Unihan-UTC166-R07 in document L2/21-015.

[166-A8] Action Item for John Jenkins, Michel Suignard: Update the Unihan database by adding, removing, and changing property values, based on document L2/20-291 and Unihan-UTC166-R07 in document L2/21-015, and as amended by the group, for Unicode Version 14.0.

[166-A9] Action Item for John Jenkins: Update the records for UTC-00443 and UTC-00732 in USourceData.txt, based on document L2/20-291 and Unihan-UTC166-R07 in document L2/21-015, for Unicode Version 14.0.

[166-A10] Action Item for John Jenkins: Update the representative glyphs for UTC-00345, UTC-00414, UTC-00728 through UTC-00732, UTC-00734, and UTC-00735 to reflect a left-side form of Radical #130 that adheres to Hong Kong SAR regional conventions, based on document L2/20-291 and Unihan-UTC166-R07 in document L2/21-015, for Unicode Version 14.0.

[166-A11] Action Item for John Jenkins, Lee Collins: Provide updated U- and V-Source fonts to Michel Suignard. See L2/21-015.

C.1 — L2/21-018: Mismatched T-Source Identifiers

[166-C6] Consensus: Make changes to the Unihan database, based on document L2/21-018 and Unihan-UTC166-R08 in document L2/21-015, for Unicode Version 14.0.

[166-A12] Action Item for John Jenkins, Michel Suignard: Update the Unihan database by changing three kIRG_TSource property values, based on document L2/21-018 and Unihan-UTC166-R08 in document L2/21-015, for Unicode Version 14.0.

C.1 — L2/21-029: Proposal to Correct Four Vietnamese Glyphs (IRG N2445)

[166-C7] Consensus: Make changes to the Unihan database, based on document L2/21-029 and Unihan-UTC166-R09 in document L2/21-015, for Unicode Version 14.0.

[166-A13] Action Item for Lee Collins: Provide an updated V-Source font to Michel Suignard. See document L2/21-015.

[166-A14] Action Item for John Jenkins, Michel Suignard: Update the Unihan database by changing property values, based on document L2/21-029 and Unihan-UTC166-R09 in document L2/21-015, for Unicode Version 14.0.

C.1 — L2/21-044 UNC Proposal for One G Source Ideograph

[166-C8] Consensus: Accept the urgently needed character proposed in L2/21-044 with code point U+2B737, based on Unihan-UTC166-R10 in L2/21-015, for Unicode Version 14.0.

[166-A15] Action Item for Ken Whistler: Update the pipeline to add the urgently needed character with code point U+2B737. See Unihan-UTC166-R10 in document L2/21-015.

[166-A16] Action Item for John Jenkins, Michel Suignard: Update the Unihan database by adding property values, based on document L2/21-044 and Unihan-UTC166-R10 in document L2/21-015, for Unicode Version 14.0.

[166-A17] Action Item for Peter Edberg: Ask the CLDR-TC to check the proposed kMandarin property value for U+2B737 then report back to the UTC. See document L2/21-044 and Unihan-UTC166-R10 in document L2/21-015.

Lunch break 12:00 - 14:00

B. Script Ad Hoc Report
B.1 Recommendations to UTC #166 January 2021 on Script Proposals [Anderson, L2/21-016]

B.1 — 3 Latin
B.1 — 3a. Phonetic characters (extIPA, VoQS, phonetic click letters, IPA retroflex letters and similar letters with hooks)

[166-C9] Consensus: The UTC accepts 38 phonetic characters for encoding in a future version of the standard as specified in SAH-UTC166-R2, with properties as documented in L2/20-115R, L2/20-116R, and L2/20-125R (though code points may need adjusting) and glyphs and names in L2/21-021. The UTC approves new blocks U+10780..U+107BF Latin Extended-F and U+1DF00..U+1DFFF Latin Extended-G. (Reference: Section 3a of document L2/21-016)

[166-A18] Action Item for Ken Whistler: Update the Pipeline for the new blocks and characters in section 3a of L2/21-016 Script Ad Hoc Recommendations.

[166-A19] Action Item for Deborah Anderson, Michael Everson: Provide a font for the characters in Section 3a of L2/21-016 Script Ad Hoc Recommendations.

B.1 — 3b. IPA Modifier Letters - Pulmonic

[166-C10] Consensus: The UTC accepts 35 modifier letter characters in a new Latin Extended-F block (U+10780..U+107BF) for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-252R and listed in UTC #166 SAH recommendations L2/21-016 section 3b. (SAH-UTC166-R3)

[166-A20] Action Item for Ken Whistler: Update the Pipeline according to L2/20-252R and UTC #166 SAH recommendations L2/21-016 section 3b.

B.1 — 3c. IPA Modifier Letters – Non-Pulmonic

[166-C11] Consensus: The UTC accepts 11 modifier letter characters in a new Latin Extended-F block (U+10780..U+107BF) for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-253R and UTC #166 SAH recommendations L2/21-016 section 3c. (SAH-UTC166-R4)

[166-A21] Action Item for Ken Whistler: Update the Pipeline according to L2/20-253R and UTC #166 SAH recommendations L2/21-016 section 3c.

B.1 — 3d. Addendum to Unicode request L2/20-253

For information only.

B.1 — 3e. Modifier Latin Capital Letters

[166-C12] Consensus: The UTC accepts the following 3 characters for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-251 and UTC #166 SAH recommendations L2/21-016 section 3e (SAH-UTC166-R5):

	A7F2 MODIFIER LETTER CAPITAL C
	A7F3 MODIFIER LETTER CAPITAL F
	A7F4 MODIFIER LETTER CAPITAL Q

[166-A22] Action Item for Ken Whistler: Update the Pipeline according to L2/20-251 and UTC #166 SAH recommendations L2/21-016 section 3e.

[166-A23] Action Item for Michael Everson: Provide a font for the three characters in L2/20-251 and UTC #166 SAH recommendations L2/21-016 section 3e.

B.1 — 3f. Subscript Modifier Letters

For information only.

B.1 — 3g. Phonetic Punctuation and Diacritics

[166-C13] Consensus: The UTC accepts 13 phonetic punctuation and diacritic characters for encoding in a future version of the standard, with glyphs and properties as documented in L2/21-042 and UTC #166 SAH recommendations L2/21-016 section 3g:

	1AC5 COMBINING SQUARE BRACKETS ABOVE
	1AC7 COMBINING INVERTED DOUBLE ARCH ABOVE
	1AC8 COMBINING PLUS SIGN ABOVE
	1ACD COMBINING DOUBLE PLUS SIGN ABOVE
	1ACE COMBINING DOUBLE PLUS SIGN BELOW
	2E55 LEFT SQUARE BRACKET WITH STROKE
	2E56 RIGHT SQUARE BRACKET WITH STROKE
	2E57 LEFT SQUARE BRACKET WITH DOUBLE STROKE
	2E58 RIGHT SQUARE BRACKET WITH DOUBLE STROKE
	2E59 TOP HALF LEFT PARENTHESIS
	2E5A TOP HALF RIGHT PARENTHESIS
	2E5B BOTTOM HALF LEFT PARENTHESIS
	2E5C BOTTOM HALF RIGHT PARENTHESIS

[166-A24] Action Item for Ken Whistler: Update the Pipeline according to L2/21-042 and UTC #166 SAH recommendations L2/21-016 section 3g.

[166-A25] Action Item for Deborah Anderson, Kirk Miller: Provide a font for the 13 characters in L2/21-042 and UTC #166 SAH recommendations L2/21-016 section 3g.

B.1 — 3h. Additional Para-IPA Letters

[166-C14] Consensus: The UTC accepts 3 phonetic characters for encoding in a future version of the standard, with glyphs and properties as documented in L2/21-041 and UTC #166 SAH recommendations L2/21-016 section 3h:

	107BA MODIFIER LETTER SMALL S WITH CURL
	1DF1D LATIN SMALL LETTER C WITH RETROFLEX HOOK
	1DF1E LATIN SMALL LETTER S WITH CURL

[166-A26] Action Item for Ken Whistler: Update the Pipeline according to L2/21-041 and UTC #166 SAH recommendations L2/21-016 section 3h.

[166-A27] Action Item for Deborah Anderson, Michael Everson: Provide a font for the 3 characters in L2/21-041 and UTC #166 SAH recommendations L2/21-016 section 3h.

B.1 — 3i. Dezh with Retroflex Hook

[166-C15] Consensus: The UTC accepts one phonetic character for encoding in a future version of the standard, glyphs and properties as documented in L2/21-004 and UTC #166 SAH recommendations L2/21-016 section 3i:

	1DF19 LATIN SMALL LETTER DEZH WITH RETROFLEX HOOK

[166-A28] Action Item for Ken Whistler: Update the Pipeline according to L2/21-004 and UTC #166 SAH recommendations L2/21-016 section 3i.

B.1 — 3j. Old Polish Nasal Vowel Letter

[166-C16] Consensus: The UTC accepts two Latin characters for encoding in a future version of the standard, with glyphs and properties as documented in L2/21-039 and UTC #166 SAH recommendations L2/21-016 section 3j:

	U+A7C0 LATIN CAPITAL LETTER OLD POLISH O
	U+A7C1 LATIN SMALL LETTER OLD POLISH O

[166-A29] Action Item for Ken Whistler: Update the Pipeline according to L2/21-039 and UTC #166 SAH recommendations L2/21-016 section 3j.

B.1 — 3k. Medieval Punctuation

[166-C17] Consensus: The UTC accepts 2 medieval punctuation characters for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-270R and UTC #166 SAH recommendations L2/21-016 section 3k:

	2E53 MEDIEVAL EXCLAMATION MARK
	2E54 MEDIEVAL QUESTION MARK

[166-A30] Action Item for Ken Whistler: Update the Pipeline according to document L2/20-270R and UTC #166 SAH recommendations L2/21-016 section 3k.

B.1 — 3l. Ten Characters for Middle English (Ormulum)

[166-C18] Consensus: The UTC accepts 8 characters used for Middle English for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-268 and UTC #166 SAH recommendations L2/21-016 section 3l:

	1AC9 COMBINING TRIPLE ACUTE ACCENT
	1ACA COMBINING LATIN SMALL LETTER INSULAR G
	1ACB COMBINING LATIN SMALL LETTER INSULAR R
	1ACC COMBINING LATIN SMALL LETTER INSULAR T
	A7D0     LATIN CAPITAL LETTER CLOSED INSULAR G
	A7D1     LATIN SMALL LETTER CLOSED INSULAR G
	A7D3     LATIN CAPITAL LETTER DOUBLE THORN
	A7D5     LATIN SMALL LETTER DOUBLE THORN

[166-A31] Action Item for Ken Whistler: Update the Pipeline according to L2/20-268 and UTC #166 SAH recommendations L2/21-016 section 3l, minus capital double thorn and double wynn.

[166-A32] Action Item for Deborah Anderson, Michael Everson: Provide a font for 8 characters used in Middle English. See consensus 166-C18 above.

B.1 — 3m. SIGMOID S

[166-C19] Consensus: The UTC accepts 2 sigmoid S characters for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-269 and UTC #166 SAH recommendations L2/21-016 section 3m:

	A7D8 LATIN CAPITAL LETTER SIGMOID S
	A7D9 LATIN SMALL LETTER SIGMOID S

[166-A33] Action Item for Ken Whistler: Update the Pipeline according to L2/20-269 and UTC #166 SAH recommendations L2/21-016 section 3m.

B.1 — 3n. Two Characters for Middle Scots

[166-C20] Consensus: The UTC accepts 2 Middle Scots S characters for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-267 and UTC #166 SAH recommendations L2/21-016 section 3n:

	A7D6 LATIN CAPITAL LETTER MIDDLE SCOTS S
	A7D7 LATIN SMALL LETTER MIDDLE SCOTS S

[166-A34] Action Item for Ken Whistler: Update the Pipeline according to L2/20-267 and UTC #166 SAH recommendations L2/21-016 section 3n.

[166-A35] Action Item for Deborah Anderson, Michael Everson: Provide a font for Middle Scots S. See consensus 166-C20 above.

B.1 — 3o. Modifier Letter Heavy Prime

[166-A36] Action Item for Deborah Anderson: Relay feedback to the proposal author of L2/20-286. (Reference: Section 3n of L2/21-016 Script Ad Hoc Recommendations)

B.1 — 3p. Oblique Hyphen.

[166-C21] Consensus: The UTC accepts 1 punctuation character for encoding in a future version of the standard, with glyph and properties as documented in L2/21-036: U+2E5D OBLIQUE HYPHEN (Reference: L2/21-036)

[166-A37] Action Item for Michael Everson: Update the proposal and change the property for OBLIQUE HYPHEN from Po to Pd, change figure “5” to figure 4, and send the revised proposal for posting in the document register. (Note: This action has been done.)

[166-A38] Action Item for Ken Whistler: Update the Pipeline to include U+2E5D OBLIQUE HYPHEN. (Reference: L2/21-036)

B.1 — 4 Todhri

Discussion. UTC took no action at this time.

B.1 — 5 Vithkuqi

[166-C22] Consensus: The UTC accepts 70 Vithkuqi characters in a new Vithkuqi block (U+10570..U+105BF) for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-187R2, but leaving reserved code points at U+1057B, U+1058B, U+10593, U+10596, U+105A2, U+105B2, U+105BA, U+105BD. (Reference: Section 5 of L2/21-016 Script Ad Hoc Recommendations; L2/20-187R2)

[166-A39] Action Item for Ken Whistler: Update the pipeline to include 70 Vithkuqi characters. (Reference L2/20-187R2)

[166-A40] Action Item for Deborah Anderson, Michael Everson: Provide a font for Vithkuqi. See consensus 166-C22 above.

B.1 — Mayan and Adinkra

For information only at this time.

B.1 — 8 Egyptian Hieroglyphs
B.1 — 8a. Glyph changes to Egyptian Hieroglyphs block

[166-C23] Consensus: The UTC approves the proposed Egyptian Hieroglyph glyph changes for version 14.0 and notes that an erratum notice should be posted to document the changes. (Reference: L2/21-028)

[166-A41] Action Item for Michel Suignard, Editorial Committee: Issue a glyph erratum notice on the Egyptian Hieroglyph characters in L2/21-028. (Reference: L2/21-028)

B.1 — 8b. Proposal to add one column to the Egyptian Hieroglyph Format Controls

The UTC requests the Roadmap Committee adjust the Egyptian Hieroglyphs Extended-A block by one column, so it starts at U+13450, instead of U+13440, leaving one column unallocated at U+13440..U+1344F. (Reference: Section 8b of L2/21-016 Script Ad Hoc Recommendations).

[166-A42] Action Item for Ken Whistler, Editorial Committee: Update FAQ on blocks to mention that their boundaries may change.

B.1 — 8c. Summary from Zoom calls on Egyptian Hieroglyphs

Discussion. UTC took no action at this time.

Short break 4:17 until 4:25.

B.1 — 9 Ethiopic

[166-C24] Consensus: The UTC accepts 28 Ethiopic characters for the Gurage orthography in a new Ethiopic Extended-B block (U+1E7E0..U+1E7FF) for encoding in a future version of the standard, with glyphs and properties as documented in L2/21-037. (Reference: L2/21-037)

[166-A43] Action Item for Ken Whistler: Update the Pipeline to include 28 Ethiopic characters for the Gurage orthography. (Reference: L2/21-037)

B.1 — 10 Kore Sebeli

[166-A44] Action Item for Deborah Anderson: Relay feedback to the proposal author of L2/20-180, including comments from the July 2020 Script Ad Hoc Recommendations L2/20-169.

B.1 — 11a. Glyph changes and annotations for Kazakh, Kyrgyz, and Uyghur

[166-A45] Action Item for Deborah Anderson: Forward the proposed annotations to the names list editor for the following characters: U+0626 (Reference: page 3 of L2/20-289) U+06C7 (Reference page 4 of L2/20-289) U+0675..U+0678 for Unicode 14.0 (Reference page 5 of L2/20-289)

[166-A46] Action Item for Michel Suignard, Lorna Evans, Editorial Committee: Issue a glyph erratum notice for U+06C5, U+FBE0, U+FBE1, U+0677, U+06C7, U+FBD7, U+FBD8, U+FBDD, U+0674..U+0678 for Unicode 14.0. (Reference: L2/20-289)

[166-A47] Action Item for Roozbeh Pournader: Change ArabicShaping.txt for: U+06C5 from WAW WITH BAR to WAW WITH LOOP (Reference: page 4 of L2/20-289) U+0677 and U+06C7 from WITH DAMMA ABOVE to WITH COMMA ABOVE for Unicode 14.0 (Reference: page 4 of L2/20-289)

[166-A48] Action Item for Lorna Evans, Editorial Committee: Prepare text for the Core Spec for Unicode 14.0, based on proposed wording on U+0626 on page 3 of L2/20-289, and the wording regarding U+0674..U+0678 on page 5 of L2/20-289.

[166-A49] Action Item for Ken Whistler: Adjust the weights in the DUCET for U+0675..U+0678 for Unicode 14.0 (Reference page 6 of L2/20-289)

B.1 — 11b. Sindhi and Behdini Kurdish

[166-A50] Action Item for Lorna Evans, Michel Suignard, Editorial Committee: Change the glyph for U+06FE ARABIC SIGN SINDHI POSTPOSITION MEN for Unicode 14.0 (Reference: page 2 of L2/20-288).

[166-A51] Action Item for Lorna Evans, Editorial Committee: to prepare text for the Core Spec for Unicode 14.0 on REVERSED COMMA and REVERSED SEMICOLON in Sindhi (reference page 1 of L2/20-288), and Sindhi meem (reference: page 2 of L2/20-288).

[166-A52] Action Item for Deborah Anderson: Forward the proposed annotation to the names list editor for U+0645 ARABIC LETTER MEEM for Unicode 14.0 (Reference: page 1 of L2/20-288).

B.1 — 14 Old Uyghur

[166-C25] Consensus: SAH-UTC166-R22: Accept 26 Old Uyghur characters (U+10F70..U+10F89) in a new Old Uyghur block that extends from U+10F70..U+10FAF for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-191.

[166-A53] Action Item for Ken Whistler: Update the Pipeline to include 26 Old Uyghur characters. (Reference: L2/20-191)

[166-A54] Action Item for Anshuman Pandey, Deborah Anderson: Provide Michel Suignard with a font for 26 Old Uyghur characters.

B.1 — 13 Mundari Bani

Discussion. UTC took no action at this time.

Meeting adjourned for the day at 5:30.


Thursday, January 21, 2021

Meeting opened at 9:30

5 members represented: Adobe, Apple, Microsoft, Google, UCB

D. Properties and Algorithms Group Report
D.1 UTC #166 properties feedback & recommendations [Scherer, et al, L2/21-012]

[166-A55] Action Item for Mark Davis: For Unicode 14 confusables.txt, add ß ~ β and ẞ ~ B if feasible. And Middle Scots S. See document L2/21-016 (U+A7D6 LATIN CAPITAL LETTER MIDDLE SCOTS S, U+A7D7 LATIN SMALL LETTER MIDDLE SCOTS S)

[166-A56] Action Item for Mark Davis, Editorial Committee: Prepare a proposed update UTS #18 (section 2.7 Full Properties) to modify the table to add
a. RGI_Emoji_Flag_Sequence* and
b. Emoji_Keycap_Sequence*

[166-A57] Action Item for Rick McGowan: Respond to feedback submitter (Monday, December 14, 2020, 12:22:02 AM PST) with information as in section F3, recommended action 1 in document L2/21-012. // Rick will copy and send that text verbatim.

[166-A58] Action Item for Mark Davis, Properties and Algorithms Group: Produce a set of criteria to be met by scripts to be considered for Identifier_Type=Recommended in UTS #39 and UAX #31. Determine what information needs to accompany any proposal to reclassify.

[166-A59] Action Item for Markus Scherer, Editorial Committee: Prepare a proposal for the UTC to add and clarify terminology for testing for canonical and other equivalences in chapter 3 (such as sections 3.7, 3.11, and 3.13) and UAX #15, and use that terminology in appropriate places, as outlined in document L2/21-012 item F4.

[166-A60] Action Item for Asmus Freytag, Michel Suignard: Contact the W3C to see if we can engage in working together to solve the French punctuation line-break issues. Reference: Document L2/21-012 item D1.

[166-A61] Action Item for Markus Scherer, Norbert Lindenberg, Editorial Committee: Propose changes to the specification of variation sequences in TUS chapter 23.4 and appropriate additions to chapter 3, based on document L2/21-012 item D2. The intent is to clarify the restrictions on initial characters in order to avoid issues under normalization. Include examples of characters and sequences that are excluded. See also action item 152-A5a.

[166-A62] Action Item for Markus Scherer, Mark Davis: Add an invariant test to make sure that initial characters of variation sequences conform to their restrictions. See document L2/21-012 item D2.

[166-A63] Action Item for Mark Davis, Editorial Committee: Make editorial changes and EBNF clarifications to UTS #18 as proposed in L2/21-002 and L2/21-003, in a new proposed update. Include in the Modifications section a list of technical changes to the EBNF.

[166-A64] Action Item for Rick McGowan: Post a proposed update of UTS #18.

[166-A65] Action Item for Mark Davis, Editorial Committee: Add to the proposed update for UTS #39 text to address the PRI #423 feedback from Asmus on 2020-dec-31. Also add the term “widespread” to “everyday common use”.

Short break.

F. Editorial Committee Report
F.1 Editorial Committee Report and Recommendations for UTC #166 Meeting [Whistler, L2/21-013]

[166-C26] Consensus: The UTC authorizes a PRI for an Alpha review period for the Unicode 14.0 repertoire, to start once the initial repertoire has been decided, and an appropriate subset of the UCD data files and charts can be prepared. To close April 20, 2021.

[166-A66] Action Item for Ken Whistler, Editorial Committee: Prepare the PRI background document for the Alpha review of Unicode 14.0.

[166-A67] Action Item for Ken Whistler: Prepare updated UCD data files for the Alpha review of Unicode 14.0.

[166-A68] Action Item for Michel Suignard: Prepare Alpha review code charts for Unicode 14.0.

[166-A69] Action Item for Rick McGowan: Post the PRI for the Alpha review of Unicode 14.0, to close April 20, 2021.

Noted: All fonts need to be to Michel Suignard by January 31, 2021 in order to be part of the 14.0 alpha.

[166-A70] Action Item for Mark Davis, Markus Scherer, Editorial Committee: Make corrections to (and possibly reformat) the table in Section 2.6, Wildcards in Property Values, in UTS #18. (Reference: Section D1 in L2/21-013.)

[166-A71] Action Item for Liang Hai, Editorial Committee: Update text in Section 18.9, Lisu in the Core Specification, to clarify issues of advance width for tone letters. For Unicode 14.0.

[166-A72] Action Item for Ken Whistler, Editorial Committee: Update Rules S1, S2, S3 in the Syriac Shaping subsection of Section 9.3, Syriac in the Core Specification, to explicitly add the end of text edge case. For Unicode 14.0.

[166-C27] Consensus: The UTC approves a name change for the previously approved candidate character U+1CF2D to ZNAMENNY COMBINING MARK KRYZH ON LEFT.

[166-A73] Action Item for Ken Whistler: Update the Pipeline for the name change to U+1CF2D.

B. Script Ad Hoc Report
B.2 Feedback on PRI #426 Proposed Update UTR #53 Unicode Arabic Mark Rendering [L2/21-011]

B.1 Recommendations to UTC #166 January 2021 on Script Proposals [Anderson, L2/21-016]

[166-C28] Consensus: The UTC accepts one Kannada character for encoding in Unicode version 14.0, with glyph and properties as documented in L2/20-228R: U+0CDD KANNADA LETTER NAKAARA POLLU (Reference: L2/20-228R)

[166-A74] Action Item for Ken Whistler: Update the Pipeline to include U+0CDD KANNADA LETTER NAKAARA POLLU. (Reference: L2/20-228R)

[166-A75] Action Item for Liang Hai: Update the Kannada block intro in the Core Spec to include U+0CDD KANNADA LETTER NAKAARA POLLU. (Reference: Section 12 of L2/21-016 Script Ad Hoc Recommendations)

[166-C29] Consensus: The UTC accepts 89 Tangsa characters (U+16A70...U+16ABE and U+16AC0..U+16AC9) in a new Tangsa block that extends from U+16A70..U+16ACF for encoding in Unicode version 14.0, with glyphs and properties as documented in L2/21-027. (Reference L2/21-027)

[166-A76] Action Item for Stephen Morey, Deborah Anderson: Provide Ken Whistler with the user community’s preferred collation order of Tangsa by June 30, 2021. (Reference: Section 15 of L2/21-016 Script Ad Hoc Recommendations).

[166-A77] Action Item for Ken Whistler: Update the Pipeline for Tangsa. (Reference L2/21-027)

[166-C30] Consensus: The UTC accepts 87 Kawi characters in a new Kawi block (U+11F00..U+11F5F) for encoding in a future version of the standard, with glyphs and properties as documented in L2/20-284R. (Reference: L2/20-284R)

[166-A78] Action Item for Liang Hai: Investigate the use of multiple pre-base vowels in clusters of Brahmic scripts, especially rendering of sequences of different pre-base vowels. (Reference: Section 18a of L2/21-016 Script Ad Hoc Recommendations).

[166-A79] Action Item for Ken Whistler: Update the Pipeline to include Kawi letters. (Reference: L2/20-284R)

Lunch break 13:30 - 14:00

Roll call adjustment. Emojipedia now present.

5.5 members represented: Adobe, Apple, Microsoft, Google, UCB, Emojipedia

[166-A80] Action Item for Ken Whistler: Draft a page explaining alpha and beta terminology.

The UTC is authorizing the Emoji Subcommittee to create Public Review Issues about emoji issues, particularly about prioritized lists.

The UTC will discuss the QID emoji proposal (PRI #408) and feedback during the 2021Q2 meeting, UTC #167, April 27 and 29, 2021.

Short break 15:30 - 15:45

B.1 Recommendations to UTC #166 January 2021 on Script Proposals [Anderson, L2/21-016]

B.1 — 22 Kana

[166-C31] Consensus: The UTC accepts 13 Kana characters (U+1AFF0..U+1AFFE) in a new block Kana Extended-B in the range U+1AFF0..U+1AFFF for encoding in Unicode version 14.0, with glyphs and properties as documented in L2/20-209R. (Reference: L2/20-209R)

[166-A81] Action Item for Ken Lunde: Add an entry in VerticalOrientation.txt for U+1AFF0..U+1AFFE as “U”, for Unicode 14.0. (Reference: L2/20-209R)

[166-A82] Action Item for Ken Whistler: Update the Pipeline to include 13 Kana characters. (Reference: L2/20-209R)

B.1 — 24 Tangut

[166-C32] Consensus: The UTC accepts the Tangut glyph changes in L2/20-166. (Reference: L2/20-166)

[166-A83] Action Item for Andrew West, Deborah Anderson: Supply updated Tangut font to Michel Suignard.

B.1 — 25 Math Calligraphic Alphabets

[166-C33] Consensus: The UTC accepts 52 variation sequences to distinguish roundhand and chancery style mathematical script alphabetic characters, for Unicode version 14.0. (Reference: L2/20-275R)

[166-A84] Action Item for Ken Whistler: Add the 52 math variation sequences to StandardizedVariants.txt for Unicode version 14.0.

[166-A85] Action Item for Ken Whistler: Update the pipeline to include 52 math variation sequences.

B.1 — XI. Recommendations for Unicode 14.0

[166-C34] Consensus: Target the list of characters and scripts in section 11 (Script and Character Additions) of L2/21-016R for Unicode 14.0.

[166-A86] Action Item for Ken Whistler: Update the pipeline to reflect the decision about which repertoire to include in 14.0 (and which will be in the bucket for publication in a future release). See above consensus and L2/21-016R.

[166-A87] Action Item for Deborah Anderson: Inform proposers of Kawi about disposition of the proposal. (Reference: L2/20-284R)

[166-C35] Consensus: Target the glyph changes section 11 (Script and Character Additions) of L2/21-016R for Unicode 14.0.

B.1 — X. PUBLIC REVIEW FEEDBACK

[166-A88] Action Item for Ken Whistler: Add an annotation to U+06E0, indicating the name is a translation of the Arabic name. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations).

[166-A89] Action Item for Ken Whistler, Editorial Committee: Add an annotation U+0886 ARABIC LETTER THIN YEH that no final or isolated forms are attested. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations).

[166-A90] Action Item for Ken Whistler, Editorial Committee: Add a note to Table 9-8 “Dual-Joining Arabic Characters” of the Core Spec for Unicode 14.0, that U+0886 ARABIC LETTER THIN YEH has no final or isolated forms attested. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations).

[166-A91] Action Item for Rick McGowan: Relay the feedback on U+0887 to David Corbett. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations). Item 2 page 37. // See: Public Feedback question 2: If U+0887 ARABIC BASELINE ROUND DOT has gc=Lo, why does U+0888 ARABIC RAISED ROUND DOT have gc=Sk? Page 37 of doc L2/21-016.

[166-A92] Action Item for Lorna Evans, Editorial Committee: Provide a glyph for the isolated form of ROHINGYA YEH for Table 9-9, and remove the note “Isolated form does not occur.” (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations).

[166-A93] Action Item for Rick McGowan: Relay the feedback about SignWriting above to David Corbett. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations).

[166-A94] Action Item for Deborah Anderson: Relay the feedback on page 39 of L2/21-016 to the authors of the SignWriting proposal, pointing out the error. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations)

[166-A96] Action Item for Deborah Anderson: Relay to comments above to Kamal Mansour concerning how to make the valid and invalid SignWriting sequences accessible. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations)

[166-A97] Action Item for Kamal Mansour: Provide tables of glyphs from the font for review by Peter Constable and other interested parties and can be used as a basis for a future UTN. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations)

[166-A98] Action Item for Rick McGowan: Thank David Corbett for his feedback. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations).

[166-A99] Action Item for Rick McGowan: Thank Eduardo Marin Silva for his feedback. (Reference: Section X of L2/21-016 Script Ad Hoc Recommendations).

UTC adjourned for the week at 17:00.

L2 continued.


Members Represented

Full Member 01/19/21 01/21/21

1. Adobe

yes yes

2. Apple Inc.

yes yes

3. Facebook

   

4. Google, Inc.

yes yes

5. IBM Corporation

   

6. Microsoft Corporation

yes yes

7. Netflix

   

8. SAP AG

9. Sultanate of Oman, MARA

   

Institutional Member

1. Bangladesh, MSICT

2. India, MICT

3. Tamil Nadu, TVA

4. UCB

yes yes
   

Supporting Member

1. Emojipedia

  yes
   

Associate Member

1. Emojination

yes yes

2. SIL

yes yes
   

UTC Attendance

PersonRepresenting
Julie AllenUnicode
Deborah AndersonUC Berkeley
Fred Brennanself
Jeremy BurgeEmojipedia
Chris ChapmanAdobe
Peter ConstableUnicode
Craig CummingsAmazon
Jennifer DanielGoogle
Mark DavisGoogle
Peter EdbergApple
Lorna EvansSIL
Loïz FilyOffice of Breton Language
Asmus Freytagself
Josh HadleyAdobe
Liang HaiUnicode
Ned HolbrookApple
John JenkinsApple
Kevin Keystoneself
Jan Kučeraself
Jennifer 8. LeeEmojination
Norbert Lindenbergself
Ken LundeUnicode
Zachary Lymself
Rick McGowanUnicode
Lisa MooreUnicode
Anshuman PandeySEI
Marcel Paulukself
Roozbeh PournaderUnicode
Judy Safran-AasenMicrosoft
Murray SargentMicrosoft
Markus SchererGoogle
Jiali ShengMicrosoft
Michel SuignardUnicode
Samantha SunneEmojination
Tex Texinself
Ken WhistlerUnicode
Lawrence Wolf-Sonkinself
Daniel Yacobself
Ben YangPanlex