L2/20-237

Approved Minutes of UTC Meeting 165
Mountain View, CA — October 5 and 7, 2020
Hosted virtually on Zoom

UTC #165 Agenda
Revision date: August 3, 2021


Monday October 5, 2020

Meeting opened at 9:30 am.

10 Full members, 3 Institutional, 2 Supporting.

Full Members in regular attendance: 7
Institutional Members in regular attendance: 1
Supporting Members in regular attendance: 1
Quorum: 5

6 members represented: Apple, Adobe, Google, Facebook, Microsoft, UCB,

A1. Membership review, proxies, and meeting quorum

A.3 Approval of minutes of prior meeting [L2/20-172]

[165-C1] Consensus: Approve the minutes of UTC #164 as documented in L2/20-172.

A.5 Action item review [L2/SD2] A.5.1 Recently closed action items [L2/20-238]

Oral review by Ken Whistler.

Roll Call adjustment: Netflix now represented.

7 members represented: Apple, Adobe, Google, Facebook, Microsoft, UCB, Netflix.

A.6 Calendar review [Calendar]

Discussion. UTC #166 will be January 19 and 21, 2021. UTC #167 will also be virtual, April 27 and 29, 2021. UTC #168 will also likely be virtual, July 27 and 29, 2021. UTC #169 will be October 5 and 7, 2021; but will have a room reserved "in case".

[165-C2] Consensus: There will be no further emoji repertoire (characters and sequences) for Unicode version 14.0 after the January 2021 UTC meeting.

A.7 Liaison reports [ISO, IRG, IETF/ICANN, INFITT, SEI, Mongolian, ICU, CLDR, TC37/SC2]

SEI liaison report (L2/20-254), Deborah Anderson.

TC37/SC2 liaison report (L2/20-273), Peter Constable.

[165-A1] Action Item for Ken Whistler, Peter Constable: Follow up with TC37/SC2 regarding the upcoming ad-hoc meeting and provide a liaison statement.

IRG liaison report, Ken Lunde. Will be covered later.

Mongolian ad-hoc report, Lisa Moore. Meetings bi-weekly; progress being made.

CLDR-TC liaison, oral report, Mark Davis.

ICU-TC liaison, oral report Markus Scherer.

Short break until 11:00.

C.1 Unihan Ad Hoc Recommendations for UTC #165 Meeting [Lunde, Jenkins, et al, L2/20-235]

[165-C3] Consensus: Make changes to USourceData.txt and to the Unihan database based on feedback from Ken Lunde [Fri Aug 31 08:29:11 CDT 2020], based on document L2/20-239, for Unicode Version 14.0.

[165-A2a] Action Item for John Jenkins: Update records in USourceData.txt and Unihan database based on instructions on page 2 of document L2/20-239 and Unihan-UTC165-R01 in document L2/20-235, for Unicode Version 14.0.

[165-A2b] Action Item for John Jenkins: Prepare a proposal to horizontally-extend U+289B1 𨦱 to add UK-02829 as a new source reference and submit to the UTC and IRG, based on document L2/20-239 and Unihan-UTC165-R01 in document L2/20-235, for Unicode Version 14.0.

[165-C4] Consensus: Accept six new U-Source ideographs as UTC-03228 through UTC-03233 with a UAX #45 status value of N, based on document L2/20-206 and Unihan-UTC165-R02 in document L2/20-235, for Unicode Version 14.0.

[165-A2c] Action Item for John Jenkins: Add six new records to USourceData.txt and their representative glyphs to USourceGlyphs.pdf, based on document L2/20-206 and Unihan-UTC165-R02 in document L2/20-235, for Unicode Version 14.0.

[165-C5] Consensus: Add the first residual stroke field, Field 9, and its description to UAX #45 for Unicode Version 14.0, based on document L2/20-229 and Unihan-UTC165-R03 in document L2/20-235.

[165-A3a] Action Item for John Jenkins: Editorial Committee: Update the text of UAX #45 to include the first residual stroke field, Field 9 and its description, based on document L2/20-229 and Unihan-UTC165-R03 in document L2/20-235, for Unicode Version 14.0.

[165-A3b] Action Item for John Jenkins: Add Field 9 to USourceData.txt, based on document L2/20-229 and Unihan-UTC165-R03 in document L2/20-235, for Unicode Version 14.0.

[165-C6] Consensus: Make changes to the Unihan database based on feedback from Jim Breen [Fri Jul 17 20:18:55 CDT 2020 (updated 2020-09-01)], based on document L2/20-239, for Unicode Version 14.0.

[165-A4] Action Item for John Jenkins: Update the Unihan database to add or change the records, based on document L2/20-239 and Unihan-UTC165-R04 in document L2/20-235, for Unicode Version 14.0.

[165-C7] Consensus: Make changes to the Unihan database based on feedback from Jaemin Chung [Thu Sep 3 12:30:38 CDT 2020], based on document L2/20-239, for Unicode Version 14.0.

[165-A5] Action Item for Michel Suignard, John Jenkins: Make changes to the Unihan database based on document L2/20-239, and Unihan-UTC165-R05 in document L2/20-235, for Unicode Version 14.0.

[165-A6] Action Item for Ken Whistler: Update NamesList.txt to add U+20092 as a related CJK Unified Ideograph to U+2EA7, based on document L2/20-239 and Unihan-UTC165-R06 in document L2/20-235, for Unicode Version 14.0.

[165-C8] Consensus: Make changes to the Unihan database based on feedback from Jaemin Chung [Thu Sep 17 18:20:16 CDT 2020], based on document L2/20-239, for Unicode Version 14.0.

[165-A7] Action Item for Michel Suignard, John Jenkins: Make changes to the Unihan database based on document L2/20-239, and Unihan-UTC165-R07 in document L2/20-235, for Unicode Version 14.0.

[165-C9] Consensus: Accept the five urgently needed characters proposed in L2/20-203 and L2/20-204 with code points U+9FFD through U+9FFF, U+2A6DE, and U+2A6DF, along with the representative glyph change for MC-00137 proposed in L2/20-205, and add to a proposed update of UAX #38 for Unicode Version 14.0 to add the new kIRG_GSource and kIRG_MSource source prefixes.

[165-A8] Action Item for John Jenkins, Editorial Committee: Update UAX #38 to add the new kIRG_GSource and kIRG_MSource source prefixes, along with their syntax and descriptions, based on documents L2/20-203 and L2/20-204, and Unihan-UTC165-R09 in document L2/20-235, for Unicode Version 14.0.

[165-A9] Action Item for Ken Whistler: Update the pipeline to add the five urgently needed characters with code points U+9FFD through U+9FFF, U+2A6DE, and U+2A6DF. See document L2/20-235.

[165-A10] Action Item for Ken Lunde: Request that Macao SAR adjust the representative glyph for MC-00137 per L2/20-205, and provide an updated font to Michel Suignard.

[165-A11] Action Item for John Jenkins, Michel Suignard: Add or change Unihan database records, based on documents L2/20-203 and L2/20-204, and Unihan-UTC165-R09 in document L2/20-235, for Unicode Version 14.0.

[165-A12] Action Item for Michel Suignard, Editorial Committee: Review description of URO layout of Macao source in section 24.2 of the core spec for Unicode version 14.0

Short break until 12:15.

[165-C10] Consensus: Disunify U+722B and encode a new CJK Unified Ideograph at the end of the Extension C block at code point U+2B735 with a kIRG_VSource property value of V0-3D5B, for Unicode Version 14.0.

[165-A13] Action Item for Michel Suignard, John Jenkins: Update and add Unihan database records based on document L2/20-210 and Unihan-UTC165-R10 in document L2/20-235, for Unicode Version 14.0.

[165-A14] Action Item for Lee Collins: Provide a font to Michel Suignard for updates based on document L2/20-210 and Unihan-UTC165-R10 in document L2/20-235.

[165-A15] Action Item for Ken Whistler: Update the pipeline to add U+2B735. See Unihan-UTC165-R10 in document L2/20-235.

[165-C11] Consensus: Add or change Unihan database records, except for the duplicate record for U+2941B and the proposed alternate kTotalStrokes property value for U+4040, along with the new description of the “V4” prefix in UAX #38, based on document L2/20-230 and Unihan-UTC165-R11 in document L2/20-235, for Unicode Version 14.0.

[165-A16] Action Item for John Jenkins, Editorial Committee: change the description of the UAX #38 kIRG_VSource property's “V4” prefix to: Kho Chữ Hán Nôm Mã Hoá (Hán Nôm Coded Character Repertoire), Hà Nội, 2007.

[165-A17] Action Item for Lee Collins: Provide to Michel Suignard an updated font that also includes the 20 representative glyph changes, based on document L2/20-230 and Unihan-UTC165-R11 in document L2/20-235, for Unicode Version 14.0.

[165-A18] Action Item for Lee Collins: Propose to the IRG a new UCV (Unifiable Component Variations) for the 決 and 决 components.

[165-A19] Action Item for Michel Suignard, John Jenkins: Update, remove, and add Unihan database records based on document L2/20-230 and Unihan-UTC165-R11 in document L2/20-235, for Unicode Version 14.0.

[165-C12] Consensus: Make changes and additions to the kXHC1983 property, based on document L2/20-231 and Unihan-UTC165-R12 in document L2/20-235, for Unicode Version 14.0.

[165-A20] Action Item for Peter Edberg: Ask the CLDR-TC to check the proposed kMandarin property value for U+2B413 𫐓 then report back to the UTC. See document L2/20-231.

[165-A21] Action Item for John Jenkins: Update and add Unihan database records based on document L2/20-231 and Unihan-UTC165-R12 in document L2/20-235, for Unicode Version 14.0.

[165-A22] Action Item for Ken Lunde: Relay the UTC feedback on kanbun to the author of L2/20-232. See document L2/20-235 section 5.

C.2 PRI #421 Proposed Update UAX #38 Unicode Unihan Database C.2.1 Feedback on PRI#421 [L2/20-239]

[165-A23] Action Item for John Jenkins, Editorial Committee: Review feedback in PRI #421 from Eduardo Marín Silva [Sun Aug 9 01:10:36 CDT 2020] on possible update of the text of UAX #38.

Break for lunch 13:25 - 14:00.

B.1 Recommendations to UTC #164 October 2020 on Script Proposals [Anderson, L2/20-250]

(Section 2, Todhri)

[165-A24] Action Item for Deborah Anderson, Roozbeh Pournader: Write a document that clearly explains the pros and cons of different approaches to Todhri.

(Section 1, Latin)

[165-C13] Consensus: UTC rescinds approval of the 38 Latin characters listed in SAH-UTC165-R1, document L2/20-250, while waiting for a new proposal that consolidates all the Latin letters with revised codepoints.

[165-A25] Action Item for Ken Whistler: Update the pipeline with changes per above consensus 165-C13.

(Section 3, Vithkuqi)

[165-A26] Action Item for Deborah Anderson: Provide feedback to the author of L2/20-187R that the UTC does not support encoding newly invented modern characters without evidence of usage in text.

(Section 4, UCAS)

[165-C14] Consensus: The UTC accepts 16 Unified Canadian Aboriginal Syllabics characters as specified in SAH-UTC165-R4 of document L2/2-250, in a new Unified Canadian Aboriginal Syllabics Extended-A block (U+11AB0..U+11ABF) for encoding in a future version of the standard. See also document L2/20-255.

[165-A27] Action Item for Liang Hai: Provide a font to Michel Suignard for printing 16 new UCAS symbols. See document L2/20-255.

[165-A28] Action Item for Ken Whistler: Update the pipeline to include 16 UCAS symbols. See document L2/20-255.

Short break until 15:15.

(Section 5, Arabic)

[165-C15] Consensus: Accept three Arabic characters with properties as given in L/20-245 for encoding in a future version of the standard:

	U+061D ARABIC END OF TEXT MARK
	U+0890 ARABIC POUND MARK ABOVE
	U+0891 ARABIC PIASTRE MARK ABOVE

[165-A29] Action Item for Ken Whistler: Update the pipeline to include three new Arabic characters:

	U+061D ARABIC END OF TEXT MARK
	U+0890 ARABIC POUND MARK ABOVE
	U+0891 ARABIC PIASTRE MARK ABOVE

[165-A30] Action Item for Roozbeh Pournader, Lorna Evans: Provide a font to Michel Suignard for printing three Arabic characters with properties as given in L/20-245.

[165-A31] Action Item for Ken Whistler, Editorial Committee: Change the spelling "Uighur" to "Uyghur" in the names list annotations, to bring them in line with the current spelling conventions in the Core Specification. For 14.0. Reference: Section 5b of L2/20-250.

[165-A32] Action Item for Roozbeh Pournader: Respond to the author of Eastern Arabic Fractions feedback in L2/20-239 about the vulgar fractions and 'Egyptian' two. Reference: Section 5c of L2/20-250.

[165-C16] Consensus: Move the 98 approved characters for Cypro-Minoan at U+12700..U+12761 and its attendant Cypro-Minoan block (U+12700..U+1276F) to U+12F90..U+12FF1 in a Cypro-Minoan block whose range extends from U+12F90..U+12FFF. Reference: Section 7 of L2/20-250.

[165-A33] Action Item for Ken Whistler: Update the pipeline to move Cypro-Minoan characters. See document L2/20-250, Section 7.

[165-C17] Consensus: The UTC accepts seven Ahom characters for encoding in a future version of the standard, with properties as given in L2/20-258, and extends the current Ahom block one column so the block is from U+11700..U+1174F.

	U+11740 AHOM LETTER CA
	U+11741 AHOM LETTER TTA
	U+11742 AHOM LETTER TTHA
	U+11743 AHOM LETTER DDA
	U+11744 AHOM LETTER DDHA
	U+11745 AHOM LETTER NNA
	U+11746 AHOM LETTER LLA

[165-A34] Action Item for Ken Whistler: Update the pipeline to include seven new Ahom letters. See consensus 165-C17.

[165-A35] Action Item for Deborah Anderson: Provide a font to Michel Suignard for printing seven new Ahom letters. See consensus 165-C17.

[165-A36] Action Item for Ken Whistler: Ask the roadmap committee to extend the Ahom block range.

[165-C18] Consensus: The UTC accepts U+1715 TAGALOG SIGN PAMUDPOD for encoding in a future version of the standard, as documented in L2/20-272.

[165-A37] Action Item for Mark Davis: Add U+1715 TAGALOG SIGN PAMUDPOD to the list of confusables, see L2/20-272.

[165-A38] Action Item for Ken Whistler: Update the pipeline to include U+1715 TAGALOG SIGN PAMUDPOD. See document L2/20-272.

[165-C19] Consensus: The UTC accepts a formal name alias of type "correction" for U+AA6E MYANMAR LETTER KHAMTI HHA, for Unicode version 14.0. The formal name alias will be: MYANMAR LETTER KHAMTI LLA. See document L2/20-263.

[165-A39] Action Item for Ken Whistler: Update NameAliases.txt for Unicode 14.0. See L2/20-263 and L2/20-250.

[165-C20] Consensus: The UTC accepts U+20C0 SOM SIGN for encoding in Unicode version 14.0. See documents L2/20-261 and L2/20-250.

[165-A40] Action Item for Ken Whistler: Update the pipeline to include U+20C0 SOM SIGN. See documents L2/20-261 and L2/20-250.

UTC adjourned for the day at 16:15.


Wednesday October 7, 2020

Meeting opened at 9:30 am.

7 members represented: Apple, Adobe, Google, Facebook, Netflix, Microsoft, UCB,

D.1 UTC #165 properties feedback & recommendations [Scherer, et al, L2/20-240]

[165-A41] Action Item for Mark Davis: Forward L2/20-240 item F1 to CLDR for discussion: Handling of quotation marks in line breaking needs language-specific tailoring.

[165-A42] Action Item for Mark Davis, Editorial Committee: Prepare a proposed update of UAX #31 to clarify when & why ZWJ/ZWNJ should be ignored vs. when not. See L2/20-240 item F4. For Unicode version 14.

[165-A43] Action Item for Rick McGowan, Editorial Committee: Post a PRI for the proposed update of UAX #31 to close December 31, 2020 for Unicode version 14.

[165-A44] Action Item for Asmus Freytag, Michel Suignard: Provide a document proposing an option in UAX #31 to prohibit ZWJ/ZWNJ altogether, for identifier security.

[165-A45] Action Item for Mark Davis, Editorial Committee: In UAX #31 more clearly and consistently refer to CLDR for UnicodeSet syntax, according to L2/20-240 item F5. For Unicode version 14.

[165-A46] Action Item for Mark Davis, Editorial Committee: In UAX #31 prefix the sentence: "The Identifier characters are always a superset of the ID_Start characters" with "by definition", for Unicode version 14.

[165-A47] Action Item for Mark Davis: In security/.../IdentifierType.txt, for U+1B6B..U+1B73 add Identifier_Type=Technical as proposed in L2/20-240 item F6, unless other UTC action items about Identifier_Type classifications contradict this. For Unicode 14.

[165-A48] Action Item for Markus Scherer, Editorial Committee: Update UTS #46 to validate ACE label edge cases, see L2/20-240 item F7. For Unicode 14.

[165-A49] Action Item for Roozbeh Pournader: Re document L2/20-240 item F8, investigate what the right Indic shaping properties should be for certain Vedic characters. See also related AI 164-A63. For Unicode 14.

Short break until 10:45.

[165-A50] Action Item for Rick McGowan: Contact Henri Sivonen re L2/20-202 and refer to L2/20-240, section D2.

D.2 Open Sourcing the Last Resort Font

[165-C21] Consensus: Make the Last Resort Font Github repository public, after updating the ReadMe file appropriately.

[165-A51] Action Item for Ken Lunde, Rick McGowan: Update the ReadMe.md file in Github for the Last Resort Font to include the following in addition to other updates: "This font may be updated for future versions of the standard as time and resources permit."

[165-A52] Action Item for Rick McGowan: Update the Last Resort Font page to refer to the appropriate Github repository.

Short break.

F.1 Editorial Committee Report and Recommendations for UTC #164 Meeting [Whistler, L2/20-241]

[165-A53] Action Item for Ken Whistler, Editorial Committee: Clarify the names list annotation regarding punctus elevatus (U+2E4E) for Unicode 14.0. Ref. David Corbett, July 21, in L2/20-239. [Tue Jul 21 13:06:27 CDT 2020]

[165-A54] Action Item for Liang Hai, Editorial Committee: In Section 12.9, Malayalam, of the core specification, provide clarification about the attestations of candrakkala (U+0D4D) in some irregular forms. For Unicode 14.0. Ref. Ajith, July 29, in L2/20-239. [Wed Jul 29 23:33:52 CDT 2020]

[165-A55] Action Item for Liang Hai, Editorial Committee: In Section 12.9, Malayalam, of the core specification, provide an explanation of the rationale for use of decomposed sequences in two-part vowels in the examples in Table 12-41. For Unicode 14.0. Ref. Ajith, July 30, in L2/20-239. [Thu Jul 30 01:19:40 CDT 2020]

[165-A56] Action Item for Peter Constable, Editorial Committee: Prepare proposed update text for UTS #39 for Version 14.0, incorporating textual suggestions noted in L2/20-239. Ref. Peter Constable, July 30. [Thu Jul 30 15:56:14 CDT 2020, Thu Jul 30 16:27:43 CDT 2020]

[165-A57] Action Item for Ken Whistler, Editorial Committee: Prepare proposed update text for UAX #31 for Version 14.0, incorporating specific textual suggestions noted in L2/20-239. Ref. Peter Constable, July 30. [Thu Jul 30 16:47:52 CDT 2020, Thu Jul 30 17:11:37 CDT 2020]

Lunch break 12:05 - 14:00.

E.1 Recommendations for Emoji, Unicode 14.0 [Daniel/ESC, L2/20-242]

Long discussion.

[165-C22] Consensus: Remove seven provisional emoji candidates based on section IV of L2/20-242R2.

[165-A58] Action Item for Mark Davis, Ned Holbrook: Remove seven provisional emoji candidates based on section IV of L2/20-242R2.

[165-C23] Consensus: Accept thirty-seven draft candidate atomic characters with codepoints and 75 sequences based on document L2/20-242R2.

[165-A59] Action Item for Mark Davis, Ned Holbrook: Update the emoji charts with the thirty-seven draft candidates approved by UTC. See above Consensus 165-C23 document L2/20-242R2.

[165-A60] Action Item for Ken Whistler: Update the pipeline to include thirty-seven emoji candidates. See above Consensus 165-C23 document L2/20-242R2.

E.2 Comments on Emoji 13.1 and 14.0 Candidates [Buff, L2/20-200] E.2.1 ESC comments on 2020 Q3 feedback [ESC, L2/20-227]

[165-A61] Action Item for Rick McGowan: Point Charlotte Buff to the ESC response document L2/20-227.

UTC adjourned for the week at 16:30.

L2 continued after a short break.


Members Represented

Full Member 10/05/20 10/07/20

1. Adobe

yes yes

2. Apple Inc.

yes yes

3. Facebook

yes yes

4. Google, Inc.

yes yes

5. IBM Corporation

   

6. Microsoft Corporation

yes yes

7. Netflix

yes yes

8. SAP AG

9. Sultanate of Oman, MARA

   

Institutional Member

1. Bangladesh, MSICT

2. India, MICT

3. Tamil Nadu, TVA

4. UCB

yes yes
   

Supporting Member

1. Emojipedia

  yes

2. Monotype Imaging Corp

   

Associate Member

1. Emojination

yes yes

2. SIL

yes yes
   

UTC Attendance

PersonRepresenting
Deborah AndersonU.C. Berkeley
Fesseha Atlawself
Dragan BesevicNetflix
Frederick Brennanself
Jeremy BurgeEmojipedia
Chris ChapmanAdobe
Mia Cinelliself
Lee CollinsNetflix
Peter Constableself
Craig CummingsAmazon
Jennifer DanielGoogle
Mark DavisGoogle
Peter EdbergApple
Behnam Esfahbodself
Lorna EvansSIL
Loïz Fily (BZH)Office of Breton Language
Rich GillamApple
Andrew GlassMicrosoft
Joshua HadleyAdobe
Liang HaiUnicode
Ned HolbrookApple
John JenkinsApple
Kevin Keystoneself
Jan Kučeraself
Jennifer 8. LeeEmojination
Kristi LeeMicrosoft
Ken LundeUnicode
Rick McGowanUnicode
Lisa MooreUnicode
Timo Nijssenself
Marcel Paulukself
Roozbeh PournaderFacebook
Murray SargentMicrosoft
Markus SchererGoogle
Jiali ShengMicrosoft
Michel SuignardUnicode
Tex Texinself
Ken WhistlerUnicode
Shawn XuNetflix
Daniel Yacobself
Ben YangPanlex

Members not in regular attendance: Tamil Nadu, Oracle, SAP, Sultanate of Oman

Quorum: 5