L2/22-016

Approved Minutes of UTC Meeting 170
Mountain View, CA — January 25 and 27, 2022
Hosted virtually on Zoom

UTC #170 Agenda
Revision date: April 21, 2022


Tuesday, January 25, 2022

Meeting opened at 9:30am. Peter Constable opened the meeting. Craig Cummings opened L2.

6.5 members in regular attendance. Quorum is 3.5.

6 members represented: Adobe, Apple, ETCO, Google, Microsoft, UCB.

A.1 Consortium membership; meeting quorum and proxies

A.2 Agenda review

A.4 Action Item review

Oral review by Ken Whistler. Discussion of plans for AI list updates.

C.1 Editorial Committee Report and Recommendations for UTC #170 Meeting [Whistler, L2/22-020]

[170-C1] Consensus: The UTC authorizes starting the alpha review for Unicode 15.0.

[170-A1] Action Item for Ken Whistler: Prepare an updated NamesList.txt for Unicode 15.0, synched with the Unicode 15.0 repertoire, as finalized during UTC #170.

[170-A2] Action Item for Michel Suignard, Rick McGowan: Prepare a set of Unicode 15.0 alpha review code charts for posting.

[170-A3] Action Item for Ken Whistler, Editorial Committee: Prepare a background document for a PRI on the Unicode 15.0 alpha review.

[170-A4] Action Item for Rick McGowan: Post the PRI for the Unicode 15.0 alpha review, to close April 4, 2022.

Note: Other PRIs close April 10. This is specific to the Alpha. PRI closing dates to be reviewed Thursday in H.1.

[170-A5] Action Item for Mark Davis, Editorial Committee: Correct a list of typos in UTS #18, as noted by Ivan Panchenko in feedback on PRI #427.

[170-A6] Action Item for Rick McGowan, Editorial Committee: Adjust the text of the copyright cover sheet for code charts for Unicode 15.0, to add a link to About Charts in an appropriate location on the sheet.

[170-A7] Action Item for Ken Whistler, Editorial Committee: Update the text following D56 in the Core Specification, to clarify some edge cases involving combining character sequences; coordinate with action item 166-A61. For Unicode 15.0.

[170-A8] Action Item for Ken Whistler, Editorial Committee: Adjust the aliases in the Unicode names list for U+0007 for Unicode 15.0, to better match NameAliases.txt.

[170-A9] Action Item for Ken Whistler, Editorial Committee: Clarify the conventions used in display of normative and informative aliases for control codes, in Section 24.1 of the Core Specification, for Unicode 15.0.

[170-A10] Action Item for Ken Whistler, Editorial Committee: Adjust the mention of the use of Extended_Pictographic in other specifications to use an example but not make the list of those specifications comprehensive. Consider adding or not adding LB30b of UAX #14 to the UAX #44 Table 9 entry for Extended_Pictographic. For Unicode 15.0.

[170-A11] Action Item for Mark Davis, Editorial Committee: Correct the section numbering errors in UTS #39, as noted by Petr Viktorin, for Unicode 15.0.

[170-A12] Action Item for Liang Hai: Investigate the Devanagari and Bengali handling of Jihvamuliya and Upadhmaniya and make suggestions for possible text additions to the Core Specification. See also AI 154-A21.

[170-A13] Action Item for Ken Whistler, Roozbeh Pournader, Editorial Committee: Correct a list of typos in UAX #42, as noted by Ivan Panchenko in L2/22-020 [Fri Jan 7 16:22:34 CST 2022]. For Unicode 15.0.

[170-A14] Action Item for Ken Whistler, Ned Holbrook, Editorial Committee: Correct a list of typos in UTS #51, as noted by Ivan Panchenko in L2/22-020 [Sun Jan 16 10:47:31 CST 2022]. For Unicode 15.0.

B. Liaison Reports

B.1 JTC1/SC2/WG2 Oral report by Michel Suignard.

Short break 10:51 - 11:02.

F. Properties and Algorithms F.1 “Trojan source” Vulnerabilities [Davis, et al, L2/22-007R]

Also see UTC #170 properties feedback & recommendations [L2/22-019], section D4: Avoiding Source Code Spoofing

Long discussion.

[170-C2] Consensus: Form a limited-duration, ad hoc working group as outlined in document L2/22-007R2, section “Proposed Plan”, with Mark Davis as the chair.

[170-A15] Action Item for Mark Davis: Create a working group with at least the following individuals who expressed interest in the meeting: Markus Scherer, Ken Whistler, Asmus Freytag, Dante Gagne, Rich Gillam, Robin Leroy, Kevin Backhouse, Alvaro Munoz, Barry Dorrans, Peter Constable.

F.2 UTC #170 properties feedback & recommendations [Scherer, et al, L2/22-019]

F.2 D3: Proposal to guarantee stability of spelling of property names, values,and aliases in UCD

[170-C3] Consensus: The UTC recommends to the executive officers a new stability policy for not changing the spelling of existing property aliases and property value aliases of UCD properties going forward, excluding provisional and contributory properties, similar to https://www.unicode.org/policies/stability_policy.html#Alias_Stability. See document L2/22-019 item D3.2.

[170-A16] Action Item for Ken Whistler, Properties & Algorithms Group: Determine the earliest Unicode version since which the exact spelling of existing property (value) aliases has been unchanged. See document L2/22-019 item D3.

F.2 Public Review Issues, PRI #427: Proposed Update UTS #18, Unicode Regular Expressions

[170-A17] Action Item for Mark Davis: Clarify in UTS #18 that loose value matching applies to symbolic values of enumerated and catalog properties, and UAX #44 5.9.1 / UAX44-LM1 applies to matching numeric values; use the example of numeric value -0.5 of U+0F33; for details refer to UAX #44; see L2/22-019 item PRI427a.

[170-A18] Action Item for Mark Davis: Adjust the current examples of escape syntax in https://www.unicode.org/reports/tr18/#Hex_notation to account for improved capabilities in regex handling in Java, JavaScript, and ICU.

[170-C4] Consensus: Approve UTS #18 for release, based on PRI #427.

[170-A19] Action Item for Mark Davis, Markus Scherer, Editorial Committee: Finalize the text of UTS #18, based on PRI #427. Include changes for action items 168-A012, 168-A013, and 166-A070 [and for PRI427b] [and for the typos reported via the editorial committee] if feasible, but these are not blockers.

[170-A20] Action Item for Rick McGowan: Close PRI #427.

[170-A21] Action Item for Rick McGowan: Post UTS #18 once the text is finalized, based on PRI #427.

F.2 D5: C0 and C1 stability for Unicode and 10646

Discussion. UTC took no action at this time.

B. Liaison Reports

B.4 CLDR

Oral report by Mark Davis.

B.3 ICU

Oral report by Markus Scherer.

Lunch break 13:00 - 14:00.

Quorum check, OK.

B.6 IETF / ICANN

Oral report by Michel Suignard, Asmus Freytag. See https://icann.org/idn to follow ICANN activities.

D.1 Recommendations to UTC #170 January 2022 on Script Proposals [Anderson, et al, L2/22-023]

D.1 1 Cyrillic Letter Multiocular O [Everson, L2/22-002]

[170-C5] Consensus: Approve a glyph change for U+A66E CYRILLIC LETTER MULTIOCULAR O from a 7-eyed glyph to a 10-eyed glyph for a change in Unicode 15.0. (Reference: L2/22-002)

[170-A22] Action Item for Michael Everson: Provide Michel Suignard with a glyph and propose an annotation for U+A66E CYRILLIC LETTER MULTIOCULAR O, describing the change from 7-eyed glyph to a 10-eyed glyph. (Reference: L2/22-002 and Section 1a of L2/22-023)

[170-A23] Action Item for Michael Everson, Deborah Anderson: Contact Ralph Cleminson and get clarification on his comments. (Reference: L2/22-002 and Section 1a of L2/22-023)

[170-A24] Action Item for Deborah Anderson, Editorial Committee: Create a glyph erratum for U+A66E CYRILLIC LETTER MULTIOCULAR O. (Reference: Section 1a of L2/22-023 and L2/22-002)

D.1 1b Cyrillic Modifier Letters

[170-C6] Consensus: UTC accepts the following two characters, as documented in L2/22-010, for a future version of the standard:

U+1E06D MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
U+1E08F COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I

D.1 2 Latin D.1 2a African Reference Alphabet

[170-A25a] Action Item for Rick McGowan: Relay the comments in Section 2a of L2/22-023 as well as a link to L2/21-247 to the author of L2/21-231.

D.1 2b Casing Pair used by Some African Orthographies

[170-A25b] Action Item for Rick McGowan: Relay comments in Section 2b of L2/22-023 to the author of L2/21-229.

D.1 2c Closed Insular G

UTC notes the documents on Closed Insular G, but takes no action.

D.1 3 Adinkra

[170-A26] Action Item for Deborah Anderson: Relay comments in Section 3 of L2/22-023 to the author of L2/21-237.

D.1 5 Garay [Rovenchak, et al, L2/22-030]

UTC notes this document but takes no further action; the SAH comments have already been conveyed to the author.

D.1 6a Arabic Alef with Right Hamza

[170-A27] Action Item for Deborah Anderson: Forward the comments in Section 6a of L2/22-023 to the author of L2/22-035.

D.1 6b Balochi

[170-A28] Action Item for Roozbeh Pournader, Deborah Anderson: Relay the comments in Section 6b of L2/22-023 to the author of L2/21-238.

D.1 6c Chinese National Body Comments

UTC notes the comments in Section 6c of L2/22-023 but takes no further action.

D.1 6d Lam-Alef Ligature for al-Dani

[170-A29] Action Item for Lorna Evans: Provide text with examples of lam-alef ligature with diacritics for the Hafs and al-Dani orthographies and propose wording for the Ligature Classes subhead of chapter 9.2. (Reference: L2/22-025)

D.1 6e Quranic Superscript Alef Motahafar

[170-A30] Action Item for Ben Yang: Relay the comments in section 6e of document L2/22-023 (including the analysis) to the proposal author of L2/21-204 and ask him to update his proposal.

Short break 15:05 - 15:15.

D.1 7 Linear Elamite

[170-A31] Action Item for Ken Whistler, Roadmap Committee: Update the Roadmap to reflect the allocation for Linear Elamite from U+1C380..U+1C3CF to U+1C380..1C4FF. (Reference: Section 7 of L2/22-023)

D.1 8 Devanagari

[170-A32] Action Item for Deborah Anderson: Relay comments in Section 8 of L2/22-023 to the author of L2/21-240.

D.1 9 Kannada and Telugu

[170-C7] Consensus: UTC accepts the following two characters, as documented in L2/22-006, for a future version of the standard: U+0CDC KANNADA ARCHAIC SHRII, U+0C5C TELUGU ARCHAIC SHRII

Note: the pipeline has already been updated for the characters above.

[170-A33] Action Item for Deborah Anderson: Follow up with Srinidhi/Sridatta to provide a font to Michel Suignard for printing U+0CDC KANNADA ARCHAIC SHRII and U+0C5C TELUGU ARCHAIC SHRII. (Reference: L2/22-006 and Section 9 of L2/22-023.)

D.1 10 Mongolian

[170-A34] Action Item for Liang Hai: Relay comments in Section 10 of document L2/22-023 to the author of L2/21-244.

D.1 11 Sunuwar

[170-C8] Consensus: UTC accepts 44 Sunuwar characters in a new Sunuwar block (U+11BC0..U+11BFF), as documented in L2/21-157R, for encoding in a future version of the standard.

Note: The pipeline has already been updated for 44 Sunuwar characters.

[170-A35] Action Item for Deborah Anderson, Anshuman Pandey: Send Michel Suignard a font for printing Sunuwar. (Reference: L2/21-157R)

D.1 12 Tulu / Tulu-Tigalari

[170-C9] Consensus: UTC accepts 78 Tulu-Tigalari characters in a new Tulu-Tigalari block U+11380..U+113FF as documented in L2/22-031, for encoding in a future version of the standard.

Note: the pipeline has already been updated for Tulu-Tigalari.

[170-A36] Action Item for Vaishnavi Murthy, Deborah Anderson: Send a font to Michel Suignard. (Reference: L2/22-031)

[170-A37] Action Item for Deborah Anderson: Reply to KTSA with feedback regarding creating a more accessible proposal with a clear list of characters and glyphs and identification of any differences in behavior or appearance with the historic Tulu-Tigalari writing system and other comments from Section 12 of L2/22-023.

[170-A38] Action Item for Norbert Lindenberg: Propose text in section 2.11 of the Core Spec on how to handle sequences with multiple left-reordering dependent vowels. (Reference: Section 12 of L2/22-023)

D.1 13 Khmer

[170-A39] Action Item for Norbert Lindenberg: Relay the comments in Section 13 of L2/22-023 to the proposal authors. Also relay work in ICANN for Khmer script label generation rules.

D.1 14 Ideographic Complex Scripts

[170-A40] Action Item for Deborah Anderson, Liang Hai: Relay comments in Section 14 of L2/22-023 to the author of L2/21-165.

D.1 15 Blissymbols

UTC notes this document but takes no further action.

D.1 16 Dwarf Planet Symbols

[170-C10] Consensus: UTC accepts the following five characters, as documented in L2/21-224, for a future version of the standard:

U+1F77B HAUMEA
U+1F77C MAKEMAKE
U+1F77D GONGGONG
U+1F77E QUAOAR
U+1F77F ORCUS

[170-A41] Action Item for Ken Whistler: Update the Pipeline to include five dwarf-planet symbols, as documented in L2/21-224.

D.1 18 Lot of Fortune and Eclipse Symbols

[170-C11] Consensus: UTC accepts the following three characters, as documented in L2/22-005, for encoding in a future version of the standard:

U+1F774 LOT OF FORTUNE
U+1F775 OCCULTATION
U+1F776 LUNAR ECLIPSE

Note: the pipeline has already been updated with three astrological symbols, and Michel Suignard has the font.

Short break 16:21 - 16:30.

D.1 4a Format Control Characters

[170-C12] Consensus: UTC accepts 30 Egyptian Hieroglyph characters, as documented in Tables 29-31 of L2/21-248, for a future version of the standard. (Reference: Section 4a of L2/22-023)

[170-C13] Consensus: UTC Accepts the extension of the Egyptian Hieroglyph Format Controls block from the current allocation U+13430..U+1343F to U+13430..U+1345F. (Reference: Section 4a of L2/22-023)

[170-A42] Action Item for Michel Suignard to create a font for the Egyptian Hieroglyph characters, based on Section 4a of L2/22-023, UTC discussion, and page 3 of L2/21-248.

[170-A43] Action Item for Ken Whistler: Update Blocks.txt with the extension of the Egyptian Hieroglyph characters, based on Section 4a of L2/22-023..

[170-A44] Action Item for Ken Whistler, Deborah Anderson: Confirm the Roadmap changes described in Section 4a of L2/22-023 are incorporated in the Roadmap (i.e., extend Egyptian Hieroglyph Format Controls from U+13430..U+1343F to U+13430..U+1345F, and leave U+13460..U+1347F empty).

Note: the pipeline has already been updated for the above characters.

D.1 4b Variation Sequences for Egyptian Hieroglyphs

[170-C14] Consensus: UTC accepts 98 standardized variants (94 rotations and 4 expanded lost signs), as documented in the attachment to L2/22-012, for encoding in a future version of the standard.

Note: the pipeline has already been updated for the above characters.

[170-A45] Action Item for Ken Whistler: Update StandardizedVariants.txt for the 98 new variation sequences for Egyptian Hieroglyph characters.

D.1 17 Legacy Computing Symbols

[170-C15] Consensus: UTC accepts 731 legacy computing symbols, as documented in L2/21-235, for encoding in a future version of the standard, but changing the gc for the outlined Latin capital letters U+1CCD6..U+1CCEF from Lu to So.

[170-C16] Consensus: UTC accepts a new block allocation, Symbols for Legacy Computing Supplement, as amended in discussion, to (U+1CC00..U+1CEBF) (Reference: L2/21-235)

Note: the pipeline has already been updated to include 731 legacy computing symbols.

[170-A46] Action Item for Ken Whistler and Debbie Anderson: Confirm the Roadmap is updated with Symbols for Legacy Computing Supplement (U+1CC00..U+1CEAF) (Reference L2/21-235)

[170-A47] Action Item for Debbie Anderson and Rebecca Bettencourt: Provide Michel Suignard with a font for printing the legacy computing symbols. (Reference L2/21-235)

[170-A48] Action Item for Doug Ewell and Rebecca Bettencourt: Update the proposal with discussion on the gc properties for the outlined Latin capital letters (U+1CCD6..U+1CCEF) and to adjust the properties accordingly (from gc=Lu to gc=So). (Reference: Section 17 of L2/22-023)

D.1 20 Smalltalk

[170-C17] Consensus: UTC accepts the following 5 Smalltalk symbols, as documented in L2/21-234, for a future version of the standard:

U+1CEB0 HORIZONTAL ZIGZAG LINE
U+1CEB1 KEYHOLE
U+1CEB2 OLD PERSONAL COMPUTER WITH MONITOR IN PORTRAIT ORIENTATION
U+1CEB3 BLACK RIGHT TRIANGLE CARET
U+1F8B2 RIGHTWARDS ARROW WITH LOWER HOOK

[170-A49] Action Item for Ken Whistler: Update the Pipeline with five Smalltalk symbols as documented in L2/21-234.

[170-A50] Action Item for Deborah Anderson, Rebecca Bettencourt: Provide Michel Suignard with a font for printing new Smalltalk symbols. (Reference: L2/21-234)

[170-C18] Consensus: Modify the names of the following six characters (which had been earlier accepted as consensus 169-C6) from:

1DF25 LATIN SMALL LETTER D WITH LEFT HOOK
1DF26 LATIN SMALL LETTER L WITH LEFT HOOK
1DF27 LATIN SMALL LETTER N WITH LEFT HOOK
1DF28 LATIN SMALL LETTER R WITH LEFT HOOK
1DF29 LATIN SMALL LETTER S WITH LEFT HOOK
1DF2A LATIN SMALL LETTER T WITH LEFT HOOK

to:

1DF25 LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK
1DF26 LATIN SMALL LETTER L WITH MID-HEIGHT LEFT HOOK
1DF27 LATIN SMALL LETTER N WITH MID-HEIGHT LEFT HOOK
1DF28 LATIN SMALL LETTER R WITH MID-HEIGHT LEFT HOOK
1DF29 LATIN SMALL LETTER S WITH MID-HEIGHT LEFT HOOK
1DF2A LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK

Reference: Section 21c of L2/22-023.

[170-A51] Action Item for Ken Whistler: Update the Pipeline with name changes to six Latin characters for legacy Malayalam, as documented in Section 21c of L2/22-023.

D.1 22 Bopomofo: Change of Vertical_Orientation property for Bopomofo Tone Marks

[170-A52] Action Item for Liang Hai: Write up a document on the background of the situation re: Bopomofo tone marks. (Reference: Section 21 of L2/21-016 Script Ad Hoc Recommendations and Section 22s of L2/22-023).

D.1 21a Arabic Presentation Forms-A

[170-A53] Action Item for Roozbeh Pournader: Provide annotation to the names list in Arabic Presentation Forms-A, mentioning that the same sequence may have distinct presentation variants, and different forms of ligatures may be needed. (Reference Section 21a of L2/22-023)

D.1 21b Latin Additional Letters

[170-A54] Action Item for Rick McGowan: Relay comments in Section 21b of L2/22-023 to the author of October 13, 2021 feedback on Latin theta contained in L2/22-018.

[170-A55] Action Item for Debbie Anderson: Relay the feedback from Eduardo Marín Silva in Section 21b of L2/22-023, with no comment, to Denis Moyogo Jacquerye.

D.1 21d Old Hungarian

[170-A56] Action Item for Rick McGowan: Relay comments in Section 21d of L2/22-023 to the author of L2/21-246.

[170-A57] Action Item for Ken Whistler and the Editorial Committee: Take into account the editorial suggestions in sections 1, 3, and 7 of L2/21-246.

D.1 21e Symbol for PLAY

[170-A58] Action Item for Rick McGowan: Relay comments in Section 21e of L2/22-023 to the author of Sept. 29, 2021 feedback on PLAY contained in L2/22-018.

D.1 21f Tulu-Tigalari

[170-A59] Action Item for Rick McGowan: Relay comments in Section 21f of L2/22-023 to the author of Sept. 27, 2022 feedback on Tulu-Tigalari contained in L2/22-018.

D.1 19 Punctuation delete mark

[170-A60] Action Item for Rick McGowan: Relay comments in Section 19 of L2/22-023 to the author of L2/21-245.

D.1 23a Recommendations for 15.0

[170-C19] Consensus: The UTC targets the list of characters in section 23a in document L2/22-023 for publication in Unicode 15.0.

[170-A61] Action Item for Markus Scherer, Ken Whistler: Make data file updates in the UCD for Unicode version 15.0.

Meeting adjourned for the day at 17:55.


January 27, 2022

Meeting opened at 9:33.

Peter Constable opened the meeting.

6.5 members in regular attendance. Quorum is 3.5.

6.5 members represented: Adobe, Apple, ETCO, Emojipedia, Google, Microsoft, UCB.

G.1 Emoji Subcommittee Report Q1, 2022 UTC [ESC/Daniel, L2/22-021]

[170-C20] Consensus: Accept the following name changes in candidate emoji for 15.0 Beta for three characters:

	U+1FA76 GRAY HEART
	U+1FACE MOOSE FACE
	U+1FADA GINGER

to these new names:

	U+1FA76 GREY HEART
	U+1FACE MOOSE
	U+1FADA GINGER ROOT

Note: the pipeline has already been updated for the above. No correction was needed for "PEA POD".

[170-C21] Consensus: Accept the change for provisional emoji candidate U+1FABE BLACK BIRD to be represented as a ZWJ sequence, "U+1F426, U+200D, U+2B1B" rather than as an atomic character, for Unicode version 15.0.

[170-A62] Action Item for Ken Whistler: Update the, NamesList.txt, and UnicodeData.txt to remove U+1FABE BLACK BIRD, for Unicode version 15.0.

[170-A63] Action Item for Ned Holbrook: Add the ZWJ sequence for BLACK BIRD, U+1F426, U+200D, U+2B1B,to emoji-zwj-sequences.txt, for Unicode version 15.0.

[170-A64] Action Item for Rick McGowan: Respond to the submitters of the feedback in section 2 of L2/22-021 and point them to that document.

G.1 3. Other Public Feedback

[170-A65] Action Item for Mark Davis and Ned Holbrook: Fix known bug in the tool for emoji charts (see https://github.com/unicode-org/unicodetools/issues/96).

[170-A66] Action Item for Rick McGowan: Respond to the submitter of L2/22-021 section 2 that this is a known bug.

[170-A67] Action Item for Mark Davis, Ned Holbrook: Review feedback from David Corbett [Mon Nov 8 15:04:37 CST 2021] and update UTS #51 as necessary.

A.3 Approval of minutes of prior meeting [L2/21-167]

[170-C22] Consensus: Approve the minutes of UTC #169, as amended in discussion.

[170-A68] Action Item for Rick McGowan: Post the approved minutes of UTC #169.

A.5 Calendar review [Calendar]

Planning for the April 2022 meeting to be a two-day virtual meeting, April 19 and 21.

July meeting discussion. Still planning UTC #172 at Microsoft, July 26-28.

November meeting discussion. Planning UTC# 173 November 1-3; tentatively hosted by Apple, backup Adobe.

Planning for a three-day UTC #174 meeting, January 24-26, 2023.

[170-A69] Action Item for Rick McGowan: Update the calendar(s) appropriately for meeting dates.

D. 2 Mongolian Ad-hoc Report [Liang Hai]

Oral report by Liang Hai. Discussion.

Short break 11:00 - 11:15.

F. Properties and Algorithms F.2: UTC #170 properties feedback & recommendations [Scherer, et al, L2/22-019]

F.2 F2: Mistake about U+0953 and U+0954

[170-C23] Consensus: Remove the explicit Indic_Positional_Category values for U+0953 DEVANAGARI GRAVE ACCENT and U+0954 DEVANAGARI ACUTE ACCENT, letting them default to NA (not applicable), for Unicode version 15.0.

Note these values are already out of the 15.0 draft data file.

F.2 F3: Bad word break of RI ZWJ RI RI

[170-A69a] Action Item for Mark Davis, Chris Chapman: Start a new document that subsumes all old AIs concerning consistency between different segmentation algorithms, with explicit examples of problem cases where available. Once the document is submitted to the registry, request that the old AIs be closed: 160-A73, 149-A50, 142-A64 and possibly more. See L2/22-019 item F3.

F.2 F4: U+0019 in ISO vs. NameAliases.txt vs. chart/NamesList.txt

[170-C24] Consensus: For U+0019, add a Name alias “EM” of type abbreviation, for Unicode version 15.0.

Note, the action proposed in L2/22-019 for this item is already done.

F.2 F5: UAX44-LM2 medial-hyphen clarification

[170-A70] Action Item for Ken Whistler, Editorial Committee: For Unicode version 15.0, clarify the meaning of medial hyphen in loose matching, in UAX #34.

F.2 D1: Proposal to add a derived data file for the “IDNA_Property” to /public/idna

[170-C25] Consensus: Create a new data file for derived data in a new folder “idna2008derived” under https://www.unicode.org/Public/idna/ with data contents as described in L2/21-227 and, for the purpose of regular expressions, with a property name of IDNA2008_Category.

[170-A71] Action Item for Asmus Freytag, Ken Whistler: Post data files with derived IDNA2008 data for Unicode versions 6.1..15.0; see L2/22-019 item D1.

[170-A72] Action Item for Asmus Freytag, Ken Whistler: In UTS #46, document the new files with derived IDNA2008 data; see L2/22-019 item D1.

[170-A73] Action Item for Markus Scherer, Properties & Algorithms Group: For generation of derived IDNA2008 data, either adopt a new tool from Asmus and Ken, or modify the tools that generate the existing idna files to also generate the new file for future versions.

[170-A74] Action Item for Michel Suignard: Convey information about IDNA properties and data files to IETF.

Note: 47 minutes ahead of schedule!

B.5 TC 37/SC 2

Oral report by Peter Constable.

B.7 SEI [Anderson, L2/22-024]

Oral report by Deborah Anderson.

Break for lunch ~12:45 - 14:00.

E. CJK and Unihan E.1 CJK & Unihan Group Recommendations for UTC #170 Meeting [Lunde, L2/22-022] E.1 Section "Public Feedback"

E.1 01) 2021-10-08 20:34:45 CDT [Lunde, L2/22-022]

Ken Lunde noted that he has a draft of a new UTN (Unicode Technical Note) that documents the Unihan Property History spreadsheet and includes it as a downloadable asset.

[170-A75] Action Item for Rick McGowan: Convey to the feedback submitter the CJK & Unihan Group comments in Section 01 of document L2/22-022. See feedback [Fri Oct 8 20:34:45 CDT 2021] in document L2/22-018.

E.1 02) 2021-11-20 04:41:17 CST [Lunde, L2/22-022]

[170-A76] Action Item for Peter Edberg: Ask the CLDR-TC to check the proposed kMandarin property value change for U+266E8, then report back to the UTC. See feedback [Sat Nov 20 04:41:17 CST 2021] in document L2/22-018 and Section 02 of document L2/22-022.

E.1 05) 2022-01-04 07:17:52 CST [Lunde, L2/22-022]

[170-C26] Consensus: Accept the kRSUnicode property value changes for UK-10140 and UK-10989, and corresponding changes to code positions of 557 characters in the Extension H block in the range U+31456..U+31682, based on feedback [Tue Jan 4 07:17:52 CST 2022] in L2/22-018 and as amended in Section 05 of document L2/22-022, for Unicode Version 15.0.

[170-A77] Action Item for Michel Suignard: Change the kRSUnicode property values of UK-10140 and UK-10989, reorder the Extension H block, and provide to John Jenkins an updated Unihan15.txt data file, for Unicode Version 15.0.

[170-A78] Action Item for John Jenkins: Update the Unihan database to reflect the Extension H changes, for Unicode Version 15.0.

[170-A79] Action Item for John Jenkins: Update the Unihan database, based on feedback [Thu Jan 6 07:53:34 CST 2022] in L2/22-018 and as amended in Section 06 of L2/22-022, for Unicode Version 15.0.

E.1 07) 2022-01-06 20:28:54 CST [Lunde, L2/22-022]

[170-C27] Consensus: Accept the kTotalStrokes property value change for U+2AB8F, based on feedback [Thu Jan 6 20:28:54 CST 2022] in L2/22-018 and Section 07 of document L2/22-022, for Unicode Version 15.0.

[170-A80] Action Item for Michel Suignard: Change the kTotalStrokes property value for U+2AB8F from 12 to 13, and provide to John Jenkins an updated Unihan15.txt data file, for Unicode Version 15.0.

[170-A81] Action Item for John Jenkins: Update the Unihan database to reflect the kTotalStrokes property value change for U+2AB8F, for Unicode Version 15.0.

E.1 08) 2022-01-10 18:45:29 CST [Lunde, L2/22-022]

[170-C28] Consensus: Accept the UAX #45 data file changes amended to use the convention of square brackets around these entities, based on feedback [Mon Jan 10 18:45:29 CST 2022] in L2/22-018 and as amended in Section 08 of document L2/22-022, for Unicode Version 15.0.

[170-A82] Action Item for John Jenkins, Editorial Committee: Document the use of square brackets for references to unencoded ideograph components in Field 5 of the UAX #45 data file, based on feedback [Mon Jan 10 18:45:29 CST 2022] in L2/22-018 and as amended in Section 08 of document L2/22-022, for Unicode Version 15.0.

E.1 Section "UAX #38 / Unihan Database Documents"

E.1 09) L2/21-226: Proposed Supplement to the Unihan Database's kTotalStrokes Field [Lunde, L2/22-022]

[170-C29] Consensus: Accept a new provisional Unihan database property, kAlternateTotalStrokes, based on L2/21-226 and as amended in Section 09 of document L2/22-022, with the understanding that adding this property value to any ideograph requires that the existing kTotalStrokes property value for that ideograph be reviewed—and modified, if necessary—so that these properties can be in sync, and change the kTotalStrokes property values of U+9AA8 and U+2A6B2 to “9 10” and 15, respectively, for Unicode Version 15.0.

[170-A83] Action Item for Michel Suignard: Change the kTotalStrokes property values of U+9AA8 and U+2A6B2 to “9 10” and 15, respectively, and provide to John Jenkins an updated Unihan15.txt data file, for Unicode Version 15.0.

[170-A84] Action Item for John Jenkins, Editorial Committee: Document the provisional kAlternateTotalStrokes property in UAX #38, add a new section after Section 3.8 that documents the single-letter IRG source identifiers, and update PRI 437 accordingly, for Unicode Version 15.0.

[170-A85] Action Item for Ken Lunde: Provide to John Jenkins the working data for the initial set of kAlternateTotalStrokes property values that were derived from the kRSAdobe_Japan1_6 property for the purpose of populating the kAlternateTotalStrokes property with an initial, albeit minimal, set of property values, for Unicode Version 15.0.

[170-A86] Action Item for John Jenkins: Update the Unihan database to reflect the kTotalStrokes property value changes for U+9AA8 and U+2A6B2, for Unicode Version 15.0.

[170-A87] Action Item for John Jenkins, CJK & Unihan Group: Populate the kAlternateTotalStrokes property with an initial, albeit minimal, set of property values, for Unicode Version 15.0.

E.1 10) L2/21-228: Request to move the source reference for UK-02830 (IRG N2520) [Lunde, L2/22-022]

[170-C30] Consensus: Accept the proposal to move the kIRG_UKSource property value UK-02830 from U+238A7 to U+4DBE, based on document L2/21-228 and Section 10 of document L2/22-022, for Unicode Version 15.0.

[170-A88] Action Item for Michel Suignard: Move the kIRG_UKSource property value UK-02830 from U+238A7 to U+4DBE, and provide to John Jenkins an updated Unihan15.txt data file, for Unicode Version 15.0.

[170-A89] Action Item for John Jenkins: Update the Unihan database to reflect the property value changes for U+4DBE and U+238A7, based on document L2/21-228 and as amended in Section 10 of document L2/22-022, for Unicode Version 15.0.

Short break 15:00 - 15:15.

E.1 11) L2/21-236: Horizontal extension or disunification request for 14 Khangxi-characters [Lunde, L2/22-022]

[170-A90] Action Item for John Jenkins, CJK & Unihan Group: Use the kIRG_GSource and kIRGKangXi properties to derive additional kKangXi property values.

[170-A91] Action Item for John Jenkins: In UAX #38 for Unicode version 15.0, add a note indicating the relationship between the kIRGKangXi and kKangXi provisional properties.

E.1 12) L2/22-008: Proposal to change various values in the Unihan Database [Lunde, L2/22-022]

[170-C31] Consensus: Accept the kRSUnicode and kTotalStrokes property value changes for U+29867, based on L2/22-008 and Section 12 of document L2/22-022, for Unicode Version 15.0.

[170-A92] Action Item for Peter Edberg: Ask the CLDR-TC to check the proposed kMandarin property value changes and additions, then report back to the UTC. See document L2/22-008 and Section 12 of document L2/22-022.

[170-A93] Action Item for Michel Suignard: Change the kRSUnicode and kTotalStrokes property values for U+29867, based on L2/22-008 and Section 12 of document L2/22-022, and provide to John Jenkins an updated Unihan15.txt data file, for Unicode Version 15.0.

[170-A94] Action Item for John Jenkins: Update the Unihan database, based on L2/22-008 and as amended in Section 12 of document L2/22-022, for Unicode Version 15.0.

E.1 13) L2/22-014: Incorrect Radical for CJK Unified Ideograph 266B9 [Lunde, L2/22-022]

[170-C32] Consensus: Accept the kRSUnicode property value change for U+266B9, based on L2/22-014 and as amended in Section 13 of document L2/22-022, for Unicode Version 15.0.

[170-A95] Action Item for Michel Suignard: Change the kRSUnicode property value for U+266B9, based on L2/22-014 and as amended in Section 13 of document L2/22-022, and provide to John Jenkins an updated Unihan15.txt data file, for Unicode Version 15.0.

[170-A96] Action Item for John Jenkins: Update the Unihan database to reflect the property value changes for U+266B9, based on document L2/22-014 and and as amended in Section 13 of document L2/22-022, for Unicode Version 15.0.

E.1 14) L2/22-027: Proposal to add derived kMandarin property values [Lunde, L2/22-022]

[170-A97] Action Item for Peter Edberg: Ask the CLDR-TC to check the proposed kMandarin property value additions, then report back to the UTC. See document L2/22-027 and Section 14 of document L2/22-022.

E.1 15) UAX #45 Adjustments for Extension H [Lunde, L2/22-022]

[170-C33] Consensus: Authorize a proposed update of UAX #45 for Unicode 15.0 to update the table in Section 2.1 to reflect “H” (“Encoded in Extension H”) as a new status.

[170-A98] Action Item for John Jenkins, Editorial Committee: Update the table in Section 2.1 of UAX #45 to reflect “H” (“Encoded in Extension H”) as a new status, based on Section 15 of document L2/22-022, for Unicode Version 15.0.

[170-A99] Action Item for John Jenkins: Update the UAX #45 data file, USourceData.txt, to reflect “H” (“Encoded in Extension H”) as a new status in its header, and update the records of 161 U-Source ideographs to reflect “H” in Field 1 and the correspond Extension H code point in Field 2, based on Section 15 of document L2/22-022, for Unicode Version 15.0.

[170-A100] Action Item for Rick McGowan: Post the PRI for the proposed update of UAX #45, to close 2022-04-06.

Note: This PRI may close earlier than others.

E.1 16) L2/22-009: Proposal to Add to UAX #45 Three Ideographs with Radical 見 [Lunde, L2/22-022]

[170-C34] Consensus: Accept three new U-Source ideographs as UTC-03252 through UTC-03254 with a UAX #45 status value of N, based on document L2/22-009 and Section 16 of document L2/22-022, for Unicode Version 15.0.

[170-A101] Action Item for John Jenkins: Add three new records to USourceData.txt and their representative glyphs to USourceGlyphs.pdf, based on document L2/22-009 and Section 16 of document L2/22-022, for Unicode Version 15.0.

E.1 17) L2/22-011: UAX #45 Data Issues in Unicode 14.0

[170-C35] Consensus: Accept the proposed UAX #45 data file and representative glyph changes, based on document L2/22-011 and as amended in Section 17 of document L2/22-022, for Unicode Version 15.0.

[170-A102] Action Item for John Jenkins: Update the UAX #45 data file, USourceData.txt, and modify the representative glyph for UTC-00443 to be identical to that of UTC-00345, based on document L2/22-011 and as amended in Section 17 of document L2/22-022, for Unicode Version 15.0.

E.1 19) IRG Working Set 2021 Status

Discussion. UTC took no action at this time.

E.1 18) L2/21-222: Request to revise UAX 50 for harmonization with Adobe Japan1 [Lunde, L2/22-022]

[170-A103] Action Item for Ken Lunde: Convey to the authors of L2/21-222 the CJK & Unihan Group comments in Section 18 of document L2/22-022, along with any feedback from the UTC.

E.2 PRI #436 UTS #37 Unicode Ideographic Variation Database [Lunde, L2/22-039, PRI #436]

[170-C36] Consensus: Approve the proposed update of UTS #37 for publication.

[170-A104] Action Item for Ken Lunde, Editorial Committee: Clean up UTS #37 for publication..

[170-A105] Action Item for Rick McGowan: Close PRI #436.

[170-A106] Action Item for Rick McGowan: Post UTS #37.

H.1 Closing dates for PRIs

[170-A107] Action Item for Rick McGowan:Update the closing dates for PRI #437 and PRI #439 to April 6, 2022.

G.2 Draft Emoji for Unicode Version 15.0

[170-A108] Action Item for Rick McGowan: Close PRI #435.

[170-C36] Consensus: Prepare a new PRI on the draft candidate emoji for Unicode version 15.0.

[170-A109] Action Item for Ned Holbrook: Produce a draft emoji chart for Unicode version 15.0 and the background statement for a PRI. (I.e., freeze-dried PDF of the emoji charts.)

[170-A110] Action Item for Rick McGowan: Post the new PRI on the emoji draft.

Thanks to the new chair.

UTC adjourned for the week at 16:44.

L2 continued.


Members Represented

Full Member 2022-01-25 2022-01-27
1. Adobe yes yes
2. Apple Inc. yes yes
3. ETCO (Oman) yes yes
4. Facebook
5. Google, Inc. yes yes
6. Microsoft Corporation yes yes
7. Netflix
8. Salesforce
9. SAP AG
10. Yat Labs
 
Institutional Member
1. Bangladesh, MSICT
2. Oman (MARA) (yes) (yes)
3. Tamil Nadu, TVA
4. UCB yes yes
 
Supporting Member
1. Emojipedia yes
 
Associate Member
1. Amazon yes yes
2. Canva yes
3. SIL yes
 

UTC Attendance

NameRepresenting
Salim Al MandhariETCO
Julie AllenUnicode
Debbie AndersonUCB
Keith BroniEmojipedia
Jeremy BurgeEmojipedia
Christopher ChapmanAdobe
Peter ConstableMicrosoft
Craig CummingsAmazon
Jennifer DanielGoogle
Mark DavisGoogle
Kamile DemirAdobe
Barry DorransMicrosoft
Peter EdbergApple
Lorna EvansSIL
Zhilin FangVMWare
Michael FicarraTC39
Loïz FilyBZH
Asmus Freytagself
Dante GagneMicrosoft
Richard GillamApple
Andrew GlassMicrosoft
Dan GriffinMicrosoft
Joshua HadleyAdobe
Liang HaiUnicode
Simon HammondCanva
Ned HolbrookApple
John JenkinsApple
Marcel Krüger
self
Jan Kučeraself
Robin LeroyGoogle
Norbert Lindenbergself
Steven LoomisUnicode
Ken LundeApple
Rick McGowanUnicode
Lisa MooreUnicode
Roozbeh PournaderUnicode
Murray SargentMicrosoft
Markus SchererGoogle
Mark Shoulsonself
L Stygerself
Michel SuignardUnicode
Samantha SunneEmojination
Tex TexinXencraft
Ken WhistlerUnicode
Amadeusz WieczorekMicrosoft
Ben YangAdobe