UTC 119/L2 216 Joint Meeting Pre-Preliminary Minutes

L2/09-105

Pre-Preliminary Motions of the UTC 119 / L2 216 Joint Meeting
San Jose, CA -- May 11-15, 2009
Hosted by Adobe
UTC #119 Agenda
Revision date: May 20, 2009

[119-C1] Consensus: Approve the minutes of UTC #118 in L2/09-003R.

[119-C2] Consensus: Approve the minutes of UTC #114 in L2/08-003R.

[119-C3] Consensus: Approve the minutes of UTC #113 in L2/07-345R, with document updated to note the last update date.

[119-M1] Motion: Accept U+065F ARABIC WAVY HAMZA BELOW and U+0620 ARABIC LETTER KASHMIRI YEH for encoding in a future version of the standard, with properties and joining type as given in L2/09-176.

Moved by V S Umamaheswaran, Deborah Anderson

5 for (Sun, IBM, UCB, Apple, Google)
1 against (Sybase)
2 abstain (Microsoft, Adobe)

Motion carried.

[119-M2] Motion: Deprecate U+0673 ARABIC LETTER ALEF WITH WAVY HAMZA BELOW.

Moved by Mark Davis, V S Umamaheswaran

5 for (Sun, IBM, Google, UCB, Apple)
1 against (Sybase)
2 abstain (Microsoft, Adobe)

Motion carried.

[119-C4] Consensus: Accept the change recommended in document L2/09-091 to make "Zinh" the preferred short alias for the Inherited script, in Unicode 5.2.

[119-C5] Consensus: Accept U+0D4E MALAYALAM LETTER DOT REPH with the representative glyph as the "dot reph" enclosed in a dotted box (not a combining character), for encoding in a future version of the standard.

[119-M3] Motion: Give U+1F110 through U+1F129 a casing relationship with the chars U+249C through U+24B5 in Unicode 5.2.

Moved by Mark Davis, seconded by V S Umamaheswaran

4 for (Sun, IBM, Apple, Google)
3 against (Sybase, Adobe, UCB)
1 abstain (Microsoft)

Motion failed.

[119-C6] Consensus: Change the linebreak property for 4 characters as follows in Unicode 5.2:

1B5C: BA -> AL
09F2: PR -> PO
09F3: PR -> PO
09F9: AL -> PO

[119-C7] Consensus: Adopt the solutions proposed in L2/09-179 for fixing rules X6 and N1 of the bidi algorithm UAX #9. Fixing X6 by adding BN; and fixing N1 to make the examples exhaustive, for Unicode 5.2.

[119-C8] Consensus: Add a clause to HL of the bidi algorithm which allows characters with resolved direction of "L" and whose bidi class is "R" or "AL" to be depicted by a mirrored glyph. See L2/09-179.

[119-C9] Consensus: Add a bidi test suite file called BidiTest.txt to the UCD 5.2 beta on the basis of the file that was distributed earlier to the bidi subcommittee.

[119-C10] Consensus: Make the changes in ArabicShaping.txt and in the text of the standard based on L2/09-146 for Unicode 5.2.

[119-C11] Consensus: The UTC does not assign joining groups to any new cursive scripts without a fully-formed joining group proposal for the script. Without such a proposal, the value No_Joining_Group will be assigned.

[119-C12] Consensus: Request that the WG2 Principles and Procedures fully document the restrictions on the ranges for right-to-left scripts and default ignorable characters.

[119-C13] Consensus: Reserve U+1E800 through U+1EFFF for right to left scripts and symbols.

[119-C14] Consensus: Accept 30 Meroitic Hieroglyphs at U+10980 through U+1099D, with block "Meroitic Hieroglyphs" U+10980 - U+1099F, for encoding in a future version of the standard, with properties and names as in L2/09-188R2.

[119-C15] Consensus: Accept 68 Meroitic characters at U+109A0 through U+109F0, with block "Meroitic" U+109A0 through U+109FF, for encoding in a future version of the standard, with properties and names as in L2/09-188R2.

[119-C16] Consensus: Publish the kHanyuPinyin readings as a provisional property in Unihan.txt for Unicode 5.2, with different delimiters for the two current uses of space.

[119-C17] Consensus: Remove the restriction on IDS that they contain only CJK ideographs and radicals.

[119-C18] Consensus: Split up the Unihan.txt within the zip file into separate pieces: one file for normative tags, and other files split using the field types of section 3 of UAX #38.

[119-C19] Consensus: Approve the name changes in section A, B. C of document L2/09-177:

U+1CD4 VEDIC SIGN YAJURVEDIC MIDLINE SVARITA
U+A9C0 JAVANESE PANGKON
U+10A6A OLD SOUTH ARABIAN LETTER SAT
U+10A6F OLD SOUTH ARABIAN LETTER SAMEKH

[119-C20] Consensus: Accept moving codepoints as recommended in sections D and E of L2/09-177.

[119-C21] Consensus: Approve the 42 revised code points for U+1F210 through U+1F231, accounting for the insertion of the SQUARED KATAKANA DE in that range, and U+1F240 through U+1F248, as shown in document L2/09-172.

[119-C22] Consensus: Approve the addition of U+AA7B MYANMAR SIGN PAO KAREN TONE for encoding in Unicode 5.2, as documented in L2/09-100R.

[119-C23] Consensus: Approve 56 Batak characters, in the ranges U+1BC0 through U+1BF3 and U+1BFC through U+1BFF, in a new Batak block (U+1BC0..U+1BFF), with names, glyphs, and code points as shown in L2/09-173 (= WG2 N3626) for encoding in a future version of the standard.

[119-C24] Consensus: Approve the new code point and name: U+3097 HIRAGANA LETTER YE becomes U+1B001 HIRAGANA LETTER ARCHAIC YE.

[119-C25] Consensus: Approve the new names for U+FBB2..U+FBC1, as documented in L2/09-173:

FBB2 ARABIC SYMBOL DOT ABOVE
FBB3 ARABIC SYMBOL DOT BELOW
FBB4 ARABIC SYMBOL TWO DOTS ABOVE
FBB5 ARABIC SYMBOL TWO DOTS BELOW
FBB6 ARABIC SYMBOL THREE DOTS ABOVE
FBB7 ARABIC SYMBOL THREE DOTS BELOW
FBB8 ARABIC SYMBOL THREE DOTS POINTING DOWNWARDS ABOVE
FBB9 ARABIC SYMBOL THREE DOTS POINTING DOWNWARDS BELOW
FBBA ARABIC SYMBOL FOUR DOTS ABOVE
FBBB ARABIC SYMBOL FOUR DOTS BELOW
FBBC ARABIC SYMBOL DOUBLE VERTICAL BAR BELOW
FBBD ARABIC SYMBOL TWO DOTS VERTICALLY ABOVE
FBBE ARABIC SYMBOL TWO DOTS VERTICALLY BELOW
FBBF ARABIC SYMBOL RING
FBC0 ARABIC SYMBOL SMALL TAH ABOVE
FBC1 ARABIC SYMBOL SMALL TAH BELOW

[119-C26] Consensus: Accept 569 Bamum characters at U+16800 through U+16A38 with block "Bamum Supplement" (U+16800..U+16A3F) for encoding in a future version of the standard. See L2/09-102, L2/09-106.

[119-C27] Consensus: Accept 1DFC COMBINING DOUBLE INVERTED BREVE BELOW for encoding in a future version of the standard. See L2/09-028.

[119-C28] Consensus: Accept nine modifier letters:

A7F2 LATIN SUBSCRIPT SMALL LETTER H
A7F3 LATIN SUBSCRIPT SMALL LETTER K
A7F4 LATIN SUBSCRIPT SMALL LETTER L
A7F5 LATIN SUBSCRIPT SMALL LETTER M
A7F6 LATIN SUBSCRIPT SMALL LETTER N
A7F7 LATIN SUBSCRIPT SMALL LETTER P
A7F8 LATIN SUBSCRIPT SMALL LETTER S
A7F9 LATIN SUBSCRIPT SMALL LETTER T
A7FA LATIN LETTER SMALL CAPITAL TURNED M

for encoding in a future version of the standard.

[119-C29] Consensus: Define CJK Unified Ideographs Extension C block in range U+2A700 - U+2B73F. See L2/09-195.

[119-C30] Consensus: Add six derived properties to DerivedCoreProperties.txt and two derived properties to DerivedNormalizationProperties.txt as documented in L2/09-219R2, for Unicode 5.2.

[119-C31] Consensus: Fix the ambiguous variables, add DIGP; add 5.2 chars to tables 4, and 5 of UAX #31 as amended in discussion; add tatweel to table 4; and add U+0F0B and Katakana middle dot to table 3.

[119-C32] Consensus: Change the description of generation of implicit weights as described in feedback item #1 in document L2/09-094.

[119-C33] Consensus: Clarify the definition of "base" for CJK characters using two block values intersected with the unified ideographic property.

[119-C34] Consensus: Change the title of UTR #47 to "Korean Text Representation and Processing in Unicode".

[119-C35] Consensus: Accept 83 Sharada characters at U+11180 through U+111D9, and block Sharada U+11180 through U+111DF for encoding in a future version of the standard. See L2/09-074.

[119-C36] Consensus: Accept 35 Sora Sompeng characters at U+110D0 through U+110F9, with block Sora Sompeng U+110D0 through U+110FF for encoding in a future version of the standard. See L2/09-189

[119-C37] Consensus: Authorize the Unicode 5.2 beta.

[119-C38] Consensus: Rescind earlier acceptance (118-M2, 118-C27) of emoji characters in favor of accepting 748 characters documented in L2/09-173, as follows:

NOTE: THIS LIST MUST BE CHECKED IN DETAIL
23E9-23F3
26CE
2705 270A 270B 2728 274C 274E 2753-2755 2795-2797
2E32
1F0A0-1F0DF (new block Playing Cards 1F0A0-1F0FF)
1F170 1F171 1F17E 1F18E
1F201 1F202 1F232-1F23A 1F250 1F251
(Miscellaneous Pictographc Symbols 1F300-1F5FF) 1F300-1F564
(Emoticons 1F600-1F64F) 1F600-1F63D
(Transport and Map Symbols 1F680-1F6FF) 1F680-1F6C4
Total 748 characters.

NOTE: Refer to document L2/09-153 (which contains the correct listings).

[119-C39] Consensus: Accept U+26E4 PENTAGRAM, U+26E5 RIGHT-HANDED INTERLACED PENTAGRAM, and U+26E6 LEFT-HANDED INTERLACED PENTAGRAM for encoding in a future version of the standard. See L2/09-185R.