L2/01-404

Motions of the UTC 89 / L2 186 Joint Meeting
Mountain View, CA -- November 6 - 9, 2001
December 13, 2001


UTC 88/L2 185 Minutes

[89-M1] Motion: Approve the minutes of UTC 88/L2 185 as documented in L2/01-295R.

Moved by Mark Davis, seconded by Deborah Anderson 

8 for (Adobe, Apple, Basis, IBM, Microsoft, Sybase, Trigeminal, Unisys)
0 against
4 abstain (Compaq, HP, RLG, Sun)

Minutes posted publicly at www.unicode.org/unicode/consortium/utc-minutes.html

Editorial Committee - Unicode 3.2

[89-C1] Consensus: The UTC authorizes a public beta for Unicode 3.2.

IRG

[89-C2] Consensus: Forward Richard Cook's analysis of errors in the database field for the Hanya Da Zidian to the IRG and request that they correct the errors. [L2/01-417] 

[89-C3] Consensus: The UTC authorizes the gathering of a list of ideographs to be submitted to the IRG as candidates for Extension C. [L2/01-417]

[89-C4] Consensus: The UTC is concerned that the 10646 standard could be introducing implementation problems by encoding multiple variants of Han characters, especially without indicating their relationships. [L2/01-417]

[89-C5] Consensus: The UTC requests that the IRG produce data that describes existing variants in Unihan and that they refrain from proposing the encoding of multiple variants in the future. The UTC requests that the IRG maintain two lists: one a list of unified Han characters that are candidates for encoding and a second list of variants with information on the character they are variants of. [L2/01-417]

[89-C6] Consensus: The UTC will ask WG2 to direct the IRG to indicate which proposed compatibility characters are duplicates of existing or proposed compatibility characters.

[89-M2] Motion: In the future the Unicode Standard and 10646 should use variant tags for distinguishing new simplified Chinese variants from their traditional counterparts. [L2/01-417]

Moved by John Jenkins, seconded by Mark Davis 

9 for (Adobe, Apple, Basis, HP, IBM, Microsoft, RLG, Sybase, Trigeminal)
0 against
3 abstain (Compaq, Sun, Unisys)

Properties - ZWJ

[89-C7] Consensus: Under ordinary circumstances, the ZWJ overrides stylistic protocols for ligature control.  The ZWJ is not required for general ligature control and should only be used for linguistically important exceptions. [L2/01-415]

IETF - Transliterating Unicode for IDN

[89-C8] Consensus: The UTC believes the transliteration approach to IDN is not viable. [L2/01-358]

Scripts and New Characters

[89-C9] Consensus: The UTC favors the addition of the remaining Greek acrophonic numerals rather than cloning the existing Greek letters. [L2/01-412]

[89-M3] Motion: Accept the following three Arabic characters for Parkari:  [L2/01-427] 

·U+06EE ARABIC LETTER DAL WITH INVERTED V 

·U+06EF ARABIC LETTER REH WITH INVERTED V 

·U+06FF ARABIC LETTER HEH WITH INVERTED V 

Moved by Ken Whistler, seconded by Mark Davis 

8 for (Adobe, Apple, HP, IBM, Microsoft, RLG, Sybase, Trigeminal)
0 against
4 abstain (Basis, Compaq, Sun, Unisys)

[89-M4] Motion: Add a policy (published in the appropriate manner) that characters will not be advanced from accepted to published state if no font data or font license is available in a format suitable for use in publication by the Consortium. This is intended to synchronize with WG2 policy. 

Moved by Mark Davis, seconded by Asmus Freytag 

8 for (Adobe, Apple, Basis, HP, IBM, Microsoft, RLG, Trigeminal)
1 against (Sybase)
3 abstain (Compaq, Sun, Unisys)

Technical Reports - Normalization

[89-M5] Motion: Remand to the Editorial Committee,  to incorporate text on Fast C or D, FCD, as an informative annex of UAX #15 Unicode Normalization Forms. [L2/01-371]

Moved by Asmus Freytag, seconded by V.S. Umamaheswaran 

9 for (Adobe, Apple, Basis, HP, IBM, Microsoft, Sun, Sybase, Trigeminal)
0 against
3 abstain (Compaq, RLG, Unisys)

Technical Reports - Casing

[89-C10] Consensus: The UTC defines a new property, soft dotted, and includes U+0268, U+0456, and U+0458 in the list of characters with this property, and add to Proplist.txt. [L2/01-441, 445] 

[89-C11] Consensus: The UTC authorizes updating SpecialCasings.txt with the changes given in L2/01-445, section B.3, which will fix the Turkish mapping problems for i's.

[89-C12] Consensus: The UTC authorizes updating SpecialCasings.txt with the changes given in L2/01-445, section 3, which will change all the mappings to be in normalization form C (NFC).

[89-M6] Motion: The UTC authorizes making UTR #21 Case Mappings a Unicode Standard Annex (UAX) and further authorizes a proposed update based on L2/01-441 as amended in discussion. Note that the definitions are normative and the rest is informative.

Moved by Mark Davis, seconded by V.S. Umamaheswaran 

10 for (Adobe, Apple, HP, IBM, Microsoft, RLG, Sun, Sybase, Trigeminal, Unisys)
1 against (Basis)
1 abstain (Compaq)

Properties - Aliases and Derived Age

[89-M7] Failed Motion: Include property aliases as described in L2/01-396R and amended in discussion, in the 3.2 beta.  Document that the intent is to make them normative data files. 

Moved by Mark Davis, seconded by Eric Muller 

5 for (Adobe, Apple, Basis, IBM, Microsoft)
1 against (Trigeminal)
6 abstain (Compaq, HP, RLG, Sun, Sybase, Unisys)

[89-M8] Motion: The UTC accepts the DerivedAge.txt file documented in L2/01-451 and modified in discussion, as a derived datafile in the Unicode Character Database for Unicode 3.2. 

Moved by Asmus Freytag, seconded by Murray Sargent 

11 for (Adobe, Apple, Basis, HP, IBM, Microsoft, RLG, Sun, Sybase, Trigeminal, Unisys)
0 against
1 abstain (Compaq)

New UTC Vice Chair

[89-M8] Motion: The UTC nominates Cathy Wissink as the new UTC Vice Chair, replacing Arnold Winkler.

Moved by Ken Whistler, seconded by V.S. Umamaheswaran 

11 for, unanimous

Technical Reports - Character Mapping

[89-C13] Consensus: Approve the changes to Unicode Technical Report #22 Character Mapping Markup Language, given in the draft L2/01-315 and amended in discussion.

Technical Reports - Script Names

[89-C14] Consensus: The UTC authorizes a Proposed Update to Unicode Technical Report #24 Script Names as described in L2/01-444 and modified during discussion.

Technical Reports - Regular Expressions

[89-C15] Consensus: The UTC authorizes a Proposed Update to Unicode Technical Report #18 Unicode Regular Expression Guidelines, based on L2/01-444 and modified during discussion.

Technical Reports - Collation

[89-C16] Consensus: The UTC authorizes a Proposed Update to Unicode Technical Standard #10 Unicode Collation Algorithm, based on L2/01-446 and modified during discussion. 

[89-C17] Consensus: Add a property to Proplist.txt called Logical_Order_Exception and include  ten Thai characters (0E40..0E44, 0EC0..0EC4). [Section 3.2.1, L2/01-446]

Technical Reports - Character Foldings

[89-C18] Consensus: Make Character Foldings as described in L2/01-447 a Proposed Draft Unicode Technical Report to  be published privately.

Technical Reports - CESU-8

[89-M9] Motion: The UTC approves advancing Proposed Draft Unicode Technical Report #26  Compatibility Encoding Scheme for UTF-16: 8-Bit (CESU-8) to Draft Unicode Technical Report #26  Compatibility Encoding Scheme for UTF-16: 8-Bit (CESU-8) after incorporating updates as discussed. 

Moved by Asmus Freytag, seconded by Toby Phipps 

11 for (Adobe, Apple, Basis, HP, IBM, Microsoft, PeopleSoft, RLG, SAP, Sybase, Unisys)
2 against (Compaq, Trigeminal)
1 abstain (Sun)

Scripts and New Characters - Indic

[89-C19] Consensus: Accept the twelve Indic characters with names and coding positions as documented in L2/01-431R: 

0904 DEVANAGARI SHORT LETTER A
09BD BENGALI SIGN AVAGRAHA
0A01 GURMUKHI SIGN ADAK BINDI
0A03 GURMUKHI SIGN VISARGA
0A8C GUJARATI LETTER VOCALIC L
0AE1 GUJARATI LETTER VOCALIC LL
0AE2 GUJARATI VOWEL SIGN VOCALIC L
0AE3 GUJARATI VOWEL SIGN VOCALIC LL
0AF1 GUJARATI RUPEE SIGN
0B35 ORIYA LETTER VA
0CBC KANNADA SIGN NUKTA
0CBD KANNADA SIGN AVAGRAHA

WG2 - Consent Docket

[89-C20] Consensus:  The UTC accepts the change in encoding positions for compatibility ideographs to FA45..FA6A.  [L2/01-420] 

[89-C21] Consensus:  The UTC accepts the encoding of the Tai Le collection of characters with names (TAI LE LETTER..) and code points (1950..196D, 1970..1974) as described in section 3b, L2/01-420.

[89-C22] Consensus: The UTC approves the addition of U+0234 LATIN SMALL LETTER L
WITH CURL, the encoding of LATIN SMALL LETTER D WITH CURL at U+0221 instead of U+0234, and the encoding of LATIN SMALL LETTER T WITH CURL at U+0236 instead of U+0221. [L2/01-420]

[89-C23] Consensus: The UTC accepts the following list of eight Tamil signs, their names,
and encodings: [L2/01-420]

0BF3    TAMIL DAY SIGN
0BF4    TAMIL MONTH SIGN
0BF5    TAMIL YEAR SIGN
0BF6    TAMIL DEBIT SIGN
0BF7    TAMIL CREDIT SIGN
0BF8    TAMIL AS ABOVE SIGN
0BF9    TAMIL RUPEE SIGN
0BFA    TAMIL NUMBER SIGN

WG2 - Resolutions

[89-C24] Consensus: The UTC confirms the reassignment of the Yijing Monograms to 268A..268F.

Technical Reports - Compression

[89-C25] Consensus: The UTC authorizes a proposed update to Unicode Technical Standard #6  A Standard Compression Scheme for Unicode.

W3C - XML Identifiers

[89-C25] Consensus: The UTC supports the proposal for characters allowed in XML identifiers that is documented in section A of L2/01-454. 

[89-M10] Motion: The UTC agrees to adding a policy that a) there will be no more non-characters so that the ranges are stable, and b) that default ignorable characters will not be assigned outside of the default ignorable code point ranges. [L2/01-454]

Moved by Ken Whistler, seconded by Mark Davis 

12 for (Adobe, Apple, Basis, HP, IBM, Microsoft, PeopleSoft, RLG, SAP, Sybase, Trigeminal, Unisys)
0 against
1 abstain (Compaq)

Properties - Aliases

[89-M11] Motion: The UTC accepts two files of aliases of property values for inclusion in the 3.2 beta.  These files will be updated as new properties are added. [L2/01-396R2]

Moved by Mark Davis, seconded by Eric Muller 

10 for (Adobe, Apple, Basis, HP, IBM, Microsoft, PeopleSoft, RLG, Sybase, Trigeminal)
0 against
3 abstain (Compaq, SAP, Unisys)

Block Boundary Fixes

[89-C26] Consensus: The UTC supports updating Blocks.txt to use the name "Private Use Area" for E000..F8FF. [L2/01-422] 

[89-C27] Consensus: The UTC supports updating the end range in Blocks.txt for CJK Unifed Ideographs Extension A from 4DB5 to 4DBF. [L2/01-422]

[89-C28] Consensus: The UTC supports updating the end range in Blocks.txt for Hangul Syllables from D7A3 to D7AF. [L2/01-422]

[89-C29] Consensus: The UTC supports updating the end range in Blocks.txt for Arabic Presentation Forms-B from FEFE to FEFF. [L2/01-422]

Arabic Enclosing Marks

[89-C27] Consensus: For the ARABIC END OF AYAH the UTC prefers the head model rather than the tail model, like the Syriac abbreviation mark. [L2/01-428] 

[89-C28] Consensus: It should be made clear in the standard that the graphical display of combining enclosing marks apply to preceding grapheme clusters. [L2/01-428]

Dingbat Unification Rules

[89-C29] Consensus: The UTC removes the restriction that the characters in the Zapf dingbat block only have the Zapf dingbat shape.  On a case by case basis, it may make sense to unify these characters with other shapes.  [L2/01-450]

Korean Syllable Structure

[89-C30] Consensus: The UTC authorizes changing the structure of grapheme clusters to recognize that one or more L and V, and zero or more Ts are valid grapheme clusters.  Make appropriated changes to the discussion of syllabification and default rendering for Korean in the standard.