L2/04-003

Pre-Preliminary Minutes of the UTC 98 / L2 195 Joint Meeting
Mountain View, CA -- February 2 - 5, 2004
Hosted by Microsoft
February 26, 2004


February 2, 2004

PRESENT: Adobe Systems, Inc.; Apple Computer; Basis Technology; Hewlett Packard; IBM Corporation; Justsystem Corp; Microsoft Corporation; Oracle Corporation; Peoplesoft; RLG; SAP; Sybase, Inc.

NOT PRESENT: India, MIT; Pakistan, NLA; Sun Microsystems.

Total members represented: 12, Total not represented:  3

Opened at 10:12 am

Calendar update: Feb 7-10 2005 UTC, Bay Area. Action item review.

B.4.1 IRG Liaison Report [Jenkins, L2/04-037]

[98-A1] Action Item for John Jenkins: Determine the frequency of usage of the supplementary Hong Kong characters.

[98-A2] Action Item for John Jenkins: Ask Tom Bishop if he is willing to have CDL data used internally to IRG.

[98-C1] Consensus: The UTC supports the correction of the glyph for U+21E45 in ISO/IEC 10646:2003 as documented in L2/04-008.

[98-A3] Action Item for John Jenkins: At the IRG in May 2004 give a demo of variation selectors.

[98-C2] Consensus: Remove kAlternateMorohashi and kAlternateKangXi fields from the Unihan database after Unicode 4.0.1.

[98-A4] Action Item for John Jenkins: Remove the kAlternateMorohashi and kAlternateKangXi fields from the Unihan database after Unicode 4.0.1

[98-C3] Consensus: For Unicode 4.0.1 UTC will amend the definition of provisional properties to say that they can be deleted in a future version of the standard, without prior notice.

[98-A5] Action Item for John Jenkins: Send to Mark Davis a description of the new 4.0.1 Unihan fields for incorporation into 4.0.1 UCD.html.

[98-A6] Action Item for Mark Davis, Editorial Committee: Modify the definition of "provisional" properties to include language that they can be removed from the standard at any time. See [98-C3] above. For Unicode 4.0.1.

[98-A7] Action Item for John Jenkins: Update the kTang field of the Unihan database using the correct vowels and diacritical marks from Stimson, after Unicode 4.0.1.

[98-A8] Action Item for John Jenkins: Add data from Cheung and Bauer to Unihan.txt after Unicode 4.0.1.

[98-A9] Action Item for John Jenkins: Draft a Public Review Issue that UTC is considering switching from the modified Yale to Linguistic Society of Hong Kong Romanization in Unihan.txt. Item to close June 8, 2004. The item should argue the case for this change. Also, this would change the Cantonese data in the whole file; specify which fields that would be affected (kCantonese).

[98-A10] Action Item for Rick McGowan: Post the Public Review Issue [98-A9 above] that UTC is considering switching from the modified Yale to Linguistic Society of Hong Kong Romanization in Unihan.txt.

[98-A11] Action Item for John Jenkins: Give information to Mark Davis on how to produce the Unihan sort key for incorporation into UCD for Unicode 4.0.1.

[98-A12] Action Item for John Jenkins, Editorial Committee: Document the properties in the Unihan database post 4.0.1

[98-C4] Consensus: Add kUIRGUSource field to Unihan.txt for 4.0.1.

B.15.6 Changing the general category of U+200B ZERO WIDTH SPACE from Zs to Cf [L2/03-389]

[98-M1] Motion: Change the general category of U+200B ZWSP from Zs to Cf in Unicode 4.0.1.

Moved by Ken Whistler, seconded by Mark Davis

9 For
0 Against
3 Abstain (HP, Oracle, Microsoft)

[98-A13] Action Item for Mark Davis, Editorial Committee: Document the above motion [98-M1] to change the general catgegory of U+200B ZWSP from Zs to Cf as part of Unicode 4.0.1.

B.11.1.1.1 Bidi Conformance

Ad hoc meeting to be convened on Tuesday.

B.12.1 UTRs that should be UTSs: 16, 22, 26, 31

[98-C5] Consensus: Issue a proposed update changing UTR #22 to a UTS.

[98-A14] Action Item for Markus Scherer, Editorial Committee: Make an update to the boilerplate of UTR #22 and add a section on conformance, then update and post as a draft UTS (#22).

B.15.3 Misspelling in LineBreak property name [Davis]

[98-A15] Action Item for Asmus Freytag, Editorial Committee: Establish a way to handle multiple aliases for properties and property values.

B.15.1 No "missing" value for Age [Davis, L2/04-013]

[98-C6] Consensus: Add to DerivedAge.txt that if no codepoint is listed the value is "unassigned", for Unicode 4.0.1.

[98-A16] Action Item for Mark Davis, Editorial Committee: Make the change to DerivedAge.txt saying that if no codepoint is listed the value is "unassigned", for Unicode 4.0.1. DerivedAge.txt and PropertyValueAliases.txt need to be changed.


February 3, 2004

PRESENT: Adobe Systems, Inc.; Apple Computer; Basis Technology; Hewlett Packard; IBM Corporation; Justsystem Corp; Microsoft Corporation; Oracle Corporation; Peoplesoft; RLG (by proxy); SAP; Sun Microsystems (by proxy); Sybase, Inc.

NOT PRESENT: India, MIT; Pakistan, NLA.

Total members represented: 13, Total not represented:  2

Proxies from Sun, RLG.

13 members present.

B.11.2 Issue 20: Review of DUTR #31 Identifier and Pattern Syntax [L2/04-026, L2/03-079]

[98-C7] Consensus: Retarget DUTR #31 to a UAX and post as a draft UAX with 2 corrections TBD later.

Note the following 2 associated actions have interpolated numbers.

[98-A16b] Action Item  for Mark Davis, Editorial Committee: Update and post DUTR #31 as a draft UAX for public review.

[98-A16c] Action Item for Rick McGowan: Post draft UAX #31 for public review.

Break for bidi ad hoc. Remaining are RLG, IBM, Peoplesoft, Adobe, Sybase, HP, Oracle, Justsystem, Sun, Microsoft.

B.14.1.1 Revised Cuneiform proposal [Everson, Tinney, L2/03-036]

[98-A17] Action Item for Steve Tinney, Rick McGowan: Document the numeric and other properties for cuneiform. Take into account the Ugaritic and Old Persian properties. Since this is the third cuneiform script coming into the standard. Check to make sure there aren't gratuitous differences.

B.14.1.2 Proposal to Encode Cuneiform Ideographic Descriptors [Snyder, L2/04-048]

B.14.1.3 Fitting Cuneiform Encoding to Cuneiform Script [Anderson, L2/04-041]

[98-C8] Consensus: UTC accepts the repertoire, character names, and encoding points in the range 12000 - 123F3; 984 characters) for Sumero-Akkadian cuneiform, as specified in document L2/04-036, with block name "Cuneiform, 12000 - 123FF", for encoding in a future version of Unicode.

[98-A18] Action Item for Ken Whistler: Update the pipeline to include acceptance of the repertoire, character names, and encoding points in the range 12000 - 123F3; 984 characters) for Sumero-Akkadian cuneiform, as specified in document L2/04-036.

[98-A19] Action Item for Steve Tinney: Update the proposal L2/04-036 with comments received at the meeting and submit to WG2 and UTC in time for the June 2004 meetings. Please use the latest proposal summary form in the revision.

[98-A20] Action Item for Rick McGowan, Steve Tinney: Discuss "named sequences" listed as possible appendix to the revised L2/04-036 document.

Lunch until 1:00.

[98-A21] Action Item for Michael Kaplan, Editorial Committee: Add a section to the Indic FAQ on using Tamil digits.

[98-C9] Consensus: Accept Tamil digit zero at U+0BE6 with properties as described in L2/04-073 for encoding in a future version of the standard.

[98-A22] Action Item for Ken Whistler: Update the pipeline to reflect addition of Tamil digit zero at U+0BE6 as in document L2/04-073.

[98-A23] Action Item for Michael Kaplan: Update the document L2/04-073 and send to Rick McGowan and Mike Ksar for posting prior to the June 2004 WG2 meeting.

[98-A24] Action Item for Michel Suignard: Include Tamil digit zero as in document L2/04-073 in the provisional ballot comments on the current amendment to ISO/IEC 10646:2003.

[98-A25] Action Item for Michael Kaplan: Provide a font to Asmus Freytag for printing the Tamil digit zero as in document L2/04-073.

B.16 Ideograph Variation Selector and Variation Collection Identifier [Hiura, L2/04-050]

B.12.2 Proposed update to Unicode Technical Standard #6: A Standard Compression Scheme for Unicode [Scherer, L2/04-419, L2/04-020]

[98-C10] Consensus: Create a proposed update to UTS #6 as based on committee discussion.

[98-A26] Action Item for Markus Scherer, Asmus Freytag, Editorial Committee: Prepare an update to UTS #6 based on discussion and post for public review.

[98-A27] Action Item for Rick McGowan: Post proposed update to UTS #6 for public review to close June 8, 2004.

[98-C11] Consensus: Issue a proposed draft UTS #33 for BOCU-1 based on L2/04-016 as modified by discussion during the meeting.

[98-A28] Action Item for Markus Scherer, Editorial Committee: Prepare a public review item for Proposed UTS #33.

[98-A29] Action Item for Rick McGowan: Post Public Review Issue for UTS #33 to close June 8, 2004.

B.12.4 Proposed update to Draft Unicode Technical Report #22: Character Mapping Markup Language [Scherer, L2/04-017, L2/04-018]

[98-M2] Motion: Include as part of the proposed update to make a UTR #22 a UTS as per previous consensus; implement the changes proposed in L2/04-018.

Moved by V S Umamaheswaran, seconded by Eric Muller

10 For,
0 Against,
3 Abstain (Apple, Oracle, Basis)

[98-A30] Action Item for Markus Scherer, Editorial Committee: Make the updates to Proposed update UTS #22 from L2/04-018.

B.15.5 Corrections to script property values [Davis, L2/03-427, L2/04-043]

[98-A31] Action Item for Eric Muller: Put updating script codes process on the agenda for next Editorial Committee meeting.

[98-C12] Consensus: UTC supports in principle the separate encoding of Chinese oracle bone and bronze inscription characters, but sees the need for further expert study prior to encoding them. The UTC sees this work as outside the main mission of the IRG.

[98-C13] Consensus: The UTC feels that whether or not the Chinese seal forms should be encoded separately is an issue that requires further study.

B.4.1 Liaison Report [Jenkins, L2/04-037]

[98-C14] Consensus: The UTC requests that missing HKSCS characters be encoded in amendment 1 of 10646:2003 as needed to ensure round trip mapping.

[98-A32] Action Item for Eric Muller, John Jenkins, Michel Suignard: Draft a proposal to encode 113 (?) missing HKSCS characters, dividing the proposal into appropriate segments (unified ideographs, compatibility, non-ideographs, etc) by April 15th, forward to Mike Ksar for submission to WG2 in time for the June 2004 meeting.

[98-A33] Action Item for Michel Suignard: Add to the ballot comments that the UTC requests that the missing 113 (?) KHSCS characters be encoded in amendment 1 of 10646:2003.

[98-A34] Action Item for Michel Suignard: Send a font to Asmus Freytag for printing the missing HKSCS characters.

B.1.7 Proposed Technical Corrigendum to 10646:2003 - missing cell entries in collection 340 [L2/04-014]

[98-C15] Consensus: The UTC supports the corrections to ISO/IEC 10646:2003 as documented in L2/04-014.

B.1.5 Update Proposal Summary Form to include a recommended default ordering [Davis]

[98-C16] Consensus: UTC requests WG2 to update the proposal summary form to include a recommended default ordering and script value.

Adjourned at 5:09 pm.


February 4, 2004

PRESENT: Adobe Systems, Inc.; Apple Computer; Basis Technology; Hewlett Packard; IBM Corporation; Justsystem Corp; Microsoft Corporation; Oracle Corporation; Peoplesoft; RLG; SAP; Sybase, Inc.

NOT PRESENT: India, MIT; Pakistan, NLA; Sun Microsystems.

Total members represented: 12, Total not represented:  3

B.14.11 Revised Proposal to Encode Phonetic Symbols with Palatal Hook [Constable, L2/04-045]

[98-C17] Consensus: Encode 15 phonetic symbols with palatal hook as specified in L2/04-045 in the phonetic extension block at U+1D7B - U+1D89 and request addition to ISO/IEC 10646:2003 amendment 1. The characters are:

U+1D7B LATIN SMALL LETTER B WITH PALATAL HOOK
U+1D7C LATIN SMALL LETTER D WITH PALATAL HOOK
U+1D7D LATIN SMALL LETTER F WITH PALATAL HOOK
U+1D7E LATIN SMALL LETTER G WITH PALATAL HOOK
U+1D7F LATIN SMALL LETTER K WITH PALATAL HOOK
U+1D80 LATIN SMALL LETTER L WITH PALATAL HOOK
U+1D81 LATIN SMALL LETTER M WITH PALATAL HOOK
U+1D82 LATIN SMALL LETTER N WITH PALATAL HOOK
U+1D83 LATIN SMALL LETTER P WITH PALATAL HOOK
U+1D84 LATIN SMALL LETTER R WITH PALATAL HOOK
U+1D85 LATIN SMALL LETTER S WITH PALATAL HOOK
U+1D86 LATIN SMALL LETTER ESH WITH PALATAL HOOK
U+1D87 LATIN SMALL LETTER V WITH PALATAL HOOK
U+1D88 LATIN SMALL LETTER X WITH PALATAL HOOK
U+1D89 LATIN SMALL LETTER Z WITH PALATAL HOOK

[98-A35] Action Item for Ken Whistler: Update the pipeline to include 15 phonetic symbols with palatal hook as specified in L2/04-045 [98-C17].

[98-A36] Action Item for Peter Constable: Forward document L2/04-045 to Mike Ksar for submission to WG2 for the June 2004 meeting; with the latest proposal summary form.

[98-A37] Action Item for Michel Suignard: Add the 15 phonetic symbols with palatal hook as specified in L2/04-045 to ballot comments for the current amendment of ISO/IEC 10646:2003.

B.14.16 Encoding Bangla Khanda-Ta With Ta+Virama [Sengupta, L2/04-060; Constable, L2/04-062]

[98-A38] Action Item for Peter Constable, Paul Nelson, Editorial Committee: Draft a Public Review Issue on Bangla Khanda Ta to close June 8, 2004.

[98-A39] Action Item for Peter Constable, Paul Nelson: Invite review and participation from the Indic list in the Public Review Issue about Khanda Ta.

[98-A40] Action Item for Rick McGowan: Post the Public Review Issue on Khanda Ta.

B.14.12 Revised Proposal to Encode Phonetic Symbols with Retroflex Hook [Constable, L2/04-046]

[98-C19] Consensus: Encode the 12 phonetic symbols with retroflex hook as in document L2/04-046 at U+1D8F through U+1D9A. The characters are:

U+1D8F LATIN SMALL LETTER A WITH RETROFLEX HOOK
U+1D90 LATIN SMALL LETTER ALPHA WITH RETROFLEX HOOK
U+1D91 LATIN SMALL LETTER D WITH HOOK AND TAIL
U+1D92 LATIN SMALL LETTER E WITH RETROFLEX HOOK
U+1D93 LATIN SMALL LETTER OPEN E WITH RETROFLEX HOOK
U+1D94 LATIN SMALL LETTER REVERSED OPEN E WITH RETROFLEX HOOK
U+1D95 LATIN SMALL LETTER SCHWA WITH RETROFLEX HOOK
U+1D96 LATIN SMALL LETTER I WITH RETROFLEX HOOK
U+1D97 LATIN SMALL LETTER OPEN O WITH RETROFLEX HOOK
U+1D98 LATIN SMALL LETTER ESH WITH RETROFLEX HOOK
U+1D99 LATIN SMALL LETTER U WITH RETROFLEX HOOK
U+1D9A LATIN SMALL LETTER EZH WITH RETROFLEX HOOK

[98-A41] Action Item for Peter Constable: Supply information on the difference between schwa with rhotic hook and schwa with retroflex hook, and forward to Julie Allen for a future edition of the Unicode book.

[98-A42] Action Item for Ken Whistler: Update the pipeline to include 12 phonetic symbols with retroflex hook as in document L2/04-046 at U+1D8F through U+1D9A.

[98-A43] Action Item for Peter Constable: Send a font for 12 phonetic symbols with retroflex hook as in document L2/04-046 to Asmus Freytag for printing.

[98-A44] Action Item for Peter Constable: Update proposal summary form for L2/04-046 to add codepoints and forward to Mike Ksar for June 2004 WG2 meeting.

[98-A45] Action Item for Michel Suignard: Add request for 12 phonetic symbols with retroflex hook as in document L2/04-046 at U+1D8F through U+1D9A to the ballot comments for ISO/IEC 10646:2003 amendment 1.

B.14.13 Revised Proposal to Encode Additional Phonetic Symbols [Constable, L2/04-047]

[98-C20] Consensus: Encode 8 phonetic symbols from L2/04-047 (not including C with stroke) at the following codepoints:

U+023A LATIN SMALL LETTER C WITH STROKE
U+0238 LATIN SMALL LETTER DB DIGRAPH
U+0239 LATIN SMALL LETTER QP DIGRAPH
U+1D7B LATIN SMALL CAPITAL LETTER I WITH STROKE
U+1D7C LATIN SMALL LETTER IOTA WITH STROKE
U+1D7D LATIN SMALL LETTER P WITH STROKE
U+1D7E LATIN SMALL CAPITAL LETTER U WITH STROKE
U+1D7F LATIN SMALL LETTER UPSILON WITH STROKE
U+1DC2 COMBINING SNAKE BELOW

[98-A46] Action Item for Peter Constable, Editorial Committee: Create a Public Review Issue for LATIN SMALL LETTER C WITH STROKE. Take a section out of the document L2/04-047 to make the PRI document.

[98-A47] Action Item for Rick McGowan: Post the Public Review Issue on LATIN SMALL LETTER C WITH STROKE to close June 8, 2004.

[98-A48] Action Item for Ken Whistler: Update the pipeline to include encoding of 8 phonetic symbols from L2/04-047 (not including C with stroke) from L2/04-047.

[98-A49] Action Item for Peter Constable: Update proposal summary form of L2/04-047 and forward document to Mike Ksar for June 2004 WG2 meeting.

[98-A50] Action Item for Peter Constable: Supply a font to Asmus for printing the characters in L2/04-047.

[98-A51] Action Item for Michel Suignard: Include 8 phonetic symbols from L2/04-047 (not including C with stroke) in ISO/IEC 10646:2003 ballot comments for the current amendment.

B.14.10 Revised Proposal to Encode Additional Phonetic Modifier Letters [Constable, L2/04-044]

[98-C21] Consensus: Encode 37 modifier letters from L2/04-044 (but not the modifier letter small glottal stop) and request that they be added to amendment 1. [codepoints TBD]

[98-A52] Action Item for Ken Whistler: Update pipeline to reflect addition of 37 modifier letters from L2/04-044 (but not the modifier letter small glottal stop).

[98-A53] Action Item for Peter Constable: Update proposal for 37 modifier letters from L2/04-044 (but not the modifier letter small glottal stop) and forward to Mike Ksar for June WG2 meeting.

[98-A54] Action Item for Peter Constable: Forward the font for 37 modifier letters from L2/04-044 (but not the modifier letter small glottal stop) to Asmus Freytag for printing.

Lunch until 1:00

[98-A55] Action Item for Lisa Moore: Restructure the UTC agenda for the first day to be a character encoding work group day. Publicize to people so that they know the must have their proposals in a week in advance. Documents not received by the deadline will be automatically deferred to the next UTC meeting.

B.14.3 Names of Phags-pa characters

B.15.2 Ignoring hyphens [Davis, L2/04-012]

[98-C22] Consensus: Adopt rule R1 in document L2/04-012 with clarification, in Unicode 4.0.1.

[98-A56] Action Item for Mark Davis, Editorial Committee: Document the change to accept rule R1 of L2/04-012 in Unicode 4.0.1.

[98-A57] Action Item for Mark Davis: Communicate to WG2 the change to accept rule R1 of L2/04-012 in Unicode 4.0.1.

[98-A58] Action Item for Michel Suignard: Include in ballot comments for amendment 1 of ISO/IEC 10646:2003 the change to accept rule R1 of L2/04-012 in Unicode 4.0.1.

B.14.6 Coptic [Anderson; Emmel, L2/04-053]

[98-C23] Consensus: Remove characters 2C99 and 2C9A from ISO/IEC 10646:2003 amendment 1 unless they are attested. Change the name of 2CBD to Coptic Symbol Stauros.

[98-A59] Action Item for Michel Suignard: Include in ballot comments the above consensus to remove characters 2C99 and 2C9A from ISO/IEC 10646:2003 amendment 1 unless they are attested; and change the name of 2CBD to Coptic Symbol Stauros.

[98-A60] Action Item for Ken Whistler: Update the pipeline with the above consensus to remove characters 2C99 and 2C9A from ISO/IEC 10646:2003 amendment 1 unless they are attested; and change the name of 2CBD to Coptic Symbol Stauros.

[98-A61] Action Item for Debbie Anderson: Reply to Dr Emmel giving him the results of the UTC decisions re Coptic and ask that he re-submit the rest of the chars as a well-formed proposal for additional Coptic characters.

B.14.14 Glagolitic [Anderson, L2/04-51; Cleminson, L2/04-052]

[98-C24] Consensus: Change the names of the following (PDAM) codepoints to these names as per L2/04-052:

U+2C0C GLAGOLITIC CAPITAL LETTER DJERVI
U+2C23 GLAGOLITIC CAPITAL LETTER YU
U+2C2D GLAGOLITIC CAPITAL LETTER TROKUSTASTI A
U+2C3C GLAGOLITIC SMALL LETTER DJERVI
U+2C5D GLAGOLITIC SMALL LETTER TROKUSTASTI A

[98-A62] Action Item for Michel Suignard: Add the proposed 5 Coptic name changes from Document L2/04-052 to ballot comments on ISO/IEC 10646:2003 amendment 1.

[98-A63] Action Item for Debbie Anderson: Respond to Ralph Cleminson requesting more information about the proposed Glagolitic Suspension Marker.

B.14.15 Proposal to encode Greek Zero [Mercier, L2/04-054]

[98-C25] Consensus: Encode "Greek Zero Sign" at U+1018A, and request that it be added to amendment 1, and give it the general category "No".

[98-A64] Action Item for Ken Whistler: Update the pipeline to include "Greek Zero Sign" at U+1018A.

[98-A65] Action Item for Michel Suignard: Include in ballot comments for ISO/IEC 10646:2003 amendment 1 "Greek Zero Sign" at U+1018A, and request that it be added.

[98-A66] Action Item for Debbie Anderson: Send a font to Asmus for printing "Greek Zero Sign" as in L2/04-054.

[98-A67] Action Item for Raymond Mercier, Debbie Anderson: Update the name and proposal and submit to WG2 in time for June meeting. Assign properties "number other" and explicitly name the properties in the propsal. ** Done

B.14.19 Additional Math characters [L2/03-410]

[98-C26] Consensus: Encode the four math characters from L2/03-410 (open superset, open subset, left s-shaped bag delimiter, right s-shaped bag delimiter) at codepoints 27C3, 27C4, 27C5, 27C6 respectively. Request addition to the amendment 1. Properties to follow the model for supersets and delimiters we already have encdoded.

[98-A68] Action Item for Ken Whistler: Update the pipeline to reflect encoding the four math characters from L2/03-410 (open superset, open subset, left s-shaped bag delimiter, right s-shaped bag delimiter) at codepoints 27C3, 27C4, 27C5, 27C6 respectively.

[98-A69] Action Item for Michel Suignard: Add four math characters from L2/03-410 (open superset, open subset, left s-shaped bag delimiter, right s-shaped bag delimiter) at codepoints 27C3, 27C4, 27C5, 27C6 respectively to the ballot comments for amendment 1 of 10646:2003.

[98-A70] Action Item for Asmus Freytag: Update proposal with the latest summary form and forward to WG2 for the June 2004 meeting.

B.14.2 Proposal to encode five Arabic characters [Kew, L2/04-025]

[98-C27] Consensus: Encode the 5 Arabic characters in L2/04-025 at 062D 062E 076B 076C 076D and request that they be added to amendment 1.

[98-A71] Action Item for Ken Whistler: Update the pipeline to reflect encoding the 5 Arabic characters in L2/04-025 at 062D 062E 076B 076C 076D.

[98-A72] Action Item for Michel Suignard: Add the 5 Arabic characters in L2/04-025 at 065D 065E 076B 076C 076D to the ballot comments for amendment 1 of 10646:2003.

[98-A73] Action Item for Jonathan Kew: Send font to Asmus for 5 Arabic characters in L2/04-025 at 062D 062E 076B 076C 076D.

[98-A74] Action Item for Jonathan Kew: Submit L2/04-025 to WG2 for the June 2004 meeting.

B.11.4 Issue 26: Update properties for Ethiopic and Tamil non-decimal digits [L2/04-026]

[98-A75] Action Item for Rick McGowan: Post resolution of the Tamil part of PRI #26 as "Added Tamil digit 0 as "Nd" and the other digits will remain "Nd" reflecting current practice."

[98-C28] Consensus: Change the Ethiopic digits U+1369 - U+1371 from "Nd" to "No" and change the numeric type to synchronize, in Unicode 4.0.1.

[98-A76] Action Item for Ken Whistler, Mark Davis: Update the Ethiopic properties and the numeric type in the UCD for 4.0.1.

[98-A77] Action Item for Rick McGowan: Close Ethiopic part of the PRI #26 with the resolution as above (changed numeric props/types).

B.11.8 Issue 28:

[98-C29] Consensus: Change the bidi properties of the following code points for Unicode 4.0.1:

U+00AD SOFT HYPHEN from ON to BN
All noncharacters from L to BN
All unassigned code points with the Default_Ignorable_Code_Point property from L to BN
     (U+2064..U+2069, U+FFF0..U+FFF8, U+E0000, U+E0002..U+E001F, U+E0080..U+E00FF, U+E01F0..U+E0FFF)
Annotation characters U+FFF9..U+FFFB from BN to ON

Note: The change for the annotation characters was already covered by a separate consensus from Meeting 97.

[98-A78] Action Item for Ken Whistler, Mark Davis: Update the data files to reflect the consensus to change bidi properties of U+00AD soft hyphen and noncharacters and the unassigned characters and DICP (2060 and block on plane 14) to boundary neutral (BN), and change FFF9 through FFFB from BN to ON for 4.0.1. Also update UCD.html.

[98-A79] Action Item for Rick McGowan: Close Public Review Issue #28 with the results of the above consensus to change bidi properties.

B.12.7 Unicode Standard Annex #15: Unicode Normalization Forms [Davis L2/04-094]

[98-C30] Consensus: On the basis of L2/04-094 create a Public Review Issue to close on June 8, 2004 for a corrigendum to UAX #15 Unicode Normalization Forms.

[98-A80] Action Item for Mark Davis, Editorial Committee: Draft Public Review Issue for a corrigendum to UAX #15 Unicode Normalization Forms (see L2/04-094).

[98-A81] Action Item for Rick McGowan: Post the Public Review Issue for a corrigendum to UAX #15 Unicode Normalization Forms closing June 8, 2004. This needs a higher-level notification as well to all of our liaison organizations with TLC.

[98-A82] Action Item for UTC Liaisons to other organizations: Contact their liaison organizations by phone regarding this Public Review Issue on corrigendum to UAX #15 Unicode Normalization Forms.

Adjourned for the day at 5:04.


February 5, 2004

PRESENT: Adobe Systems, Inc.; Apple Computer; Basis Technology; Hewlett Packard; IBM Corporation; Justsystem Corp; Microsoft Corporation; Peoplesoft; RLG; SAP; Sybase, Inc.

NOT PRESENT: India, MIT; Oracle Corporation; Pakistan, NLA; Sun Microsystems.

Total members represented: 11, Total not represented:  4

B.12.5.2 Request for Change Greek Collation Order for SAN [Anderson, L2/04-034]

No action.

B.12.5.3 Request for Change to Greek Collation Order for Koppa [Kirk, L2/04-030; Anderson, L2/04-055]

No action.

[98-A83] Action Item for Lisa Moore: Put [Anderson, L2/04-034] and [Kirk, L2/04-030; Anderson, L2/04-055] onto the agenda for next meeting (B.12.5.2 and B.12.5.3)

B.1.8 Alternate Format Characters [L2/03-336]

[98-A84] Action Item for Ken Whistler, Michel Suignard, Asmus Freytag, V S Umamaheswaran: Review L2/03-336 (Alternate Format characters) and draft UTC feedback before the next UTC meeting.

A.5.1 Approval of minutes of Joint Meeting UTC 97/L2 194 [L2/03-356]

"Failed action" -> "Invalid action". Enter note about the failed motions, which were thought to have passed during the meeting.

[98-C31] Consensus: Approved the minutes as amended in the meeting.

B.15.5 Corrections to script property values [Davis, L2/03-427, L2/04-043; Muller, L2/04-083]

[98-A85] Action Item for Benson Margulies: Produce a proposal for a model for a characterization of multi-valued script properties.

[98-C32] Consensus: Adopt the script change recommendations in L2/04-096, as modified in discussion, for Unicode 4.0.1.

[98-A86] Action Item for Mark Davis, Editorial Committee: Update the Scripts.txt file including some documentation based on L2/04-096 for Unicode 4.0.1.

[98-A87] Action Item for Mark Davis, Editorial Committee: Produce a proposed update to UAX #24 that captures the proposed changes in L2/04-096, for a post 4.0.1 release.

[98-A88] Action Item for Mark Davis: Provide to Rick McGowan an "R" version of L2/04-096 with corrections as in the meeting, for posting to the doc register.

B.11.1.1.1 Conformance clauses [Davis, L2/04-049R] B.11.1.1.2 Property changes

[98-M3] Motion: Change bidi classes of the following four characters:

U+002B PLUS SIGN from ET to ES
U+002D HYPHEN-MINUS from ET to ES
U+002F SOLIDUS (SLASH) from ES to CS
U+2044 FRACTION SLASH from ON to CS

Moved by Mark Davis, seconded by Ken Whistler

9 for
0 against
1 abstain (Apple).

[98-A89] Action Item for Ken Whistler, Mark Davis: Make the required changes in the data file to change bidi class of fraction slash U+2044 fraction slash from ON to CS and change the following 3 characters: + to ES, - to ES, / to CS respectively; for Unicode 4.0.1.

[98-A90] Action Item for Mark Davis, Editorial Committee: Make the corresponding changes in UAX #9 section 4.3 (see the above consensus: bidi class of fraction slash U+2044 fraction slash from ON to CS and change the following 3 characters: + to ES, - to ES, / to CS respectively), which may involve restructuring the clauses while keeping the implications for conformance constant. Make changes for Unicode 4.0.1.

B.11.5 Issue 27: Joiner/nonjoiner in combining character sequences [L2/04-026, L2/04-097]

[98-C33] Consensus: Allow ZWJ and ZWNJ in combining character sequences. The interpretation of joiner/nonjoiner between two combining marks is not yet defined.

[98-M4] Motion: Undertake actions defined by option B of document L2/04-097 (Public Review Issue #27).

Moved by Ken Whistler, seconded by V S Umamaheswaran

5 for (Adobe, IBM, Microsoft, Peoplesoft, Sybase)
0 against
5 abstain (Basis Apple RLG HP SAP)

[98-M5] Motion: Change the general category of ZWJ and ZWNJ to Mn.

Moved by Mark Davis, seconded by Rick McGowan

0 for
5 against
5 abstain

[98-A91] Action Item for Mark Davis, Editorial Committee: Make the minimal change to add "or joiner characters" to definitions D14 and D17 of the standard (with other minor edits allowed) for Unicode 4.0.1, to reflect the consensus above to allow ZWJ and ZWNJ in combining character sequences.

[98-A92] Action Item for Mark Davis, Editorial Committee: Change the script property of ZWJ and ZWNJ to inherited and change Grapheme_Extend to include ZWJ and ZWNJ. For Unicode 4.0.1.

[98-C34] Consensus: Issue a proposed update of Regex UTS #18 that makes the fixes described in dcument L2/04-097, and incorporates other feedback. (See above consensus re allowing ZWJ/ZWNJ.)

[98-A93] Action Item for Mark Davis, Editorial Committee: Issue a proposed update to UTS #18.

[98-A94] Action Item for Rick McGowan: Close Public Review Issue #27 and post the results of the consensus 98-C33 and 98-C34 above.

B.13.2 Proposed changes to the Unihan database [Jenkins, L2/04-038, L2/04-039]

[98-C35] Consensus: All the additions to Unicode 4.0.1 made by the committee [notes incomplete].

[98-A95] Action Item for Editorial Committee: Release 4.0.1 with updates decided at this meeting, with alacrity.

[98-A96] Action Item for John Jenkins, Editorial Committee: Prepare an erratum documenting the known errors in the Unihan database for Unicode 4.0.1.

B.1.4 ISO/IEC 10646:2003/Amd.1:2004 (PDAM 1) [Suignard, L2/04-021]

[98-A97] Action Item for Asmus Freytag: Provide a code chart for 10646:2003/Amd.1:2004 that matches the PDAM plus US comments.

[98-C36] Consensus: The UTC recommends a yes vote with comments to PDAM 1 of 10646:2003.

B.3.2 Position on 14651 ballot [Ksar, L2/03-441, L2/03-442, L2/04-027]

[98-C37] Consensus: The UTC recommends a YES vote with an editorial comment on 14651.

B.3.4 DTR 19769, Extensions for C to support new data types

B.1.3 WG2 - Action Items [Umamaheswaran]

[98-A98] Action Item for Rick McGowan: From WG2, send letter of appreciation to people who did the ancient Greek proposals (Pantelia, Richard, Nick).

B.1.1 Convenor's Report [Ksar]

Adjourned joint meeting at 3:20.


UTC Attendees
Attendees:Representing:
Joan AliprandRLG
Lloyd AndersonEcological Linguistics
Debbie AndersonU C Berkeley
Joe BeckerUnicode
Mark DavisIBM
Deborah GoldsmithApple
Cale JohnsonUCLA
Michael KaplanMicrosoft
Tatsuo KobayashiJustSystem
Mike KsarWG2
Benson MarguliesBasis
Rick McGowanUnicode
Lisa MooreIBM
Eric MullerAdobe
Gabriel PlumleePeoplesoft
Gary RobertsNCR
Lynn RugglesHP
Murray SargentMicrosoft
Markus SchererIBM
Bernhard SchillingSAP
Dean SnyderJohns Hopkins University
Michel SuignardMicrosoft
Steve TinneyUniversity of Pennsylvania
V S UmamaheswaranIBM
Ken WhistlerSybase
Cathy WissinkMicrosoft
Jianping YangOracle

UTC Full Member Attendance Roster
Member 2/2/04 2/3/04 2/4/04 2/5/04
1. Adobe Systems, Inc.
yes
yes
yes
yes
2. Apple Computer, Inc.
yes
yes
yes
yes
3. Basis Technology Corporation
yes
yes
yes
yes
4. India, MIT

 
 


5. Hewlett Packard yes yes yes yes
6. IBM Corporation
yes
yes
yes
yes
7. Justsystem Corporation yes

yes
 
 
8. Microsoft Corporation
yes
yes
yes
yes
9. Oracle Corporation
yes
yes
yes
 
10. Pakistan, NLA




11. PeopleSoft
yes
yes
yes
yes
12. RLG, Inc.
yes
yes (proxy)
yes
yes
13. SAP AG  yes
yes
yes
yes
14. Sun Microsystems, Inc.

 

yes (proxy)

 
 
15. Sybase, Inc.
yes
yes yes yes

Members not in regular attendance: India, MIT; Pakistan, NLA

Total members in regular attendance: 13

Quorum: 7