L2/06-187

Title: WG2 Consent Docket
Source: Ken Whistler
Date: May 10, 2006

Following my usual procedure, I have rolled up all items from the
latest WG2 meeting (WG2 #48, Mountain View, CA, April 24 - 27, 2006)
for which there is a synchronization issue that the UTC needs to
address.

The main outcome of the WG2 meeting was the resolution of
ballot comments on PDAM 3 and the decision to reissue Amendment 3
for another PDAM ballot, in part because of the agreement to
add significant new content to Amendment 3. This consent docket
is largely aimed at precise specification of how the content
agreed for Amendment 3 differs from what the UTC has approved
to date.

================================================================

A. Name Changes for Characters Accepted by the UTC

A.1 Malayalam

0D79 MALAYALAM ORDINAL INDICATOR

WG2 accepted a name change to:

0D79 MALAYALAM DATE MARK

This has been discussed on the unicore list, and I believe there
is consensus that the revised name is more accurate.


A.2 Latin

2C7A LATIN SMALL LETTER O WITH RING INSIDE DOWN

WG2 accepted a name change to:

2C7A LATIN SMALL LETTER O WITH LOW RING INSIDE

The UTC should accept this name change.


A.3 Saurashtra

A8B4 SAURASHTRA LETTER UPAKSHARA

WG2 accepted a name change to:

A8B4 SAURASHTRA CONSONANT SIGN HAARU

The UTC should accept this name change.

=================================================================

B. Myanmar Additions

B.1 Core additions for Burmese

WG2 accepted 7 additional Myanmar characters, on the basis of
WG2 N3043 (= L2/06-077). See also the report of the WG2 ad
hoc regarding Myanmar, WG2 N3099 (= L2/06-140).

102B MYANMAR VOWEL SIGN TALL AA
103A MYANMAR SIGN ASAT
103B MYANMAR CONSONANT SIGN MEDIAL YA
103C MYANMAR CONSONANT SIGN MEDIAL RA
103D MYANMAR CONSONANT SIGN MEDIAL WA
103E MYANMAR CONSONANT SIGN MEDIAL HA
103F MYANMAR LETTER GREAT SA

These have engendered much controversy and a long document trail,
but in my opinion should now be accepted by the UTC.


B.2 Glyph changes for existing core Burmese characters

WG2 also approved glyph changes for two Myanmar characters,
based on WG2 N3043 (= L2/06-077):

1039 MYANMAR SIGN VIRAMA
104E MYANMAR SYMBOL AFOREMENTIONED

The glyph change for 1039 is linked to item B.1, because of
the separation in function between the virama and the killer
(U+103A MYANMAR SIGN ASAT). Both glyph changes have consensus
among all relevant parties at this point, and should be approved
by the UTC.


B.3 Additions for Mon and S'gaw Karen

WG2 accepted 14 additional Myanmar characters for the minority
languages Mon and S'gaw Karen, on the basis of WG2 N3044
(= L2/06-078). See also the report of the WG2 ad
hoc regarding Myanmar, WG2 N3099 (= L2/06-140).

1028 MYANMAR LETTER MON E
1033 MYANMAR VOWEL SIGN MON II
1034 MYANMAR VOWEL SIGN MON O
105A MYANMAR LETTER MON NGA
105B MYANMAR LETTER MON JHA
105C MYANMAR LETTER MON BBA
105D MYANMAR LETTER MON BBE
105E MYANMAR CONSONANT SIGN MON MEDIAL NA
105F MYANMAR CONSONANT SIGN MON MEDIAL MA
1060 MYANMAR CONSONANT SIGN MON MEDIAL LA
1061 MYANMAR LETTER SGAW KAREN SHA
1062 MYANMAR VOWEL SIGN SGAW KAREN EU
1063 MYANMAR TONE MARK SGAW KAREN HATHI
1064 MYANMAR TONE MARK SGAW KAREN KE PHO

Assuming that the UTC accepts the repertoire in B.1, I think
that there is then consensus regarding these additional
14 characters, and the UTC should accept them. However, as
for all the Myanmar additions, other feedback documents should
be considered. (Cf. L2/06-161, etc.)

=================================================================

C. Sundanese Script

WG2 approved the Sundanese script for encoding, 55 characters
in the range 1B80..1BB9, in a new block, Sundanese (1B80..1BBF),
on the basis of WG2 N3022.

The UTC should also approve this script for encoding.

=================================================================

D. Lepcha Addition

WG2's approval of the Lepcha script includes one character not
yet approved by the UTC:

U+1C35 LEPCHA CONSONANT SIGN KANG

Further clarification about this character was provided by
the Irish NB in response to the query from the U.S. NB, and
at this point, I think the correct course is to approve this
addition, to bring the UTC and WG2 back in synch for Lepcha.

=================================================================

E. Combining Diacritical Marks Additions

E.1 Lithuanian dialectology

WG2 approved:

1DCB COMBINING BREVE-MACRON
1DCC COMBINING MACRON-BREVE

on the basis of WG2 N3048. I think the characters are justified,
and the UTC should go on record as approving them.


E.2 Medievalist combining marks

WG2 approved a series of 26 combining marks, U+1DCD..U+1DE6, of
various types, on the basis of WG2 N3027 (= L2/06-074). For details,
see WG2 N3059 (= L2/06-147). The UTC should go on record as
approving them.

=================================================================

F. Latin Extended Additions

F.1 Medievalist Latin characters

WG2 approved the following 9 characters in the Latin Extended
Additional block, on the basis of WG2 N3027 (= L2/06-074):

1E9C LATIN SMALL LETTER LONG S WITH STROKE
1E9D LATIN SMALL LETTER LONG S WITH HIGH STROKE
1E9F LATIN SMALL LETTER DELTA
1EFA LATIN CAPITAL LETTER MIDDLE-WELSH LL
1EFB LATIN SMALL LETTER MIDDLE-WELSH LL
1EFC LATIN CAPITAL LETTER MIDDLE-WELSH V
1EFD LATIN SMALL LETTER MIDDLE-WELSH V
1EFE LATIN CAPITAL LETTER Y WITH LOOP
1EFF LATIN SMALL LETTER Y WITH LOOP

The UTC should approve them.


F.2 Mayanist additions

The UTC had approved the following 4 characters in the Latin Extended-C
block:

2C6F LATIN LETTER TRESILLO
2C70 LATIN LETTER CUATRILLO
2C7B LATIN CAPITAL LETTER TZ
2C7C LATIN SMALL LETTER TZ

WG2 accepted, instead, a revised an extended repertoire of 10
Mayanist Latin additions, based on WG2 N3082 (= L2/06-121),
in the Latin Extended-D block:

A726 LATIN CAPITAL LETTER HENG 
A727 LATIN SMALL LETTER HENG
A728 LATIN CAPITAL LETTER TZ
A729 LATIN SMALL LETTER TZ
A72A LATIN CAPITAL LETTER TRESILLO
A72B LATIN SMALL LETTER TRESILLO
A72C LATIN CAPITAL LETTER CUATRILLO
A72D LATIN SMALL LETTER CUATRILLO
A72E LATIN CAPITAL LETTER CUATRILLO WITH COMMA
A72F LATIN SMALL LETTER CUATRILLO WITH COMMA

This constitutes a move of two characters already approved,
a move, name change, and case cloning for two more (tresillo
and cuatrillo), and the addition of four more characters.
This change has been quite controversial, but the emerging
consensus is that the case pairs, while marginal, are
justified. The UTC should discuss, but I recommend the
approval of the revised repertoire.


F.3 UPA additions

WG2 approved the following 3 character in the Latin Extended-C
block, on the basis of WG2 N3070:

2C7B LATIN LETTER SMALL CAPITAL TURNED E
2C7C LATIN SUBSCRIPT SMALL LETTER J
2C7D MODIFIER LETTER CAPITAL V

The UTC should approve them.


F.4 Medievalist Latin characters

WG2 approved 73 characters in the Latin Extended-D
block, on the basis of WG2 N3027 (= L2/06-074). These
are documented in WG2 N3059 (= L2/06-147). The UTC should go on record
as approving them.

=================================================================

G. CJK Strokes

WG2 accepted an additional set of 20 CJK strokes, in the
range 31D0..31E3 in the CJK Strokes block, to complete the
set of basic stroke type symbols.

The UTC should approve them.

=================================================================

H. Vai additions

H.1 Vai nasal vowel syllables for foreign sounds

WG2 accepted 4 additional Vai characters, based on WG2 N3081R
(= L2/06-120R):

A501 VAI SYLLABLE EEN
A525 VAI SYLLABLE IN
A572 VAI SYLLABLE OON
A596 VAI SYLLABLE UN

And as a result, the entire Vai block was rearranged slightly
to interpolate these values into the block.

The UTC should accept these four characters and the rearranged
code points for the rest of the Vai block, based on WG2 N3059
(= L2/06-147).

H.2 Vai digits

WG2 accepted 10 Vai digits, based on WG2 N3081R (= L2/06-120R):

A620 VAI DIGIT ZERO
...
A629 VAI DIGIT NINE

The UTC should accept these characters and also specify that the
Vai block extends from A500..A62F.

=================================================================

I. Kayah Li Script

WG2 approved the Kayah Li script for encoding, 64 characters
in the range A900..A92F, in a new block, Kayah Li (A900..A92F),
on the basis of WG2 N3038 (= L2/06-073).

The UTC should also approve this script for encoding.

=================================================================

J. Rejang Script

WG2 approved the Rejang script for encoding, 37 characters
in the range A930..A95F, in a new block, Rejang (A930..A95F),
on the basis of WG2 N3096 (= L2/06-139).

The UTC should also approve this script for encoding.

=================================================================

K. Phaistos Disc Symbols

WG2 approved the Phaistos Disc symbols for encoding, 46 characters
in the range 101D0..101FD, in a new block, Phaistos (101D0..101FF),
on the basis of WG2 N3066R (= L2/06-095).

The UTC should also approve this script for encoding.

=================================================================

L. Combining Marks for Old Cyrillic

The UTC approved 22 Old Cyrillic combining marks in the
range 2DE0..2DF5. The WG2 did not take these up, because
the proposal was considered premature. At some point, the
UTC will be getting a revised proposal for consideration,
and at that point may need to reconsider the already
approved repertoire.

My current recommendation on this is to take no action,
pending the submission of the revised proposal.

=================================================================

M. Named Character Sequences

Named Sequences added by WG2 (Lithuanian)

These are not uniquified yet, neither by name nor by sequence

LATIN CAPITAL LETTER A WITH OGONEK AND ACUTE; 0104 0301
LATIN SMALL LETTER A WITH OGONEK AND ACUTE; 0105 0301
LATIN CAPITAL LETTER A WITH OGONEK AND TILDE; 0104 0303
LATIN SMALL LETTER A WITH OGONEK AND TILDE; 0105 0303
LATIN CAPITAL LETTER E WITH OGONEK AND ACUTE; 0118 0301
LATIN SMALL LETTER E WITH OGONEK AND ACUTE; 0119 0301
LATIN CAPITAL LETTER E WITH OGONEK AND TILDE; 0118 0303
LATIN SMALL LETTER E WITH OGONEK AND TILDE; 0119 0303
LATIN CAPITAL LETTER E WITH DOT ABOVE AND ACUTE; 0116 0301
LATIN SMALL LETTER E WITH DOT ABOVE AND ACUTE; 0117 0301
LATIN CAPITAL LETTER E WITH DOT ABOVE AND TILDE; 0116 0303
LATIN SMALL LETTER E WITH DOT ABOVE AND TILDE; 0117 0303
LATIN SMALL LETTER I WITH DOT ABOVE AND GRAVE; 0069 0307 0300
LATIN SMALL LETTER I WITH DOT ABOVE AND ACUTE; 0069 0307 0301
LATIN SMALL LETTER I WITH DOT ABOVE AND TILDE; 0069 0307 0303
LATIN CAPITAL LETTER I WITH OGONEK AND ACUTE; 012E 0301
LATIN SMALL LETTER I WITH OGONEK AND DOT ABOVE AND ACUTE; 012F 0307 0301
LATIN CAPITAL LETTER I WITH OGONEK AND TILDE; 012E 0303
LATIN SMALL LETTER I WITH OGONEK AND DOT ABOVE AND TILDE; 012F 0307 0303
LATIN CAPITAL LETTER J WITH TILDE; 004A 0303; 
LATIN SMALL LETTER J WITH DOT ABOVE AND TILDE; 06A 0307 0303 
LATIN CAPITAL LETTER L WITH TILDE; 004C 0303
LATIN SMALL LETTER L WITH TILDE; 006C 0303
LATIN CAPITAL LETTER M WITH TILDE; 004D 0303
LATIN SMALL LETTER M WITH TILDE; 006D 0303
LATIN CAPITAL LETTER R WITH TILDE; 0052 0303
LATIN SMALL LETTER R WITH TILDE; 0072 0303
LATIN CAPITAL LETTER U WITH OGONEK AND ACUTE; 0172 0301
LATIN SMALL LETTER U WITH OGONEK AND ACUTE; 0173 0301
LATIN CAPITAL LETTER U WITH OGONEK AND TILDE; 0172 0303
LATIN SMALL LETTER U WITH OGONEK AND TILDE; 0173 0303
LATIN CAPITAL LETTER U WITH MACRON AND ACUTE; 016A 0301
LATIN SMALL LETTER U WITH MACRON AND ACUTE; 016B 0301
LATIN CAPITAL LETTER U WITH MACRON AND TILDE; 016A 0303
LATIN SMALL LETTER U WITH MACRON AND TILDE;016B 0303