L2/04-259R Title: Scripts subcommittee recommendations to UTC Source: Rick McGowan Date: June 17, 2004 On June 15 the scripts subcommittee of UTC met to discuss a number of proposals that have been submitted in recent months. This document contains the recommendations of the subcommittee to UTC. Included are recommendations for the June 16 evening ad hoc meeting as well. A.15 Tai Lue L2/04-147, L2/04-164, L2/04-154 Recommend that UTC: 1. support reordering of Tai Lue, as in L2/04-164 2. prefer not to decompose, but decomposing the tone mark is OK 3. decomposing the hat is OK; prefer the circle not be decomposed 4. instruct the UTC liaison to negotiate item 5 5. instruct liaison to be flexible on names. A.23 Combining right dot above L2/04-200 Recommend that UTC doesn't oppose this encoding, but otherwise no action is recommended. A.18 N'ko L2/04-172 The subcommittee welcomes the encoding of N'ko, and recommends that UTC take a position of not opposing the encoding in WG2. The subcommittee would like to see further work done regarding the possibile of unification of some characters, but is not adamant. Recommend to instruct UTC liaison not to object to putting N'ko into PDAM 2, if WG2 should decide to do that. A.6 Ethiopic L2/04-143 Recommend that UTC accept this proposal for encoding. A few items regarding collation are needed, and it needs to have codepoints worked out. Suggest to take 2 columns between Ethiopic and Cherokee, then put the rest somewhere in roadmap. An ad hoc session of the roadmap committee convened in the evening to look at the space and allocation issues. A separate recommendation will be forthcoming in an updated roadmap. A.27 Revision of Cuneiform L2/04-189 Recommend that UTC adopt the changes to the cuneiform encoding given in this document, and request that cuneiform be included in PDAM 2 of 10646. A.17.3, A.17.4, A.17.5 Hebrew issues L2/03-443: Recommend that UTC approve adding U+05A2 in a future version of Unicode and also change the glyph of U+05AA in an erratum to the standard. L2/04-089: Recommend that UTC approve adding U+05C5 and U+05C6. L2/04-150: Recommend that UTC accept Qamats Qatan for encoding at U+05BA, with annotations added referencing of U+05B8. See L2/04-237 for the suggested annotation. Someone will need to figure out how it collates: before or after qamats, or nearby, and how to break ties. It should have the same combining class as the existing qamats. A.17.1 Holam, A.17.2 Meteg The subcommittee could not decide on these, so the issues were moved into the UTC plenary agenda. A.19 Sencoten Recommend that UTC accept the 4 chars, in whatever amendment the lower case c with slash is coming. Rename "T WITH DIAGONAL STROKE" to differentiate from an existing character. Tack onto PDAM 1, as follows: approve 023A LATIN CAPITAL LETTER A WITH STROKE approve 023B LATIN CAPITAL LETTER C WITH STROKE move 023C LATIN SMALL LETTER C WITH STROKE approve 023D LATIN CAPITAL LETTER L WITH STROKE approve 023E LATIN CAPITAL LETTER T WITH DIAGONAL STROKE Suggest using "LONG STROKE" in the name as DIAGONAL is redundant. A.8 Coptic L2/04-130 Recommend that UTC change Coptic block repertoire to align with L2/04-130, and re-order it to be in line with modern user community. Start with doc L2/04-130 (N2744) and move ecclesiastic chars to front. A.22 Glagolitic suspension mark L2/04-171 Recommend that UTC accept it and encode at U+1DC3, not where the proposal has it. Change the name to "COMBINING SUSPENSION MARK". A.12 L2/04-139 Recommend that UTC accept both of the currency symbols, at the given codepoints. A.10 Phags pa Recommend that UTC generally support Ireland comments about the new Chinese counter-proposal N2745, and support what is now in the PDAM. Suggest that the UTC liaison be free to accept various names changes in accordance with naming rules, or accept re-ordering, but use the existing script desciption in the ballot as the default. In WG2, suggest an ad hoc, and use West's document L2/04-174 as a basis for the discussion, A.3 Bhutanese marks L2/04-007 Recommend that UTC accept the two Bhutanese marks U+0FD0 and U+0FD1 for encoding. The codepoints on page 2 are wrong, and should be moved. A.16 Indo-Eurpoean Laryngeals On the merits, the 5 subscripted small Latin letters are well attested and are recommended to UTC as candidates for encoding. Encode at U+2090 - U+2094. There is some need to discuss L2/04-214 in the plenary. A.2 Tifinagh Recommend that UTC accept the Tifinagh script for encoding. The codepoints need to change because it's being proposed for a place that has been reserved for RTL scripts. UTC should also discusse possible name improvements with WG2. It should be proposed for addition to amendment 2. Need to respond to the government of Morocco: Everson, McGowan, and Yergeau. A.11 Roman canopy Needs more discussion. Recommend that UTC actively oppose the addition of this character at this time, due to architectural problems. It is too complex for Latin layout. A.7 Combining marks, modifier letters, 5 degree countours L2/04-107 Recommend that UTC accept the repertoire of 23 characters in L2/04-107 (N2713), with changes to some code locations. The roadmap committee was suggested to look at possible placements, U+A700 - U+A71F. A.24 DPRK 106 compatibility ideographs The subcommittee has not been able to find anything wrong with these. Recommend that UTC empower the liaison to not oppose the addition of these 106 chars unless contrary evidence is brought forward. A.26 German umlaut/trema distinction Recommend that UTC to oppose the inclusion of a new character for this. Suggest to DIN that they use CGJ with COMBINING DIAERESIS to mark "diaresis". A.28 = C.16.6 (Two Africanist phonetic characters [L2/04-242]) Recommend that UTC encode the two africanist letters at U+023F LATIN SMALL LETTER S WITH SWASH TAIL and U+0240 LATIN SMALL LETTER Z WITH SWASH TAIL. C.15.12 Sri Lanka standard for Sinhala [Dias, L2/04-131, L2/04-231, L2/04-235, L2/04-239, L2/04-248] The issue left to resolve is how to write some of the stand-alone "right-side" forms of conjuncts. (Flip the order of the ZWJ and virama with respect to what they are recommending, to make it more consistent with other Indic scripts. They have X, virama, zwj, Y; needs to be X, zwj, virama, Y) Rick McGowan to write a response to Sri Lanka re the subcommittee recommendation. C.17.6 Proposal to encode five additional CJK symbols [West, L2/04-029] Black and White squares are already encoded, from CJK standards. These are already mapped from existing standards, and are represented as wide glyphs in implemented fonts on existing platforms. They should be rejected for encoding. The three ideographic iteration marks may be candidates for encoding and should be investigated with CJK experts. C.17.7 Chinese counting rod numerals [Jenkins, L2/04-227] Recommend that UTC accept the 18 counting rod numerals for encoding at U+1D360 - U+1D372, with block name "Counting Rod Numerals" U+1D360 - U+1D37F. Exclude the combining negative number sign, which should be unified with the combining solidus overlay. A.5 Proposal to encode orthographic glottal stops [Constable, L2/04-224] Recommend that UTC accept the upper-case glottal stop at U+0241 LATIN CAPITAL LETTER GLOTTAL STOP.