Ar 18:02 -0800 1998-11-11, scr�obh Keld J|rn Simonsen:
>> >Java is also going to get problems: "\u10208" would be mistaken as
>> >U+1020 <undefined Mongolian character> U+0038 DIGIT EIGHT instead
>> >of U-00010208 ETRUSCAN LETTER TH.
>>
>> \uD800\uDE08 is an obvious answer for Java, since Java's 16-bit data
>> type implies its use of UTF-16.
>
>Yoou should not use \uxxxx nothation for surrogates,
>as surrogates are not charcters in neither Unicode nor 10646,
>and thus the short identifiers cannot be used.
WG2 has provisionally accepted and provisionally allocated Etruscan,
Gothic, Western Musical Symbols, and Byzantine Musical Symbols to Plane 1.
Yes, it hasn't been published or ballotted or anything, but one has to have
a way of referring to those (provisional) code positions.
-- Michael Everson, Everson Gunn Teoranta ** http://www.indigo.ie/egt 15 Port Chaeimhghein �ochtarach; Baile �tha Cliath 2; �ire/Ireland Guth�n: +353 1 478-2597 ** Facsa: +353 1 478-2597 (by arrangement) 27 P�irc an Fh�ithlinn; Baile an Bh�thair; Co. �tha Cliath; �ire
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:42 EDT