> >Java is also going to get problems: "\u10208" would be mistaken as
> >U+1020 <undefined Mongolian character> U+0038 DIGIT EIGHT instead
> >of U-00010208 ETRUSCAN LETTER TH.
> \uD800\uDE08 is an obvious answer for Java, since Java's 16-bit data
> type implies its use of UTF-16.
Yoou should not use \uxxxx nothation for surrogates,
as surrogates are not charcters in neither Unicode nor 10646,
and thus the short identifiers cannot be used.
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:42 EDT