Re: Displaying Plane 1 characters

From: Keld J|rn Simonsen (keld@dkuug.dk)
Date: Wed Nov 11 1998 - 21:02:10 EST


> >Java is also going to get problems: "\u10208" would be mistaken as
> >U+1020 <undefined Mongolian character> U+0038 DIGIT EIGHT instead
> >of U-00010208 ETRUSCAN LETTER TH.
>
> \uD800\uDE08 is an obvious answer for Java, since Java's 16-bit data
> type implies its use of UTF-16.

Yoou should not use \uxxxx nothation for surrogates,
as surrogates are not charcters in neither Unicode nor 10646,
and thus the short identifiers cannot be used.

Keld



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:42 EDT