>UCS-4 range:
0x00110000-0x3FFFFFFF
>Extended UTF-16 expression:
U+DB7C + low surrogate + U+DB7E + low surrogate + U+DB7F +
low surrogate
>UCS-4 range:
0x40000000-0x7FFFFFFF
>Extended UTF-16 expression:
U+DB7D + low surrogate + U+DB7E + low surrogate + U+DB7F +
low surrogate
In addition to Geoffrey's comments, is there not also a problem
with this that the sequence of three pairs of high- and
low-surrogates can (and, in accordance with existing
specifications, should) be interpreted as a seqence of three
surrogate pairs, i.e. three characters in the range x10000 -
10FFFF?
Peter Constable
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:45 EDT