Re: [Proposal] Extended UTF-16 by using Plane 14

From: peter_constable@sil.org
Date: Mon Apr 12 1999 - 10:00:01 EDT


>UCS-4 range:
       0x00110000-0x3FFFFFFF

>Extended UTF-16 expression:
         U+DB7C + low surrogate + U+DB7E + low surrogate + U+DB7F +
         low surrogate

>UCS-4 range:
         0x40000000-0x7FFFFFFF

>Extended UTF-16 expression:
         U+DB7D + low surrogate + U+DB7E + low surrogate + U+DB7F +
         low surrogate

       In addition to Geoffrey's comments, is there not also a problem
       with this that the sequence of three pairs of high- and
       low-surrogates can (and, in accordance with existing
       specifications, should) be interpreted as a seqence of three
       surrogate pairs, i.e. three characters in the range x10000 -
       10FFFF?

       Peter Constable



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:45 EDT