Re: Private Use Surrogate Pairs

From: Peter_Constable@sil.org
Date: Wed May 08 2002 - 17:29:06 EDT


On 05/08/2002 02:03:07 PM "James E. Agenbroad" wrote:

> Wednesday, May 8, 2002
>On page 322 of version 3.0 of the Unicode Standard in describing the
>Private-Use High Surrogates: "This mechanism allows for a total of 131,068
>(= 128 x 1024 - 4) private-use characters representable by means of
>surrogate pairs." I understand that the 128 is the codes U+DB80 to U+DBFF
>for the high-surrogates. It would be helpful if it were stated that the
>low-surrogate codes are from the 1,024 at U+DC00 to U+DFFF and which four
>codes are excluded.

I think Jim is asking for clarification in the text of the Standard and not
just in a response to him, but in case anyone isn't sure, the four that are
excluded are U+FFFFE, U+FFFFF, U+10FFFE and U+10FFFF.

And don't bother asking for a good reason *why* they are excluded: there
isn't any good reason why; they just are.

- Peter

---------------------------------------------------------------------------
Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>



This archive was generated by hypermail 2.1.2 : Wed May 08 2002 - 18:24:51 EDT