Re: An Aburdly Brief Introduction to Unicode (was Re: Perception ...)

From: Markus Scherer (markus.scherer@jtcsv.com)
Date: Thu Feb 22 2001 - 15:12:25 EST


Tom Lord wrote:
> Two code points represent non-characters. These are U+FFFE and
> U+FFFF. Programs are free to give these values special meaning
> internally.

Unicode (2.0 and up?) has 34 non-characters at U+xxFFFE and U+xxFFFF where xx is 00, 01, .., 0F, 10.
Unicode 3.1 is adding another 32 non-characters on the BMP. See UTR 27 for details.

markus



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:19 EDT