RE: Perception that Unicode is 16-bit (was: Re: Surrogate space i

From: Marco Cimarosti (
Date: Thu Feb 22 2001 - 05:56:47 EST

There was a discussion about finding a short correction for the "widespread
belief" that Unicode is 16-bit character set containing 65536 characters.

Now I have noticed this statement by Roman Czyborra (taken from the last
paragraph of, and I found that it
is one of the most compact, precise, and understandable explanations that I
have seen so far:
        "Unicode [...] encodes all the world's characters in a 16bit space
and a 20bit extension zone for everything that did not fit into the 16bit

_ Marco

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:19 EDT