Re: What's the BMP being saved for?

From: John Jenkins (
Date: Thu Mar 18 2004 - 11:00:42 EST

  • Next message: Curtis Clark: "Re: Investigating: LATIN CAPITAL LETTER J WITH DOT ABOVE"

    On Mar 18, 2004, at 8:33 AM, Arcane Jill wrote:

    > This probably is going to sound like a really dumb question, but ...
    > I'm curious. Why are characters being assigned codepoints > U+FFFF,
    > when there is still loads and loads of unused empty space below that
    > point. Is the BMP being saved for something? Are codepoints < U+010000
    > reserved for something of which I am unaware? If so, what? If not,
    > why are assignments being made up there in the astral planes?

    Check the roadmaps <>.

    > By my calculations, the total number of currently existent Unicode
    > characters is < 0x10000, which means that

    your calculations are way off. Unicode 4.0 has over 96,000 characters.
      The Han repertoire alone is larger than 65536. (See

    John H. Jenkins

    This archive was generated by hypermail 2.1.5 : Thu Mar 18 2004 - 11:42:47 EST