Re: Surrogate pairs and UTF-8

From: Mike Ayers (mayers@celequest.com)
Date: Wed Jun 21 2006 - 13:31:22 CDT

  • Next message: Mike Ayers: "Re: Surrogate pairs and UTF-8"

    Pavils Jurjans wrote:

    > - The guides on unicode.org <http://unicode.org/> site talk only about
    > surrogate pair and UTF-16 conversion. How about the UTF-8?

            Surrogates do not exist in UTF-8. They are the mechanism by which
    UCS-2 (which encodes 16 bits) was simultaneously restricted and extend
    to become UTF-16 (which encodes 21 bits). Surrogates are not
    characters. They are UTF-16 code points only.

            HTH,

    /|/|ike



    This archive was generated by hypermail 2.1.5 : Wed Jun 21 2006 - 14:00:18 CDT