Re: Surrogate pairs and UTF-8

From: Otto Stolz (
Date: Mon Jun 26 2006 - 04:18:50 CDT

  • Next message: Erkki Kolehmainen: "Re: Finnegans Wake, was Re: comment on L2/06-215"

    Peter Constable schrieb:
    > UTF-16 Surrogate Pairs are basically doing the same
    > thing that multi-byte sequences in UTF-8 do
    > They mainly differ only in details.

    One essential detail being that UTF-16 surrogates are excluded
    from the valid Unicode codepoints, while UTF-8 "surrogates"
    have binary values that are also valid Unicode codepoints.

    Best wishes,
       Otto Stolz

    This archive was generated by hypermail 2.1.5 : Mon Jun 26 2006 - 04:41:56 CDT