Re: Surrogate pairs and UTF-8

From: Otto Stolz (Otto.Stolz@uni-konstanz.de)
Date: Mon Jun 26 2006 - 04:18:50 CDT

Next message: Erkki Kolehmainen: "Re: Finnegans Wake, was Re: comment on L2/06-215"

Previous message: Richard Cook: "Re: Finnegans Wake, was Re: comment on L2/06-215"
In reply to: Peter Constable: "RE: Surrogate pairs and UTF-8"
Next in thread: Peter Constable: "RE: Surrogate pairs and UTF-8"
Reply: Peter Constable: "RE: Surrogate pairs and UTF-8"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Peter Constable schrieb:
> UTF-16 Surrogate Pairs are basically doing the same
> thing that multi-byte sequences in UTF-8 do
...
> They mainly differ only in details.

One essential detail being that UTF-16 surrogates are excluded
from the valid Unicode codepoints, while UTF-8 "surrogates"
have binary values that are also valid Unicode codepoints.

Best wishes,
Otto Stolz

Next message: Erkki Kolehmainen: "Re: Finnegans Wake, was Re: comment on L2/06-215"
Previous message: Richard Cook: "Re: Finnegans Wake, was Re: comment on L2/06-215"
In reply to: Peter Constable: "RE: Surrogate pairs and UTF-8"
Next in thread: Peter Constable: "RE: Surrogate pairs and UTF-8"
Reply: Peter Constable: "RE: Surrogate pairs and UTF-8"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Mon Jun 26 2006 - 04:41:56 CDT