Re: UTF-16 inside UTF-8

From: David E. Hollingsworth (deh@fastanimals.com)
Date: Tue Nov 04 2003 - 09:51:59 EST

Next message: Philippe Verdy: "[hebrew] Re: variation selectors for combining characters (was: Hebrew composition model, with cantillation marks)"

Previous message: Philippe Verdy: "Re: [hebrew] Re: Hebrew composition model, with cantillation marks"
Next in thread: Philippe Verdy: "Re: UTF-16 inside UTF-8"
Reply: Philippe Verdy: "Re: UTF-16 inside UTF-8"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

I believe this is described pretty well in sections 3.8 & 3.9 (plus
conformance requirement C12b) of Unicode 4.0.

Surrogate pairs are for UTF-16 only. For UTF-8 & UTF-32, surrogates
(pairs or otherwise) are ill-formed code unit sequences, and
conformant processes must treat them as erroneous.

--deh!

Next message: Philippe Verdy: "[hebrew] Re: variation selectors for combining characters (was: Hebrew composition model, with cantillation marks)"
Previous message: Philippe Verdy: "Re: [hebrew] Re: Hebrew composition model, with cantillation marks"
Next in thread: Philippe Verdy: "Re: UTF-16 inside UTF-8"
Reply: Philippe Verdy: "Re: UTF-16 inside UTF-8"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Tue Nov 04 2003 - 10:58:57 EST