Re: Utility to report and repair broken surrogate pairs in UTF-16 text

From: Markus Scherer (markus.icu@gmail.com)
Date: Fri Nov 05 2010 - 15:42:53 CST

Next message: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"

Previous message: Doug Ewell: "RE: Utility to report and repair broken surrogate pairs in UTF-16 text"
In reply to: Doug Ewell: "RE: Utility to report and repair broken surrogate pairs in UTF-16 text"
Next in thread: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"
Reply: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

On Fri, Nov 5, 2010 at 1:56 PM, Doug Ewell <doug@ewellic.org> wrote:

> Right, but as I said, those downstream tasks shouldn't be consumers of
> UTF-16 code units anyway. They should be consumers of Unicode code
> points, which by definition excludes loose surrogates.
>

Code points include surrogates. Maybe you mean "UTF-32 code units" or
"Unicode scalar values".

markus

Next message: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"
Previous message: Doug Ewell: "RE: Utility to report and repair broken surrogate pairs in UTF-16 text"
In reply to: Doug Ewell: "RE: Utility to report and repair broken surrogate pairs in UTF-16 text"
Next in thread: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"
Reply: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Fri Nov 05 2010 - 15:44:54 CST