Re: Utility to report and repair broken surrogate pairs in UTF-16 text

From: Markus Scherer (markus.icu@gmail.com)
Date: Fri Nov 05 2010 - 15:42:53 CST

  • Next message: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"

    On Fri, Nov 5, 2010 at 1:56 PM, Doug Ewell <doug@ewellic.org> wrote:

    > Right, but as I said, those downstream tasks shouldn't be consumers of
    > UTF-16 code units anyway. They should be consumers of Unicode code
    > points, which by definition excludes loose surrogates.
    >

    Code points include surrogates. Maybe you mean "UTF-32 code units" or
    "Unicode scalar values".

    markus



    This archive was generated by hypermail 2.1.5 : Fri Nov 05 2010 - 15:44:54 CST