RE: What does it mean to "not be a valid string in Unicode"?

From: Doug Ewell <doug_at_ewellic.org>
Date: Mon, 07 Jan 2013 11:48:08 -0700

Markus Scherer <markus dot icu at gmail dot com> wrote:

> Also, we commonly read code points from 16-bit Unicode strings, and
> unpaired surrogates are returned as themselves and treated as such
> (e.g., in collation). That would not be well-formed UTF-16, but it's
> generally harmless in text processing.

But still non-conformant.

--
Doug Ewell | Thornton, CO, USA
http://ewellic.org | @DougEwell ­
Received on Mon Jan 07 2013 - 12:50:28 CST

This archive was generated by hypermail 2.2.0 : Mon Jan 07 2013 - 12:50:29 CST