Re: Unicode Regular Expressions, Surrogate Points and UTF-8

From: Markus Scherer <markus.icu_at_gmail.com>
Date: Sat, 31 May 2014 19:24:09 -0700

On Sat, May 31, 2014 at 6:41 AM, Mark Davis ☕️ <mark_at_macchiato.com> wrote:

> I think you have a point here. We should probably change to:
>
> To meet this requirement, an implementation shall supply a mechanism for
> specifying any Unicode scalar value (from U+0000 to U+D7FF and U+E000 to
> U+10FFFF), using the hexadecimal code point representation.
>
> and then in the notes say that the same notation can be used for
> codepoints that are not scalar values, for implementation that handle them
> in Unicode strings.
>

This combination sounds good.
markus

_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode
Received on Sat May 31 2014 - 21:25:09 CDT

This archive was generated by hypermail 2.2.0 : Sat May 31 2014 - 21:25:09 CDT