Re: Handling of Surrogates

From: Sam Mason (
Date: Fri Apr 17 2009 - 06:57:31 CDT

  • Next message: Doug Ewell: "Re: Handling of Surrogates"

    On Fri, Apr 17, 2009 at 03:00:05AM +0200, Bjoern Hoehrmann wrote:
    > I think you would be better of doing it similar to Perl, which uses ex-
    > plicit delimiters for the value. This has a number of benefits: you can
    > parse it case-insensitively, there is no confusion if you, for example,
    > want U+AFFE followed by the literal "AFFE", there is no confusion as to
    > what the required length is (some formats allow only six digits), it is
    > extendable (with Perl you can also use character names, alias names,
    > etc.), and the answer to your question is more obvious. Perl generates a
    > warning if you specify surrogate code points.

    I suggested this as well, but with a slightly different syntax:


    people seemed to prefer the familiarity of the Python style (i.e. \u and


    This archive was generated by hypermail 2.1.5 : Fri Apr 17 2009 - 06:59:00 CDT