Re: Invalid code points

From: Hans Aberg (haberg@math.su.se)
Date: Sun May 31 2009 - 14:45:04 CDT

  • Next message: Doug Ewell: "Re: Invalid code points"

    On 31 May 2009, at 19:42, Doug Ewell wrote:

    >> In particular, it would be great to know if the range U+0080, …, U
    >> +009F is invalid.
    >
    > That bit is especially wrong. I can at least imagine why there
    > might be confusion about the noncharacters and surrogate code
    > points, but not the C1 controls.

    It is a bit disappointing: I was looking for a beginning (escape) byte
    sequence to tell that string isn't UTF-8, among other valid strings.
    But perhaps it does not matter.

       Hans



    This archive was generated by hypermail 2.1.5 : Sun May 31 2009 - 14:47:58 CDT