Re: Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

From: Hans Åberg via Unicode <unicode_at_unicode.org>
Date: Wed, 17 May 2017 23:21:54 +0200

> On 17 May 2017, at 23:18, Doug Ewell <doug_at_ewellic.org> wrote:
>
> Hans Åberg wrote:
>
>>> Far from solving the stated problem, it would introduce a new one:
>>> conversion from the "bad data" Unicode code points, currently
>>> well-defined, would become ambiguous.
>>
>> Actually not: just translate the invalid UTF-8 sequences into invalid
>> UTF-32.
>
> Far from solving the stated problem, it would introduce TWO new ones...

There is no good solution to the problem of illegal UTF-8 sequences, as the intent of those is not known.
Received on Wed May 17 2017 - 16:22:12 CDT

This archive was generated by hypermail 2.2.0 : Wed May 17 2017 - 16:22:12 CDT