Yes, I think that's what we (Netscape) did with our "to Unicode"
converters. In our "from Unicode" converters, we typically use '?'.
Markus Kuhn wrote:
> If I implement a UTF-8 -> UCS-2 converter, what shall I do with
> malformed UTF-8 sequences? ISO 10646-1 in section 2.3c and section R.7
> clearly requires that malformed UTF-8 sequences are indicated to the
> user. Is replacing any malformed UTF-8 sequence by 0xFFFD appropriate
> use of this character?
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:45 EDT