Re: Mixing UTF-8 and ISO 8859-1 (was: Normalization Form KC)

From: Alain LaBont\i\ (alb@sct.gouv.qc.ca)
Date: Tue Aug 31 1999 - 08:53:27 EDT


A 22:48 99-08-30 -0700, Doug Ewell a écrit :
>The problem is not that it is impossible to write such a tool (it isn't)
>but that it won't work 100 percent of the time. It is commonly pointed
>out that a byte in the range [0xC0, 0xDF] followed by a byte in the
>range [0x80, 0xBF] is unlikely to occur in Latin-1 text, but

[Alain] Such an assumption would indeed be extremely fragile...

Alain LaBonté
Québec



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:51 EDT