Re: Mixing UTF-8 and ISO 8859-1

From: Edward Cherlin (
Date: Wed Sep 01 1999 - 03:51:33 EDT

At 05:50 -0700 8/31/1999, Alain wrote:
>A 22:48 99-08-30 -0700, Doug Ewell a écrit :
>>The problem is not that it is impossible to write such a tool (it isn't)
>>but that it won't work 100 percent of the time. It is commonly pointed
>>out that a byte in the range [0xC0, 0xDF] followed by a byte in the
>>range [0x80, 0xBF] is unlikely to occur in Latin-1 text, but
>[Alain] Such an assumption would indeed be extremely fragile...
>Alain LaBonté

Could you stick a ZWNJ between them? Or a NUL?



Would there really be a problem if our questioner used Latin-1 for files
containing only Latin-1, and UTF-8 when he wanted to combine Chinese and
APL with Castilian and Catalonian? And mark the files to indicate the
different formats clearly?

Edward Cherlin
"It isn't what you don't know that hurts you, it's
what you know that ain't so."--Mark Twain, or else
some other prominent 19th century humorist and wit

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:51 EDT