At 05:50 -0700 8/31/1999, Alain wrote:
>A 22:48 99-08-30 -0700, Doug Ewell a écrit :
>>The problem is not that it is impossible to write such a tool (it isn't)
>>but that it won't work 100 percent of the time. It is commonly pointed
>>out that a byte in the range [0xC0, 0xDF] followed by a byte in the
>>range [0x80, 0xBF] is unlikely to occur in Latin-1 text, but
>[Alain] Such an assumption would indeed be extremely fragile...
Could you stick a ZWNJ between them? Or a NUL?
Would there really be a problem if our questioner used Latin-1 for files
containing only Latin-1, and UTF-8 when he wanted to combine Chinese and
APL with Castilian and Catalonian? And mark the files to indicate the
different formats clearly?
-- Edward Cherlin email@example.com "It isn't what you don't know that hurts you, it's what you know that ain't so."--Mark Twain, or else some other prominent 19th century humorist and wit
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:51 EDT