weird UTF-8 encoding in MS Exchange 2000 IM client

From: Eugene Mandel (kipodrach@hotmail.com)
Date: Thu May 15 2003 - 01:22:35 EDT

Next message: jarkko.hietaniemi@nokia.com: "RE: Decimal separator with more than one character?"

Previous message: Yael.Aharon@nokia.com: "RE: Unicode conformant character encodings and us-ascii"
Next in thread: Pim Blokland: "Re: weird UTF-8 encoding in MS Exchange 2000 IM client"
Reply: Pim Blokland: "Re: weird UTF-8 encoding in MS Exchange 2000 IM client"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Microsoft Exchange 2000 Instant Messaging client claimes that message body's charset is UTF-8. However, it does not look like UTF-8 I'm familiar with. Every character encoding starts with 0xc3 and then is interlaced with 0xc2 bytes. So the encoding is twice the length of the usual UTF-8 encoded string. (e.g. Hebrew letter "alef" (05 D0) is "D7 90" in UTF-8 encoding, but "C3 97 C2 90" in this encoding).

Is it possible that there several flavors of UTF-8?
Please help ! :)

Thanks,
-Eugene

Next message: jarkko.hietaniemi@nokia.com: "RE: Decimal separator with more than one character?"
Previous message: Yael.Aharon@nokia.com: "RE: Unicode conformant character encodings and us-ascii"
Next in thread: Pim Blokland: "Re: weird UTF-8 encoding in MS Exchange 2000 IM client"
Reply: Pim Blokland: "Re: weird UTF-8 encoding in MS Exchange 2000 IM client"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Thu May 15 2003 - 02:06:59 EDT