Re: Unicode 4.0 BETA available for review

From: Stefan Persson (alsjebegrijptwatikbedoel@yahoo.se)
Date: Thu Feb 27 2003 - 13:51:44 EST

Next message: Yung-Fong Tang: "Re: Unicode 4.0 BETA available for review"

Previous message: John Hudson: "Announcement: new font technology association to be formed"
In reply to: Kenneth Whistler: "Re: Unicode 4.0 BETA available for review"
Next in thread: Yung-Fong Tang: "Re: Unicode 4.0 BETA available for review"
Reply: Yung-Fong Tang: "Re: Unicode 4.0 BETA available for review"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Kenneth Whistler wrote:

>Unicode 3.0 defined non-shorted UTF-8 as *irregular* code value
>sequences. There were two types:
>
> a. 0xC0 0x80 for U+0000 (instead of 0x00)
> b. 0xED 0xA0 0x80 0xED 0xB0 0x80 for U+10000 (instead of 0xF0 0x90 0x80 0x80)
>
>
Ah, but encoding NULL as a surrogate character and then encoding those
two surrogates as three bytes, making totally 6 bytes a character, would
also be technically possible (though not legal), right?

Stefan

_____________________________________________________
Gå före i kön och få din sajt värderad på nolltid med Yahoo! Express
Se mer på: http://se.docs.yahoo.com/info/express/help/index.html

Next message: Yung-Fong Tang: "Re: Unicode 4.0 BETA available for review"
Previous message: John Hudson: "Announcement: new font technology association to be formed"
In reply to: Kenneth Whistler: "Re: Unicode 4.0 BETA available for review"
Next in thread: Yung-Fong Tang: "Re: Unicode 4.0 BETA available for review"
Reply: Yung-Fong Tang: "Re: Unicode 4.0 BETA available for review"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Thu Feb 27 2003 - 14:34:49 EST