From: Doug Ewell (dewell@adelphia.net)
Date: Wed Jul 02 2003 - 11:28:02 EDT
Kent Karlsson <kentk at cs dot chalmers dot se> wrote:
>> Believe it or not, the IJ and ij digraphs *were* included for
>> compatibility with an 8-bit legacy character set (ISO 6937).
>
> 6937 is a multibyte encoding (one or two bytes per character).
> There are no combining characters at all in 6937, even though
> there is a common misunderstanding that there are, since the
> lead bytes are (almost) systematically assigned.
It's still an 8-bit character set. Characters are defined in terms of
8-bit code units; some use one, others use two. This is just like the
double-byte character sets used for CJK.
-Doug Ewell
Fullerton, California
http://users.adelphia.net/~dewell/
This archive was generated by hypermail 2.1.5 : Wed Jul 02 2003 - 12:22:49 EDT