Re: ASCII control codes in sequences of multibyte character sets

From: Steffen <sdaoden_at_gmail.com>
Date: Mon, 02 Sep 2013 20:45:36 +0200

"Doug Ewell" <doug_at_ewellic.org> wrote:
 |How would you define the difference between multi-octet and
 |multi-byte?

hm, to me multi-octet is an encoding which uses a fixed amount of
octets (8-bit bytes) per character, e.g., UCS-2, UCS-4 etc.,
whereas a multi[-]byte character set is designed as a 8-bit
character set, but which may use multiple 8-bit bytes per
character, possibly even fixed.
I.e., in the end i think it comes out as "are embedded NUL octets
a regular part of the character set".

You're asking ... i'm sure there is an officially accepted
definition somewhere?

--steffen
Received on Mon Sep 02 2013 - 13:48:54 CDT

This archive was generated by hypermail 2.2.0 : Mon Sep 02 2013 - 13:48:56 CDT