Properties of multibyte encodings

From: Nitsan Seniak (
Date: Tue Nov 25 1997 - 10:25:51 EST


I'm not sure this is the right place for posting this question --
but I don't know of any other mailing list talking of i18n issues.

I'm currently working on the internationalization of a product
for asian countries, especially Japan. For implementation reasons,
I'm considering only supporting multibyte encodings with the
following properties:

  1. They are a superset of ASCII, which means that a character starting

     with a byte in the range [0x00, 0x7F] is a one-byte ASCII
  2. They don't use shift states (ie, a multibyte character can always
     be interpreted independently of the ones which precede it.)

Does anybody knows if these restrictions are reasonable? I know that EUC

and SJIS are OK, and that JIS isn't; will not supporting JIS cut a big
part of the market? Thanks for any advice.

-- Nitsan Seniak


Nitsan Seniak net:
ILOG S.A. tel: +33 1 49 08 35 00
9, rue de Verdun - BP 85 fax: +33 1 49 08 35 10
94253 Gentilly Cedex <URL:>
France or <URL:>

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:38 EDT