RE: Japanese EUC and Shift-JIS text samples?

From: Addison Phillips (
Date: Mon Oct 04 1999 - 12:28:42 EDT

Don't forget Host DBCS.


        Addison Phillips
        Director, Globalization Engineering
        SimulTrans, L.L.C.
        2606 Bayshore Parkway
        Mountain View, California 94043 USA

        +1 650-526-4652 (direct telephone)
        +1 650-969-9959 (facsimile) (Internet email) (website)

        "22 languages. One release date."

-----Original Message-----
From: Frank da Cruz []
Sent: Monday, October 04, 1999 9:17 AM
To: Unicode List
Subject: Japanese EUC and Shift-JIS text samples?

Does anybody have fairly large ftp-able samples of Shift-JIS
(Code Page 982) plain text containing a "typical" mixture of
halfwidth Roman, halfwidth Katakana, and Kanji? (Does anybody
have an idea what the typical mixture might be over a very
large sample of Japanese text?)

Same question for Japanese EUC.

And for that matter, also JIS-7.

As far as I know, these are the only three commonly-used
Japanese character sets (besides Unicode) that include both
single- and doublewidth characters.

(For working on conversion to/from Unicode, of course :-)

- Frank

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:53 EDT