RE: Japanese EUC and Shift-JIS text samples?

From: Addison Phillips (AddisonP@simultrans.com)
Date: Mon Oct 04 1999 - 12:28:42 EDT


Don't forget Host DBCS.

Addison
        __________________________________________

        Addison Phillips
        Director, Globalization Engineering
        SimulTrans, L.L.C.
        2606 Bayshore Parkway
        Mountain View, California 94043 USA

        +1 650-526-4652 (direct telephone)
        +1 650-969-9959 (facsimile)
        AddisonP@simultrans.com (Internet email)
        http://www.simultrans.com (website)

        "22 languages. One release date."
        __________________________________________

-----Original Message-----
From: Frank da Cruz [mailto:fdc@watsun.cc.columbia.edu]
Sent: Monday, October 04, 1999 9:17 AM
To: Unicode List
Subject: Japanese EUC and Shift-JIS text samples?

Does anybody have fairly large ftp-able samples of Shift-JIS
(Code Page 982) plain text containing a "typical" mixture of
halfwidth Roman, halfwidth Katakana, and Kanji? (Does anybody
have an idea what the typical mixture might be over a very
large sample of Japanese text?)

Same question for Japanese EUC.

And for that matter, also JIS-7.

As far as I know, these are the only three commonly-used
Japanese character sets (besides Unicode) that include both
single- and doublewidth characters.

(For working on conversion to/from Unicode, of course :-)

- Frank



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:53 EDT