Re: Sample CJK files...

From: Jake Morrison (Jacob.Morrison@cdc.com)
Date: Sun Mar 08 1998 - 20:44:18 EST

Next message: Mustafa Hasham: "Re: Sample CJK files..."
Previous message: Mustafa Hasham: "Sample CJK files..."
Maybe in reply to: Mustafa Hasham: "Sample CJK files..."
Next in thread: Mustafa Hasham: "Re: Sample CJK files..."
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Mustafa,

An excellent source is the pages for the 10th International Unicode
Conference:
http://www.unicode.org/unicode/iuc10/languages.html

It also has the data in Unicode, so you can check your work.

Another option is to surf the home pages for the major Asian companies.

If you want lots of random text (sometimes very random :-), you can get
messages from Usenet news.

The tw.* hierarchy is from Taiwan
The hk.* hierarchy is from Hong Kong
The fj.* hierarchy is from Japan
The han.* hierarchy is from Korea

Regards,
Jake

On Sun, 8 Mar 1998, Mustafa Hasham wrote:

>
> Hi:
>
> As part of a project in a CS class, I intend to convert CJK encoded text
> files into Unicode. I am using Windows NT and program in Java. Does anyone
> out there know of any sample text files I can use? Any encoding scheme
> would be fine... Big5, Kanji, GB, etc.. I do not have access to an input
> editor.
>
> Thanks
>
> Mustafa
>
>

Next message: Mustafa Hasham: "Re: Sample CJK files..."
Previous message: Mustafa Hasham: "Sample CJK files..."
Maybe in reply to: Mustafa Hasham: "Sample CJK files..."
Next in thread: Mustafa Hasham: "Re: Sample CJK files..."
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:39 EDT