Re: cp 932 to UTF-8 conversion (for Java)

From: Adrian Havill (havill@turbolinux.co.jp)
Date: Mon Nov 15 1999 - 12:42:40 EST


iconv (Unix, GNU/Linux... comes with glibc 2.1+)

BasisTech's uniconv (Solaris, Win, HPUX, SunOS)
<URL:http://rosette.basistech.com/demo.html>

Plan 9's tcs (Unix, Plan-9)
<URL:ftp://plan9.bell-labs.com/plan9/unixsrc/>

and "ucconv" inside "fugu" (Windows, Unix, GNU/Linux)
<URL:ftp://www.turbolinux.co.jp/pub/fugu/>

should all convert CP932 to UTF-8 easily, respecting half-width
katakana. "ucconv" has an "native2ascii" compatibility mode,
and can input/output Java \u notation (including the multi-u
notation) which is compatible with Java resource files you mentioned.

On Mon, 15 Nov 1999, Peck, Jon wrote:
> Surprisingly, I haven't been able quickly to locate a (preferably batch)
> converter that will do this, but surely there must be many. Can anyone
> point me to a tool with this capability? I'd certainly prefer not to write
> one.



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:56 EDT