From: Uriah Eisenstein (email@example.com)
Date: Sun Sep 12 2010 - 17:23:59 CDT
OK, I should probably clarify: the program does not provide direct web
access to Unihan in any way, nor even contain it. It rather expects the user
to have downloaded Unihan.zip, and direct the program to the location of
that file, which it would then process. This shouldn't be more of a problem
than having a local copy of Unihan.zip, to the best of my understanding.
(It might also be possible to direct it to read Unihan.zip directly from the
Unicode site, but I'd be sure to ask permission before trying that out).
On Mon, Sep 13, 2010 at 12:03 AM, John H. Jenkins <firstname.lastname@example.org> wrote:
> I'll raise the possibility with the appropriate individuals, but I think it
> likely that the Consortium would prefer that third parties not host clones
> of the Unihan database.
> On Sep 12, 2010, at 9:57 AM, Uriah Eisenstein wrote:
> I'm nearing completion of a simple Java program which loads Unihan data
> from the source files into a DB, and provides SQL access to it.There's still
> at least a week or so of work on issues I consider essential, but once ready
> I'd be happy to make it available on the Internet if anyone's interested.
> So far I've used it to search for possibly erroneous data in Unihan; my
> latest find is that 73 characters have a kTaiwanTelegraph value of 0000,
> which seems doubtful. It can also be useful for various statistical
> information such as how many characters are listed under each radical, or
> which blocks include IICore characters.
> I'm also considering adding the contents of the Unicode Character Database
> as well at a later phase.
> Uriah Eisenstein
> Siôn ap-Rhisiart
> John H. Jenkins
This archive was generated by hypermail 2.1.5 : Sun Sep 12 2010 - 17:28:56 CDT