Re: Unihan SQL access

From: John H. Jenkins (
Date: Sun Sep 12 2010 - 17:03:56 CDT

  • Next message: Uriah Eisenstein: "Re: Unihan SQL access"

    I'll raise the possibility with the appropriate individuals, but I think it likely that the Consortium would prefer that third parties not host clones of the Unihan database.

    On Sep 12, 2010, at 9:57 AM, Uriah Eisenstein wrote:

    > Hello,
    > I'm nearing completion of a simple Java program which loads Unihan data from the source files into a DB, and provides SQL access to it.There's still at least a week or so of work on issues I consider essential, but once ready I'd be happy to make it available on the Internet if anyone's interested.
    > So far I've used it to search for possibly erroneous data in Unihan; my latest find is that 73 characters have a kTaiwanTelegraph value of 0000, which seems doubtful. It can also be useful for various statistical information such as how many characters are listed under each radical, or which blocks include IICore characters.
    > I'm also considering adding the contents of the Unicode Character Database as well at a later phase.
    > Regards,
    > Uriah Eisenstein

    Sin ap-Rhisiart
    John H. Jenkins

    This archive was generated by hypermail 2.1.5 : Sun Sep 12 2010 - 17:10:20 CDT