Re: Unihan SQL access

From: Uriah Eisenstein (
Date: Sun Sep 12 2010 - 17:23:59 CDT

  • Next message: John H. Jenkins: "Re: Unihan SQL access"

    OK, I should probably clarify: the program does not provide direct web
    access to Unihan in any way, nor even contain it. It rather expects the user
    to have downloaded, and direct the program to the location of
    that file, which it would then process. This shouldn't be more of a problem
    than having a local copy of, to the best of my understanding.
    (It might also be possible to direct it to read directly from the
    Unicode site, but I'd be sure to ask permission before trying that out).


    On Mon, Sep 13, 2010 at 12:03 AM, John H. Jenkins <> wrote:

    > I'll raise the possibility with the appropriate individuals, but I think it
    > likely that the Consortium would prefer that third parties not host clones
    > of the Unihan database.
    > On Sep 12, 2010, at 9:57 AM, Uriah Eisenstein wrote:
    > Hello,
    > I'm nearing completion of a simple Java program which loads Unihan data
    > from the source files into a DB, and provides SQL access to it.There's still
    > at least a week or so of work on issues I consider essential, but once ready
    > I'd be happy to make it available on the Internet if anyone's interested.
    > So far I've used it to search for possibly erroneous data in Unihan; my
    > latest find is that 73 characters have a kTaiwanTelegraph value of 0000,
    > which seems doubtful. It can also be useful for various statistical
    > information such as how many characters are listed under each radical, or
    > which blocks include IICore characters.
    > I'm also considering adding the contents of the Unicode Character Database
    > as well at a later phase.
    > Regards,
    > Uriah Eisenstein
    > =====
    > SiƓn ap-Rhisiart
    > John H. Jenkins

    This archive was generated by hypermail 2.1.5 : Sun Sep 12 2010 - 17:28:56 CDT