Re: Unihan SQL access

From: John H. Jenkins (
Date: Sun Sep 12 2010 - 17:36:27 CDT

  • Next message: Rick McGowan: "PRI #172 (UTS #46) - review period closing"

    Got it. Yes, that would be no problem at all. Go for it.

    On Sep 12, 2010, at 4:23 PM, Uriah Eisenstein wrote:

    > OK, I should probably clarify: the program does not provide direct web access to Unihan in any way, nor even contain it. It rather expects the user to have downloaded, and direct the program to the location of that file, which it would then process. This shouldn't be more of a problem than having a local copy of, to the best of my understanding.
    > (It might also be possible to direct it to read directly from the Unicode site, but I'd be sure to ask permission before trying that out).
    > Uriah
    > On Mon, Sep 13, 2010 at 12:03 AM, John H. Jenkins <> wrote:
    > I'll raise the possibility with the appropriate individuals, but I think it likely that the Consortium would prefer that third parties not host clones of the Unihan database.
    > On Sep 12, 2010, at 9:57 AM, Uriah Eisenstein wrote:
    >> Hello,
    >> I'm nearing completion of a simple Java program which loads Unihan data from the source files into a DB, and provides SQL access to it.There's still at least a week or so of work on issues I consider essential, but once ready I'd be happy to make it available on the Internet if anyone's interested.
    >> So far I've used it to search for possibly erroneous data in Unihan; my latest find is that 73 characters have a kTaiwanTelegraph value of 0000, which seems doubtful. It can also be useful for various statistical information such as how many characters are listed under each radical, or which blocks include IICore characters.
    >> I'm also considering adding the contents of the Unicode Character Database as well at a later phase.
    >> Regards,
    >> Uriah Eisenstein
    > =====
    > Sin ap-Rhisiart
    > John H. Jenkins

    Hoani H. Tinikini
    John H. Jenkins

    This archive was generated by hypermail 2.1.5 : Sun Sep 12 2010 - 17:38:57 CDT