RE: SQL version of the Unicode database?

From: Philippe Verdy (
Date: Sat Jul 07 2007 - 05:56:34 CDT

  • Next message: Brian Wilson: "RE: Phetsarat font, Lao unicode"

    You most probably don't need 13000 lines of C++ to load the UCD and other
    files into some SQL database, if you just use the database engine tools that
    can load a delimited text file into a table.
    So all you need to write is the script to create the schema, and possibly a
    small script to possibly command this table load.

    Once this is done, you can dump your database into a SQL script file. This
    does not even require writing any C++ application, just possibly a small SQL
    script that can be reused (a similar SQL script could generate your C/C++
    data source).

    For all that, you won't need more than about 200 lines of scripts, without
    even using a C++ compiler/linker.

    > -----Message d'origine-----
    > De : [] De la
    > part de Mike
    > Envoyé : vendredi 6 juillet 2007 21:40
    > À :
    > Objet : Re: SQL version of the Unicode database?
    > > The UTC has been working for some time now on an XML version
    > > of the Unicode Character Database.
    > I hope that the XML version will merely augment the UCD and
    > not replace all the .txt files. I've written a tool that
    > parses through all the .txt files and generates a .c file
    > implementing all of the properties. I would really hate to
    > have to rewrite it to parse XML. It's 13,000 lines of C++
    > as it is.
    > Mike

    This archive was generated by hypermail 2.1.5 : Sat Jul 07 2007 - 06:00:08 CDT