Re: printing IANA IDN tables

From: Neil Harris (neil@tonal.clara.co.uk)
Date: Fri Oct 13 2006 - 09:24:09 CST

  • Next message: Richard Wordingham: "Re: postal delivery efforts"

    JFC Morfin wrote:
    > From expeience, what would be the easiest way to generate PDF of the
    > different http://www.iana.org/assignments/idn/registered.htm tables
    > whch would display the concerned glyphs?
    > Thank you for the help.
    > jfc
    >
    >
    >
    >
    Get the files thus:

    wget -r -l1 http://www.iana.org/assignments/idn/registered.htm

    Now run this Python program in the directory with the downloaded HTML
    files in:

    -------------------------------
    import os, string, re

    filenames = [x for x in os.listdir(".") if ".html" in x]

    for filename in filenames:

        points = [int(string.join(x, ""), 16) for x in
                 re.findall(r"U\+([0-9A-Fa-f]+)|([0-9A-Fa-f]+)\(",
    open(filename).read())]
        points = {}.fromkeys(points).keys()
        points.sort()

        print "<h1>", string.replace(filename, ".html", ""), "</h1>"
        n = 0
        for p in points:
            print "&#%d;" % p
            n += 1
            if n % 40 == 0: print "<br>"

    -------------------------------

    and view the resulting output in your web browser, after installing CJK
    fonts, including both traditional and simplified Chinese. Print to PDF.

    -- Neil



    This archive was generated by hypermail 2.1.5 : Fri Oct 13 2006 - 09:42:47 CST