From: Neil Harris (neil@tonal.clara.co.uk)
Date: Fri Oct 13 2006 - 09:24:09 CST
JFC Morfin wrote:
> From expeience, what would be the easiest way to generate PDF of the 
> different http://www.iana.org/assignments/idn/registered.htm tables 
> whch would display the concerned glyphs?
> Thank you for the help.
> jfc
>
>
>
>
Get the files thus:
wget -r -l1 http://www.iana.org/assignments/idn/registered.htm
Now run this Python program in the directory with the downloaded HTML 
files in:
-------------------------------
import os, string, re
filenames = [x for x in os.listdir(".") if ".html" in x]
for filename in filenames:
    points = [int(string.join(x, ""), 16) for x in
             re.findall(r"U\+([0-9A-Fa-f]+)|([0-9A-Fa-f]+)\(", 
open(filename).read())]
    points = {}.fromkeys(points).keys()
    points.sort()
    print "<h1>", string.replace(filename, ".html", ""), "</h1>"
    n = 0
    for p in points:
        print "&#%d;" % p
        n += 1
        if n % 40 == 0: print "<br>"
-------------------------------
and view the resulting output in your web browser, after installing CJK 
fonts, including both traditional and simplified Chinese. Print to PDF.
-- Neil
This archive was generated by hypermail 2.1.5 : Fri Oct 13 2006 - 09:42:47 CST