Re: writing Chinese dialects

From: Arne Götje (高盛華) (arne@linux.org.tw)
Date: Sun Feb 04 2007 - 17:32:39 CST

  • Next message: vunzndi@vfemail.net: "Re: writing Chinese dialects"

    On Sunday 04 February 2007 23:53, vunzndi@vfemail.net wrote:
    > For Extension B the best is Mr Taichi Kawabata's ids_irg.txt which
    > includes all the cjkv characters presently in unicode at
    >
    > <http://www.cse.cuhk.edu.hk/~irg/irg/irg25/IRGN1183A_ids_irg.txt.gz>
    >
    > I usually just grep it, sometimes
    >
    > $ grep AB ids_irg.txt
    >
    > but more often the "fuzzy"
    >
    > $ grep A ids_irg.txt | grep B
    >
    >
    > For, the very much smaller, and still to be fully passed Extension C,
    > there is my "very much a work in progress"
    > ExtensionC_decomposed.txt, which gives only the IRG numbers since the
    > characters are not yet official. I hope to update this very soon. For
    > this please goto
    > http://east-chr-data.cvs.sourceforge.net/east-chr-data/ExtensionC/dat
    >a/tables/ExtensionC_decomposed.txt?view=log and download the latest
    > version.
    >
    > Accordiing to this at least 7 characters from your missing list are
    > apparently in Extension C ( File attached).
    >
    > John Knightley

    Thanks very much, both of you. I think this will help a lot for finding
    more "missing" characters... :)

    John, may I help you to update your Ext. C file to use the "correct" IDS
    instead of "/" and "+" ? ;) I would send you a diff then...

    Cheers
    Arne

    -- 
    Arne Götje (高盛華) <arne@linux.org.tw>
    PGP/GnuPG key: 1024D/685D1E8C
    Fingerprint: 2056 F6B7 DEA8 B478 311F  1C34 6E9F D06E 685D 1E8C
    Key available at wwwkeys.pgp.net.   Encrypted e-mail preferred.
    
    




    This archive was generated by hypermail 2.1.5 : Sun Feb 04 2007 - 17:34:47 CST