Re: Detecting encoding in Plain text

From: Katsuhiko Momoi (momoi@alumni.indiana.edu)
Date: Thu Jan 08 2004 - 21:04:01 EST

  • Next message: Gerd Schumacher: "Re: Long S in Germany (was: 0364 COMBINING LATIN SMALL LETTER E)"

    Jungshik Shin wrote:

    >On Thu, 8 Jan 2004, Tex Texin wrote:
    >
    >
    >
    >>There were also papers on the subject at past unicode conferences.
    >>Look for one by Martin Duerst several years ago and one by Kat Momoi,
    >>Netscape only a few years back. I think both are on the web.
    >>
    >>
    >
    >
    >
    >>Also look at the Netscape open source code. I believe it does some
    >>detection.
    >>
    >>
    >
    > It's mozilla (not netscape) :-). See
    >
    >http://lxr.mozilla.org/seamonkey/source/extensions/universalchardet/
    >http://lxr.mozilla.org/seamonkey/source/intl/chardet/
    >
    >Li and Momoi paper presented at the 19th IUC is available there.
    >
    >
    The specific URL for our IUC 19 paper with an update note at the
    beginning is this:

    http://www.mozilla.org/projects/intl/UniversalCharsetDetection.html

    - Kat

    -- 
    Katsuhiko Momoi
    e-mail: katmomoi@pacbell.net
    


    This archive was generated by hypermail 2.1.5 : Thu Jan 08 2004 - 21:43:05 EST