Re: Detecting encoding in Plain text

From: D. Starner (
Date: Thu Jan 08 2004 - 09:20:45 EST

  • Next message: Patrick Andries: "Re: Detecting encoding in Plain text"

    > Given any sizeable chunk of text, it ought to be possible to estimate
    > the statistical likelihood of its being in a certain
    > encoding/[language] even if it's in an unspecified 8859-* encoding.
    > It would be quite an interesting exercise, but I'd be surprised if
    > someone hasn't done it before. Perhaps someone here knows. has a paper on the subject
    and an implemenation in Perl. has an alternate
    implementation in compiled code (called mguesser).

    Sign-up for Ads Free at

    This archive was generated by hypermail 2.1.5 : Thu Jan 08 2004 - 10:04:13 EST