Re: Multi-lingual corpus?

From: Philippe Verdy (verdy_p@wanadoo.fr)
Date: Wed Aug 24 2005 - 13:07:56 CDT

  • Next message: Bruno Lowagie: "Re: Unicode TTF question"

    From: "Ken Krugler" <ken@transpac.com>
    > Hi all,
    >
    > Kevin Burton has created an open source language detector written in Java
    > (see http://www.feedblog.org/2005/08/ngram_language_.html)
    > and he's asking for contributions of sample data for additional languages.

    Beside his blog page, and the existing sourceforge project name, he has not
    provided anything for now (there's no source and no demo available, not even
    a alpha version).
    I wonder if it's a good idea to provide him with such data, if he does not
    want to publish anything in fact (there may be legal issues with his source,
    notably if he used copyrighted materials such as the paper he is citing).



    This archive was generated by hypermail 2.1.5 : Wed Aug 24 2005 - 13:09:35 CDT