We're looking for a multilingual search engine to use with our internal and
external web sites. The criteria I have in mind are those in Thierry Sourbier's
paper on "Keys to Building a Multlingual Search Engine":
- all text processing as normalized Unicode or UTF-8
- language-specific text parsing, in particular word breaking
- storage of indeces as Unicode or UTF-8
Does anyone know of an available search engine with these capabilities?
Failing this, what are some of the best search engines for monolingual searching
of a segmented web site -- one in which any query will be processing text in
a single language?
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:56 EDT