RE: traditional vs simplified chinese

From: Marco Cimarosti (marco.cimarosti@essetre.it)
Date: Thu Feb 13 2003 - 14:12:35 EST

Next message: Edward H Trager: "Re: traditional vs simplified chinese"

Previous message: Rick Cameron: "RE: traditional vs simplified chinese"
Maybe in reply to: Paul Hastings: "traditional vs simplified chinese"
Next in thread: Rick Cameron: "RE: traditional vs simplified chinese"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Paul wrote:
> To: Edward H Trager
> > Marco Cimarosti has questioned, why do you need to classify
> > text as being simplified or traditional?
>
> if i understand their needs correctly, its to implement a
> search system with search phrases of either "type" of
> chinese--content would be in both types.

Still, I don't see what's the purpose of "classifying" the user input. What
they really need is rather a special collation algorithm that *ignores* the
difference between corresponding traditional and simplified characters for
the purpose of searching. This is somewhat analogous to making a "caseless"
search.

The easiest way to do it is "folding" both the user's query and the content
being sought to the same form (either traditional or simplified, it doesn't
matter). It may also help to "fold" also other kinds of variants beside
simplified and traditional.

This "folding" is much easy that implementing a full-fledged
simplified<->traditional conversion (which needs to be context sensitive and
dictionary-driven), because the result is just in a temporary buffer used
for comparison, and no one is going to see it.

_ Marco

Next message: Edward H Trager: "Re: traditional vs simplified chinese"
Previous message: Rick Cameron: "RE: traditional vs simplified chinese"
Maybe in reply to: Paul Hastings: "traditional vs simplified chinese"
Next in thread: Rick Cameron: "RE: traditional vs simplified chinese"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Thu Feb 13 2003 - 15:52:34 EST