Re: Ternary search trees for Unicode dictionaries

From: Jungshik Shin (jshin@mailaps.org)
Date: Wed Nov 19 2003 - 13:50:10 EST

  • Next message: Peter Kirk: "Re: Definitions"

    On Tue, 18 Nov 2003, Doug Ewell wrote:

    > That's not an absolute; it depends on the input text. In my experience,
    > SCSU usually does perform somewhat better than BOCU-1, but for some
    > scripts (e.g. Korean) the opposite often seems to be true.

      Just out of curiosity, which NF did you use for your
    uncompressed source Korean text, NFC or NFD when you got the above result?
    I guess I'll know in a week or so when your paper is out, but...

      Jungshik



    This archive was generated by hypermail 2.1.5 : Wed Nov 19 2003 - 14:45:02 EST