Re: Characters

From: Neil Harris (neil@tonal.clara.co.uk)
Date: Mon Feb 14 2011 - 04:37:13 CST

  • Next message: Charlie Ruland: "Re: Characters"

    William,

    I worry that you may be attempting to reinvent the wheel.

    For short strings, there are already carefully designed and standardized
    hand-tuned algorithms like SCSU.

    For long strings, general-purpose compression algorithms are typically
    very effective, and although these are complex, code for them is widely
    available under free licenses.

    All of this is covered in great detail by Doug Ewell in his Unicode
    Technical Note 14. which can be found at http://www.unicode.org/notes/tn14/

    -- Neil



    This archive was generated by hypermail 2.1.5 : Mon Feb 14 2011 - 04:40:54 CST