From: Neil Harris (firstname.lastname@example.org)
Date: Mon Feb 14 2011 - 04:37:13 CST
I worry that you may be attempting to reinvent the wheel.
For short strings, there are already carefully designed and standardized
hand-tuned algorithms like SCSU.
For long strings, general-purpose compression algorithms are typically
very effective, and although these are complex, code for them is widely
available under free licenses.
All of this is covered in great detail by Doug Ewell in his Unicode
Technical Note 14. which can be found at http://www.unicode.org/notes/tn14/
This archive was generated by hypermail 2.1.5 : Mon Feb 14 2011 - 04:40:54 CST