New Unicode Encoding/Compression: BOCU-1

From: Markus Scherer (markus.scherer@jtcsv.com)
Date: Wed Feb 06 2002 - 13:22:20 EST


Hello,

Mark Davis and I developed a concrete, MIME-friendly version of the BOCU algorithm that we presented earlier (http://oss.software.ibm.com/icu/docs/papers/binary_ordered_compression_for_unicode.html).

We have a summary and spec with sample code at http://oss.software.ibm.com/cvs/icu/~checkout~/icuhtml/design/conversion/bocu1/bocu1.html

     BOCU-1:
     A MIME-compatible application of the
     Binary Ordered Compression for Unicode base algorithm.

     "... BOCU-1 combines the wide applicability of UTF-8
     with the compactness of SCSU.
     It is useful for short strings and
     maintains code point order. ... stateful ..."

Feedback is welcome.

Best regards,
markus



This archive was generated by hypermail 2.1.2 : Wed Feb 06 2002 - 13:14:23 EST