Re: Factor implements 24-bit string type for Unicode support

From: Hans Aberg (
Date: Mon Feb 04 2008 - 07:49:15 CST

  • Next message: Markus Scherer: "Re: Factor implements 24-bit string type for Unicode support"

    On 3 Feb 2008, at 22:45, Jeroen Ruigrok van der Werven wrote:

    > Personally I'd wonder about this. I can understand the desire to
    > shave bytes
    > off in-memory, but given a lot of platforms having issues with
    > non-32 bit
    > boundaries and the resulting performance or alignment issues I
    > seriously
    > wonder if it is worth the trade off of not just using UCS4 internally.

    I think that 32-bit is probably best for internal use in programs for
    speed, avoiding alignment problems; the best way to actually know is
    to do some profiling. Externally, for distributed files, UTF-8 seems
    best, because most agree on how to sort out the bits the bytes.

       Hans Åberg

    This archive was generated by hypermail 2.1.5 : Mon Feb 04 2008 - 07:52:47 CST