Re: Roundtripping Solved

From: Marcin 'Qrczak' Kowalczyk (qrczak@knm.org.pl)
Date: Wed Dec 15 2004 - 06:53:43 CST

  • Next message: Marcin 'Qrczak' Kowalczyk: "Re: Roundtripping in Unicode"

    "Arcane Jill" <arcanejill@ramonsky.com> writes:

    > OBSERVATION - Requirement (4) is not met absolutely, however,
    > the probability of the UTF-8 encoding of this sequence occuring
    > "accidently" at an arbitrary offset in an arbitrary octet stream
    > is approximately one in 2^384;

    Assuming that the distribution of sequences of characters is uniform.
    But it's not! As soon as you start using this encoding somewhere,
    the probability of appearing of this sequence raises dramatically.
    If you convert UTF-8 -> UTF-32 using modified rules, and UTF-32 -> UTF-8
    using standard rules, then you get this sequence without waiting
    2^340 years.

    -- 
       __("<         Marcin Kowalczyk
       \__/       qrczak@knm.org.pl
        ^^     http://qrnik.knm.org.pl/~qrczak/
    


    This archive was generated by hypermail 2.1.5 : Wed Dec 15 2004 - 07:02:28 CST