Re: Roundtripping Solved

From: Marcin 'Qrczak' Kowalczyk (
Date: Wed Dec 15 2004 - 06:53:43 CST

    "Arcane Jill" <> writes:

    > OBSERVATION - Requirement (4) is not met absolutely, however,
    > the probability of the UTF-8 encoding of this sequence occuring
    > "accidently" at an arbitrary offset in an arbitrary octet stream
    > is approximately one in 2^384;

    Assuming that the distribution of sequences of characters is uniform.
    But it's not! As soon as you start using this encoding somewhere,
    the probability of appearing of this sequence raises dramatically.
    If you convert UTF-8 -> UTF-32 using modified rules, and UTF-32 -> UTF-8
    using standard rules, then you get this sequence without waiting
    2^340 years.

       Marcin Kowalczyk

