Fri May 20 2005

    Dean Snyder <> writes:

    > If <0xD800 0xDF02> is interpreted differently than <0xD801 0xDF02>,
    > then the high surrogate is altering the interpretation of 0xDF02,
    > the low surrogate. I assert that that is stateful in the context of
    > discussing fragment fragility.

    It's much easier tractable kind of statefulness. No matter how they
    are called, they should be distinguished.

    In UTF-16 for each boundary between characters you can find the
    corresponding boundary in the encoded text, and the fragments can be
    physically put together in a different order, as far as surrogates are
    concerned (but not wrt. a BOM). This applies to UTF-8 too of course.

    This is not true for ISO-2022.

