Re: UTF-8 stress test file?

From: D. Starner (
Date: Tue Oct 12 2004 - 15:34:48 CST

  • Next message: Mike Ayers: "RE: bit notation in ISO-8859-x is wrong"

    "Philippe Verdy" <> writes:
    > Examples of bad assumptions that a reader could make:
    > - [quote](...) Experience so far suggests
    > that most first-time authors of UTF-8 decoders find at least one
    > serious problem in their decoder by using this file.[/quote]
    > This suggests to the reader that if its browser or editor does not display
    > the contained test text as indicated, there's a problem in that application.

    If you're a reader, not an "author[...] of [a] UTF-8 decoder", then I don't
    see where that statement gives you cause to assume anything. It is indeed
    a bad assumption on the part of the reader.

    > So who's puzzling here? Not me! It's the content of the text itself.

    Funny; I've never been puzzled by the text of the document. It's obviously
    designed to test the edge cases and the failure cases of a UTF-8 decoder.

    Sign-up for Ads Free at

    This archive was generated by hypermail 2.1.5 : Tue Oct 12 2004 - 15:35:47 CST