Re: Stateful encoding mechanisms

From: Alexander Kh. (
Date: Thu May 19 2005 - 21:42:06 CDT

  • Next message: Dean Snyder: "Re: Stateful encoding mechanisms"

    From: "Philippe Verdy" <>
    > The BOM is NOT a character.

    Hm. Is that relevant in terms of statelessness?

    > In a UTF-16 encoding *scheme* the leading BOM is fully ignorable.
    > But in a UTF-16 encoding form, there's simply NO BOM and the
    > codepoint U+FEFF is legal and represents ZWNSP.
    > You are mixing several levels in the Unicode character model.

    In this case let's split UTF-8 into several levels and put control
    characters on different level from codepoints. That will make that
    encoding stateless :-) The text will probably have to be stored in
    separate chunks starting with control sequence each, though.

    Best Reagrds.

    Alexander Kh

    Sign-up for Ads Free at

    This archive was generated by hypermail 2.1.5 : Thu May 19 2005 - 21:42:54 CDT