Re: Stateful encoding mechanisms

From: Alexander Kh. (alexkh@writeme.com)
Date: Thu May 19 2005 - 21:42:06 CDT

  • Next message: Dean Snyder: "Re: Stateful encoding mechanisms"

    From: "Philippe Verdy" <verdy_p@wanadoo.fr>
    >
    > The BOM is NOT a character.

    Hm. Is that relevant in terms of statelessness?

    > In a UTF-16 encoding *scheme* the leading BOM is fully ignorable.
    > But in a UTF-16 encoding form, there's simply NO BOM and the
    > codepoint U+FEFF is legal and represents ZWNSP.
    >
    > You are mixing several levels in the Unicode character model.

    In this case let's split UTF-8 into several levels and put control
    characters on different level from codepoints. That will make that
    encoding stateless :-) The text will probably have to be stored in
    separate chunks starting with control sequence each, though.

    Best Reagrds.

    Alexander Kh

    -- 
    ___________________________________________________________
    Sign-up for Ads Free at Mail.com
    http://promo.mail.com/adsfreejump.htm
    


    This archive was generated by hypermail 2.1.5 : Thu May 19 2005 - 21:42:54 CDT