BOM as WJ?

From: Pim Blokland (pblokland@planet.nl)
Date: Wed Nov 19 2003 - 04:49:17 EST

  • Next message: Philippe Verdy: "Re: Ternary search trees for Unicode dictionaries"

    In the online 4.0 book, chapter 15

    http://www.unicode.org/versions/Unicode4.0.0/ch15.pdf

    the definition for Word Joiner says:

    > Until Unicode 3.1.1, U+FEFF was the only code point with word
    > joining semantics, but because it is more commonly used as
    > byte order mark, the use of U+2060 [word joiner] to indicate
    > word joining is strongly preferred for any new text.

    However, a couple of paragraphs up, the definition for No-Break
    Space says:

    > U+00A0 [No-Break Space] behaves like the following coded
    > character sequence: U+FEFF [Zero Width No-Break Space] +
    > U+0020 [Space] + U+FEFF [Zero Width No-Break Space].

    Is this something that has slipped by the editors? Or am I missing
    something?

    Pim Blokland



    This archive was generated by hypermail 2.1.5 : Wed Nov 19 2003 - 05:34:22 EST