Re: BOM as WJ?

From: Philippe Verdy (verdy_p@wanadoo.fr)
Date: Wed Nov 19 2003 - 18:18:15 EST

  • Next message: Frank Yung-Fong Tang: "Re: What does i18n mean?"

    From: "Peter Kirk" <peterkirk@qaya.org>
    > Does this equivalence hold when combining characters are applied to the
    > NBSP? Is the sequence <NBSP, CC> (recommended for spacing diacritics,
    > where CC is any sequence of combining characters) equivalent to <ZWNBS,
    > SP, ZWNBS, CC>? Or should the equivalence be to <ZWNBS, SP, CC, ZWNBS>?
    > Is it legal to combine combining characters with ZWNBS, or WJ, and how
    > should this be rendered?

    This is not an equivalence: despite <NBSP> should be treated as if it was
        <WJ,SP,WJ>
    when it is found isolately, this does not apply when it is followed by a
    combining character (CC). So, <NBSP,CC> must not be treated as if it was:
        <WJ,SP,WJ,CC>
    but really rather as:
        <WJ,SP,CC,WJ>
    Note here the inversion. Note also that all these sequences are NOT
    canonically equivalent, meaning that it is impossible to define a formal
    equivalence between <NBSP> and <ZW,SP,WJ>.



    This archive was generated by hypermail 2.1.5 : Wed Nov 19 2003 - 19:28:58 EST