Re: TR29 Word Break awkwardness

From: Andy Heninger (andyh@jtcsv.com)
Date: Tue Sep 14 2004 - 16:44:34 CDT

  • Next message: Asmus Freytag: "Re: TR29 Word Break awkwardness"

    Peter Kirk wrote:
    > I have in mind certain situations found in Hebrew (Ketiv/Qere blended
    > forms) in which anomalous (but quite frequently found) word forms
    > begins
    > with a spacing combining character. The currently specified way of
    > supporting this situation is to use SPACE or NBSP followed by the
    > combining character (as these combining characters do not have
    > non-spacing clones). It would be highly undesirable to make a change
    > here which would allow word breaks, line breaks etc after the
    > combining
    > character but before the rest of the word.

    The proposed change to word boundaries would have no effect on the case
    you describe, but word boundaries may already not be doing what you
    want. If you have a SPACE or NBSP preceding the combining character,
    the grapheme cluster composed of the space plus the combining char will
      behave as just a space, and be split off from the remainder of the word.

    I found 16 Hebrew characters that would be affected by the change,
    \u05B0 HEBREW POINT SHEVA through
    \u05C2 HEBREW POINT SIN DOT
    with a couple of holes in the middle of the range.

    To have these characters attach to a following word, an alphabetic base
    character is needed.

       -- Andy Heninger
          heninger@us.ibm.com



    This archive was generated by hypermail 2.1.5 : Tue Sep 14 2004 - 16:46:26 CDT