Re: ZWNJ in IDN (Burmese Issues)

From: Richard Wordingham (
Date: Wed Nov 23 2005 - 17:18:11 CST

  • Next message: Kenneth Whistler: "Re: How many characters?"

    Javier SOLA wrote:

    > The character for separation of words should be ZWSP (Zero Width Space),
    > and not ZWNJ. ZWNJ should normally be used for modification of rendering
    > of clusters.

    It so happens that a visible virama ('atha' in Burmese) often terminates a
    word. It doesn't always, because tone marks (some may not like the term)
    such as U+1038 MYANMAR SIGN VISARGA may follow. How appropriate would ZWSP
    be in the middle of words like 'Myanma(r)' and 'Yangon'? I believe there
    are unanalysable words where use of a subscript consonant (or kinzi) would
    be an inappropriate alternative, but I know far too little of Burmese to say
    for sure. A possible example might be THA, RA, NA, VIRAMA, ZWNJ, CA, CA,
    VIRAMA, CA, TA, AA 'transistor'.

    Moreover, Chapter 10 of the Unicode Standard (Version 4.x) gives U+1039,
    U+200C as the encoding of visible virama - perhaps it is an act of faith to
    assume U+1039 will be rendered as 'atha' unless immediately followed by a


    This archive was generated by hypermail 2.1.5 : Wed Nov 23 2005 - 17:19:32 CST