Re: Arabic Joining Classes

From: Rick McGowan (rick@unicode.org)
Date: Fri Jun 03 2005 - 12:08:44 CDT

  • Next message: Chris Jacobs: "Re: UTF-8 text files"

    On 02 June, Andreas Priolop asked about joining classes:

    > In http://www.unicode.org/Public/UNIDATA/ArabicShaping.txt
    > however, the trailing characters U+0649
    > http://ppewww.ph.gla.ac.uk/~flavell/unicode/unidata06.html#x0649
    > and U+06BA
    > http://ppewww.ph.gla.ac.uk/~flavell/unicode/unidata06.html#x06BA
    > are classified as dual-joining.
    > Why?

    I posted this question to the Bidi list and got 2 answers...

    1. Paul Nelson:

    U+0649 (ALEF MAKSURA) is used with Uighur in the middle of words.
    Basically, this is a tooth seat without dots. If you only look at this from
    an Arabic language point of view, then right joining would correct.
    Unicode is not an encoding for only Arabic language and therefore the ALEF
    MAKSURA is not only an Arabic language character.

    U+06BA - There is an initial and medial form of the NOON GHUNNA. So, it
    needs to be dual joining.

    The fact that he web pages may not display properly on some versions of
    fonts, or with some browsers or operating systems does not make Unicode's
    classification incorrect. There were defective implementations.

    2. Thomas Milo:

    U+0649 (ALEF MAKSURA) is used in the Arabic Qur'an in the middle of words.

    Hope that's useful.

            Rick



    This archive was generated by hypermail 2.1.5 : Fri Jun 03 2005 - 12:10:15 CDT