Collating nonconjunct and conjunct forms of words

From: N. Ganesan (
Date: Tue May 10 2005 - 10:10:49 CDT

  • Next message: Dominikus Scherkl: "AW: AW: help required"

    In Indian languages, ZWJ or ZWNJ are used
    to produce conjunct and nonconjunct forms
    of identical words.

    Interestingly, identical words appear in
    conjuncts form in some places in a book while
    nonconjunct forms of the same words appear elsewhere
    in that particular book. In Tamil, this situation
    exists for Sanskrit loan words.

    Also in the first half of 20th century,
    Islamic names were written with a conjunct ksha,
    are now universally written with a nonconjunct ksha.

    Linguistically, it makes sense to place the
    identical words next to each other while sorting
    a book words, if that book has the same word
    has both conjunct and conjunct letters at different places.

    How does Unicode treat collation of conjunct
    and nonconjunct forms of identical words?
    Are they next to each other? Since North Indian
    languages have possibly this situation many times,
    any general rule or policy?

    N. Ganesan

    This archive was generated by hypermail 2.1.5 : Tue May 10 2005 - 10:14:58 CDT