From: Vinod Kumar (rigvinod@gmail.com)
Date: Tue Sep 20 2005 - 04:45:09 CDT
> Kent Karlsson Sun Sep 04 2005 - 04:56:34 CDT
> Then there are two orthographic syllables here, per definition.
R15 (Chap 9.1 TUS4): When the dependent vowel I_vs is used to override
the inherent vowel of a syllable, it is always written to the extreme
left of the orthographic syllable. If the orthographic syllable contains
a consonant cluster, then this vowel is always depicted to the left
of that cluster.
The important word here is ortho. The rule R15 has to be interpreted
and modified in terms of the mathematical meaning of being orthogonal
or independent of each other. From my familiarity with Indic scripts,
and experience in designing and implementing the shaping engine
and OpenType tables for the Saral series of fonts for all nine Indic scripts,
I modify and interpret the Unicode rule R15 for glyphs as follows:
First the orthogonal glyph cluster is defined.
Orthogonal glyph cluster:
An orthogonal glyph cluster is the minimal sequence of glyphs
that can occur stand-alone in displayed text between two spaces.
Example: Each of <Virama-Glyph>, <I_vs-Glyph>, <Half-TTA-Glyph> are
not valid as stand alone forms in text. Each of these is not an
orthogonal glyph cluster.
Some orthogonal glyph clusters would be
<TT-MA-Glyph>, <Half-TTA-Glyph MA-Glyph>,
<TTA-Glyph Virama-Glyph> and <MA-Glyph>.
<TTA-Glyph Virama-Glyph MA-Glyph> is not an orthogonal glyph cluster.
Even though it can stand alone between two spaces, its subsequences
<TTA-Glyph Virama-Glyph> and <MA-Glyph>
can stand alone and so it is not a minimal sequence.
The importance of an orthogonal glyph cluster is that it can be
produced, pronounced and interpreted independent of other
glyph clusters.
(Modified R15) When the dependent vowel I_vs is used to override
the inherent vowel of a syllable, it is always written to the
left of the previous orthogonal glyph cluster.
I_vs repositioning using orthogonal glyph clusters
The character sequence <TTA Virama MA> could take several forms
in the glyph domain, depending on how rich the font is or because
the writer chooses (using ZWJ, ZWNJ) a particular form.
-----------------------------------------------------------------------------
3 Forms of Visual syllable for <TTA Virama MA>
Full Visual syllable Orthogonal glyph clusters
-----------------------------------------------------------------------------
<TT-MA-Glyph> <TT-MA-Glyph>
<Half-TTA-Glyph MA-Glyph> <Half-TTA-Glyph MA-Glyph>
<TTA-Glyph Virama-Glyph MA-Glyph> <TTA-Glyph Virama-Glyph >,
<MA-Glyph>
-----------------------------------------------------------------------------
The explicit halant form (third) of the visual syllable consists of
two orthogonal glyph clusters. The other two consist of a single
orthogonal glyph cluster.
Now consider the full syllable <TTA Virama MA I_vs>.
The correct reordering of I_vs for the 3 Forms will be
<I_vs-Glyph TT-MA-Glyph>
<I_vs-Glyph Half-TTA-Glyph MA-Glyph>
<TTA-Glyph Virama-Glyph I_vs-Glyph MA-Glyph>
Look at the third form. The I_vs-Glyph should not move to left of
the first orthogonal glyph cluster <TTA-Glyph Virama-Glyph>
but to the left of the previous orthogonal glyph cluster <MA-Glyph>.
A form like <I_vs-Glyph TTA-Glyph Virama-Glyph MA-Glyph> is
incorrect. I will not be surprised if such incorrect forms have been
created first by buggy shaping software. Then the mistake is
propagated by other influential
people as "two orthographies" or as "two writing conventions"
and implemented in software like Windows Vista!
Sometimes, people say that text display should be driven by convention
and not by logic. I could not find any instance in Devanagari text
(searching in several pages of a Jnaneswari exposition in Marathi)
where the I_vs-Glyph crosses to the left of the Virama-Glyph.
I could find only two syllables with explicit halant consonants
and in both cases there was no I_vs in the syllable.
Prof R K Joshi too does not recollect any syllable
with explicit Halant and a left moving I_vs that crosses it.
In Malayalam text, I could locate (again in a Jnaneswari exposition)
several instances of E-matra (or Ee-matra) that moves left
but each one of them stopped before crossing the Chandrakala (halant)
in the syllable's form.
When conventional instances cannot be seen, one has to fall back
on logic to deduce forms that can be justified and defended.
The form
<TTA-Glyph Virama-Glyph I_vs-Glyph MA-Glyph> will be pronounced
properly as TT followed by MI, maybe with an inordinate delay
between TT and MI. The incorrect form
<I_vs-Glyph TTA-Glyph Virama-Glyph MA-Glyph>
tends to be pronounced as TTI followed by MA at first glance.
Later the Virama will be noticed and the pronouncer may scratch
her head wondering how an animal like <I_vs-Glyph TTA-Glyph
Virama-Glyph> should sound. Such a confusion does not arise with
<I_vs-Glyph Half-TTA-Glyph> because <Half-TTA-Glyph> is not a stand
alone symbol and has to be seen together with the succeeding glyph.
Vinod Kumar
Project IndiX
This archive was generated by hypermail 2.1.5 : Tue Sep 20 2005 - 04:45:47 CDT