RE: Indic scripts, visual-order vs phonetic-order

Date: Thu Jun 06 2002 - 13:33:43 EDT

On 06/06/2002 12:45:15 AM "Maurice Bauhahn" wrote:

>In Khmer one of the problems visual order brings up for computer
>implementations is the large variety of character orders this could
>There are two-glyph vowels with pre and post consonant placement,
>vowels which preceed, and one-glyph vowels which follow (super or sub or
>post). Failure to lock those into a standard order would result in quite a
>bit of preprocessing for sorting, not to mention the problems of
>searchin/spell checking.

It seems to me that this is a non-issue in relation to searching and spell
checking since both of those processes are sensitive only to sequences of
encoded characters and do no need to know what any given character is used
to represent (unless you're doing something akin to sound-based searching).
As for sorting, the preprocessing is not necessarily a big deal -- at
least, Thai and Lao have visually-ordered encoding that requires a bit of
reordering before creating sort keys (or as part of the process of creating
sort keys), but the preprocessing is pretty trivial: Vp C > C Vp.

- Peter

Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <>

This archive was generated by hypermail 2.1.2 : Thu Jun 06 2002 - 12:05:36 EDT