From: Agnieszka Kasprzyk (a.e.kasprzyk@uw.edu.pl)
Date: Thu May 24 2007 - 07:00:33 CDT
Hello,
I work for the union catalog of Polish libraries. Our contributors use ISO
transliteration standards.
Could you explain me how to deal with those characters from transliteration
standards that do not exist as precomposed characters in Unicode but they
are combined of others BUT they may be combined in a number of different
ways. Which is the correct way?
Example:
ISO 259: 1984 Transliteration of Hebrew characters into Latin characters
requires us to enter letter t with dot below and above and letter s with dot
below and above.
Now each of these characters may be built of:
a) letter t/s (U+0073/U+0074) + combining dot below (U+0323) + combining dot
above (U+0307)
b) letter t/s with dot below (U+1E6D/U+1E63)+ combining dot above (U+0307)
c) letter t/s with dot above (U+1E6B/U+1E61) + combining dot below (U+0323)
Other cases are for instance letters with two diacritics one over the other.
Should it be base letter + upper character + lower character, or base letter
+ character which is closer + character which is further from the base
letter, or if it's possible, base letter with one diacritic as one character
+ the other diacritic as the combining character?
What is the rule to follow in such cases? Is there any document specifying
what to do?
I would really appreciate your help with this,
thank you,
Agnieszka Kasprzyk
mail: a.e.kasprzyk@uw.edu.pl
NUKAT Center, Warsaw University Library, Poland
http://www.nukat.edu.pl
This archive was generated by hypermail 2.1.5 : Thu May 24 2007 - 07:05:01 CDT