UCS and ISCII mapping, unofficial table

From: Michael Everson (everson@indigo.ie)
Date: Mon Mar 25 1996 - 13:39:54 EST


To follow up on Dominik Wujastyk's query about ISCII and UCS, I have
produced the following mapping table based primarily on the ISSCII-83 to
ISCII table in ISCII 1991. I have included 0020 SPACE in this list because
it is on the same table (there is a unique Indic space (DSP) in ISSCII-83
which is unified with SPACE in ISCII and in UCS).

UCS ISCII UCS Name
0901 A1 DEVANAGARI SIGN CANDRABINDU
0902 A2 DEVANAGARI SIGN ANUSVARA
0903 A3 DEVANAGARI SIGN VISARGA
0020 20 SPACE
0905 A4 DEVANAGARI LETTER A
0906 A5 DEVANAGARI LETTER AA
0907 A6 DEVANAGARI LETTER I
0908 A7 DEVANAGARI LETTER II
0909 A8 DEVANAGARI LETTER U
090A A9 DEVANAGARI LETTER UU
090B AA DEVANAGARI LETTER VOCALIC R
090C AC + E9 DEVANAGARI LETTER VOCALIC L
090D AE DEVANAGARI LETTER CANDRA E
090E AB DEVANAGARI LETTER SHORT E
090F AC DEVANAGARI LETTER E
0910 AD DEVANAGARI LETTER AI
0911 B2 DEVANAGARI LETTER CANDRA O
0912 AF DEVANAGARI LETTER SHORT O
0913 B0 DEVANAGARI LETTER O
0914 B1 DEVANAGARI LETTER AU
0915 B3 DEVANAGARI LETTER KA
0916 B4 DEVANAGARI LETTER KHA
0917 B5 DEVANAGARI LETTER GA
0918 B6 DEVANAGARI LETTER GHA
0919 B7 DEVANAGARI LETTER NGA
091A B8 DEVANAGARI LETTER CA
091B B9 DEVANAGARI LETTER CHA
091C BA DEVANAGARI LETTER JA
091D BB DEVANAGARI LETTER JHA
091E BC DEVANAGARI LETTER NYA
091F BD DEVANAGARI LETTER TTA
0920 BE DEVANAGARI LETTER TTHA
0921 BF DEVANAGARI LETTER DDA
0922 C0 DEVANAGARI LETTER DDHA
0923 C1 DEVANAGARI LETTER NNA
0924 C2 DEVANAGARI LETTER TA
0925 C3 DEVANAGARI LETTER THA
0926 C4 DEVANAGARI LETTER DA
0927 C5 DEVANAGARI LETTER DHA
0928 C6 DEVANAGARI LETTER NA
0929 C7 DEVANAGARI LETTER NNNA
092A C8 DEVANAGARI LETTER PA
092B C9 DEVANAGARI LETTER PHA
092C CA DEVANAGARI LETTER BA
092D CB DEVANAGARI LETTER BHA
092E CC DEVANAGARI LETTER MA
092F CD DEVANAGARI LETTER YA
0930 CF DEVANAGARI LETTER RA
0931 D0 DEVANAGARI LETTER RRA
0932 D1 DEVANAGARI LETTER LA
0933 D2 DEVANAGARI LETTER LLA
0934 D3 DEVANAGARI LETTER LLLA
0935 D4 DEVANAGARI LETTER VA
0936 D5 DEVANAGARI LETTER SHA
0937 D6 DEVANAGARI LETTER SSA
0938 D7 DEVANAGARI LETTER SA
0939 D8 DEVANAGARI LETTER HA
093C E9 DEVANAGARI SIGN NUKTA
093D EA + E9 DEVANAGARI SIGN AVAGRAHA
093E DA DEVANAGARI VOWEL SIGN AA
093F DB DEVANAGARI VOWEL SIGN I
0940 DC DEVANAGARI VOWEL SIGN II
0941 DD DEVANAGARI VOWEL SIGN U
0942 DE DEVANAGARI VOWEL SIGN UU
0943 DF DEVANAGARI VOWEL SIGN VOCALIC R
0944 AA + E9 DEVANAGARI VOWEL SIGN VOCALIC RR
0945 E3 DEVANAGARI VOWEL SIGN CANDRA E
0946 E0 DEVANAGARI VOWEL SIGN SHORT E
0947 E1 DEVANAGARI VOWEL SIGN E
0948 E2 DEVANAGARI VOWEL SIGN AI
0949 E7 DEVANAGARI VOWEL SIGN CANDRA O
094A E4 DEVANAGARI VOWEL SIGN SHORT O
094B E5 DEVANAGARI VOWEL SIGN O
094C E6 DEVANAGARI VOWEL SIGN AU
094D E8 (+ E8) DEVANAGARI SIGN VIRAMA
0950 A1 + E9 DEVANAGARI OM
0951 F0 + B5 DEVANAGARI STRESS SIGN UDATTA
0952 F0 + B8 DEVANAGARI STRESS SIGN ANUDATTA
0953 -- DEVANAGARI GRAVE ACCENT
0954 -- DEVANAGARI ACUTE ACCENT
0958 B3 + E9 DEVANAGARI LETTER QA
0959 B4 + E9 DEVANAGARI LETTER KHHA
095A B5 + E9 DEVANAGARI LETTER GHHA
095B BA + E9 DEVANAGARI LETTER ZA
095C BF + E9 DEVANAGARI LETTER DDDHA
095D CO + E9 DEVANAGARI LETTER RHA
095E C9 + E9 DEVANAGARI LETTER FA
095F CE DEVANAGARI LETTER YYA
0960 AA + E9 DEVANAGARI LETTER VOCALIC RR
0961 A7 + E9 DEVANAGARI LETTER VOCALIC LL
0962 DB + E9 DEVANAGARI VOWEL SIGN VOCALIC L
0963 DC + E9 DEVANAGARI VOWEL SIGN VOCALIC LL
0964 EA DEVANAGARI DANDA
0965 EA + EA DEVANAGARI DOUBLE DANDA
0966 F1 DEVANAGARI DIGIT ZERO
0967 F2 DEVANAGARI DIGIT ONE
0968 F3 DEVANAGARI DIGIT TWO
0969 F4 DEVANAGARI DIGIT THREE
096A F5 DEVANAGARI DIGIT FOUR
096B F6 DEVANAGARI DIGIT FIVE
096C F7 DEVANAGARI DIGIT SIX
096D F8 DEVANAGARI DIGIT SEVEN
096E F9 DEVANAGARI DIGIT EIGHT
096F FA DEVANAGARI DIGIT NINE
0970 F0 + BF DEVANAGARI ABBREVIATION SIGN

There are some other features of the transformation which are dependent on
the implementation. The examples on pages 54-55 of the Unicode Standard
Vol. 1 would be encoded thus in Unicode:

KKI: KA(0915) + VIRAMA(094D) + KA(0915) + -I(093F)
RKKA: RA(0930) + VIRAMA(094D) + KA(0915) + VIRAMA(094D) + KA(0915)
KKA: KA(0915) + VIRAMA(094D) + KA(0915)
K-KA: KA(0915) + VIRAMA(094D) + ZWNJ(200C) + KA(0915)

And thus in ISCII:

KKI: KA(B3) + VIRAMA(E8) + KA(B3) + -I(DB)
RKKA: RA(CF) + VIRAMA(E8) + KA(B3) + VIRAMA(E8) + KA(B3)
KKA: KA(B3) + VIRAMA(E8) + KA(B3)
K-KA: KA(B3) + VIRAMA(E8) + VIRAMA(E8) + KA(B3)

The ATR character (EF) + a script name code (42 Devanagari, 43 Bengali, 44
Tamil, 45 Telugu, 46 Bengali(Assamese), 47 Oriya, 48 Kannada, 49 Malayalam,
4A Gujarati, 4B Gurmukhi) preceding a text will refer to the
script-specific block and its codes in UCS.

Michael Everson, Everson Gunn Teoranta
15 Port Chaeimhghein Íochtarach; Baile Átha Cliath 2; Éire (Ireland)
Gutháin: +353 1 478-2597, +353 1 283-9396
http://www.indigo.ie/egt
27 Páirc an Fhéithlinn; Baile an Bhóthair; Co. Átha Cliath; Éire



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:30 EDT