Re: ISCII-Unicode Conversion

From: Ram Viswanadha (ram@jtcsv.com)
Date: Tue Nov 06 2001 - 16:33:21 EST


Hi Etienne,

> Hi,
>
> Would anybody be able to point me to possible ISCII-Unicode conversion
utilities/APIs?
ICU 2.0 will have ISCII - Unicode converter. Please see
http://oss.osftware.ibm.com/icu

>How reliable is the conversion?
No conversion is ever reliable unless converted from charset x, which is a
subset of y, to y.
ISCII defines codes that declare the script optional and assumes a default
script, so the conversion may not yeild expected results if the data does
not contain the declaratory code and the script is not not specified by the
caller. There are codes that declare font styles which will not converted.
But in general, an ISCII byte stream can be converted to Unicode without
much loss of information for display purposes. I have converted a 10 page
document in ISCII to UTF-8 and compared the output displayed with MS Word
for UTF-8 and ILeap for ISCII and did not find a noticeable difference.

Regards

Ram Viswanadha
IBM



This archive was generated by hypermail 2.1.2 : Mon Nov 12 2001 - 11:00:39 EST