Re: Cp1256 (Windows Arabic) Characters not supported by UTF8

From: Theo Veenker (Theo.Veenker@let.uu.nl)
Date: Thu Aug 11 2005 - 06:31:36 CDT

  • Next message: Philippe Verdy: "Re: Cp1256 (Windows Arabic) Characters not supported by UTF8"

    Ritesh wrote:
    > Hi,
    >
    > Thanks a lot for prompt reply.
    >
    > Issue is like. We have one application where user can upload a file in
    > tab delimited or xls file.
    >
    > Now we have few user who upload a file which can contain English and
    > other language characters(Here it is Arabic).
    >
    > This files can have different combinations as below,
    > 1. File is a UTF-8 and have English and Arabic Characters.
    > 2. File is a UTF-16 (LE) and have English and Arabic Characters.
    > 3. File is UTF-8 and Have only Arabic Characters
    > 4. File is UTF-8 and Have only English Characters
    > 5. File is UTF-16 and Have only Arabic Characters
    > 6. File is UTF-16 and Have only English Characters
    > 7. File can be in ASCII format.
    >
    > Once the file is uploaded it will be displayed on the browser.
    >
    > Now, we are using following logic while uploading file.

    [snip]

    Did you check this one, it is a java port of mozilla's automatic charset
    detection algorithm. The original C++ sources are provided as well.

    http://www.i18nfaq.com/chardet.html

    Theo



    This archive was generated by hypermail 2.1.5 : Thu Aug 11 2005 - 06:33:14 CDT