Re: Unicode conversion to normal asci file

From: Anto'nio Martins-Tuva'lkin (antonio@tuvalkin.web.pt)
Date: Thu Jun 07 2007 - 15:15:54 CDT

  • Next message: Jeroen Ruigrok van der Werven: "Re: Devanagari composing help needed"

    On 2007.06.06, 18:52, Saltel, Dan <Dan.Saltel@gwl.ca> wrote:

    > I want a program that will read in a unicode file, convert it to asci
    > so that we can upload it into our database.
    > Example:
    > If the input file looks like:
    > último año de carrera
    > It would convert it to
    > ultimo ano de carrera

    You must mean 7-bit ASCII, not ANSI (whatever that exactly means here).
    FWIW, in "ANSI" (considering this to be either Windows cp-1252 or ISO
    8859-1) it would still be "último año de carrera" as equivalents of both
    U+00FA and U+00F1 are present in it.

    Anyway, this sentence is an excellent example of why "stripping accents"
    is not the way to go. Do you know what mean "ano" in Spanish? Do you? ;-)

    -- ____.
    António MARTINS-Tuválkin | ()|
    <antonio@tuvalkin.web.pt> Não me invejo de quem tem |####|
    PT-1500-111 LISBOA carros, parelhas e montes |
    +351 934 821 700, +351 217 150 939 só me invejo de quem bebe |
    ICQ:193279138 http://tuvalkin.web.pt/ a água em todas as fontes |



    This archive was generated by hypermail 2.1.5 : Thu Jun 07 2007 - 15:26:45 CDT