Re: Utility to report and repair broken surrogate pairs in UTF-16 text

From: Bjoern Hoehrmann (derhoermi@gmx.net)
Date: Thu Nov 04 2010 - 07:51:26 CST

  • Next message: Doug Ewell: "Re: Utility to report and repair broken surrogate pairs in UTF-16 text"

    * Jim Monty wrote:
    >Unfortunately, I'm not a good enough programmer to write such a utility in C or
    >even Perl, the language I know best. Is this a project that interests you, by
    >chance?
    >
    >I'm surprised I'm having difficulty finding an existing utility to repair broken
    >UTF-16 text. I thought this was something many programmers would need,
    >especially Web developers.

    I see. Well, it seems `uconv` which comes with http://icu-project.org/
    does it with `uconv --callback substitute -f utf-16 -t utf-16 ...`.

    -- 
    Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de
    Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
    25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 
    


    This archive was generated by hypermail 2.1.5 : Thu Nov 04 2010 - 07:55:08 CST