From: Edward H. Trager (ehtrager@umich.edu)
Date: Thu Jun 17 2004 - 10:07:03 CDT
On Thursday 2004.06.17 00:46:31 -0400, John Cowan wrote:
> Edward H. Trager scripsit:
>
> > What about vim (vi clone: http://www.vim.org). I just converted
> > a very large UTF-8-encoded HTML document (more than 15000
> > lines) to standards-compliant XHTML-1.0 and found the advanced
> > regular-expression-based substitution facilities in vi(m) absolutely
> > indispensible for adding all of the closing tags that XML requires
> > which were missing in my original document.
>
> HTML Tidy or TagSoup would probably have served you better, rather than
> groveling over the code bit by bit. (HTML Tidy can do more cleaning,
> but it sometimes loops or delivers garbage if the HTML is sufficiently
> broken. TagSoup never gives up and never loops, but doesn't know
> as much about HTML.)
Actually I did use Tidy too.
>
> --
> Said Agatha Christie / To E. Philips Oppenheim John Cowan
> "Who is this Hemingway? / Who is this Proust? cowan@ccil.org
> Who is this Vladimir / Whatchamacallum, http://www.reutershealth.com
> This neopostrealist / Rabble?" she groused. http://www.ccil.org/cowan
> --author unknown to me; any suggestions?
This archive was generated by hypermail 2.1.5 : Thu Jun 17 2004 - 09:31:30 CDT