RE: Coloured diacritics (Was: Transcoding Tamil in the presence of markup)

From: Philippe Verdy (verdy_p@wanadoo.fr)
Date: Tue Dec 09 2003 - 09:54:15 EST

  • Next message: Arcane Jill: "RE: Text Editors and Canonical Equivalence (was Coloured diacritics)"

    > You might as well say that C code is not plain text because it too is
    > subject to special canons of interpretation.

    C, C++ and Java source files are not plain text as well (they have their own
    "text/*" MIME type, which is NOT "text/plain" notably because of the rules
    associated with end-of-lines, notably in presence of comments).

    > But both XML/HTML/SGML and the various programming languages are plain
    text.

    See "text/xml", "text/html" and "text/sgml" MIME types. They also aren't
    "text/plain" so they have their own interpretation of Unicode characters
    which is not the one found in the Unicode standard.

    __________________________________________________________________
    << ella for Spam Control >> has removed Spam messages and set aside
    Newsletters for me
    You can use it too - and it's FREE! http://www.ellaforspam.com





    This archive was generated by hypermail 2.1.5 : Tue Dec 09 2003 - 10:49:49 EST