Re: UTF-8 text files

From: Antoine Leca (
Date: Tue Jun 07 2005 - 03:44:03 CDT

  • Next message: "Re: Arabic Joining Classes"

    On Monday, June 6th, 2005 19:07Z Philippe VERDY wrote:

    > Instead of focusing on the trademark or registered symbols, just
    > consider the case of the non-breaking space (U+00A0) which may follow
    > lots of uppercase ISO 8859-1 Letters (U+00C0..U+00DF).

    Remember that Lasse's idea is to check _all_ the text; so while NBSP
    certainly can occur after an capital accentuated letter (or an eszet), for
    this to misbehave, it would require _all_ the NBSPs to occur after
    accentuated capitals, _never_ after unaccentuated letters or lower case
    letters or punctuation.
    Which I would find less probable.


    This archive was generated by hypermail 2.1.5 : Tue Jun 07 2005 - 03:46:13 CDT