Re: character entities in UTF-8 files

From: Chris Jacobs (chris.jacobs@freeler.nl)
Date: Tue Jul 12 2005 - 18:28:59 CDT

Next message: Gregg Reynolds: "Re: character entities in UTF-8 files"

Previous message: Asmus Freytag: "Re: character entities in UTF-8 files"
In reply to: Peter Constable: "RE: character entities in UTF-8 files"
Next in thread: Gregg Reynolds: "Re: character entities in UTF-8 files"
Reply: Gregg Reynolds: "Re: character entities in UTF-8 files"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

----- Original Message -----
From: "Peter Constable" <petercon@microsoft.com>
To: <unicode@unicode.org>
Sent: Tuesday, July 12, 2005 11:03 PM
Subject: RE: character entities in UTF-8 files

> > From: unicode-bounce@unicode.org [mailto:unicode-bounce@unicode.org]
> > On Behalf Of Chris Jacobs
>
> > > We have an XML based application...
>
> > Only it does not stand for e acute, as far as unicode is involved it
> > just stands for itself, for é.
> >
> > Of course you are allowed to have agreements with your users about
> > replacing é by e acute or by whatever you want to replace it by.
>
> Since this is an XML application, then at the level of XML parsing,
> &#233 must be interpreted as e-acute; he is not allowed to have
> agreements with his users about replacing &#233 with anything else.

Except that not: specifies UTF-8 files as source, but: "specifies UTF-8
files as input".
So this é is not in the XML source, but in the input which the XML
reads.
The &#233 will then not be parsed as XML, just like when you write in BASIC
a text editing program the edited text will not be scanned for BASIC key
words unless you for whatever reason program it to do so.

Next message: Gregg Reynolds: "Re: character entities in UTF-8 files"
Previous message: Asmus Freytag: "Re: character entities in UTF-8 files"
In reply to: Peter Constable: "RE: character entities in UTF-8 files"
Next in thread: Gregg Reynolds: "Re: character entities in UTF-8 files"
Reply: Gregg Reynolds: "Re: character entities in UTF-8 files"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Tue Jul 12 2005 - 18:33:27 CDT