Re: Question regarding HTML 4.0 &#x??; syntax

From: John Cowan (john_cowan@hotmail.com)
Date: Fri Jul 18 1997 - 13:42:24 EDT


Due to network problems, I can read mail at cowan@ccil.org, but
can't post/reply/send from there. Please direct all private replies
to cowan@ccil.org , not the HotMail address. Thanks.

David Pope wrote:

> I assume &#x??, where ??=hex value, will display the character
> associated [by] the assigned charset with the ?? hex value.

Not so. &#????; (and &#x????;) always refer to ISO 10646/Unicode,
not the character encoding used to transmit the document. This is
true no matter how many decimal or hex digits there are. If this were
not so, then anyone transcoding the document would have to convert
(or discard as unconvertable) all the numeric character references;
as it is, they are invariant.

This does not prevent certain losing HTML composers from outputting
the illegal € through Ÿ using CP1252 (Windows) instead of
the proper Unicode values. When you meet one of these on the
road, shoot it.

John Cowan cowan@ccil.org
        Please do not use "Reply"
        e'osai ko sarji la lojban.
______________________________________________________
Get Your Private, Free Email at http://www.hotmail.com



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:36 EDT