Re: FW: Web Form: Other Question: British pound sign - U+00A3

From: Peter Kirk (peterkirk@qaya.org)
Date: Wed Oct 01 2003 - 13:50:35 CST


>>-----Original Message-----
>>Date/Time: Wed Oct 1 05:19:00 EDT 2003
>>Contact: jim.leek@admin.ox.ac.uk
>>Report Type: Other Question, Problem, or Feedback
>>
>>Hi,
>>
>>I'm a web developer at Oxford University in the UK, and we
>>are considering encoding all our websites in Unicode to allow
>>support of non-western languages.
>>
>>However, we have a problem.
>>
>>[...]
>>
>>Our problem is the representation of the £ sign (British
>>pound sign - U+00A3). When we type this character into our
>>pages and then set the character encoding in our pages to
>>Unicode (UTF-8) (either by setting it directly in the HTTP
>>header, or setting it using the <meta
>>http-equiv="Content-Type" content="text/html; charset=utf-8">
>>tag), when we view the pages we see the standard ASCII set of
>>characters, but the Pound sign displays as an error.
>>
>>...
>>
>
>
Jim, I hope this doesn't sound too obvious, but you could have missed
it: In your UTF-8 encoded file, do you have a single byte 0xA3 or 163?
It sounds like this is what you are doing if you first type in the pound
sign then declare the encoding to be UTF-8. But 0xA3 alone is invalid
UTF-8, so this would explain the error. The pound sign needs to be
encoded as two bytes according to the UTF-8 definition, which would be
0xC2 0xA3 - or as "&#163;" as Marco says.

-- 
Peter Kirk
peter@qaya.org (personal)
peterkirk@qaya.org (work)
http://www.qaya.org/


This archive was generated by hypermail 2.1.5 : Thu Jan 18 2007 - 15:54:24 CST