Re: Fun with UDCs in Shift-JIS

From: Lars Marius Garshol (larsga@garshol.priv.no)
Date: Sat Jan 19 2002 - 12:21:26 EST


* Thomas Chan
|
| - NTT-DoCoMo pictographs[1] in webpages for cell phones
| http://www.nttdocomo.co.jp/i/tag/emoji/ .

* Marco Cimarosti
|
| Something is definitely weird in this page.
|
| They suggests to use shift-JIS codes for numerical character
| references, but such a thing is not allowed by the current
| definition of HTML: NCR's unambiguously and exclusively represent
| Unicode/ISO-10646 code points, regardless of the character set used
| in the page.
|
| How can such a thing work?

In HTML and XML it won't work. In both languages NCRs refer to Unicode
code points, and there is no way to change that.

In SGML, however, formally the source standard of both, this is
possible by changing the document character set in the SGML
declaration. These people aren't doing that, however, and even if they
did it would probably be of little use to anyone.

| BTW, Ironically, U+F9A1 (as well as U+8AAA) means "to SCOLD, to
| UPBRAID", which is what ANGRY people do all the time... :-)

It is further ironical that U+F9A1 describes what people who use NCRs
in this way need. :-)

--Lars M.



This archive was generated by hypermail 2.1.2 : Sat Jan 19 2002 - 13:14:28 EST