In HTML 4.0, I can code the character for Go (aka Baduk or Wei-ch'i) in the following ways:
| Character encoding | in UTF-8 | E7 A2 81 | 碁 |
|---|---|---|---|
| Numeric character reference | decimal | 碁 | 碁 |
| hexadekadic | 碁 | 碁 | |
| 碁 | 碁 |
What does your browser display for each? Note also the document title in your browser windows title bar, or in the "Document Info" window.
Beware: the last cell in the first row of the table contains three non-ASCII bytes, which will be MIME-q-p encoded, in this letter. If theis cell does not display properly on your system, it may have been distorted by the mail-transferring process. In this case, insert three characters with the hex values given in the 3rd column of the same row. The title, the header, and the last row of the table should contain the same han/kanji character. The following table (please use a mono-pitch font to display it) summarizes how these browsers display the various encodings of a han/kanji character in various contexts. The encodings are denoted by the abbreviations UTF for UTF-8 dec for a decimal numerical character reference, hex for a hexadekadic, lower-case NCR, HEX for a hexadekadic upper-case NCR. Browser | enc | text window | title bar | menus | source -------------+-----+-------------+------------+------------+------------ Alis | UTF | ok | ok | ok | ok Tango | dec | ok | not tested | not tested | N/A | hex | ok | not tested | not tested | N/A | HEX | missing | not tested | not tested | N/A -------------+-----+-------------+------------+------------+------------ Microsoft | UTF | ok | N/A | N/A | wrong Internet | dec | ok | N/A | not tested | N/A Explorer | hex | wrong | N/A | not tested | N/A | HEX | wrong | N/A | not tested | N/A -------------+-----+-------------+------------+------------+----------- Netscape | UTF | ok | repl.char. | repl.char. | repl.char. Communicator | dec | ok | not tested | not tested | N/A | hex | repl.char. | not tested | not tested | N/A | HEX | repl.char. | not tested | not tested | N/A where wrong: encoding not recognized, byte values interpreted as Latin-1 characters N/A: not applicable repl.char: encoding recognized but character not available; rather, a question-mark or an open box ist displayed missing: blank space is displayed rather than the character