RE: Japanese Web pages in Unicode?

From: Marco Cimarosti (marco.cimarosti@essetre.it)
Date: Fri Jun 28 2002 - 03:45:25 EDT


juuitchan@hotmail.com wrote:
> It seems to me that the Japanese prefer to use Shift-JIS or EUC-JP to
> Unicode in their Web pages. Why on earth is this?
> [...] hankaku katakana take up twice as many bytes?

UTF-8
        ASCII: 1 bytes
        hankaku: 3 bytes
        other: 3 bytes

UTF-16
        ASCII: 2 bytes
        hankaku: 2 byte
        other: 2 bytes

EUC-JP:
        ASCII: 1 byte
        hankaku: 1 byte
        other: 2 bytes

Shift-JIS uses the same number of bytes as EUC-JP, plus a random number of
bytes to shift modes.

_ Marco



This archive was generated by hypermail 2.1.2 : Fri Jun 28 2002 - 02:24:33 EDT