RE: japanese xml

From: Marco Cimarosti (marco.cimarosti@essetre.it)
Date: Wed Aug 29 2001 - 05:21:43 EDT


Viranga Ratnaike wrote:
>I was hunting for examples of japanese xml and came across the
>following, which looks rather cool. Except that it doesn't seem
>to actually be unicode. I thought XML had mandated unicode?
> http://java.sun.com/xml/jaxp-1.1/examples/samples/weekly-euc-jp.xml

Not at all! Any encoding can be used in XML documents. The requirement is
that the encoding is declared inside each document.

In fact, that document begins with:

        <?xml version="1.0" encoding="euc-jp" ?>

"euc-jp" means the Japanese character set (JIS) serialized in EUC ("Extended
Unix Code"). EUC is what Unicoders would call a "transformation format", and
it is very popular with the three main CJK character sets (JIS=Japan,
GB=China, KCS=Korea).

_ Marco



This archive was generated by hypermail 2.1.2 : Wed Aug 29 2001 - 06:39:25 EDT