RE: unknown encoding

From: Stephen Holmes (stephen.holmes@eircom.net)
Date: Thu Oct 28 1999 - 11:18:22 EDT


It's a UTF-8 stream encoding unicode codepoints. You can get some UTF-8
processing code from the Unicode web site or alternatively convert this
UTF-8 stream to UCS-2 (or a local encoding scheme) using a plethora of tools
available on the net, try http://unicode.basistech.com/demo.html for one
such tool.

Cheers
Steve.

-----Original Message-----
From: Yung-Fong Tang [mailto:ftang@netscape.com]
Sent: Thursday, October 28, 1999 8:05 AM
To: Unicode List
Cc: unicode@unicode.org; nink@24h.co.jp; bobv@nl.compuware.com
Subject: Re: unknown encoding

read http://www.w3.org/TR/REC-html40/charset.html#h-5.3.1

Chak Ng wrote:

> Dear Sir/Madam,
>
> We are a software company developing Chinese search engine.
> Recently, we are integrating unicode support, and meet the
> following question.
> I found some webpages do display Chinese characters, but the source of
> them are not in big5, gb, or unicode. They are something like the
following
> 國立台南師範學院
> 附設實驗國民小學
>
> Here are the example webpages that have these coding.
> http://home.ust.hk/~ce_htc/
> http://210.70.37.2/
>
> Would you have any idea what this coding is?
> Is this kind of unicode?
> How to convert this coding into 16-bit unicode?
>
> Thank you for your attention and help.
>
> /Chak Ng
> IPO Inc.
>
> ______________________________________________________
> Get Your Private, Free Email at http://www.hotmail.com



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:54 EDT