Re: Question...

From: Joon Kang (ykang@verity.com)
Date: Thu May 06 1999 - 19:47:29 EDT


Hello Addison,

Thank you for your answering my question.
I am a Win-guru but our products support various platforms.

Anyway, my original curiosity comes from since UCS2 has 65K code points, it
could adopt almost Asian characters without having one byte NULL.
If I restate my question then, why UCS2 allows the one byte NULL on its
codeset design/implementation?
Is it simply just because to provide codepoint integrity or ?

Thanks in advance for your kind reply.
-Joon
International Engineering at Verity, Inc.

----- Original Message -----
From: Addison Phillips <AddisonP@simultrans.com>
To: <ykang@verity.com>
Cc: <unicode@unicode.org>
Sent: Thursday, May 06, 1999 4:17 PM
Subject: RE: Question...

>Hi Joon,
>
>By "NULL represent a character", do you mean that character NULL 0x0000? Or
>do you mean NULL has a half character (e.g. one byte of NULL such as 0x0032
>or 0x3200)?
>
>If the former, the NULL character gives compatibility for programs written
>in C that use NULL as a string terminator.
>If the latter, NULL represents half of the character and cannot (well,
>should not) be eliminated without changing the encoding.
>
>If you wish to send data that contains no nulls (other than string
>terminators), consider converting your data to UTF-8 or UTF-7. The
>particular encodings were created for file-system and mail-system safety in
>particular and have the advantage of containing no NULLs. UTF-8 is commonly
>used for compatibility purposes such as this.
>
>The downside of using these encodings is that, while they "compress"
English
>UNICODE data, your Asian characters will average MORE THAN 2 bytes per
>character.
>
>The encoding function for UTF-7 and UTF-8 are out on the web and both are
>small functions.
>The definition and encoding for UCS-2 is on Unicode's web site (well,
>conversion tables are... the definition is long and you should buy the
>book). You can find some sample code that might help on our FTP site
>(ftp://ftp.simultrans.com/anonymous). You're not going to write your own
>character converter, though, are you? I thought you guys were a Windows
>shop?
>
>Addison
> __________________________________________
>
> Addison Phillips
> Director, Globalization Services
> SimulTrans, L.L.C.
> 2606 Bayshore Parkway
> Mountain View, California 94043 USA
>
> +1 650-526-4652 (direct telephone)
> +1 650-969-9959 (facsimile)
> AddisonP@simultrans.com (Internet email)
> http://www.simultrans.com (website)
>
> "22 languages. One release date."
> __________________________________________



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:46 EDT