From: Addison Phillips [wM] (aphillips@webmethods.com)
Date: Tue Nov 23 2004 - 11:14:06 CST
If you are writing a C program, then the null character can be used to indicate the end of a string.
One of the nice things about UTF-8 is that the ASCII bytes from 0 to 7F hex (including the C0 control characters from \x00 through \x01f---including NULL) represent the ASCII characters from 0 to 7F hex. That is, amoung other things UTF-8 was designed specifically to be compatible with C language strings.
Addison
Addison P. Phillips
Director, Globalization Architecture
http://www.webMethods.com
Chair, W3C Internationalization Working Group
http://www.w3.org/International
Internationalization is an architecture.
It is not a feature.
> -----Original Message-----
> From: unicode-bounce@unicode.org
> [mailto:unicode-bounce@unicode.org]On Behalf Of Harshal Trivedi
> Sent: 2004年11月23日 3:42
> To: unicode@unicode.org
> Subject: My Querry
>
>
> How can i make sure that UTF-8 format string has terminated while
> encoding it, as compared to C program string which ends with '\0'
> (NULL) character?
>
> -> Is there any special symbol or procedure to determine end of UTF-8
> string OR just ASCII NULL '\0' is used as it is to indicate that.
>
> --
> Harshal P. Trivedi
> Software Engineer
This archive was generated by hypermail 2.1.5 : Tue Nov 23 2004 - 11:24:56 CST