RE: My Querry

From: Addison Phillips [wM] (aphillips@webmethods.com)
Date: Tue Nov 23 2004 - 11:14:06 CST

  • Next message: Kenneth Whistler: "Re: My Querry"

    If you are writing a C program, then the null character can be used to indicate the end of a string.

    One of the nice things about UTF-8 is that the ASCII bytes from 0 to 7F hex (including the C0 control characters from \x00 through \x01f---including NULL) represent the ASCII characters from 0 to 7F hex. That is, amoung other things UTF-8 was designed specifically to be compatible with C language strings.

    Addison

    Addison P. Phillips
    Director, Globalization Architecture
    http://www.webMethods.com

    Chair, W3C Internationalization Working Group
    http://www.w3.org/International

    Internationalization is an architecture.
    It is not a feature.

    > -----Original Message-----
    > From: unicode-bounce@unicode.org
    > [mailto:unicode-bounce@unicode.org]On Behalf Of Harshal Trivedi
    > Sent: 2004年11月23日 3:42
    > To: unicode@unicode.org
    > Subject: My Querry
    >
    >
    > How can i make sure that UTF-8 format string has terminated while
    > encoding it, as compared to C program string which ends with '\0'
    > (NULL) character?
    >
    > -> Is there any special symbol or procedure to determine end of UTF-8
    > string OR just ASCII NULL '\0' is used as it is to indicate that.
    >
    > --
    > Harshal P. Trivedi
    > Software Engineer



    This archive was generated by hypermail 2.1.5 : Tue Nov 23 2004 - 11:24:56 CST