Re: UTF8 and COntrol Characters

From: YTang0648@aol.com
Date: Wed Nov 05 2003 - 13:43:10 EST

  • Next message: Language Analysis Systems, Inc. Unicode list reader: "RE: ZWJ/ZWNJ in combining mark sequences"

    In a message dated 11/4/2003 9:13:17 PM Pacific Standard Time,
    Abdij.Bhat@kshema.com writes:
    Hi,
    Thanks a lot in advance for having a look into my problem.
    We have an VC++ Application that interacts with a hardware. We recently
    Globalized the Application. The hardware however is ASCII only.
    However, there are a couple of strings that are sent to the hardware. We
    have used UTF8 to convert all strings to be stored in the hardware. All
    these strings and other data are passed to the hardware as a TCL file. The
    hardware parses this file for all the information. The parser uses Control
    Characters and escape sequences information as delimiter to data (name pair
    values).
    What code GENERATE such control characters? Are they generated and insert
    into the data before "convert to UTF-8", or you are inserting those control code
    INTO the UTF-8 string?
    What is the value of the control code? C0 [0x00-0x1F] or C1 [0x80-0x9F] or
    some other thing?

    The query we have is:
    If a UNICODE strings is converted to UTF8, will the UTF8 encoded string
    contain and control character or escape sequences?
    It depend what VALUE do you use in the control code sequence. and where do
    you insert into the data, insert into the UNICODE or insert into the UTF8
    If so, is it possible to
    eliminate the same?
    We use the MultiByteToWideChar() and WideCharToMultiByte() to achieve the
    encoding and decoding.
    This is a MS specific implementation issue. once you find out the valid
    control code point range, it should be easy for you to write small program to test
    it.

    Thanks and Regards,
    Abdij Bhat
    Kshema Technologies
    mailto:abdij.bhat@kshema.com
    www.kshema.com
    Phone:+91 80 860 3600 (Extension 2102)
    Fax: +91 80 860 3372

    ==================================
    Frank Yung-Fong Tang
    System Architect, Itrntinl Dvlpmet, AOL Intrtv Srvies
    AIM:yungfongta mailto:ytang0648@aol.com Tel:650-937-2913
    Yahoo! Msg: frankyungfongtan

    John 3:16 "For God so loved the world that he gave his one and only Son, that
    whoever believes in him shall not perish but have eternal life.

    Does your software display Thai language text correctly for Thailand users?
    -> Basic Conceptof Thai Language linked from Frank Tang's
    Itrntinliztin Secrets
    Want to translate your English text to something Thailand users can
    understand ?
    -> Try English-to-Thai machine translation at
    http://c3po.links.nectec.or.th/parsit/



    This archive was generated by hypermail 2.1.5 : Wed Nov 05 2003 - 14:41:20 EST