RE: AL32UTF8 Vs UTF8

From: Carl W. Brown (cbrown@xnetinc.com)
Date: Sun Aug 10 2003 - 13:46:34 EDT

  • Next message: Peter Kirk: "Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)"

    Jay,

    Oracle's UTF-8 is not really a valid encoding. It encodes surrogates as if
    they were characters. The kept the old Unicode 2.x code that only supports
    BMP to provide sort key compatibility for clients who never upgraded to
    Unicode 3.0 support and are using 16 bit character encoding improperly.
    UTF8 sorts in the same way as the old 16 bit Unicode before surrogates. Do
    not use UTF8 because it is really not Unicode conformant with any Unicode
    standard. Instead use AL32UTF8.

    Carl

      -----Original Message-----
      From: unicode-bounce@unicode.org [mailto:unicode-bounce@unicode.org]On
    Behalf Of Jay Chandru
      Sent: Sunday, August 10, 2003 8:58 AM
      To: unicode@unicode.org
      Subject: AL32UTF8 Vs UTF8

      Greetings,

      We are using Oracle9i with application tier as 11i.

      I wanted to know the differences between AL32UTF8 and UTF8. My database
    (oracle) will be in AL32UTF8 format. Will the applications that require
    multibyte characters work as they are functionin in UTF8 format.

      Would be great if anybody can gimme a comparision on AL32UTF8 and UTF8

      Also pls list requirement of any 3rd party softwares for code page
    conversions in case of AL32UTF8

      Thanks in advance,
      -Jay

    ----------------------------------------------------------------------------

    --
      Do you Yahoo!?
      Yahoo! SiteBuilder - Free, easy-to-use web site design software
    


    This archive was generated by hypermail 2.1.5 : Sun Aug 10 2003 - 14:21:13 EDT