Re: utf-8 and databases

From: Markus Scherer (markus.scherer@jtcsv.com)
Date: Tue Jul 09 2002 - 16:58:08 EDT


Databases use table definitions that usually define what encoding is used in which parts of the database.
Encodings can be set per database, per table, or per column, and the definition syntax seems to vary widely among vendors and products.

Generally, "UTF-8" or "1208" or "unicode" or similar is available as an encoding name for char types (byte-oriented).
"UTF-16" or "1200" or "unicode" similar is available as an encoding name for graphic/DBCS types.

You can check your database definition or submit a query to the database definition catalog to find out what encoding is set.

Encoding specifications are necessarily observed in text matching, substringing, character searching, conversions between client and server, lower-/uppercasing, etc.

markus



This archive was generated by hypermail 2.1.2 : Tue Jul 09 2002 - 15:14:31 EDT