Re: Encodings for SQL Databases

From: Peter_Constable@sil.org
Date: Mon Aug 07 2000 - 17:38:41 EDT


On 08/07/2000 03:45:42 PM addison wrote:

>Actually, the way surrogates work is: one high surrogate followed by one
>low surrogate. The second value would never, ever, coincide with a valid
>character (in the same way that bytes in UTF-8 multibyte characters never
>collide with valid ASCII values).

A slight correction: The second value *should* never, ever be anything but
a low surrogate, but that doesn't mean it won't happen in data you're asked
to process.

- Peter

---------------------------------------------------------------------------
Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:06 EDT