On 08/07/2000 03:45:42 PM addison wrote:
>Actually, the way surrogates work is: one high surrogate followed by one
>low surrogate. The second value would never, ever, coincide with a valid
>character (in the same way that bytes in UTF-8 multibyte characters never
>collide with valid ASCII values).
A slight correction: The second value *should* never, ever be anything but
a low surrogate, but that doesn't mean it won't happen in data you're asked
to process.
- Peter
---------------------------------------------------------------------------
Peter Constable
Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:06 EDT