Re: Unicode to UTF-8

From: John Cowan (jcowan@reutershealth.com)
Date: Wed Mar 15 2000 - 16:48:34 EST


John O'Conner wrote:
>
> So, to represent a surrogate pair requires 12 bytes?

No, four bytes or eight nybbles. By "all others" I mean "all other Unicode
scalar values", namely 10000 to 10FFFF.

> > All others: 1111 0--- 10-- ---- 10-- ---- 10-- ----

-- 

Schlingt dreifach einen Kreis vom dies! || John Cowan <jcowan@reutershealth.com> Schliesst euer Aug vor heiliger Schau, || http://www.reutershealth.com Denn er genoss vom Honig-Tau, || http://www.ccil.org/~cowan Und trank die Milch vom Paradies. -- Coleridge (tr. Politzer)



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:00 EDT