Re: "A Programmer's Introduction to Unicode"

From: Philippe Verdy <verdy_p_at_wanadoo.fr>
Date: Tue, 14 Mar 2017 16:35:48 +0100

Per definition yes, but UTC-4 is not Unicode.
As well (any UCS-4 code unit) & 0xFFE00000 == 0 (i.e. 21 bits) is not
Unicode, UTF-32 is Unicode (more restrictive than just 21 bits which would
allow 32 planes instead of just the 17 first ones).
I suppose he meant 21 bits, not 11 bits which covers only a small part of
the BMP.

2017-03-14 16:14 GMT+01:00 Doug Ewell <doug_at_ewellic.org>:

> Steffen Nurpmeso wrote:
>
> >> I didn’t say you never needed to work with code points. What I said
> >> is that there’s no advantage to UCS-4 as an encoding, and that
> >
> > Well, you do have eleven bits for flags per codepoint, for example.
>
> That's not UCS-4; that's a custom encoding.
>
> (any UCS-4 code unit) & 0xFFE00000 == 0
>
> --
> Doug Ewell | Thornton, CO, US | ewellic.org
>
>
Received on Tue Mar 14 2017 - 10:36:30 CDT

This archive was generated by hypermail 2.2.0 : Tue Mar 14 2017 - 10:36:30 CDT