From: Ernest van den Boogaard ([email protected])
Date: Tue Oct 12 2010 - 03:14:43 CDT
FW to Unicode ml
From: [email protected]
To: [email protected]
Subject: RE: statistics
Date: Tue, 12 Oct 2010 10:13:17 +0200
In 5.2, Chapter 2.4 table 2-3 is listed which General Categories are "characters". Out are: Surrogates, Private Use, Non-characters and Reserved code points. Note that Format characters (Cf) are included as characters. The code points with formatting aspects in C0 and C1 are Controls ("Cc"), so excluded.
Total number of characters in 6.0 is 109,242+142=109,384.
Regards,
Ernest van den Boogaard
> From: [email protected]
> To: [email protected]
> CC: [email protected]
> Subject: Re: statistics
> Date: Tue, 12 Oct 2010 09:14:21 +0200
>
> On Mon, 11 Oct 2010 Asmus Freytag <[email protected]> wrote:
>
> > On 10/11/2010 9:49 PM, Janusz S. "Bień" wrote:
> >> On Mon, 11 Oct 2010 [email protected] wrote:
> >>
> >>> The newly finalized Unicode Version 6.0 adds 2,088 characters,
> >> What is the current total? Are other statistic informations available
> >> somewhere?
> > The announcement gives a link to click through.
> >
> > There you will find more statistics.
>
> I guess you mean "Character Assignment Overview" at
>
> http://www.unicode.org/versions/Unicode6.0.0/
>
> However it does not provide the precise answer to my primary question,
> which is not purely arithmetic but depends on the definition of the
> character. In particular, do noncharacters belong to characters?
>
> Regards
>
> JSB
>
> --
> ,
> dr hab. Janusz S. Bien, prof. UW - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej)
> Prof. Janusz S. Bien - Warsaw University (Department of Formal Linguistics)
> [email protected], [email protected], http://fleksem.klf.uw.edu.pl/~jsbien/
>
>
This archive was generated by hypermail 2.1.5 : Tue Oct 12 2010 - 03:18:06 CDT