On Mon, 22 Apr 2002, Doug Ewell wrote:
> Zsigri Gyula <email@example.com> wrote:
> > How many printable characters are there in Unicode 3.2.0? I tried
> > desperately to find the answer at the Unicode web site but could
> > not.
> There are 95,156 total assigned characters.
> To find the number of "printable" characters, you must first determine
> what you mean by "printable" and then subtract that number. This is
> where it might get tricky. Control characters, formatting characters,
> and such are obviously not "printable," but what about things like
> spaces? (Unicode 3.1 had about 20 of them.)
> You might try subtracting those characters in
> http://www.unicode.org/Public/UNIDATA/UnicodeData.txt that have specific
> properties, such as "Cc". Again, though, which properties are to be
> excluded is up to you.
> -Doug Ewell
> Fullerton, California
Tuesday, April 23, 2002
There are also various combining characters. For instance a tilde could
be printed with any letter A-Z, a-z and others. Arabic and various South
and Southeast Asian scripts have many combinations of letters that appear
different from a simple linear string of such letters. How many depends
on the level of quality one wants to achieve.
Jim Agenbroad ( jage@LOC.gov )
"It is not true that people stop pursuing their dreams because they
grow old, they grow old because they stop pursuing their dreams." Adapted
from a letter by Gabriel Garcia Marquez.
The above are purely personal opinions, not necessarily the official
views of any government or any agency of any.
Addresses: Office: Phone: 202 707-9612; Fax: 202 707-0955; US
mail: I.T.S. Sys.Dev.Gp.4, Library of Congress, 101 Independence Ave. SE,
Washington, D.C. 20540-9334 U.S.A.
Home: Phone: 301 946-7326; US mail: Box 291, Garrett Park, MD 20896.
This archive was generated by hypermail 2.1.2 : Tue Apr 23 2002 - 12:30:06 EDT