RE: Request for information - Subset of Unicode character set

From: Carl W. Brown (
Date: Sun Mar 18 2001 - 17:15:27 EST


Since you appear to be targeting the Windows platform, have you considered
using the mlang font linking? Font linking effectively concatenates fonts
so that if the character does not appear in on font it will look to the next
font. If you characters do not appear in any standard font, you can
download a special font that only has these specific characters. You can
either use a application or IE to use mlang.

Carl W. Brown
X.Net, Inc.
Globalization Consulting

-----Original Message-----
From: Magda Danish (Unicode) []
Sent: Friday, March 16, 2001 11:05 AM
To: Unicode List
Subject: FW: Request for information - Subset of Unicode character set

-----Original Message-----
From: Bob Hecht []
Sent: Tuesday, March 13, 2001 3:54 PM
To: ''
Cc: David Dekker; Joanne Jay; Marc Schneider
Subject: Request for information

Dekker, a publisher of scholarly works in the science,
technical & medical fields is about to launch a new web site which will
provide access to all of our current journal content and we will be
adding support for encyclopedia
and book content over time. The content on our site will be delivered
either full text or via PDF for print output. The full text will be
dynamically pulled from an XML database and rendered on screen via XSL
style sheets. We are hoping that you can assist us in solving the
problem(s) with the full text rendering part of the application.

Our long term solution is to provide to our users an "STM subset" of the
Unicode character set. We have searched everywhere for a character set
resolves the 400-500 characters that we need to properly build both
characters (via character entity references) and display math (via Math
The Microsoft Ariel Unicode font would be perfect, except that its 13 MB
download and about 23 MB to install, which makes it somewhat of a
problem to
provide via the internet. We are hoping we can come up with (or create)
character set that encompasses the following code pages:

* Greek Extended 1F00 - 1FFF
* Superscripts and Subscripts 2070 - 209F
* Number Forms 2150 - 218F
* Arrows 2190 - 21FF
* Mathematical Operators 2200 - 22FF
* Miscellaneous Technical 2300 - 23FF

and likely 2 or 3 other sets that we have to evaluate. While some of
staff thinks that we should find a way to incorporate Klingon somewhere,
management has discouraged this endeavor. Perhaps there is some
opportunities for a part of the code set that we could investigate.

As a short term fall-back position, we are planning on mapping special
characters that appear inline (within a paragraph) for which there is no
available TrueType font to a graphic image. We recognize that
has some number of these already created. Is there some way of
this set of GIFs for use in a commercial endeavor?

I hope you can provide us with some insight into how we may proceed in
our short and long term endeavors either through The Unicode Consortium
with another organization with similar goals . As publishers, we
the pressing need in the internet community to find a common solution to
issues associated with language and character resolution on the web and
forward to working with your organization in reaching that objective.

Most sincerely,

Bob Hecht
Director - Electronic Publishing
Marcel Dekker, Inc.
270 Madison Ave.
New York, NY 10016
(212) 696-9000 x-384

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:20 EDT