Re: "Missing character" glyph

From: Peter_Constable@sil.org
Date: Thu Aug 01 2002 - 14:24:55 EDT


On 08/01/2002 10:42:58 AM "Doug Ewell" wrote:

>I think this is exactly what they have done by creating the
>"noncharacters" from U+FDD0 through U+FDEF. These code points are
>guaranteed never to be assigned to real characters.

But that doesn't you can use them for content, as Martin seems to want.
They are non-characters so that programmers can have codes available for
proprietary internal-only use. Using one of these in data can have
unpredictable results.

>> Otto Stolz suggested U+03A2, which would be equally valid. However,
>> U+03A2 is quite obviously the code for GREEK CAPITAL LETTER FINAL
>> SIGMA.

It may have been left blank at one time to maintain a certain pattern, but
that doesn't mean that's what U+03A2 means now or will always mean in the
future. Note, for instance, that U+2071 was left unassigned and "meant"
superscript 1, but now it has been assigned to a character, SUPERSCRIPT
LATIN SMALL LETTER I.

Don't assume anything when it comes to unassigned characters.

>My recommendation: Use the noncharacters. That's what they're there
>for.

I would strongly advise against that.

- Peter

---------------------------------------------------------------------------
Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>



This archive was generated by hypermail 2.1.2 : Thu Aug 01 2002 - 12:34:37 EDT