On 08/01/2002 10:42:58 AM "Doug Ewell" wrote:
>I think this is exactly what they have done by creating the
>"noncharacters" from U+FDD0 through U+FDEF. These code points are
>guaranteed never to be assigned to real characters.
But that doesn't you can use them for content, as Martin seems to want.
They are non-characters so that programmers can have codes available for
proprietary internal-only use. Using one of these in data can have
unpredictable results.
>> Otto Stolz suggested U+03A2, which would be equally valid. However,
>> U+03A2 is quite obviously the code for GREEK CAPITAL LETTER FINAL
>> SIGMA.
It may have been left blank at one time to maintain a certain pattern, but
that doesn't mean that's what U+03A2 means now or will always mean in the
future. Note, for instance, that U+2071 was left unassigned and "meant"
superscript 1, but now it has been assigned to a character, SUPERSCRIPT
LATIN SMALL LETTER I.
Don't assume anything when it comes to unassigned characters.
>My recommendation: Use the noncharacters. That's what they're there
>for.
I would strongly advise against that.
- Peter
---------------------------------------------------------------------------
Peter Constable
Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>
This archive was generated by hypermail 2.1.2 : Thu Aug 01 2002 - 12:34:37 EDT