What you think of as gaps are actually assigned characters; moreover,
even unassigned characters cannot be used for your purposes, except for
the Private Use Characters (U+E000 - U+F8FF). You also ask about
delimiters in the standard; you would have to explain what you mean by
that (i.e., what are you delimiting? Words? Paragraphs? Database
For an update on the state of the upcoming version of the standard, look
It also has new html documentation of the database files that I think
would clarify the structure of the database files, at
"Magda Danish (Unicode)" wrote:
> -----Original Message-----
> From: Wong Ee Kian [mailto:email@example.com]
> Sent: Thursday, July 29, 1999 6:00 AM
> To: firstname.lastname@example.org
> Subject: Characters suitable for delimiters
> Hi there!
> I was running thru UnicodeData-Latest.txt, looking for characters
> suitable for use as delimiters. Noticed huge gaps between 4E00 and
> 9FA5, AC00 and D7A3, for example.
> 1. Would characters in these gaps work as delimiters? Or does the
> Unicode consortium have other plans for these unused ranges?
> 2. Are there already designated delimiters in the Unicode standard?
> Many thanks!
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:50 EDT