From: Mark Davis (email@example.com)
Date: Wed Mar 23 2005 - 16:29:11 CST
What I'd like to do is collect *specific* information for the following
categories on http://unicode.org/reports/tr36/draft/idn-chars.html: Which
characters are used as parts of words in some modern language (and the name
of that language).
We can then see which should be taken into consideration in different
----- Original Message -----
From: "Peter Kirk" <firstname.lastname@example.org>
To: "Mark Davis" <email@example.com>
Cc: "Unicode Mailing List" <firstname.lastname@example.org>
Sent: Wednesday, March 23, 2005 12:27
Subject: Re: Security Issues
> On 23/03/2005 18:08, Mark Davis wrote:
> >C. Characters with no uppercase in bicameral scripts may be suspect, and
> >disallowed or flagged. Which of these really need to be allowed?
> >U+04C0 ( Ӏ ) PALOCHKA?)
> Palochka is certainly needed in identifiers as it is an integral part of
> the alphabet of some languages, especially of Dagestan. Words missing
> their palochka are simply misspelled. But it might be possible to treat
> palochka as equivalent to U+0406 for IDN purposes.
> Similarly there are alphabets in the same region which use apostrophe
> (perhaps it should be MODIFIER LETTER APOSTROPHE) as part of their
> Is it reasonable to disallow letters which are part of alphabets in
> regular use?
> Peter Kirk
> email@example.com (personal)
> firstname.lastname@example.org (work)
> No virus found in this outgoing message.
> Checked by AVG Anti-Virus.
> Version: 7.0.308 / Virus Database: 266.8.0 - Release Date: 21/03/2005
This archive was generated by hypermail 2.1.5 : Wed Mar 23 2005 - 16:30:05 CST