From: Mark Davis (mark.davis@jtcsv.com)
Date: Wed Mar 23 2005 - 16:29:11 CST
What I'd like to do is collect *specific* information for the following
categories on http://unicode.org/reports/tr36/draft/idn-chars.html: Which
characters are used as parts of words in some modern language (and the name
of that language).
Categories
- Atomic-no-uppercase
- Non-ID
We can then see which should be taken into consideration in different
proposals.
Mark
----- Original Message -----
From: "Peter Kirk" <peterkirk@qaya.org>
To: "Mark Davis" <mark.davis@jtcsv.com>
Cc: "Unicode Mailing List" <unicode@unicode.org>
Sent: Wednesday, March 23, 2005 12:27
Subject: Re: Security Issues
> On 23/03/2005 18:08, Mark Davis wrote:
>
> >...
> >
> >C. Characters with no uppercase in bicameral scripts may be suspect, and
> >disallowed or flagged. Which of these really need to be allowed?
(Example:
> >U+04C0 ( Ӏ ) PALOCHKA?)
> >
> >
> >
> Palochka is certainly needed in identifiers as it is an integral part of
> the alphabet of some languages, especially of Dagestan. Words missing
> their palochka are simply misspelled. But it might be possible to treat
> palochka as equivalent to U+0406 for IDN purposes.
>
> Similarly there are alphabets in the same region which use apostrophe
> (perhaps it should be MODIFIER LETTER APOSTROPHE) as part of their
> alphabets.
>
> Is it reasonable to disallow letters which are part of alphabets in
> regular use?
>
> --
> Peter Kirk
> peter@qaya.org (personal)
> peterkirk@qaya.org (work)
> http://www.qaya.org/
>
>
>
> --
> No virus found in this outgoing message.
> Checked by AVG Anti-Virus.
> Version: 7.0.308 / Virus Database: 266.8.0 - Release Date: 21/03/2005
>
>
>
This archive was generated by hypermail 2.1.5 : Wed Mar 23 2005 - 16:30:05 CST