Re: U+02D0 and U+01C3

From: <vanisaac_at_boil.afraid.org>
Date: Mon, 21 May 2012 16:51:55 -0700

From: Chigurupati, Nagesh <nchigurupati_at_verisign.com>
>
> Hello Unicode Folks,
>
> U+01C3 looks like an 'Exclamation Mark' and it is categorized as 'Letter Other'.
> U+02D0 looks like two inverted triangles and it is categorized as 'Letter
> Modifier'.
>
> These code points being categorized as they are would not prevent them in an
> IDN. However, similar looking code points which are categorized as symbols or
> punctuation marks (U+0021 and U+003A) would be disallowed in an IDN (IDNA 2008).
>
> My question is:
>
> Should the code points be categorized differently so that they are disallowed
> by IDNA2008 RFCs, or
> Should the individual end user applications prevent such kind of characters in
> an IDN.
>
> Thank you.
>
> Regards,
> Nagesh

These characters are used as letters in different orthographies; Ok, U+02D0 is only IPA, AFAIK, so it's an edge case. Nevertheless, the whole idea behind IDNs is that they allow domain names in languages other than English, so eliminating characters necessary for writing a particular language, simply based on their /looking/ like unallowed punctuation characters, would be counterproductive to the entire point of IDNs. Does it present challenges? Sure. But Unicode was developed to meet just those kinds of challenges.

-Van
Received on Mon May 21 2012 - 18:54:14 CDT

This archive was generated by hypermail 2.2.0 : Mon May 21 2012 - 18:54:15 CDT