Re: Unicode abuse

From: Erik van der Poel (erik@vanderpoel.org)
Date: Mon Mar 07 2005 - 15:13:20 CST

Next message: Peter Kirk: "Re: Languages using multiple scripts"

Previous message: Patrick Andries: "Re: Languages using multiple scripts"
In reply to: Mark Davis: "Re: Unicode abuse"
Next in thread: David Starner: "Re: Unicode abuse"
Reply: David Starner: "Re: Unicode abuse"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

Mark Davis wrote:
> If that link is clicked on, some user agent will eventually
> have to show http://badplace.com, and at that point it should be normalized
> in appearance. So double-struck C is not really a problem -- and it wouldn't
> be anyway, unless it looked like something *other* than a C.

Hi Mark,

I realize that Stringprep and Nameprep adopted the whole Unicode 3.2
character set partly because the General Category Values in the Unicode
Character Database did not appear to be stable yet at the time, and that
Normalization Form KC was adopted as is, partly because applications may
already implement that and it would take up more memory to have a
separate table for Nameprep, but now that we are discussing limiting
IDNs to Letters, Digits and Hyphen (LDH), which would require a separate
table anyway, it seems appropriate to question the original inclusion of
unnecessary characters like double-struck C, and to discuss ways to
exclude them. No?

Erik

Next message: Peter Kirk: "Re: Languages using multiple scripts"
Previous message: Patrick Andries: "Re: Languages using multiple scripts"
In reply to: Mark Davis: "Re: Unicode abuse"
Next in thread: David Starner: "Re: Unicode abuse"
Reply: David Starner: "Re: Unicode abuse"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Mon Mar 07 2005 - 15:15:53 CST