Re: Can NFKC turn valid UAX 31 identifiers into non-identifiers?

From: Hans Åberg via Unicode <unicode_at_unicode.org>
Date: Fri, 8 Jun 2018 14:39:09 +0200

> On 8 Jun 2018, at 11:07, Henri Sivonen via Unicode <unicode_at_unicode.org> wrote:
>
> My question is:
>
> When designing a syntax where tokens with the user-chosen characters
> can't occur next to each other without some syntax-reserved characters
> between them, what advantages are there from limiting the user-chosen
> characters according to UAX #31 as opposed to treating any character
> that is not a syntax-reserved character as a character that can occur
> in user-named tokens?

It seems best to stick to the canonical forms and add the sequences one deems useful and safe, as treating inequivalent characters as equal is likely to be confusing. But this requires more work; it seems that the use of the compatibility forms is aimed at something simple to implement.
Received on Fri Jun 08 2018 - 07:39:27 CDT

This archive was generated by hypermail 2.2.0 : Fri Jun 08 2018 - 07:39:27 CDT