Re: Character folding in text editors from Doug Ewell on 2016-02-20 (Unicode Mail List Archive)

From: Doug Ewell <doug_at_ewellic.org>
Date: Sat, 20 Feb 2016 14:43:15 -0700

Eli Zaretskii wrote:

> What about language-independent character-folding: where in the
> Unicode database is the data for that?

The OP kind of alluded to that: there is no such thing really as
language-independent character folding.

About the closest approximation you can get using Unicode data alone
(not CLDR) is to normalize to NFD, then ignore the combining diacritics.
But that still doesn't work for a character like ø, which doesn't
decompose to o + anything, and more importantly, it still won't meet
expectations because of the n/ñ and o/ö/ø language-dependency problems.

As Mark and Philippe said, the real solution is to use CLDR, because
that is where language-dependent information like this lives.

--
Doug Ewell | http://ewellic.org | Thornton, CO 🇺🇸

Received on Sat Feb 20 2016 - 15:44:29 CST

This archive was generated by hypermail 2.2.0 : Sat Feb 20 2016 - 15:44:29 CST