Re: ID_Start, ID_Continue, and stability extensions

From: Karl Williamson <>
Date: Fri, 25 Apr 2014 12:53:27 -0600

On 04/24/2014 01:56 PM, Steffen Nurpmeso wrote:
> Markus Scherer <> wrote:
> |I strongly recommend you parse the derived properties rather than trying to
> |follow the derivation formula, because that can change over time.
> ..this file includes only those core properties that have
> themselves a derivation-may-change property?
> (I long hesitated to write this though.)
> --steffen
> _______________________________________________

Somewhere it says that the derived property files are subservient to the
other files. And in fact in some Unicode releases, they contained
errors. I therefor changed my parser to populate my internal db first
with the derived files, and then to populate using the non-derived
files. Any conflicts were thus automatically resolved in favor of the
non-derived. But if the derived files contained things not in the
non-derived ones, they would be used.

I think that Unicode is doing a better job of making their files
consistent and accurate these days, but I haven't had to worry since I
made that change. (I no longer remember any details of what the
problems were.)

If I were starting from scratch, I would try the xml version first.

Unicode mailing list
Received on Mon Apr 28 2014 - 23:46:14 CDT

This archive was generated by hypermail 2.2.0 : Mon Apr 28 2014 - 23:46:15 CDT