From: Martin v. Löwis (martin@v.loewis.de)
Date: Wed Aug 15 2007 - 06:05:44 CDT
> I glean this as the algorithm:
>
> Add middle dot to ID_CONTINUE
>
> If an ID_START or ID_CONTINUE character has a decomposition containing a
> character other than middle dot that's not in ID_CONTINUE, then remove
> that character from ID_START or ID_CONTINUE.
>
> If an ID_START has a decomposition that begins with a character that's
> not an ID_START, remove it from ID_START.
Thanks, this is exactly what I was looking for - at least for Unicode
4.1, this algorithm produces an outcome equal to the published tables.
Could that be added to UAX#31?
Regards,
Martin
This archive was generated by hypermail 2.1.5 : Wed Aug 15 2007 - 06:07:42 CDT