Re: Parsers for the UnicodeSet notation? from Roozbeh Pournader on 2014-07-23 (Unicode Mail List Archive)

From: Roozbeh Pournader <roozbeh_at_unicode.org>
Date: Wed, 23 Jul 2014 15:28:51 -0700

On Wed, Jul 23, 2014 at 3:23 PM, Eric Muller <emuller_at_adobe.com> wrote:

> I would like to work with the exemplarCharacters data in the CLDR. That
> uses the UnicodeSet notation. Is there somewhere a parser for that
> notation, that would return me just the list of characters in the set?

Note that it's a set of strings, not characters.

I suspect that the exemplarCharacters use a restricted form of the
> UnicodeSet notation (e.g. do not use property values). Is that correct, and
> if so, what's the subset?
>

I have an Apache-licensed parser in Python here:
https://code.google.com/p/noto/source/browse/nototools/generate_website_data.py#180

_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode
Received on Wed Jul 23 2014 - 17:29:46 CDT

This archive was generated by hypermail 2.2.0 : Wed Jul 23 2014 - 17:29:46 CDT