Re: Emoji and Annotation data

From: Mark Davis ☕️ <mark_at_macchiato.com>
Date: Fri, 24 Jun 2016 18:04:40 +0200

You should never be scraping *any* Unicode HTML files. They are not made
for that, and there is no guarantee of stability.

The emoji files are built from data which is described in
http://www.unicode.org/reports/tr51/
(plus CLDR annotations and collation)

Mark

On Fri, Jun 24, 2016 at 7:21 AM, Takao Fujiwara <tfujiwar_at_redhat.com> wrote:

> Hi,
>
> I'm working on IBus - the input method framework for Linux.
> I parse http://unicode.org/emoji/charts/emoji-list.html and create a
> dictionary between the annotations and the Emoji characters.
> Since the file size is large and it's often updated, I'm thinking how to
> maintain the file.
>
> I copied the file as http://ibus.github.io/files/ibus/emoji-list.html for
> the build at the moment.
>
> I have questions:
> - if unicode.org provides the tarball of the stable html files or other
> data.
> - what is the license of the html files.
>
> Do you have any ideas?
>
> Thanks,
> Fujiwara
>
Received on Fri Jun 24 2016 - 11:05:19 CDT

This archive was generated by hypermail 2.2.0 : Fri Jun 24 2016 - 11:05:19 CDT