L2/05-051

Public Review Issue #63: POSIX Data for CLDR

There is a new tool that creates POSIX locale data files from CLDR. It has been used to generate draft POSIX locale data files for public review. We encourage review of this data; any feedback can be filed at http://unicode.org/cldr/filing_bug_reports.html. (Note: the CLDR 1.3 freeze data has been extended to allow for feedback on this and other locale data.)

The draft files are available in http://unicode.org/cldr/data/common/posix/. Because POSIX locale data files are specific to charset, there are two kinds of files:

  1. generated with the UTF-8 charset, such as http://unicode.org/cldr/data/common/posix/hi_IN.UTF-8.src
  2. generated with other charsets, such as http://unicode.org/cldr/data/common/posix/de_DE.ISO8859-15.src

The main remaining issue at this point appears to be the repertoire of characters to be used for the UTF-8 locales. Currently the mechanism is to use the following heuristic:

Feedback on this and other issues is welcome.

Notes: