Re: CLDR: Bad exemplar chars for some locales

From: Asmus Freytag (
Date: Wed Apr 05 2006 - 18:40:09 CST

  • Next message: Kenneth Whistler: "Re: The Phaistos Disc"

    On 4/5/2006 5:03 PM, Peter Edberg wrote:
    > We are planning to use CLDR exemplar character set data for various
    > purposes, and so I have been looking into some current exemplar sets
    > (as indicated by the survey tool at
    > <>). The exemplar sets are
    > "supposed to represent the set of characters needed to write the form
    > of the language that is currently in use" (per Deborah Goldsmith). By
    > this criterion, several of them include characters that seem to me to
    > be inappropriate. I wanted to get some feedback on these, and then try
    > to get them changed ASAP (via bugs etc.). In all cases below I am
    > referring to the standard exemplar set, not the auxiliary characters.
    > 1. Arabic (ar) & Persian (fa):
    > - Both of these include 200C and 200D (ZWNJ and ZWJ). I would argue
    > that these characters are not required in order to write Arabic or
    > Persian.

    I believe that extensive discussion on the bidi list has established
    that ZWNJ is indeed needed for Persian.


    This archive was generated by hypermail 2.1.5 : Wed Apr 05 2006 - 18:45:47 CST