From: Mark Davis (firstname.lastname@example.org)
Date: Sun May 24 2009 - 12:40:34 CDT
I don't believe the Ethnologue does so. If it did, it would disagree with
ISO and IETF BCP 47, in which en means any English; en-US, en-UK, ...
Similarly, fr means any French.
Often one has to make a choice; for example, if I ask for an 'en' web page,
I need to get either en-US or en-UK, or en-CA, or en-AU, etc. If you know
other information about the user, you may be able to pick the best one. In
the absence of such information, the typical choice is to go with the
variant with the most users: en-US for English, fr-FR for French, etc.
Unicode CLDR provides some information on this that people may use if they
However, at a more fundamental level, the IDs used in the transforms are not
BCP 47 ids. So "en-ipa" is not a BCP 47 language tag. For more, see
(And, I want to emphasize again, the data is draft; fixes are welcome.)
On Sun, May 24, 2009 at 00:25, Julian Bradfield
> On 2009-05-23, Mark Davis <email@example.com> wrote:
> > FYI, there's a draft ipa transform at
> > ɪts frɑm səm opən-sors dætə, so hæz səm glɪtʃɛz (ænd no strɛs).
> It says: "en-ipa". The ISO standards don't define the languages, but
> Ethnologue (which is what most people rely on, I think) defines "en" as
> the language spoken in the UK. No _English_ dialect pronounces fox as
> /fɑks/! (Or "data" as [dæta].)
> At least put en_US-ipa, but even then that's unfair to a number of
> American dialects.
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
This archive was generated by hypermail 2.1.5 : Sun May 24 2009 - 12:43:00 CDT