Re: New extension for transformed languages

From: Philippe Verdy <verdy_p_at_wanadoo.fr>
Date: Sun, 4 Mar 2012 18:33:44 +0100

If the "t" singleton subtag is used to indicate a transliteration to
letters used in another language, this does not make clear where to
indicate in which script those letters are selected.

As the "t" singleton (with its value being another language code) is an
extension, it should appear at end of the tag,
so that "ru-t-it" actually means (if the Latin script is implied for
Italian) "ru-Latn-t-it", preferable to "ru-t-it-Latn" which is not ordered
correctly.

One problem is that there's no mechanism to imply the script from the
target language of the transliteration. The implicit script is only derived
from the first language code, not from an extension.

This means that "ru-t-it" can't be made formally equivalent
to "ru-Latn-t-it", because the implied script for Russian is still Cyrillic
!

So "ru-t-it" still actually means the same as "ru-Cyrl-t-it", i.e. Russian
transliterated in the Cyrillic letters of Italian (sic!).

And you'll need to use "ru-Latn-t-it" instead, which is compatible (with
the "inheritance" mechanism) with "ru-Latn" that indicates a
transliteration of Russian to the Latin script (with an unspecified
alphabet, the default being according to the Russian standard, even if it
requires extended Latin letters not used in Italian).

Le 24 février 2012 17:45, Mark Davis ☕ <mark_at_macchiato.com> a écrit :

> Just got a new spec published that for the first time gives a standard way
> to specify transliterations between languages. So "Italian transliterated
> into Russian letters" can be requested or tagged with the code "ru-t-it".
> It is now in the standard for identifying languages on the internet (
> tools.ietf.org/html/bcp47).
>
> [FYI, the page below is an example of how transliteration is used at
> Google.]
>
> Greetings from Santa Kurara, Kariforunia - Google Open Source Blog<http://google-opensource.blogspot.com/2010/10/greetings-from-santa-kurara-kariforunia.html>
>
>
> The spec is at tools.ietf.org/html/rfc6497, and supported in the latest
> version of the Unicode Locale Data specification (LDML,
> unicode.org/reports/tr35/).
>
> Thanks to +Yoshito Umaoka <https://plus.google.com/109545292842309917629>
> , +Addison Phillips <https://plus.google.com/113702471488101841815> ,
> Courtney Falk, Doug Ewell, +Pete Resnick<https://plus.google.com/113725870361689375954>,
> and others for helping to put this together!
>
> [Original post:
> https://plus.google.com/u/0/114199149796022210033/posts/6SiRyvHN33J]
>
> ------------------------------
> Mark <https://plus.google.com/114199149796022210033>
> *
> *
> *— Il meglio è l’inimico del bene —*
> **
>
>
Received on Sun Mar 04 2012 - 11:44:25 CST

This archive was generated by hypermail 2.2.0 : Sun Mar 04 2012 - 11:44:27 CST