The following illustrates some of the CLDR transliterations to/from the Latin script.
Note: These charts do not show the available script and language transliterations that are not to/from Latin. It also does not show other transforms that are available in CLDR, such as the casing transformations, full-width and half-width transformations, and specialized transformations such as IPA-XSampa. For the latest snapshot of the data files, see Transform XML Data. For more information, see below.
- A cell with a blue background indicates a case that doesn't roundtrip.
- A cell with a red background indicates a missing case.
- Hovering over each cell should show the character name, if enabled on your browser.
Known Data Issues
- The CLDR data currently does not contain many language-specific transliterations. So, for example, the Cyrillic transliteration is not very natural for English speakers.
- The unmarked script transliterations to Latin are generally designed to be reversible, thus some of the transliterations use extra accents to provide for a round-trip. (Implementations like ICU allows those to be easily stripped.). Variant transliterations (such as BGN) typically only go one direction.
- Less common characters may be missing; as may be some characters that don't appear in isolation.
- Some transliterations only work in context, which won't be visible. For example, an isolated 'a' transliterates to
- There are known bugs in some of the charts, such as Hangul.
Known Chart Issues
- Some browsers will not show combinations of accents correctly. See Display Problems?
- Because the context is not taken into account, significant combinations will not show in the charts. For example: For greek, Ψ shows as 'PH', when the transliteration rules will change it to 'Ph' in front of lowercase letters. Characters that are not normally used in isolation, such as ぁ, will show as an odd format (eg with extra punctuations marks or accents).
- Only the script-script charts are shown.
- The display in some of the charts needs to be improved, such as Greek, Indic, and Kana. In particular, the nonLatin-Latin display needs to be merged.