Supported Scripts
The Unicode Standard encodes scripts rather than
languages per se. When writing systems for more than one language
share sets of graphical symbols that have historically related
derivations, the union of all of those graphical symbols is treated
as a single collection of characters for encoding and is identified
as a single script. Each script then serves as as an inventory of
graphical symbols, which are drawn upon for the writing systems of
particular languages. In many cases, a single script, such as the
Latin script, may be used to
write tens or even hundreds of languages. In other cases, only one language employs a
particular script—for example, Hangul, which is typically used only
to write the Korean language. The writing systems for some languages
may also use more than one script; for example, Japanese
traditionally makes use of the Han (Kanji), Hiragana, and Katakana
scripts, and modern Japanese usage commonly mixes in the Latin
script as well.
The scripts currently supported by Unicode 6.2.0 are listed below. Tool tips for
each entry specify the version of the Unicode Standard which first supported the script.
- Arabic
- Aramaic, Imperial
- Armenian
- Avestan
- Balinese
- Bamum
- Batak
- Bengali
- Bopomofo
- Brahmi
- Buginese
- Buhid
- Canadian Syllabics
- Carian
- Chakma
- Cham
- Cherokee
- Coptic
- Cypriot
- Cyrillic
- Deseret
- Devanagari
- Egyptian Hieroglyphs
- Ethiopic
- Georgian
- Glagolitic
- Gothic
- Greek
- Gujarati
- Gurmukhi
- Han
- Hangul
- Hanunóo
- Hebrew
|
- Hiragana
- Javanese
- Kaithi
- Kannada
- Katakana
- Kayah Li
- Kharoshthi
- Khmer
- Lao
- Latin
- Lepcha (Rong)
- Limbu
- Linear B
- Lisu
- Lycian
- Lydian
- Malayalam
- Mandaic
- Meetei Mayek
- Meroitic Cursive
- Meroitic Hieroglyphs
- Miao
- Mongolian
- Myanmar
- New Tai Lue
- N'Ko
- Ogham
- Ol Chiki
- Old Italic (Etruscan)
- Old Persian Cuneiform
- Old South Arabian
- Old Turkic
- Osmanya
- Oriya
|
- Pahlavi, Inscriptional
- Parthian, Inscriptional
- Phags-pa
- Phoenician
- Rejang
- Runic
- Saurashtra
- Samaritan
- Sharada
- Shavian
- Sinhala
- Sora Sompeng
- Sumero-Akkadian Cuneiform
- Sundanese
- Syloti Nagri
- Syriac
- Tagalog
- Tagbanwa
- Tai Le
- Tai Tham
- Tai Viet
- Takri
- Tamil
- Telugu
- Thaana
- Thai
- Tibetan
- Tifinagh (Berber)
- Ugaritic
- Vai
- Yi
|
In addition to the above scripts, a number of other collections
of symbols are also encoded by Unicode. These collections consist of
the following:
- Numbers
- General Diacritics
- General Punctuation
- General Symbols
- Mathematical Symbols (Western and Arabic)
- Musical Symbols (Western, Byzantine, and Ancient Greek)
- Technical Symbols
- Emoji
- Dingbats
- Arrows, Blocks, Box Drawing Forms, and Geometric Shapes
- Game Symbols
- Miscellaneous Symbols
- Presentation Forms
- Braille Patterns
- Kangxi Radicals
|