Re: Language Tagging And Unicode

From: John Cowan (
Date: Thu Jan 20 2000 - 13:31:05 EST

Michael Everson wrote:

> Both derive directly from Old Slavonic letter tvrdo.

That proves too much: they also derive directly from tau,
as does Latin "t".

Serbian has, AFAIK, a unique position* among the world's
written languages: it has two scripts but only one writing
system (unlike Mongolian or Javanese, where there are two
completely separate writing systems). It is common,
I am told, for a manuscript to be submitted in Latin
script even though it is to be printed in Cyrillic, e.g.
Transliteration is completely mechanical, requiring
no knowledge of Serbian spelling rules.

So in the Serbian context it actually makes sense to
say that U+0411 and U+0042 are mere glyphic variants
of the same underlying character!\


* I will not enter into the discussion of how many
languages are named by the labels "Serbian" and "Croatian"
and other more recently applied names. I am using
"Serbian" in this posting for convenience.


