RE: Genesis v. UDHR?

From: Frank da Cruz (fdc@columbia.edu)
Date: Sat May 26 2001 - 11:09:01 EDT


> As I am interested in finding any texts in Unicode (UTF8) in any language.
> I must admit that in most cases the more interesting scripts (LR, such as
> Hebrew, Arabic, Farsi, Urdu, or combining such as Khmer) do not have the
> source available as UTF8 text, but only as images. In order to test
> various typographic systems I really would have liked to have a short but
> representative source text also in these "more complex" scripts. If
> somebody has such a text I would very much appreciate being able to use it.
>
There is a "many languages in UTF-8" sampler here:

  http://www.columbia.edu/kermit/utf8.html

but of course it lacks some of the scripts you mentioned. I too would like
to have Urdu, Khmer, Arabic, Korean, and other missing versions of the short,
silly phrase repeated there. (A new language trickles in every couple weeks,
recent additions include Cornish, Walloon, and my own no doubt fractured
renditions of Anglo Saxon and Middle English -- it's always nice to find a
use for Yogh :-) I'm sure some readers of this list can send a few more --
at least Lombard and Glaswegian? Extra points for Pictish and Papiamentu.

- Frank



This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:18:17 EDT