Re: browsers and unicode surrogates

From: David Starner (starner@okstate.edu)
Date: Mon Apr 22 2002 - 19:55:10 EDT


On Mon, Apr 22, 2002 at 12:19:15PM -0400, jshin@mailaps.org wrote:
> My test pages don't have yet characters beyond BMP(I just
> recycled a page I made a long time ago for Korean testing) . I may later
> add them. (Tex, can I use your sample page? I'd rather put up a page
> with some content instead of just a list of characters.)

Mark 1:1 Anastodeins aiwaggeljons Iesuis Xristaus sunaus gudis. 2 Swe
gameliþ ist in Esaïin praufetau: sai ik insandja aggilu meinana faura
þus, saei gamanweþ wig þeinana faura þus. ...

You of course have to transcribe it into Gothic (my technology isn't up
to handling non-BMP characters.) The names of the characters are pretty
much obvious; e = U+10334, þ = U+10338, i = 10339, ï = U+10339 U+0308, ƕ
= U+10348, x (should be chi) = U+10347. The rest of the Gothic gospels
are available from http://extranet.ufsia.ac.be/wulfila/Index.html; they
have transcrbed it into Latin-1, though. Part of it is also available
from http://dvdeug.dhis.org/gutenberg/misc/Gotische_Grammatik, if you
don't mind paging through the scans.

-- 
David Starner - starner@okstate.edu
"It's not a habit; it's cool; I feel alive. 
If you don't have it you're on the other side." 
- K's Choice (probably referring to the Internet)



This archive was generated by hypermail 2.1.2 : Mon Apr 22 2002 - 20:32:51 EDT