Re: Private Use Area

From: Mark Davis (mark.davis@jtcsv.com)
Date: Sun Apr 27 2003 - 20:32:29 EDT

  • Next message: Doug Ewell: "[OT] Re: Country codes"

    But it misses English, such as the name "Zoë"

    Märk Dāvĭs
    ________
    mark.davis@jtcsv.com
    IBM, MS 50-2/B11, 5600 Cottle Rd, SJ CA 95193
    (408) 256-3148
    fax: (408) 256-0799

    ----- Original Message -----
    From: "Eric Muller" <emuller@adobe.com>
    To: "Kevin Brown" <graphity@adelaide.on.net>
    Cc: "Public email list" <unicode@unicode.org>
    Sent: Saturday, April 26, 2003 10:46
    Subject: Re: Private Use Area

    >
    >
    > Kevin Brown wrote:
    >
    > >Is there a database anywhere that cross-references Latin-based glyphs (eg
    > >E WITH DIAERESIS) to all the Latin-based writing systems they're used in
    > >(eg Dutch, French, Albanian, Cyrillic and Welsh) and vice versa? It would
    > >be really useful.
    > >
    > www.eki.ee, which has already been mentioned a few times, reports the
    > following for U+00CB Ë LATIN CAPITAL LETTER E WITH DIAERESIS:
    >
    > name: LATIN CAPITAL LETTER E WITH DIAERESIS
    > old name: LATIN CAPITAL LETTER E DIAERESIS
    > Adobe glyph name: Edieresis
    > mnemonic name(s): <E:>
    > HTML 4 mnemonic name:&Euml;
    > category: Lu (Letter, Uppercase)
    > combining: 0
    > decomposition info: 0045 0308
    > comment:
    > found in charsets: 8859-1 (CB); 8859-10 (CB); 8859-14 (CB); 8859-15
    > (CB); 8859-2 (CB); 8859-3 (CB); 8859-4 (CB); 8859-9 (CB); CP1250 (CB);
    > CP1252 (CB); CP1254 (CB); CP1258 (CB); CP850 (D3); CP852 (D3); CP857
    > (D3); CP863 (94); ROMAN (E8); CP1116 (D3); CP1122 (73); SAMI_WIN (CB);
    > SAMI_MAC (E8); 8859-16 (CB);
    > found in languages: af [Afrikaans]; cy [Welsh]; fr [French]; fy
    > [Frisian]; lb [Luxembourgian]; nl [Dutch]; sq [Albanian]; sv [Swedish];
    > wo [Wolof]; yap [Yapese]; dink [Dinka]; sla [Kashubian];
    > used in romanization of: be_r [Belarusian (cyrillic)]; ru_r [Russian
    > (cyrillic)];
    > lowercase: 00EB
    >
    >
    > This database also covers Cyrillic and the languages written in it.
    >
    > The underlying data can be accessed a number of ways: e.g. “what is
    > needed beyond basic Cyrillic to write Altai?” reports U+0458 ј CYRILLIC
    > SMALL LETTER JE, U+04A5 ҥ CYRILLIC SMALL LIGATURE EN GHE, U+04E7 ӧ
    > CYRILLIC SMALL LETTER O WITH DIAERESIS and U+04F1 ӱ CYRILLIC SMALL
    > LETTER U WITH DIAERESIS.
    >
    > As far as I can tell, the underlying data is of good quality; of course,
    > there the devil is in the details, and there are certainly decisions
    > that could be argued. A wonderful resource.
    >
    > Eric.
    >
    >



    This archive was generated by hypermail 2.1.5 : Sun Apr 27 2003 - 21:10:48 EDT