Re: IPA chart in Unicode

From: Peter_Constable@sil.org
Date: Tue Aug 28 2001 - 14:16:50 EDT


Roozbeh:

>Does anyone have a copy of IPA charts:
>
> http://www2.arts.gla.ac.uk/IPA/ipachart.html
>
>in Unicode? I may be able to produce one, but I'm a novice in IPA, and
>given the number of similiar looking Latin characters in Unicode, ....

If you're wondering what Unicode characters to use for each IPA symbol,
that's listed in the Handbook of the International Phonetic Association:
http://uk.cambridge.org/order/WebBook.asp?ISBN=0521637511.

This following isn't exactly what you've asked for, but it may give you
the info that you need -- it's the best I can offer in a hurry. It's taken
from the release notes for the IPA font we're just about to release --
once we have someone free to update the web pages. (This was copied out of
a PDF doc and simply pasted into my mail client, and the formatting hasn't
survived.) Note that this font is designed to support IPA (rev. 1996) as
well as MS codepage 1252.

Technical Details
This section contains details of what characters are covered by the font,
and how the font interacts
with the various smart rendering technologies.

Character Coverage
The font supports the following character ranges:

U+0020 ? U+007F: All characters, excluding U+007F.
U+00A0 ? U+00FF: All characters
U+0100 ? U+017F: The following characters: U+0127, U+0131, U+014B, U+0152,
U+0153,
U+0161, U+0161, U+0178, U+017D, U+017E.
U+0180 ? U+024F: The following characters: U+0192, U+01C0 ? U+01C3.
U+0250 ? U+02AF: All characters except: U+0269, U+0277, U+027C, U+027F,
U+0285 ?
U+0287, U+0293, U+0296, U+0297, U+029A, U+02A0, U+02A3 ?
U+02AF.
U+02B0 ? U+02FF: The following characters: U+02B0, U+02B2, U+02B7, U+02BC,
U+02C6 ?
U+02C8, U+02CC, U+02D0, U+02D1, U+02D6 ? U+02DE, U+02E0,
U+02E1, U+02E3 ? U+02E9.
U+0300 ? U+036F: The following characters: U+0300 ? U+0304, U+0306,
U+0308, U+030A ?
U+030C, U+030F, U+0316 ? U+031A, U+031C ? U+0320, U+0324,
U+0325, U+0329, U+032A, U+032C, U+032F, U+0330, U+0334, U+0339 ?
U+033D, U+0361.
U+0370 ? U+03FF: The following characters: U+03A9, U+03B2, U+03B8, U+03C0,
U+03C7
U+2000 ? U+206F: U+2013 ? U+2044 excepting: U+2015, U+2017, U+201B,
U+201F, U+2023
? U+2025, U+2027 ? U+202F, U+2031 ? U+2038, U+203B ? U+203E,
U+2040 ? U+2043.
U+2070 ? U+209F: The following characters: U+2070 ? U+2079, U+207B,
U+207F.
U+20A0 ? U+20CF: The following character: U+20AC
U+2100 ? U+214F: The following character: U+2122
U+2190 ? U+21FF: The following characters: U+2191, U+2193, U+2197, U+2198
U+2200 ? U+22FF: The following characters: U+2202, U+2206, U+220F, U+2211,
U+221A,
U+221E, U+222B, U+2248, U+2260, U+2264, U+2265.
U+25A0 ? U+25FF: The following character: U+25CA.
U+FB00 ? U+FB4F: The following characters: U+FB01, U+FB02.

Private Use Area
To provide backward compatibility with the SILIPA93 encoding, the
following characters are
included as Private Use characters:
Code Character UnicodeData entry
U+F180 Superscript m (U+006D) F180;MODIFIER LETTER SMALL M;Lm;0;L;<super>
006D;;;;N;;;;
U+F181 Superscript nya (U+0272) F181;MODIFIER LETTER SMALL N WITH LEFT
HOOK;Lm;0;L;<super>
0272;;;;N;;;;;
U+F182 Superscript eng (U+014B) F182;MODIFIER LETTER SMALL
ENG;Lm;0;L;<super> 014B;;;;N;;;;;

Hope this helps.

- Peter

---------------------------------------------------------------------------
Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <peter_constable@sil.org>



This archive was generated by hypermail 2.1.2 : Tue Aug 28 2001 - 15:32:50 EDT