RE: Level of Unicode support required for various languages

From: Peter Constable (petercon@microsoft.com)
Date: Thu Oct 25 2007 - 09:14:52 CDT

  • Next message: Lorna_Priest@sil.org: "Re: Cost of no OCR for extended Latin"

    Ideographic Description Sequence characters are not meant for, and should not be used for, representing text elements in running text other than ideographic description sequences. In other words, descriptions of characters that would be used in simple sentences of Zhuang can be represented in Unicode, but simple sentences of Zhuang cannot.

    Peter

    -----Original Message-----
    From: unicode-bounce@unicode.org [mailto:unicode-bounce@unicode.org] On Behalf Of vunzndi@vfemail.net
    Sent: Wednesday, October 24, 2007 9:06 PM
    To: Andrew West
    Cc: Kenneth Whistler; tarmes@fr.imaje.com; unicode@unicode.org
    Subject: Re: Level of Unicode support required for various languages

    Since it is difficult to answer the question does uniocde support more
    than half of the 4000 languages of the world, and who knows how many
    dialects. The list of what is required grows the more one thinks about
    it.

    For example talking about CJKV characters at present many characters
    can only be repesented by IDS, one therefore really needs IDS to
    character display to write even simple sentance in some languages like
    Zhuang.

    While Europe has quite good coverage ( though not hungarian runes) the
    coverage of asian and african scripts is sparse. Current scripts under
    WG2 review include, but are by no means restricted to Javanese, Lisu,
    Miao, Nushu, and Tangut. Some scripts like Egyptians Heiroglyphics
    will need a higher protocol outside of the scope of uniocde to display
    properly.

    John Knightley

    Quoting Andrew West <andrewcwest@gmail.com>:

    -------------------------------------------------
    This message sent through Virus Free Email
    http://www.vfemail.net



    This archive was generated by hypermail 2.1.5 : Thu Oct 25 2007 - 09:16:51 CDT