character groupings in various languages

From: Ben Dougall (bend@freenet.co.uk)
Date: Thu May 15 2003 - 18:03:07 EDT

  • Next message: Philippe Verdy: "Re: Decimal separator with more than one character?"

    would it be the uca / collation
    <http://www.unicode.org/unicode/reports/tr10/> that will allow me to do
    this? :

    having specified which language is being used, compare one character to
    another and find out which various groupings they may or may not share.
    such as comparing in english, an 'F' and 'W' would match on case (and
    consonants even). case catagories i'm sure don't exist in some other
    languages, but then i'm sure there are many other types of
    catagorisations in other languages that english doesn't have.

    i'd like to have access to any kind of character catagories / groupings
    that maybe applicable to whichever language is initially specified.

    is it the uca that's what i need to look into for that type of thing?

    also i notice icu <http://oss.software.ibm.com/icu/> has a lot of
    collation stuff. how does that compare to unicode's collation?, (if
    collation is even what i'm after, that is). how is icu different from
    unicode's collation?

    thanks.



    This archive was generated by hypermail 2.1.5 : Thu May 15 2003 - 19:03:05 EDT