Re: base character

From: Mark Davis (mark.davis@jtcsv.com)
Date: Thu Jun 10 2004 - 10:11:30 CDT

  • Next message: Michael Everson: "Re: Bantu click letters"

    Whenever you have questions about these sorts of issues, make very sure that you
    consult the latest version of the Unicode standard, in this case, 4.0.1
    <http://www.unicode.org/versions/Unicode4.0.1/>

    You will find there the following -- much more explicit -- text, that amends the
    text from 4.0:

    D13a Graphic character: A character with the General Categories of Letter (L),
    Combining Mark (M), Number (N), Punctuation (P), Symbol (S), or Space Separator
    (Zs).
    ...
    D13b Base character: Any graphic character except for those with the General
    Category of Combining Mark (M).

    Mark
    __________________________________
    http://www.macchiato.com
    ► शिष्यादिच्छेत्पराजयम् ◄

    ----- Original Message -----
    From: "Theo Veenker" <Theo.Veenker@let.uu.nl>
    To: "unicode" <unicode@unicode.org>
    Sent: Thu, 2004 Jun 10 07:05
    Subject: base character

    > According to the definition a base character is:
    >
    > "A character that does not graphically combine with
    > preceding characters, and that is neither a control
    > nor a format character."
    >
    > What is this expressed in terms of properties?
    > Something like this? cc==0 AND GG!=Cc AND GC!=Cf AND GC!=Cn
    >
    > Theo
    >
    >
    >



    This archive was generated by hypermail 2.1.5 : Thu Jun 10 2004 - 10:12:05 CDT