Re: Text Editors and Canonical Equivalence (was Coloured diacriti cs)

From: jon@hackcraft.net
Date: Wed Dec 10 2003 - 05:41:58 EST

  • Next message: Peter Kirk: "Re: Qumran Greek"

    Quoting Peter Kirk <peterkirk@qaya.org>:

    > OK, as a C function handling wchar_t arrays it is not expected to
    > conform to Unicode. But if it is presented as a function available to
    > users for handling Unicode text, for determining how many characters (as
    > defined by Unicode) are in a string, it should conform to Unicode,
    > including C9.

    If a function is presented as a function available to users for handling
    Unicode text then it should do whatever it claims to do.

    There are concepts of "code units", "code points", "characters", and "default
    grapheme clusters" in Unicode. Functions which count either of these are
    perfectly conformant with Unicode, as long as the perform their task correctly.

    --
    Jon Hanna                   | Toys and books
    <http://www.hackcraft.net/> | for hospitals:
                                | <http://santa.boards.ie/>
    


    This archive was generated by hypermail 2.1.5 : Wed Dec 10 2003 - 06:26:16 EST