Re: Folding algorithm and canonical equivalence

From: Peter Kirk (peterkirk@qaya.org)
Date: Mon Jul 19 2004 - 12:52:46 CDT

  • Next message: Asmus Freytag: "Re: Folding algorithm and canonical equivalence"

    On 19/07/2004 03:20, Mark E. Shoulson wrote:

    > ...
    >
    > Jony's right: when it's down to brass tacks in Hebrew, it's consonants
    > and whitespace (and punctuation, I guess).
    >
    Agreed. But then there are a few characters which are not combining
    marks but which are really part of the accent system and so should
    perhaps be stripped when points are removed: 05C0 paseq/legarmeh, which
    should be deleted; and 05BE maqaf, which should be replaced by a (word
    dividing) space. For 05C0 is an annotation which certainly has no place
    in an unpointed text; and in an accented text whether two words are
    separated by maqaf or space depends on their accentuation, and space is
    always used in unaccented texts.

    Within the biblical text it would also be logical to delete 05C3 sof
    pasuq, but its use elsewhere as punctuation suggests otherwise.

    -- 
    Peter Kirk
    peter@qaya.org (personal)
    peterkirk@qaya.org (work)
    http://www.qaya.org/
    


    This archive was generated by hypermail 2.1.5 : Mon Jul 19 2004 - 12:54:46 CDT