Re: Yerushala(y)im - or Biblical Hebrew

From: Philippe Verdy (verdy_p@wanadoo.fr)
Date: Tue Jul 08 2003 - 12:16:07 EDT

  • Next message: Ted Hopp: "Re: Yerushala(y)im - or Biblical Hebrew"

    On Tuesday, July 08, 2003 5:14 PM, John Cowan <jcowan@reutershealth.com> wrote:
    > Peter Kirk scripsit:
    > Such a character could only be encoded if it were put into the list
    > of composition exceptions, because it would upset the stability of
    > normalization.

    Even if listed in the Canonical Composition Exclusion list, this would
    not work: this list only refers to characters that are canonically
    decomposable into a character pair, and that MUST be decomposed
    and MUST NOT be recomposed when creating *either* a NFC or
    NFD form.

    There's a requirement that if two string are canonically equivalent,
    they have identical NFC form *and* identical NFD form.

    The reason is that Unicode algorithms must produce identical
    results on NFC and NFD forms.

    Only the compatibility decompositions would work as expected,
    i.e. the NFKD decomposition of the "abnormal" sequence of
    vowels MUST be still given with two vowels in canonical order.
    meaning that the NFKD or NFKC transformation would swap
    the vowels to their canonical order.

    -- Philippe.



    This archive was generated by hypermail 2.1.5 : Tue Jul 08 2003 - 13:09:36 EDT