Re: ZWJ, ZWNJ, CGJ and combination

From: Peter Kirk (
Date: Sun Nov 09 2003 - 18:15:17 EST

  • Next message: John Hudson: "Re: Berber/Tifinagh (was: Swahili & Banthu)"

    On 09/11/2003 14:55, Philippe Verdy wrote:

    > ...
    >And canonical normalization _guarantees_ to preserve *only* "starter
    >sequences" (defective or not), but not necessarily "combining character
    >sequences" (defective or not), and that's where care must be taken when
    >encoding text...
    Surely not. A combining character sequence consists of an optional base
    character followed by one or more combining characters. Canonical
    normalisation preserves the sequence of combining characters only,
    although it may reorder this sequence. It also preserves without
    reordering the juxtaposition of this seuqence to the optional base
    character. Therefore the combining character sequence is preserved.

    Peter Kirk (personal) (work)

    This archive was generated by hypermail 2.1.5 : Sun Nov 09 2003 - 18:52:10 EST