Re: Compression through normalization

From: Doug Ewell (dewell@adelphia.net)
Date: Sat Dec 06 2003 - 14:24:29 EST

  • Next message: Doug Ewell: "Re: Transcoding Tamil in the presence of markup"

    Peter Kirk <peterkirk at qaya dot org> wrote:

    >> Subprocesses within a closed system may be able to make certain
    >> assumptions for efficiency. Process B, for example, may know that
    >> its only source of input is Process A, which is guaranteed always to
    >> produce NFC. ...
    >
    > Does C9 actually allow this? Well, perhaps within a closed system, but
    > then standardisation and so Unicode is irrelevant to data transfer
    > between sub-processes within a closed system.

    Within a closed system, Unicode text can be transformed in any way
    imaginable. As long as it comes out of the "black box" intact, there is
    no problem.

    > Outside a closed system, Process B's best assumption for efficiency
    > may be that Process A has *probably* normalised, and so it is worth
    > performing first a quick check before a full normalisation.

    ... using the algorithm (though not necessarily the exact code) provided
    in UAX #15.

    -Doug Ewell
     Fullerton, California
     http://users.adelphia.net/~dewell/



    This archive was generated by hypermail 2.1.5 : Sat Dec 06 2003 - 15:09:13 EST