Re: NFD on u+AC00 contradicts NormalisationData.txt ?

From: Richard Wordingham (
Date: Wed Jun 14 2006 - 18:51:12 CDT

  • Next message: Kenneth Whistler: "Re: Mnemonics for LAO LETTER HO TAM"

    Eric Muller wrote on Wednesday, June 14, 2006 at 10:43 PM

    > Theodore H. Smith wrote:
    >> On 14 Jun 2006, at 21:09, Eric Muller wrote:
    >>> Theodore H. Smith wrote:
    >>>> Does AC00 actually decompose?

    >> But why isn't it listed in UnicodeData.txt?

    > Because that would double the size of the file, and because the
    > decompositions are algorithmic and are most often implemented that way
    > (rather than being driven by tables).

    But why aren't the full entries given for the first and last characters in
    the excluded ranges? At present it's very easy for a parser to treat them
    as valid lines and load duff data.


    This archive was generated by hypermail 2.1.5 : Wed Jun 14 2006 - 20:10:19 CDT