Re: Does Unicode 4.1 change NFC?

From: Marcin 'Qrczak' Kowalczyk (qrczak@knm.org.pl)
Date: Sat Apr 02 2005 - 10:24:13 CST

  • Next message: Michael Everson: "Re: Sindhi characters proposed"

    Elliotte Rusty Harold <elharo@metalab.unc.edu> writes:

    > Do the recent additions to Unicode 4.1 make any changes to NFC? i.e.
    > does a program that correctly performs normalization on Unicode 4.0
    > data need any updates, to data tables or algorithms, to normalize
    > Unicode 4.1 data in normalization form C?

    Yes. New CJK compatibility ideographs U+FA70..U+FAD9 have canonical
    decompositions into single characters. For example NFC(U+FACF) =
    U+2284A (for the first time a BMP character is normalized to something
    outside BMP).

    These are the only differences in NFC/NFD between Unicode 4.0.1 and 4.1.0.

    There are 48 more differences in NFKC/NFKD.

    -- 
       __("<         Marcin Kowalczyk
       \__/       qrczak@knm.org.pl
        ^^     http://qrnik.knm.org.pl/~qrczak/
    


    This archive was generated by hypermail 2.1.5 : Sat Apr 02 2005 - 10:25:39 CST