Re: Small Java implementation of NFC

From: Marcin 'Qrczak' Kowalczyk (qrczak@knm.org.pl)
Date: Fri Mar 04 2005 - 09:32:12 CST

  • Next message: Andrew C. West: "Re: Small Java implementation of NFC"

    unicode.org server still doesn't like my mail. Please do something
    with this. I'm sending from IP 217.96.225.87.

    The contents of my actual message are at the end.

    ----
    A message that you sent could not be delivered to one or more of its
    recipients. This is a permanent error. The following address(es) failed:
      unicode@unicode.org
        SMTP error from remote mailer after MAIL FROM:<qrczak@knm.org.pl>:
        host unicode.org [69.13.187.164]: 550 5.0.0 Spam access denied.
    ------ This is a copy of the message, including all the headers. ------
    Return-path: <qrczak@knm.org.pl>
    Received: from qrczak by qrnik.knm.org.pl with local (Exim 3.36 #1)
    	id 1D7EdK-0007Os-00
    	for unicode@unicode.org; Fri, 04 Mar 2005 16:22:30 +0100
    To: unicode@unicode.org
    Subject: Re: Small Java implementation of NFC
    X-Face: OW>RV&gN+&b-aiNY|U)f=S%w+/rK!);f>/W9IXg})]&F>ht.1Up8@04+_!gOp(_/l_-+E^.
     2\vI)1=D,%HWiq)r(M/V~dr^5T^KF/[w5YZ4<0Sus3+O>l3uA/&W_21m?.s,Po8{pb0@
    References: <20050304142249.16250.fh035.wm@smtp.sc0.cp.net>
    From: Marcin 'Qrczak' Kowalczyk <qrczak@knm.org.pl>
    Mail-Followup-To: unicode@unicode.org
    Date: Fri, 04 Mar 2005 16:22:30 +0100
    In-Reply-To: <20050304142249.16250.fh035.wm@smtp.sc0.cp.net> (Andrew C.
     West's message of "Fri, 04 Mar 2005 06:22:48 -0800 (PST)")
    Message-ID: <87zmxj4ert.fsf@qrnik.zagroda>
    User-Agent: Gnus/5.1006 (Gnus v5.10.6) Emacs/21.3 (gnu/linux)
    MIME-Version: 1.0
    Content-Type: text/plain; charset=us-ascii
    Sender: Marcin 'Qrczak' Kowalczyk <qrczak@knm.org.pl>
    "Andrew C. West" <andrewcwest@alumni.princeton.edu> writes:
    > Unicode Standard Annex #15 (http://www.unicode.org/reports/tr15/)
    > specifies that precomposed characters that are added after Unicode
    > 3.0 are excluded from composition (i.e. not recomposed when NFC is
    > applied to them). As all characters beyond the BMP were added in
    > Unicode 3.1 or later, you can effectively ignore any character
    > greater than U+FFFF (or any surrogates if you are processing UTF-16)
    > when applying NFC to a text stream.
    The last sentence is not true: precomposed characters above U+FFFF
    must be *decomposed* by NF*C*.
    -- 
       __("<         Marcin Kowalczyk
       \__/       qrczak@knm.org.pl
        ^^     http://qrnik.knm.org.pl/~qrczak/
    


    This archive was generated by hypermail 2.1.5 : Fri Mar 04 2005 - 09:33:27 CST