Re: Questionable lines on LineBreakTest.txt

From: Asmus Freytag (asmusf@ix.netcom.com)
Date: Mon Jun 07 2010 - 19:11:23 CDT

  • Next message: Mark Davis ☕: "Re: Questionable lines on LineBreakTest.txt"

    On 6/7/2010 4:26 PM, Masaaki Shibata wrote:
    > I'm studying the UAX #14 (5.2.0) and testing my code against
    > LineBreakTest.txt. And I found some test cases on this text file seem
    > to be contradictory to the rules on the document.
    >
    > For example, LB25 explicitly prohibits breaking between CP and PO,
    > while LineBreakTest.txt says " [0.2] RIGHT PARENTHESIS (CP) [999.0]
    > PERCENT SIGN (PO) [0.3]" (l. 1137).
    >
    > I'm not a Unicode expert; which rules lead to the result like this?
    > Did I miss any important descriptions on the document?
    >
    Probably not. The test file has been known to be wrong before.

    The spec clearly states that breaks are only allowed if there are spaces,
    as in:

    CP SP+ OP

    So this line in the "test" file appears incorrect.

    A./
    >
    >
    >



    This archive was generated by hypermail 2.1.5 : Mon Jun 07 2010 - 19:15:20 CDT