Re: Questionable lines on LineBreakTest.txt

From: Asmus Freytag (
Date: Mon Jun 07 2010 - 19:11:23 CDT

  • Next message: Mark Davis ☕: "Re: Questionable lines on LineBreakTest.txt"

    On 6/7/2010 4:26 PM, Masaaki Shibata wrote:
    > I'm studying the UAX #14 (5.2.0) and testing my code against
    > LineBreakTest.txt. And I found some test cases on this text file seem
    > to be contradictory to the rules on the document.
    > For example, LB25 explicitly prohibits breaking between CP and PO,
    > while LineBreakTest.txt says " [0.2] RIGHT PARENTHESIS (CP) [999.0]
    > PERCENT SIGN (PO) [0.3]" (l. 1137).
    > I'm not a Unicode expert; which rules lead to the result like this?
    > Did I miss any important descriptions on the document?
    Probably not. The test file has been known to be wrong before.

    The spec clearly states that breaks are only allowed if there are spaces,
    as in:

    CP SP+ OP

    So this line in the "test" file appears incorrect.


    This archive was generated by hypermail 2.1.5 : Mon Jun 07 2010 - 19:15:20 CDT