Re: Questionable lines on LineBreakTest.txt

From: Masaaki Shibata (shibatamasaaki@gmail.com)
Date: Tue Jun 08 2010 - 02:55:58 CDT

  • Next message: Luke-Jr: "Re: Hexadecimal digits"

    Asmus, Mark, thank you for replying.

    I'm very surprised. These document and test file must have been public
    for years and I couldn't find any cautions or notations about that on
    their site. This is very misleading. Most developers will reasonably
    expect this text file will be useful.

    I agree with Mark. I hope some UTC people will notice our argument.

    Ref. I've got 17 cases of the same kind of contradiction on
    LineBreakTest.txt. They are all seemed to be against LB25:

    l.1137: [0.2] RIGHT PARENTHESIS (CP) [999.0] PERCENT SIGN (PO) [0.3]
    l.1139: [0.2] RIGHT PARENTHESIS (CP) [9.0] COMBINING DIAERESIS
    (CM) [999.0] PERCENT SIGN (PO) [0.3]
    l.1141: [0.2] RIGHT PARENTHESIS (CP) [999.0] DOLLAR SIGN (PR) [0.3]
    l.1143: [0.2] RIGHT PARENTHESIS (CP) [9.0] COMBINING DIAERESIS
    (CM) [999.0] DOLLAR SIGN (PR) [0.3]
    l.2569: [0.2] COMMA (IS) [999.0] DIGIT ZERO (NU) [0.3]
    l.2571: [0.2] COMMA (IS) [9.0] COMBINING DIAERESIS (CM) [999.0]
    DIGIT ZERO (NU) [0.3]
    l.3869: [0.2] PERCENT SIGN (PO) [999.0] LEFT PARENTHESIS (OP) [0.3]
    l.3871: [0.2] PERCENT SIGN (PO) [9.0] COMBINING DIAERESIS (CM)
    [999.0] LEFT PARENTHESIS (OP) [0.3]
    l.4013: [0.2] DOLLAR SIGN (PR) [999.0] LEFT PARENTHESIS (OP) [0.3]
    l.4015: [0.2] DOLLAR SIGN (PR) [9.0] COMBINING DIAERESIS (CM)
    [999.0] LEFT PARENTHESIS (OP) [0.3]
    l.4441: [0.2] SOLIDUS (SY) [999.0] DIGIT ZERO (NU) [0.3]
    l.4443: [0.2] SOLIDUS (SY) [9.0] COMBINING DIAERESIS (CM)
    [999.0] DIGIT ZERO (NU) [0.3]
    l.5226: [0.2] LATIN SMALL LETTER E (AL) [28.0] LATIN SMALL LETTER
    Q (AL) [28.0] LATIN SMALL LETTER U (AL) [28.0] LATIN SMALL LETTER
    A (AL) [28.0] LATIN SMALL LETTER L (AL) [28.0] LATIN SMALL LETTER
    S (AL) [7.01] SPACE (SP) [13.02] FULL STOP (IS) [999.0] DIGIT
    THREE (NU) [25.03] DIGIT FIVE (NU) [7.01] SPACE (SP) [18.0]
    LATIN SMALL LETTER C (AL) [28.0] LATIN SMALL LETTER E (AL) [28.0]
    LATIN SMALL LETTER N (AL) [28.0] LATIN SMALL LETTER T (AL) [28.0]
    LATIN SMALL LETTER S (AL) [0.3]

    Notice that they are the only cases i've found. There may be more.

    I also took a glance at LineBreakTest-6_0_0d4.txt and found same
    contradictions there too.

    Thanks.



    This archive was generated by hypermail 2.1.5 : Tue Jun 08 2010 - 11:00:42 CDT