New Public Review Issue: UAX #14 proposed update

From: Rick McGowan (
Date: Fri Jan 20 2006 - 11:44:33 CST

  • Next message: Guy Steele: "Re: Ellipsis"

    The Unicode Technical Committee has posted a new issue for public review
    and comment. Details are on the following web page:


    Review period for the new item closes on January 30, 2006.

    Please see the page for links to discussion and relevant documents.
    Briefly, the new issue is:

    Proposed Update UAX #14: Line Breaking Properties

    The UTC has modified the conformance clauses of UAX #14 and the text they
    reference. These changes clarify precisely what is tailorable in conformant
    implementations and what is not. The non-tailorable results are limited to
    interactions among a small set of well-defined core characters, such as
    CR, LF, NBSP, SP, and so on, where the semantics of the characters is bound
    up in how they linebreak.

    The rules of the algorithm have been reordered and restated to allow free
    tailorability of most rules and the character properties they depend upon,
    except for those rules involving the core set.

    Reviewers should note that all the rules were renumbered, with a
    correspondence table between rule numbers in the Modifications section.

    The default assignments to linebreak classes for certain South East Asian
    scripts have been adjusted to make it easier to interface with the kind of
    external algorithm needed to handle these scripts correctly. A few rules
    have been added or modified to better reflect the behavior of currency
    characters (and similar characters) in numbers, and the use of words like

    Other minor adjustments have been made to line break classes, rules and text.

    If you have comments for official UTC consideration, please post them by
    submitting your comments through our feedback & reporting page:

    If you wish to discuss issues on the Unicode mail list, then please use
    the following link to subscribe (if necessary). Please be aware that
    discussion comments on the Unicode mail list are not automatically recorded
    as input to the UTC. You must use the reporting link above to generate
    comments for UTC consideration.

            Rick McGowan
            Unicode, Inc.

    This archive was generated by hypermail 2.1.5 : Fri Jan 20 2006 - 11:47:29 CST