|215||Proposed Update UAX #29: Unicode Text Segmentation||2012.07.23|
|Informal Discussion:||Unicode Mail List (Join)|
|Formal Feedback:||Contact Form|
|Resolution:||The UAX will be updated with final content and published as part of Unicode 6.2.|
Description of Issue:
This Unicode Standard Annex will be updated for Unicode 6.2. The proposed update is now available for general public review and comment.
The text of UAX #29 is being changed to address certain segmentation issues for symbols in Grapheme_Cluster_Break and Word_Break, disallowing breaking within the sequence <regional indicator symbol, Zero Width Joiner, regional indicator symbol>. This involves the introduction of new property values, new and modified rules, and changes in property values for certain characters. The reasons for these changes are discussed in the following background document:
The corresponding property value changes are incorporated in the data files available for beta review of the UCD for Unicode 6.2.
For information about how to discuss this issue and how to supply formal feedback, please see the feedback and discussion instructions. The accumulated feedback received so far on this issue is shown below, or you can look at a full page view.