229 Linebreaking Changes for Pictographic Symbols 2012.07.23
Status: Closed
Originator: UTC
Resolution: Fourteen characters (chess symbols, two telephone symbols) were removed from the main inclusion set. The remainder of the main inclusion set will have its Line_Break property updated for Unicode 6.2. The regional indicator set was not affected by the resolution of this PRI.

Description of Issue:

Emoji are now being used in many contexts other than Japanese e-mail and text messages. The current line break property value of most Unicode characters used for emoji is AL (alphabetic letter), which is causing problems in many of these contexts. For example, here are short lines (adapted from examples promoting an emoji application) mixing Latin characters and emoji, without spaces; there is no line break opportunity anywhere in the sentences:

Bad line break in short lines with emoji

For details of the proposal to change the LineBreak property values for certain symbols, please see the background document.

The changes for LineBreak.txt associated with this proposal are not part of the beta review of the UCD for Unicode 6.2, but are separately posted for review in LineBreakPRI229.txt

