[Unicode] Unicode Corrigenda Tech Site | Site Map | Search
 

Corrigendum #7: UAX #14, Unicode Line Breaking Algorithm, rule LB8

 

Corrigendum Effective Date Applicable Versions Fixed Version Result Documented In:
Corrigendum #7: UAX #14, Unicode Line Breaking Algorithm, rule LB8 2010-Mar-15
[121-C5]
5.0.0 to 5.2.0 6.0.0 UAX #14

Background

In UAX #14: Unicode Line Breaking Algorithm (citing Version 5.2), the formulation of Rule LB8 does not match its intent, nor does it match the suggested implementation in the pair table. The effect of rule LB8 before correction is that there is a break in the sequence ZW ÷ CL, but no break in ZW SP × CL. This is not consistent with similar situations where the addition of one or more spaces does not remove break opportunities. With this corrigendum, there is a break in ZW SP ÷ CL.

Changes to the Text of UAX #14

Rule LB8 is changed to:

LB8. Break before any character following a zero-width space, even if one or more spaces intervene.

ZW SP* ÷


Access to Copyright and terms of use