Re: Minor flaw in rules for locating text element boundaries

From: Mark Davis (markdavis@ispchannel.com)
Date: Tue May 16 2000 - 00:37:37 EDT


Good catch. We will be preparing the errata for 3.0.1 soon, so keep those bugs coming...

Mark

Timothy Partridge wrote:

> On page 125 of Unicode 3.0, rule 4 says
>
> No overlapping sets. [snip] A later character set definition will override a
> previous one, removing its characters from the previous set.
>
> In the Line Boundaries section a large number of sets are defined on pages
> 129-130. Unfortunately the last set to be defined is
>
> All All Unicode characters
>
> Surely by strict interpretation of rule 4 this sucks all the characters out
> of the previous sets? I know what you mean, but you don't mean what you say.
>
> Tim
>
> P.S. This significantly increases the efficiency of implementations - line
> breaks can occur before and after every character :-)
>
> --
>
> Tim Partridge. Anyopinions expressed are mine only and not those of my employer



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:02 EDT