UAX #29 6.2

From: Zack Newman via Unicode <unicode_at_unicode.org>
Date: Fri, 6 Mar 2020 20:36:31 -0700

According to 6.2, "thus ignoring Extend is sufficient to disallow breaking
within a grapheme cluster." However the sequence of Unicode scalar values
(U+0600, U+0020) is considered a single grapheme cluster due to rule GB9,
but the sequence is parsed into two words according to 4.1.1. While it
would be ideal to not have sequences of Unicode scalar values that can be
parsed into more words than grapheme clusters, I think it's more
understandable if section 6.2 didn't explicitly state that this isn't
possible.
Received on Sat Mar 07 2020 - 09:48:27 CST

This archive was generated by hypermail 2.2.0 : Sat Mar 07 2020 - 09:48:28 CST