Line breaking status of emoji modifiers

From: Simon Cozens <simon_at_simon-cozens.org>
Date: Sun, 6 Dec 2015 09:08:59 +0900

My renderer just got hit with an interesting, if possibly obscure, bug.

UTR#51 says "A supported emoji modifier sequence should be treated as a
single grapheme cluster for editing purposes (cursor moment, deletion,
etc.); word break, line break, etc." However, the modifier codepoints
have line break category AL.

So you have an emoji (line break ID) and its modifier (line break AL),
and ICU (quite correctly) inserts a line break opportunity between the
two. This split the cluster, and then everything went downhill after that.

If you don't expect a line break here, shouldn't they be better as CM
for line breaking purposes rather than AL?
Received on Sun Dec 06 2015 - 11:03:25 CST

This archive was generated by hypermail 2.2.0 : Sun Dec 06 2015 - 11:03:25 CST