Re: unicode Digest V12 #108

From: Jukka K. Korpela <>
Date: Wed, 06 Jul 2011 10:16:50 +0300

2011-07-06 9:25, Asmus Freytag wrote:

> Because accidental digraphs (in Danish) happen at word boundaries in a
> compound, the SHY is an elegant way to mark them.

It may often be a practical trick, given the current repertoire of
characters in Unicode and the way they are handled in different
programs. But I don’t see any elegance in it, and it may turn to an
impractical method rather easily.

You don’t really want to say “this is an allowable word division point”
but “these two (or more!) characters are not to be treated as one unit
text, even in a context where they normally would be so treated.” You
_might_ want to explicitly allow word division, but that’s coincidental.

Allowing word division just to say that some characters do not
constitute a digraph (or trigraph…) is not practical e.g. when the text
has otherwise no word divisions, for one reason or another, or when the
particular word division point is typographically suboptimal or even bad.

Received on Wed Jul 06 2011 - 02:18:59 CDT

This archive was generated by hypermail 2.2.0 : Wed Jul 06 2011 - 02:19:00 CDT