Re: ZWJ, ZWNJ and Markup languages.

From: Asmus Freytag (t) <>
Date: Fri, 27 Nov 2015 18:49:31 -0800
On 11/27/2015 11:55 AM, Plug Gulp wrote:

The Unicode standard 8.0 states in chapter 23, section titled "Cursive
Connection and Ligatures"(printed page #814, PDF page #850) that:

"The zero width joiner and non-joiner characters are designed for use
in plain text; they should not be used where higher-level ligation and
cursive control is available. (See Uni-code Technical Report #20,
“Unicode in XML and Other Markup Languages,” for more information.) "

I went through TR#20 and did not find any mention that ZWJ and ZWNJ
are not suitable for use with markup languages. On the contrary, ZWJ
and ZWNJ are listed in TR#20 under section 4 titled "Format Characters
Suitable for Use with Markup".

So are ZWJ and ZWNJ characters suitable for use with markup languages
such as HTML and XML?


PS: the kind of "per-instance" ligation control isn't generally available, although one
could design special XML schemas where it would be.

PPS: when it is desired, for example, to suppress all ligation, it would be better to
do that with some overal control rather than to use ZWNJ at all possible instances.

PPPS: there's always a tension between stylistic and orthographic ligation; the rules
for the latter are changing though, because people are adapting them to the limitations
of software....

Thanks and kind regards,


Received on Fri Nov 27 2015 - 20:50:34 CST

This archive was generated by hypermail 2.2.0 : Fri Nov 27 2015 - 20:50:35 CST