Re: Another take on the English apostrophe in Unicode

From: David Starner <>
Date: Fri, 05 Jun 2015 01:31:09 +0000

On Thu, Jun 4, 2015 at 2:38 PM Markus Scherer <> wrote:

> "don’t" is a contraction of two words, it is not one word.

But as he points out, it's not a contraction of don and t; it is, at best,
a contraction of do and n't. It's eliding, not punctuating. In the
comments, he also brings up the examples of "Don’t you mind?" being okay
but not *"Do not you mind?", and "fo’c’sle".

> You can't use simple regular expressions to find word boundaries.

Who uses _simple_ regular expressions? You can't use any code to reliably
find word boundaries in English, and that's a problem.
Received on Thu Jun 04 2015 - 20:32:53 CDT

This archive was generated by hypermail 2.2.0 : Thu Jun 04 2015 - 20:32:54 CDT