Re: Specification of Encoding of Plain Text

From: Richard Wordingham <>
Date: Fri, 13 Jan 2017 09:02:32 +0000

On Thu, 12 Jan 2017 21:03:29 +0100
Mark Davis ☕️ <> wrote:

> Latin is not a complex script,...

Unlike the common script, which notably has U+2044 FRACTION SLASH.

That statement is actually dubious from a typographical point of view.

> it was only an illustration.

But it's good for looking for the non-obvious issues.

> A more serious effort would look at some of the issues from
>, for example.

I don't think we want to have to repeat them all for each script.
Putting common-script punctuation and numbers in the regex will add
obscurity, and possibly be a maintainability issue.

Received on Fri Jan 13 2017 - 03:03:10 CST

This archive was generated by hypermail 2.2.0 : Fri Jan 13 2017 - 03:03:11 CST