Re: CSUR Tonal

From: Luke Dashjr <luke_at_dashjr.org>
Date: Sat, 14 Mar 2015 16:17:44 +0000

On Friday, August 06, 2010 9:02:39 AM Andrew West wrote:
> Looking at the examples shown on
> <http://en.wikipedia.org/wiki/John_W._Nystrom>, it seems to me that
> 0-8 are ordinary digits, and the symbols for 9 through 15 are inverted
> or inverted+modified forms of the digits '7' through '1', so that
> there is some sort of imperfect bilateral symmetry on the clock and
> compass faces, with '0' and '8' as the axis of symmetry. Thus the '9'
> is an inverted '6' (as 16-6=10) not an ordinary '9'. So except for the
> odd glyph forms for 9, 11, 12 and 15 (would be be expected to be
> simple inversions of '7', '5', '4' and '1') it makes sense as a system
> to me.
>
> Anyhow, I do not think the ordinary digits '0' through '8' should be
> encoded in the CSUR.

The question was recently raised whether this reasoning is sound, considering
that the digits '0' through '8' are not the same when being "rendered" by
text-to-speech. A number '100' (written in tonal digits) ought not be spoken
the same as ASCII '100' although it is visually identical when displayed as
writing. Does Unicode give any relevance to non-visual rendering, or do TTS
just need to settle for environmental hints (eg, the user explicitly telling
it tonal numbers are in use)?

For additional context, the pronunciation vs visual differentiation came up
when trying to get the BTC "B with double lines" encoded in Ubuntu's fonts.
The font designers argue that because the BTC version should be pronounced
"bitcoins", it is not appropriate to use the existing encoding which might be
pronounced differently[1].

Thank you for any insight on this matter,

Luke

1. https://bugs.launchpad.net/ubuntu-font-family/+bug/1061115

P.S. Can someone disable (or soften) Spamhaus RBL for the Unicode mailing list
(and/or private email servers for some of those CC'd)? Spamhaus has recently
been abusing their RBL for putting political pressure on entire ISPs including
the one my mail server is within the IP subnet of, despite that my mail server
has never been involved in any spam.
_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode
Received on Sat Mar 14 2015 - 12:19:15 CDT

This archive was generated by hypermail 2.2.0 : Sat Mar 14 2015 - 12:19:17 CDT