Re: Dealing with Unencodeable Characters

From: Doug Ewell <doug_at_ewellic.org>
Date: Thu, 06 Oct 2016 12:06:07 -0700

Charlotte Buff wrote:

> Private use characters are an obvious choice but of course their
> meaning is user-defined, so while all other emoji in my Shift JIS
> document would receive an unambiguous Unicode mapping, Shibuya 109
> would remain vague and very limited in interchange options.

But that's exactly what private-use characters were invented for: so you
can represent characters in a given character encoding framework which
are not encoded for some reason.

Of course you need a private agreement of some kind, but it can be as
simple as "Hey, everybody, in the attached document (or in any documents
I create) U+FF109 means SHIBUYA 109." Private agreements don't have to
be secret or limited-distribution, and they don't have to be excessively
formal.

Unicode rejected the "compatibility symbols" because they would have
amounted to private-use characters defined by Unicode, where the formal
names and definitions of the characters were not specified but, shhh, we
all know what they REALLY mean. This would have been the Wrong Thing to
Do on many levels.

 

--
Doug Ewell | Thornton, CO, US | ewellic.org
Received on Thu Oct 06 2016 - 14:06:27 CDT

This archive was generated by hypermail 2.2.0 : Thu Oct 06 2016 - 14:06:27 CDT