Combining Triple Diacritics (N3915) not accepted by UTC #125

From: Karl Pentzlin (karl-pentzlin@acssoft.de)
Date: Wed Nov 10 2010 - 11:11:08 CST

Next message: Murray Sargent: "RE: Combining Triple Diacritics (N3915) not accepted by UTC #125"

Previous message: Karl Pentzlin: "Re: Are Latin and Cyrillic essentially the same script?"
Next in thread: Murray Sargent: "RE: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Reply: Murray Sargent: "RE: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Reply: Khaled Hosny: "Re: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Maybe reply: Khaled Hosny: "Re: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Maybe reply: Benjamin M Scarborough: "Re: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

From the Pre-Preliminary minutes of UTC #125 (L2/10-416):

> C.4 Preliminary Proposal to enable the use of Combining Triple
> Diacritics in Plain Text (WG2 N3915) [Pentzlin, L2/10-353]
- see http://std.dkuug.dk/jtc1/sc2/wg2/docs/n3915.pdf

> [125-A13] ... UTC does not believe that either solution A or solution B
> represents an appropriate encoding solution for the text
> representation problem shown in this document. Appropriate
> technology involving markup should be applied to the problem of
> representation of text at this level.

This will not happen.
Linguists will continue to use their PUA code points (or even their
8-bit fonts), which employ these characters perfectly (albeit using
precomposed glyphs for the used combinations).

> This is not plain text.

It *is*, at least for the applications in dialectology where groups of
three characters linked by one of the proposed triple diacritics have a
well-defined and documented meaning.

This is also proven by the fact that the existing PUA characters
fulfill perfectly the needs of the relevant academic communities,
except being interchangeable without using special fonts containing
these PUA characters (a request which could be overcome when these
characters are contained in Unicode).

> Processes such as line-breaking do not know about these, or the
> double diacritics, and this creates problems for processes.

Problems are there to be solved, and they are solvable.
E.g., simply state that no line break may occur in the realm of a
diacritic spanning over three letters.

Latin *is* a complex script, anyway.

- Karl Pentzlin

Next message: Murray Sargent: "RE: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Previous message: Karl Pentzlin: "Re: Are Latin and Cyrillic essentially the same script?"
Next in thread: Murray Sargent: "RE: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Reply: Murray Sargent: "RE: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Reply: Khaled Hosny: "Re: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Maybe reply: Khaled Hosny: "Re: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Maybe reply: Benjamin M Scarborough: "Re: Combining Triple Diacritics (N3915) not accepted by UTC #125"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Wed Nov 10 2010 - 11:15:12 CST