The term "complex script" in OOXML spec (Re: [unicode] Re: Character set cluelessness)

From: suzuki toshiya <>
Date: Wed, 03 Oct 2012 14:46:57 +0900

Dear Richard,

There had been long discussion about OOXML's "complex script" in JTC1/SC34,
since 2009.
I expect the next corrigendum or amendment will describe more about it.
Unfortunately, the implementation referred by "complex script" would be
different what Unicode experts remind from this word.


Martin J. Dürst wrote (2012/10/03 9:37):
> Richard - Complex script usually refers to scripts where rendering isn't
> just simply putting glyphs side by side. That includes stuff with
> combining marks, ligatures, reordering, stacking, and the like.
> Regards, Martin.
> On 2012/10/03 7:09, Richard Wordingham wrote:
>> On Tue, 02 Oct 2012 09:14:08 -0700
>> "Doug Ewell"<> wrote:
>>> It's 2012. How does one get through to folks like this?
>> Even people who should know better can get confused about character
>> sets. Does anyone know what 'a complex script Unicode range' is? It's
>> a term that occurs in the Office Open XML specification, but I
>> can't find a definition for it.
>> It's just possible that it means a range where hypothetically unassigned
>> characters would not be left-to-right, but I've a feeling it ought to
>> include Vietnamese characters for all that they're Latin script.
>> Possibly the definitions have not been provided because the concept
>> ought to involve the tricky task of breaking text runs into script
>> runs. (Lots of people feel one should be able to add script-specific
>> combining marks to U+25CC DOTTED CIRCLE, U+2013 EN DASH and U+00D7
>> U+0964 DEVANAGARI DANDA is used with the Latin, Devanagari and Tamil
>> scripts, to name but a few.)
>> Richard.
Received on Wed Oct 03 2012 - 00:52:50 CDT

This archive was generated by hypermail 2.2.0 : Wed Oct 03 2012 - 00:52:51 CDT