From: Shawn Steele (Shawn.Steele@microsoft.com)
Date: Wed Mar 02 2011 - 11:26:23 CST
It might be wise to point out that title casing might be weird in English. ("Book Of Things" isn't normal), but for other languages it's even more bizarre because they don't title case at all when "we" would. So a developer using text-transform to title case stuff because it "looks better" might be actually making it look worse in many places.
From: Koji Ishii [mailto:email@example.com]
Sent: Wednesday, March 02, 2011 12:00 AM
To: Asmus Freytag; Shawn Steele
Cc: CE Whitehead; firstname.lastname@example.org
Subject: RE: Titlecasing words starting with numeric glyphs and period as word separator
Thank you everyone for the responses.
I was not aware that titlecasing is this complicated when I started discussion here and at CSS ML, sorry about that, but now I knew much better thanks to everyone.
The text-transform property in CSS is designed as a very limited, dumb and simple feature in CSS1, so what we want here is to improve its international support using what Unicode has defined while keeping the scope of the feature intact. It may not be as useful as many would expect, but is still usable for some cases, and can protect existing pages from broken.
The scope of the feature, at least as of now, doesn't cover cases like "of" or "Of", which require language-dependent dictionaries.
For cases like "a.m.", "'Tis", "L'Arbre", "!Kung", or "49ers", the spec took a little different approach from 5.18 Case Mapping in Unicode. As you can see, the spec says "first *character* of each word" and recommends UAX #29 for separating words, but allows implementations to change it to optimize for titlecasing. For instance, U+002E FULL STOP and U+0027 APOSTROPHE are MidNumLet in UAX #29 but implementations can use them as word separators for titlecasing to solve all the cases listed above. Doing so may have undesired side effects, I'm not sure, but the point is in the text-transform CSS property, the exact behavior for punctuation is UA dependent.
I'm not suggesting or requesting anything here, but I posted the message here to let you know about the current activities in CSS, and to let you know that if Unicode can define something new, I will try to incorporate the efforts into the text-transform property.
This archive was generated by hypermail 2.1.5 : Wed Mar 02 2011 - 11:28:02 CST