Titlecasing words starting with numeric glyphs and period as word separator

From: Koji Ishii (kojiishi@gluesoft.co.jp)
Date: Tue Feb 22 2011 - 01:15:46 CST

  • Next message: Mark Davis ☕: "Re: Titlecasing words starting with numeric glyphs and period as word separator"


    There's a discussion going on in W3C CSS mailing list[1] about specifications of the text-transform property[2], specifically how the "capitalize" value that titlecase specified span of text.

    During the discussion, two cases were presented:

    1. Titlecasing words starting with numeric glyphs (e.g., "99ers") can be "99Ers" if we follow the rules defined in 5.18 Case Mappings. Is this discussed here and it's up to implementations to define which words to apply titlecasing, or should this be fixed in Unicode spec?

    2. We're thinking to use UAX #24 to separate words and then apply Titlecase_Mapping to every word. But doing so makes "a.m." to be "A.m." and it contradicts with the general publication rules[3]. While I understand both separating words and titlecasing are ambiguous, cannot be perfect, and we must make compromises. But since Unicode defines these two rules separately, I guess there's a possibility that "word separating rules optimized for titlecasing" could be slightly different from general word separating rules. I haven't thought much about counter-cases for not doing so, but I wonder if anyone in this ML could have idea including whether we should do it or not, or we should include more other cases.

    Any feedback is greatly appreciated.


    [1] http://lists.w3.org/Archives/Public/www-style/2011Feb/0621.html
    [2] http://dev.w3.org/csswg/css3-text/#text-transform
    [3] http://www.businesswritingblog.com/business_writing/2009/06/what-is-the-correct-time-am-pm-am-pm-am-pm-.html

    This archive was generated by hypermail 2.1.5 : Tue Feb 22 2011 - 01:20:25 CST