Re: Definition of titlecase

From: Mark Davis (mark@macchiato.com)
Date: Sun Jul 25 1999 - 21:03:17 EDT


Titlecase is defined to be a format where the first letter of each word is
capitalized. This is also known as initial capitals. This format is not always
appropriate for titles: even in simpler cases than your example, not every word
in a title is titlecased: e.g. from the film world (www.imdb.com):

L' Humanite (1999)
Shakespeare in Love (1998)
The Silence of the Lambs (1991)
Dances with Wolves (1990)
Kramer vs. Kramer (1979)
etc.

A format like "vederLa" or "McGowan" is known as mixed case. For more
information, see UTR #21, Case Mappings at
http://www.unicode.org/unicode/reports/tr21/

As to the word boundaries, thanks for the tip; we will check on it in the 3.0
text to see if it is still there.

Mark

Timothy Partridge wrote:

> On page 4-2 of the 2.0 standard it says that titlecase capitalises the first
> letter of each word. Is this intended to be a definition of how an
> implementation should do titlecasing? If so is word meant to be script /
> language / implementation specific or is a more precise definition of word
> intended, such as the algorithm for determining word boundaries on page
> 5-22?
>
> As an exercise to the reader, try titlecasing the following rather contrived
> example which is hard for a computer:
>
> "chapter 5 in which mr o'brien and miss hadden-smith are cock-a-hoop when
> they discover that 'tis better to leap o'er trouble than to face danger's
> jaws."
>
> As a separate issue I notice that the suggested word boundary algorithm
> defines a set called Mid but doesn't use it. Is this an error?
>
> Tim



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:50 EDT