Re: Question about “Uppercase” in DerivedCoreProperties.txt

From: Mike FABIAN <mfabian_at_redhat.com>
Date: Sat, 08 Nov 2014 10:22:10 +0100

Philippe Verdy <verdy_p_at_wanadoo.fr> さんはかきました:

> note that tolower() and toupper() can only work one 1-character level, it
> is not recommended for use for changing case of plain text.
>
> For correct handling of locales, to upper and toupper should be replaced by
> strtolower and strtoupper (or their aliases) which will be able to process
> character clusters and contextual casing rules needed for a language or
> orthographic style

Yes, thank you for explaining this.

But these details of upper and lower casing cannot be expressed in the
“i18n” file of glibc:

https://sourceware.org/git/?p=glibc.git;a=blob;f=localedata/locales/i18n

For toupper and tolower, this file just has character -> character
mapping tables, for example the “tolower” table contains only

(<U03A3>,<U03C3>)

(i.e. mapping Σ U+03A3 -> σ U+03C3, never to the final sigma ς
U+03C2).

More correct, detailed information about upper and lower case must come
from elsewhere, not from this “i18n” file in glibc. Using only the
information from this “i18n” file, not even the Greek sigma can be
handled correctly.

Pravin and me want to update this “i18n” file to the latest
data from Unicode 7.0.0, doing it as correct as possible within
the limitations caused by this file and the ISO C standard.

-- 
Mike FABIAN <mfabian_at_redhat.com>
☏ Office: +49-69-365051027, internal 8875027
睡眠不足はいい仕事の敵だ。
_______________________________________________
Unicode mailing list
Unicode_at_unicode.org
http://unicode.org/mailman/listinfo/unicode

Received on Sat Nov 08 2014 - 03:23:24 CST

This message: [ Message body ]
Next message: Mark Davis ☕️: "Re: Emoji skin tone modifiers on the website of a leading German daily newspaper"
Previous message: Karl Williamson: "Re: New Unicode Emoji draft, available for review"
In reply to: Philippe Verdy: "Re: Question about “Uppercase” in DerivedCoreProperties.txt"
Next in thread: Philippe Verdy: "Re: Question about “Uppercase” in DerivedCoreProperties.txt"
Reply: Philippe Verdy: "Re: Question about “Uppercase” in DerivedCoreProperties.txt"
Reply: Christopher Vance: "Re: Question about “Uppercase” in DerivedCoreProperties.txt"

Mail actions: [ respond to this message ] [ mail a new topic ]
Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]

This archive was generated by hypermail 2.2.0 : Sat Nov 08 2014 - 03:23:25 CST