Re: Character classification and casing and locales

From: Alain LaBont/e'/ (alb@sct.gouv.qc.ca)
Date: Tue Nov 26 1996 - 15:04:35 EST


At 11:52 96-11-26 -0800, unicode@Unicode.ORG wrote:
>There are two separate questions:
>
>1. Is anyone working on Locale support for Unicode encoded strings?
>
>I suppose people are doing such things, but I don't have a definitive
answer.
>There are some tables that the consortium provides for DEFAULT character
>classification, etc.
>
>2. Are there any better ways to do [upper/lower-casing, etc], and other
locale
>dependent character operations?
>
>The Unicode FTP site, and the standard, provides a default upper/lower case
>table. I think that this particular operation is typically the same
>everywhere, with the exception of the dotted upper-case "I" in Turkish. You
>could use the table as a default, if you don't have other information you'd
>prefer to use.
>
> Rick

You probably already know that ISO/IEC JTC1/SC22/WG20 is working on a
default tailorable ordering standard (ISO/IEC 14651) for the UCS which
defines the ordering in terms of LOCALES with slight extensions to POSIX
syntax, don't you?

Keld Simonsen is also working on the complement of this standard (ISO/IEC
14652) which will define the syntax and the rest of the default "universal
LOCALE" for such things as upper/lower.

Alain LaBonti
editor, ISO/IEC CD 14651 standard
Quibec



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:32 EDT