Re: regular expressions

From: John Cowan (cowan@locke.ccil.org)
Date: Tue Feb 04 1997 - 16:56:47 EST


Alain LaBonté wrote:

> That's the idea of a traditional locale. However in ISO/IEC CD 14651 we
> completely redefine the comparison operation (we introduce the concept of
> equivalence at different levels of comparison) and that allows you to do all
> that with a single operation without changing locales...
>
> Hence "La Bonté" is equivalent to "labonte" at level 1, it is not at level 2
> or higher... it is equivalent to "labonté" at level 2, it is not at level 3
> or higher (note that there is a space in the reference, none in the
> comparand, that is intentional), it is equivalent to "LaBonté" at level 3
> and it is not at higher levels and finally it is absolutely identical (or
> equivalent at level 4) to "La Bonté" (note that the standard makes this
> code-independent too!)

How about locales that use two different alphabets? Historically
in Yugoslavia the strings "znati" (= to know) and
"\x0437\x043D\x0430\x0442\x0438" were taken as "the same thing"
in a fairly strong sense: a manuscript received by a publisher
in Latin script might wind up being typeset in Cyrillic.
(I don't know if Latin orthography is deprecated in
Yugoslavia these days.) This certainly would not be
true of Russian or English text.

-- 
John Cowan						cowan@ccil.org
			e'osai ko sarji la lojban



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:33 EDT