Re: Regular expressions in Unicode (Was: Ethiopic text)

From: Peter Westlake (peter@harlequin.co.uk)
Date: Fri Mar 13 1998 - 18:22:23 EST


At 09:27 1998/03/13 -0800, John Cowan wrote:
>Peter Westlake wrote:
>
>> Kolbjørn Aambø@unicode.org wrote:
>> > Would not something like:
>> >
>> > Aa:á:Àà:â:Ãã:Ææ:Ää:Åå,Bb,Cc:Çç,Dd,Ee:Ééèêë,Ff,Gg,Hh,
>> > I:¡iíìîï,Jj,Kk,Ll,Mm,Nn:Ññ,O
>> > o:óòô:Õõ:‘¦:Øø:Öö,Pp,Qq,Rr,Ss,Tt,Uu:úùû,Vv,Ww,Xx,Yy:Üü,Zz.
>> >
>> > be apropriate for english searching?
>>
>> Yes. In fact, that could be the value of the ordered equivalence
>> class for letters in English, except that I think you are including
>> extra information about how letters sort within each class.
>
>How's that again? The Y to Ü equivalence would seem to be
>purely Nordic, certainly not English. English would probably
>expect to see Ü collate with U.

Make that, "those bits that I bothered to read looked English" :-)

Peter.



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:39 EDT