Re: Regular expressions in Unicode (Was: Ethiopic text)

From: John Cowan ([email protected])
Date: Fri Mar 13 1998 - 12:35:39 EST


Peter Westlake wrote:

> Kolbj�rn Aamb�@unicode.org wrote:
> > Would not something like:
> >
> > Aa:�:��:�:��:��:��:��,Bb,Cc:��,Dd,Ee:�����,Ff,Gg,Hh,
> > I:�i����,Jj,Kk,Ll,Mm,Nn:��,O
> > o:���:��:��:��:��,Pp,Qq,Rr,Ss,Tt,Uu:���,Vv,Ww,Xx,Yy:��,Zz.
> >
> > be apropriate for english searching?
>
> Yes. In fact, that could be the value of the ordered equivalence
> class for letters in English, except that I think you are including
> extra information about how letters sort within each class.

How's that again? The Y to � equivalence would seem to be
purely Nordic, certainly not English. English would probably
expect to see � collate with U.

But then see my .sig for the pitfalls of such reasoning!

-- 
John Cowan	http://www.ccil.org/~cowan		[email protected]
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (FW 16.5)



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:39 EDT