Re: (i18n.422) Regular expressions in Unicode (Was: Ethiopic text)

From: (ache@nagual.pp.ru)
Date: Thu Mar 12 1998 - 10:22:36 EST


On Thu, Mar 12, 1998 at 09:25:47AM -0500, Alain LaBont wrote:
> A 02:37 98-03-12 -0800, Hallvard B Furuseth a crit :
> >I wrote:
> >
> >>> In particular, I wonder about
> >>> character ranges: If the user says "[-]" in his 8-bit charset (not
> >>> latin-1),

FYI: in practice I patch all regex family in FreeBSD tree to use collation
sequence data from locale for [a-z]-type national ranges. Using Unicode
f.e. not help here, because letters not sorted alphabetically (f.e.
Russian YO letter is out of order) and [a-z]-type ranges assume alphabet
order in most cases.

-- 
Andrey A. Chernov
http://www.nagual.pp.ru/~ache/
MTH/SH/HE S-- W-- N+ PEC>+ D A a++ C G>+ QH+(++) 666+>++ Y



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:39 EDT