regular expressions with unicode situation?

From: Ben Dougall (bend@freenet.co.uk)
Date: Tue Apr 22 2003 - 15:38:36 EDT

  • Next message: Kenneth Whistler: "Re: alternative names for letterlike symbols(was..Re: Release of Unicode 4.0)"

    i'm just wondering if anyone can tell me what the general state of play
    is at the moment regarding using regular expressions with unicode?

    i'm not even completely sure if / how the two would fit together
    completely or successfully? i've used regex in php, which was a version
    of posix regex, and found it very useful. i'm now doing stuff on a mac
    - os x (cocoa), and am starting work on an app that will analyses and
    dissect text and am wondering if i can make use of regular expressions.
    i want the app to work equally in all languages / character subsets. if
    regex in general only covers small portions of unicode i don't think
    it'll be so useful.

    any general info regarding regex in conjunction with unicode much
    appreciated. thanks.



    This archive was generated by hypermail 2.1.5 : Tue Apr 22 2003 - 16:25:23 EDT