RE: Question about \uxxxx etc. for 21-bit code points - need advi ce

From: jarkko.hietaniemi@nokia.com
Date: Wed May 24 2000 - 11:15:06 EDT


> > If the context is C, the suggested \x{yyyy} would be the
> nicest: because it
> > would
> > be easily extensible, it's because unambiguous, and because
> Perl already
> > does the same :-)
>
> "would" means that Perl does not have any syntax for Unicode
> escapes, or no syntax for escapes above U+ffff?

Perl 5.6 has \x{YYYY}. I cannot find an explicit documentation of being
limited
to just four hex digits (hexits...?), and peeking at the source code seems
to
confirm that there is no such limitation.

One can also just use the Unicode names:

        use charnames;
        $a = "\N{GREEK SMALL LETTER SIGMA}";

        use charnames ':short';
        $b = "\N{greek:Sigma}"; # uppercase Sigma

        use charnames 'greek';
        $c = "\N{Sigma}";

> > --
> > Jarkko Hietaniemi <jarkko.hietaniemi@nokia.com>
>
> markus

-- 
Jarkko Hietaniemi <jarkko.hietaniemi@nokia.com>



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:03 EDT