RE: Question about \uxxxx etc. for 21-bit code points - need advi ce

From: [email protected]
Date: Wed May 24 2000 - 11:15:06 EDT


> > If the context is C, the suggested \x{yyyy} would be the
> nicest: because it
> > would
> > be easily extensible, it's because unambiguous, and because
> Perl already
> > does the same :-)
>
> "would" means that Perl does not have any syntax for Unicode
> escapes, or no syntax for escapes above U+ffff?

Perl 5.6 has \x{YYYY}. I cannot find an explicit documentation of being
limited
to just four hex digits (hexits...?), and peeking at the source code seems
to
confirm that there is no such limitation.

One can also just use the Unicode names:

        use charnames;
        $a = "\N{GREEK SMALL LETTER SIGMA}";

        use charnames ':short';
        $b = "\N{greek:Sigma}"; # uppercase Sigma

        use charnames 'greek';
        $c = "\N{Sigma}";

> > --
> > Jarkko Hietaniemi <[email protected]>
>
> markus

-- 
Jarkko Hietaniemi <[email protected]>



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:03 EDT