Re: Unicode Lookahead in Parsers?

From: Keld J|rn Simonsen (
Date: Sun Sep 01 1996 - 20:17:40 EDT

unicode@Unicode.ORG writes:

> The entire trick is in specifying the identifier correctly. The
> implementation guidelines published in the Unicode Standard 2.0
> include a section which spells out a complete suggested BNF syntax
> for identifiers which can be used to generate an efficient one-step
> table lookup underneath an isIdentifierPart() implementation.
> Check with the Java implementers. They're not complaining about
> combining characters causing inefficiencies in the lexer.

would that not be due to Java only implementig the 8859-1
subset of Unicode?


