RE: Unicode in source code. WHY?

From: Jonathan Rosenne (rosenne@qsm.co.il)
Date: Thu Jul 22 1999 - 11:21:52 EDT


> -----Original Message-----
> From: G. Adam Stanislav [mailto:adam@whizkidtech.net]
> Sent: Wednesday, July 21, 1999 10:29 PM
> To: Unicode List
> Cc: Unicode List; mohrin@sharmahd.com
> Subject: Re: Unicode in source code. WHY?
>
>
> On Wed, Jul 21, 1999 at 10:08:14AM -0700, Addison Phillips wrote:
> > Clearly text editors (which means programming environments)
> should support
> > all of Unicode.
> >
> > Identifiers are a thornier issue. Combining marks versus
> precomposed clearly
> > presents a problem in this area.
>
> Why is it a problem? As long as each identifier is represented by the same
> sequence of bytes every time it is used, why should a compiler
> care whether
> combining marks or precomposed characters were used? For all the compiler
> needs to know, it is just a unique sequence of bytes.

The two cases - combining and precomposed - are visually indistinguishable,
and if the compiler does not treat them as identical there will be a big
confusion.

Jony



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:48 EDT