RE: UTF-8, C1 controls, and UNIX

From: Frank da Cruz (fdc@columbia.edu)
Date: Fri Mar 02 2001 - 10:07:50 EST


"Ayers, Mike" <Mike_Ayers@bmc.com>
> From: Frank da Cruz [mailto:fdc@columbia.edu]
> > "Ayers, Mike" <Mike_Ayers@bmc.com>
> > > Just to be sure: ISO 2022 has two modes, 7 bits and 8 bits, hasn't
> > > it? And in 7 bit mode (I know it's obsolescent), then C1 controls
> > > are not upposed to be interpreted as controls, are they?
> >
> > Nor as graphics.
>
> Clarification: If I connect a terminal to an arbitrary machine an
> use 7-bit mode, there will be no interpretation of characters and thus no C1
> problems, yes? However, I will also not be able to send UTF-8, yes? Is
> there therefore (so far) no easy way to go all-Unicode?
>
In 7-bit mode, you're not supposed to send C1 controls. If you do, the 8th
bit is assumed to be a parity bit and it is stripped, and therefore each C1
control becomes its C0 counterpart, e.g. 0x83 becomes 0x03.

- Frank



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:20 EDT