Re: U+xxxx, U-xxxxxx, and the basics

From: Jonathan Coxhead (jonathan@doves.demon.co.uk)
Date: Fri Mar 03 2000 - 22:04:21 EST


 | 3. Code values, or "code units", are numbers that computers use to
 | represent abstract objects, such as Unicode characters. Code values
 | are typically 8-bit, 16-bit, or 32-bit wide non-negative integers.

    Some computers also use negative integers to represent
characters. If you write portable code in C, you have to be aware of
this possibility. (In other words, that the type "char" may be
signed.) One common possibility is to represent the characters of
Latin-1 as the values -128--127. This can be important, though I
don't know whether it is relevant to your purposes.

        /|
 o o o (_|/
        /|
       (_/



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:59 EDT