From: Michael \(michka\) Kaplan (email@example.com)
Date: Sat Dec 07 2002 - 17:29:13 EST
This is an expected behavior. Ii has to do with the "Endian-ness" of the
processor on which you are running. From the glossary
Little-endian. A computer architecture that stores multiple-byte numerical
values with the least significant byte (LSB) values first.
Big-endian. A computer architecture that stores multiple-byte numerical
values with the most significant byte (MSB) values first.
You can also find several thousand web pages describing the issue by
searching in google for these two terms.
----- Original Message -----
From: "Smith, Mike" <SMITHM@fish.govt.nz>
Sent: Saturday, December 07, 2002 1:16 PM
Subject: Order in which unicode charactoers displayed
> I am attempting to utilise the unicode values from the CJK Unified
> Ideographs to undertake searches for the occurance of the corresponding
> characters on a hard disk drive.
> When I look at a chinese character with a hex editor I get a certain
> order for the hex or unicode value for the character. For example, the
> english word 'abalone' in chinese has a code of '8D9C 7C9C' when viewed
> in a hex editor, but when I referred to the CJK unicode table, the value
> came out as '9C8D 9C7C'.
> Can you explain the different ordering of the code?
> When I conducted a test search of the contents of a hard drive known to
> contain the chinese characters for 'abalone' I only found hits on the
> hex values not onth e CJK unicode values.
> Any assistance woudl be appreciated.
> Mike Smith
This archive was generated by hypermail 2.1.5 : Sat Dec 07 2002 - 17:58:20 EST