Tuesday, May 2, 2000
In table 3-1 on UTF-8 bit distribution a note about conversion of
surrogate pairs says: "Where uuuuu = wwww + 1 (to account for addition of
10000(16) as in Section 3.7, Surrogates)." I would appreciate some help
with this. Does it mean: Always add one to the binary value of wwww; if
wwww was 1111, uuuuu becomes 10000, otherwise uuuuu becomes 0 followed by
the sum of wwww +1. For example, if wwww were 1010, uuuuu becomes 01011.
Some alternatives: 1. If wwww is 1111 make uuuuu = 10000; add 1 only
if wwww = 1111; otherwise precede wwww by zero. 2. To create uuuuu always
insert a 1 before the value of wwww. 3. To create uuuuu always insert a 1
after the value of wwww. My uncertainty is due to understanding that
uuuuu should always be one bit longer than wwww, but that adding 1 to wwww
will only increase its length when the value of wwww is 1111.
Thanks in advance.
Jim Agenbroad ( jage@LOC.gov )
The above are purely personal opinions, not necessarily the official
views of any government or any agency of any.
Phone: 202 707-9612; Fax: 202 707-0955; US mail: I.T.S. Dev.Gp.4, Library
of Congress, 101 Independence Ave. SE, Washington, D.C. 20540-9334 U.S.A.
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:02 EDT