There is a bug in the examples in the TR6. It's been on my list to fix them
for a long while now, and I the time has come to do it. There has been
renewed interest in this TR.
The default assignments were moved late in the editing, not all changes were
caught. That explains the incosistency with the examples.
I don't recall having recently received any substantial bug reports against
specification itself or the sample implementation. There was some mail this
summer, but it contained only a URL to an archive in a UNIX format that I
cannot read. If anyone has plain text (or pdf or html formatted) bug
reports that they wish me to address I will do so in the next week or so.
At 01:06 AM 9/30/98 -0700, you wrote:
>I have been studying Technical Report #6 on the Standard Compression
>Scheme for Unicode*, and I am running into a problem that perhaps one
>of the gurus on this list can explain for me.
>In the example for Russian, the compressed data begins with an SC7
>tag (0x17), which maps the subsequent characters 0x80 through 0xFF
>into the default position of (dynamic) window 7, as the accompanying
>text points out.
>However, according to Table X-5, the default offset for window 7 is
>0xFF00. Window 2, on the other hand, does default to offset 0x0400
>and would seem to be the correct window for Cyrillic (and is
>identified as such in the table). The proper tag would then be SC2
>Am I missing something, or is there an error in the technical report?
>* What's wrong with the shorter and more straightforward "Standard
>Unicode Compression Scheme," anyway? Someone got a problem with
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:42 EDT