Re: Nonsense in http://www.unicode.org/Public/PROGRAMS/CVTUTF/CVTUTF.C?

From: Marcin 'Qrczak' Kowalczyk (qrczak@knm.org.pl)
Date: Sat Aug 25 2001 - 16:26:16 EDT


Wed, 22 Aug 2001 15:59:15 -0700, Michael (michka) Kaplan <michka@trigeminal.com> pisze:

>> Functions ConvertUCS4toUTF8 and ConvertUTF8toUCS4 use surrogates
>> in UCS4. In particular ConvertUTF8toUCS4 converts a character above
>> U+FFFF into two UCS4 words. Why is this absurd there?!
>
> UCS-4 has no knowledge of surrogate code points or their significance; it is
> ap urely algorithmic conversion. Not sure why the results would be so
> surprising, given this?

I don't understand. I'm talking about characters above U+FFFF, not
about characters from the range U+D800..DFFF. They are represented
as themselves in UCS-4. But the said routine represents them as pairs
of surrogates.

-- 
 __("<  Marcin Kowalczyk * qrczak@knm.org.pl http://qrczak.ids.net.pl/
 \__/
  ^^                      SYGNATURA ZASTĘPCZA
QRCZAK



This archive was generated by hypermail 2.1.2 : Sat Aug 25 2001 - 18:34:25 EDT