At 1999-10-21 17:54, G. Adam Stanislav wrote:
>Yes, we can type "ch" using the GLYPHS "c" and "h", but Unicode prides
>itself in being a character encoding, not a glyph encoding.
Yes, but it does not in general use one codepoint per character.
>To us, "ch" is a character. Period.
That's correct. The single Slovak character 'ch' is represented by a
sequence of two codepoints U+0063 U+0068, just as the single French
character 'à' (a grave) is represented by a sequence of two codepoints,
U+0061 U+0300.
If you need to specify that it's a particularly Slovak 'ch', you can use
Plane 14 language tags.
>In our dictionaries the "ch" follows the "h" and
>precedes the "i". We would never dream of looking for "ch" after "cg" and
>before "ci".
Up to you to write a sensible Slovak sorting algorithm. That's not
Unicode's job.
-- Ashley Yakeley, Seattle WA
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:54 EDT