"First six ideographs" problem considered bogus

From: John Cowan (cowan@locke.ccil.org)
Date: Wed Jun 09 1999 - 12:06:32 EDT

The text of IS 17455 says:

# As there are no digits available beyond 9, the first 6 letters
# of the Latin alphabet (or of any alphabet if the Latin script
# is not used) are used to represent the extra hexadecimal
# "digits" [...].

Han script is not an alphabet, and need not be handled by an
implementor of this standard, however internationalized.
Therefore, there is an issue only for the Western scripts
(Greek, Cyrillic, Armenian, Georgian), the West Asian scripts
(Arabic, Hebrew), and the South Asian scripts (Devanagari,
Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada,
Malayalam, Thai, Lao, Tibetan), plus Bopomofo.

For each of these, there is a canonical list of basic letters
and a canonical ordering. ("Ch" may be a letter in Spanish
for some purposes, but it is not a *basic* letter.)

Hiragana and Katakana are syllabaries, but can be handled by
the same principles.

This list needs extending for Unicode 3.0.

John Cowan	http://www.ccil.org/~cowan		cowan@ccil.org
	You tollerday donsk?  N.  You tolkatiff scowegian?  Nn.
	You spigotty anglease?  Nnn.  You phonio saxo?  Nnnn.
		Clear all so!  'Tis a Jute.... (Finnegans Wake 16.5)

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:46 EDT