>It is expected that no characters will ever be assigned in
Unicode that require the five-byte and six-byte UTF-8 forms, so
you can consider the maximum for UTF-8 to be four bytes.
You can, in fact, state this more strongly: *No characters will
ever be assigned* in Unicode that require the five-byte and
six-byte UTF-8 forms. Based on recent WG2 decisions (I think
they made this decision last month), the same is true for ISO
10646. All that's left now would be to formally change the
definition for UTF-8 to eliminate the five- and six-byte forms.
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:02 EDT