From: John Cowan (jcowan@reutershealth.com)
Date: Wed Mar 05 2003 - 11:35:46 EST
Pim Blokland scripsit:
> Then why does UnicodeData break them down as (e.g.) 0064 030C rather than
> 0064 0315?
To keep the upper case and lower case characters in sync for decomposition,
they always have the same combining characters. For another example, G with
cedilla gets the cedilla on top when it's a capital, but it still decomposes
to the ordinary combining cedilla. These are essentially font-ligaturing
issues.
-- John Cowan http://www.ccil.org/~cowan jcowan@reutershealth.com To say that Bilbo's breath was taken away is no description at all. There are no words left to express his staggerment, since Men changed the language that they learned of elves in the days when all the world was wonderful. --The Hobbit
This archive was generated by hypermail 2.1.5 : Wed Mar 05 2003 - 12:10:45 EST