The "K" forms of the normalization, whether KD or KC indiscriminantly throw
away information. When preparing TR 20 I initially thought that the
information in question would be easily captured in style markup, but when
I really did the work, it turned out that there are many instances where
it's not worth the effort to get it right, and other instances where it
runs counter to the way the encoded entities are intended to be used.
PS: Incidentally, the *public* draft version of TR 20 that is accessible at
http://www.unicode.org/unicode/reports/tr20 is by now quite out of date and
will soon be replaced with a new version.
At 10:39 AM 4/25/00 -0800, John McConnell wrote:
>I'm looking for software and standards that require or prefer Unicode data
>to be in Normalization Forms D or KD. Does anyone have such a list? If
>not, I'd be happy to create one based on replies to this mail. Thanks.
This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:02 EDT