Re: Normalization Form KC for Linux

From: Rick McGowan (
Date: Tue Aug 24 1999 - 16:12:44 EDT

This is probably dredging up an old thread by now, but I just got in from a
few days off...

At 01:34 AM 8/18/99 -0700, Markus Kuhn wrote:
> One day, combining characters will surely be supported under Linux, but
> even then the precomposed characters should be preferred over combining
> character sequences where available. More formally, the preferred way of
> encoding text in Unicode under Linux should be Normalization Form KC as
> defined in Unicode Technical Report #15

>I hope this recommendation meets general approval.

Gosh, I don't approve. And I've been using Unix systems for many years.
The most flexible kind of implementation would prefer decomposed sequences.
In any case, enlightened systems would accept anything and massage as needed
to fit the particular application instead of forcing (or "suggesting") the
user to run everything through the meat grinder first...

In any case, I think Unix community tends in general to be very very
confused about the distinction between how data exists in storage and what
appears on one's screen/window/emulator. Until that is cleared up, I don't
see much hope for serious progress in Unix systems with graphicsl UI.


This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:51 EDT