Normalization Form KC for Linux

From: Markus Kuhn (Markus.Kuhn@cl.cam.ac.uk)
Date: Wed Aug 18 1999 - 04:41:23 EDT

Next message: Martin J. Duerst: "Re: Last Call: UTF-16"
Previous message: PILCH Hartmut: "Re: combining/fullwidth support for xterm"
Next in thread: Mark H. David: "Re: Normalization Form KC for Linux"
Maybe reply: Mark H. David: "Re: Normalization Form KC for Linux"
Maybe reply: Mark Davis: "Re: Normalization Form KC for Linux"
Maybe reply: Kenneth Whistler: "Re: Normalization Form KC for Linux"
Maybe reply: Addison Phillips: "RE: Normalization Form KC for Linux"
Maybe reply: John Cowan: "Re: Normalization Form KC for Linux"
Maybe reply: Francois Yergeau: "Re: Normalization Form KC for Linux"
Maybe reply: Markus Kuhn: "Re: Normalization Form KC for Linux"
Maybe reply: schererm@us.ibm.com: "Re: Normalization Form KC for Linux"
Maybe reply: Mark Davis: "Re: Normalization Form KC for Linux"
Maybe reply: Martin J. Duerst: "Re: Normalization Form KC for Linux"
Maybe reply: Martin J. Duerst: "Re: Normalization Form KC for Linux"
Maybe reply: Asmus Freytag: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: John Cowan: "Re: Normalization Form KC for Linux"
Maybe reply: Edward Cherlin: "Re: Normalization Form KC for Linux"
Maybe reply: Dan Oscarsson: "Re: Normalization Form KC for Linux"
Maybe reply: Juliusz Chroboczek: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Frank da Cruz: "Re: Normalization Form KC for Linux"
Maybe reply: Kenneth Whistler: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Markus Kuhn: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Dan: "Re: Normalization Form KC for Linux"
Maybe reply: Michael Everson: "Re: Normalization Form KC for Linux"
Maybe reply: Paul Keinanen: "Re: Normalization Form KC for Linux"
Maybe reply: Michael Everson: "Re: Normalization Form KC for Linux"
Maybe reply: Dan: "Re: Normalization Form KC for Linux"
Maybe reply: peter_constable@sil.org: "Re: Normalization Form KC for Linux"
Maybe reply: Dan Oscarsson: "Re: Normalization Form KC for Linux"
Maybe reply: peter_constable@sil.org: "Re: Normalization Form KC for Linux"
Maybe reply: Robert Brady: "Re: Normalization Form KC for Linux"
Maybe reply: Juliusz Chroboczek: "Re: Normalization Form KC for Linux"
Maybe reply: Michael Everson: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Frank da Cruz: "Re: Normalization Form KC for Linux"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

I was never too happy with the UCS implementation levels, and after
reading Unicode Tech Report #15, I think I have now seen the light and I
have just added in

http://www.cl.cam.ac.uk/~mgk25/unicode.html

in section "How should Unicode be used under Linux?" the following
paragraph:

  One day, combining characters will surely be supported under Linux, but
  even then the precomposed characters should be preferred over combining
  character sequences where available. More formally, the preferred way of
  encoding text in Unicode under Linux should be Normalization Form KC as
  defined in Unicode Technical Report #15
  <http://www.unicode.org/unicode/reports/tr15/>.

I hope this recommendation meets general approval. I would even suggest
that programs such as less and ls could be extended to replace
characters on output by \xx hex escape sequences if they find in file
names or text files characters that are not conforming to Normalization
Form KC, such that these potential trouble-makers can be spotted more
easily by users.

It might be a very nice idea to have all the Unicode Normalization forms
added to GNU recode or iconv.

Markus

-- 
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

Next message: Martin J. Duerst: "Re: Last Call: UTF-16"
Previous message: PILCH Hartmut: "Re: combining/fullwidth support for xterm"
Next in thread: Mark H. David: "Re: Normalization Form KC for Linux"
Maybe reply: Mark H. David: "Re: Normalization Form KC for Linux"
Maybe reply: Mark Davis: "Re: Normalization Form KC for Linux"
Maybe reply: Kenneth Whistler: "Re: Normalization Form KC for Linux"
Maybe reply: Addison Phillips: "RE: Normalization Form KC for Linux"
Maybe reply: John Cowan: "Re: Normalization Form KC for Linux"
Maybe reply: Francois Yergeau: "Re: Normalization Form KC for Linux"
Maybe reply: Markus Kuhn: "Re: Normalization Form KC for Linux"
Maybe reply: schererm@us.ibm.com: "Re: Normalization Form KC for Linux"
Maybe reply: Mark Davis: "Re: Normalization Form KC for Linux"
Maybe reply: Martin J. Duerst: "Re: Normalization Form KC for Linux"
Maybe reply: Martin J. Duerst: "Re: Normalization Form KC for Linux"
Maybe reply: Asmus Freytag: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: John Cowan: "Re: Normalization Form KC for Linux"
Maybe reply: Edward Cherlin: "Re: Normalization Form KC for Linux"
Maybe reply: Dan Oscarsson: "Re: Normalization Form KC for Linux"
Maybe reply: Juliusz Chroboczek: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Frank da Cruz: "Re: Normalization Form KC for Linux"
Maybe reply: Kenneth Whistler: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Markus Kuhn: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Dan: "Re: Normalization Form KC for Linux"
Maybe reply: Michael Everson: "Re: Normalization Form KC for Linux"
Maybe reply: Paul Keinanen: "Re: Normalization Form KC for Linux"
Maybe reply: Michael Everson: "Re: Normalization Form KC for Linux"
Maybe reply: Dan: "Re: Normalization Form KC for Linux"
Maybe reply: peter_constable@sil.org: "Re: Normalization Form KC for Linux"
Maybe reply: Dan Oscarsson: "Re: Normalization Form KC for Linux"
Maybe reply: peter_constable@sil.org: "Re: Normalization Form KC for Linux"
Maybe reply: Robert Brady: "Re: Normalization Form KC for Linux"
Maybe reply: Juliusz Chroboczek: "Re: Normalization Form KC for Linux"
Maybe reply: Michael Everson: "Re: Normalization Form KC for Linux"
Maybe reply: Rick McGowan: "Re: Normalization Form KC for Linux"
Maybe reply: Frank da Cruz: "Re: Normalization Form KC for Linux"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:20:51 EDT