Re: The mother of all collation schemes

From: Tex Texin (texin@progress.com)
Date: Thu Jun 15 2000 - 16:54:02 EDT


However, 10 comes before 2...

rampshot@usa.net wrote:
>
> I am trying to think of a collation scheme for the purpose of ordering a set
> of CDs. Let's say you have CD titles you want to order. They are in different
> languages, with a few accented letters, and even some non-Roman letters.
>
> 1) Romanise all non-roman names. For Japanese, I'd use "fu" and "chi" and
> "shi" and "tsu" and DEFINITELY indicate long vowels (so Tokyo would come out
> as "Toukyou").
> 2) My alphabetical order: (digits are treated as letters):
> [sp] [other punc.] 0 1 2 3 4 5 6 7 8 9 A B C D E F G H J K
> L M N O P Q R S T U V W X Y (why couldn't I find this in
> uppercase?) Z
> The reason digits are treated as letters is so "97" will come before "98".
> I'm not sure how to treat names like "Ranma 1/2". Any ideas? Also, this system
> is very sensitive to things such as misspelling "DJ" as "D.J."
> Does anyone have any ideas for ordering punctuation?
> Of course, if it was just anime CDs the order would be 0 1 2 3 4 5 6 7 8 9 a i
> u e o ka ki ku ke ko sa shi su, etc.
>
> ____________________________________________________________________
> Get free email and a permanent address at http://www.netaddress.com/?N=1

-- 
------------------------------------------------------------------------------------------------
Tex Texin                     Director, International Products
                                 
Progress Software Corp.       +1-781-280-4271
14 Oak Park                   +1-781-280-4655 (Fax)
Bedford, MA 01730  USA        texin@bedford.progress.com

http://www.progress.com The #1 Embedded Database http://www.SonicMQ.com JMS Compliant Messaging- Best Middleware Award http://www.aspconnections.com Leading provider in the ASP marketplace

Progress Globalization Program (New URL) http://www.progress.com/partners/globalization.htm ------------------------------------------------------------------------------------------------ Come to the Panel on Open Source Approaches to Unicode Libraries at the Sept. Unicode Conference http://www.unicode.org/iuc/iuc17



This archive was generated by hypermail 2.1.2 : Tue Jul 10 2001 - 17:21:04 EDT