The Unicode Consortium Discussion Forum

The Unicode Consortium Discussion Forum

 Forum Home  Unicode Home Page Code Charts Technical Reports FAQ Pages 
 
It is currently Sun Aug 31, 2014 1:22 am

All times are UTC - 6 hours [ DST ]





Post new topic Reply to topic  [ 3 posts ] 
Author Message
 Post subject: Unicode (Sindhi Char) Sorting Problem
PostPosted: Wed Nov 16, 2011 11:22 pm 
Offline

Joined: Wed Nov 16, 2011 1:56 am
Posts: 1
I and my team working onSindhi Language sorting issues. We needto ask that can we change sort order by assigning different code weights ofSindhi Unicode Collection. Current Unicode char collection of Sindhi languagehaving many sorting issues. For example
Random Data for checking sort order
name sortorder
ٽوپي 7
ڊيل 23
هاري 50
غالب 36
موت 46
ڪردار 39
گھارو 43
Accordingto above input the required result of names by following sort order should comelike 7,23,36,39,43,46,50.
Case 1: Query execution without sort order –Default sortorder
Query :“SELECT names.name, names.sortorder FROM [names]order by name asc”
Results :
name sortorder
غالب 36
گھارو 43
موت 46
هاري 50
ٽوپي 7
ڊيل 23
ڪردار 39
Above results are really not following Sindhi alphabet sort order .
We need your comments andfeedback regarding concern issues. Your feedback wills highly appricated.


Top
 Profile  
 
 Post subject: Re: Unicode (Sindhi Char) Sorting Problem
PostPosted: Thu Nov 17, 2011 11:52 am 
Offline
Forum Admin

Joined: Tue Dec 01, 2009 4:05 pm
Posts: 40
Reading your query, I believe the answer is to support sorting that is not based on code points but on sort-weights, using the UCA (UTS#10). This will likely need tailoring to your language, which would be the domain of CLDR. I'm hoping someone here can help you.


Top
 Profile  
 
 Post subject: Re: Unicode (Sindhi Char) Sorting Problem
PostPosted: Thu Nov 17, 2011 6:50 pm 
Offline
Forum Admin

Joined: Fri Dec 04, 2009 9:13 pm
Posts: 32
The DUCET (the default ordering for the Unicode Collation algorithm) is primarily useful for supplying a base ordering, and is rarely suitable for specific languages without a language tailoring. For the language tailorings, you need to look at DUCET plus CLDR.

The language tailorings are at http://unicode.org/repos/cldr/trunk/common/collation/ (also beta for CLDR v21). If additions/changes are needed for Singhi, please file a bug at http://unicode.org/cldr/trac/newticket. Collation ordering can be quite tricky to specify: please see the guidelines at http://cldr.unicode.org/index/cldr-spec ... guidelines.

BTW, there is a chart for the next DUCET at http://www.unicode.org/charts-6.1.0beta/collation/

See also:

http://unicode.org/collation/
http://unicode.org/faq/collation.html
http://unicode.org/collation/ducet-changes.html
http://unicode.org/collation/ducet-criteria.html


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 3 posts ] 

All times are UTC - 6 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


Quick-mod tools:
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Jump to:  
cron
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group
Template made by DEVPPL.com