The Unicode Consortium Discussion Forum

 It is currently Thu Jul 31, 2014 8:43 pm

 All times are UTC - 6 hours [ DST ]

 Page 1 of 1 [ 7 posts ]
 Print view Previous topic | Next topic
Author Message
 Post subject: Relation of Character Properties to AlgorithmsPosted: Fri Sep 21, 2012 7:16 pm

Joined: Sat Aug 06, 2011 9:02 am
Posts: 43
In 2.3 Relation of Character Properties to Algorithms one can read the following paragraph :

Quote:
When assigning character properties for use with a given algorithm, it may be tempting to assign somewhat arbitrary values to some characters, as long as the algorithm happens to produce the expected results.Proceeding in this way hides the nature of the character and limits the re-use of character properties by related processes. Therefore, instead of tweaking the properties to simply make a particular algorithm easier, the Unicode Standard pays careful attention to the essential underlying linguistic identity of the character. However, not all aspects of a characterâ€™s identity are relevant in all circumstances, and some characters can be used in many different ways, depending on context or circumstance. This means the formal character properties alone are not sufficient to describe the complete range of desirable or acceptable character behaviors.

I'm having a hard time trying to understand what's being said here. Could you clarify (perhaps with an example) ?

Top

 Post subject: Re: Relation of Character Properties to AlgorithmsPosted: Sat Sep 22, 2012 9:37 am
 Unicode Guru

Joined: Tue Dec 01, 2009 2:49 pm
Posts: 186
There are several statements here. Which one do you have trouble with, in particular?

Top

 Post subject: Re: Relation of Character Properties to AlgorithmsPosted: Sat Sep 22, 2012 10:04 am

Joined: Sat Aug 06, 2011 9:02 am
Posts: 43
Quote:
When assigning character properties for use with a given algorithm, it may be tempting to assign somewhat arbitrary values to some characters, as long as the algorithm happens to produce the expected results.

Are we talking about reassigning character properties here ? Why would anyone do this ? To produce expected results ? What for ? This is not clear to me.

Top

 Post subject: Re: Relation of Character Properties to AlgorithmsPosted: Sat Sep 22, 2012 11:55 am
 Unicode Guru

Joined: Tue Dec 01, 2009 2:49 pm
Posts: 186
We are talking about assigning properties in the first place.

When you create an algorithm you have a choice of how you want to create your rules. These rules act on properties. It's sometimes tempting to achieve the correct results by "arbitrary" properties, instead of of properties that group characters by some linguistic or typographic aspects of their identity first, and then write the rules to make things work.

The outcome under the algorithm is the same, but in the former case, it's likely that it will be much more difficult to reuse the properties for other algorithms (or more sophisticated variants of a similar algorithm).

Top

 Post subject: Re: Relation of Character Properties to AlgorithmsPosted: Sat Sep 22, 2012 3:04 pm

Joined: Sat Aug 06, 2011 9:02 am
Posts: 43
I'm sorry asmus, but I haven't got the idea yet. When you say properties, are you talking about the character properties in the Unicode Character DataBase (e.g. UnicodeData.txt) ? If so, aren't these properties already assigned ? What is the meaning of "assigning character properties for use with a given algorithm" ? As far as I can understand those properties are already assigned to each character in the UCD.

Top

 Post subject: Re: Relation of Character Properties to AlgorithmsPosted: Sat Sep 22, 2012 10:07 pm
 Unicode Guru

Joined: Tue Dec 01, 2009 2:49 pm
Posts: 186
This is an unnecessarily static view.

New characters get added, new algorithms get defined, leading to entirely new properties - there's a lot of change that doesn't involved "re-assignment" although that also happens.

The model presented in UTR#23 intends to be applicable not only to using the data from the standard, but also to the process of maintaining the standard. In some ways it could also be usefully applied, at least in part, to properties and algorithms defined by others.

Top

 Post subject: Re: Relation of Character Properties to AlgorithmsPosted: Sun Sep 23, 2012 5:56 am

Joined: Sat Aug 06, 2011 9:02 am
Posts: 43
Quote:
The model presented in UTR#23 intends to be applicable not only to using the data from the standard, but also to the process of maintaining the standard

It's all clear now. Thanks.

Top

 Display posts from previous: All posts1 day7 days2 weeks1 month3 months6 months1 year Sort by AuthorPost timeSubject AscendingDescending
 Page 1 of 1 [ 7 posts ]

 All times are UTC - 6 hours [ DST ]

Who is online

Users browsing this forum: No registered users and 2 guests

 Quick-mod tools: View logs You cannot post new topics in this forumYou cannot reply to topics in this forumYou cannot edit your posts in this forumYou cannot delete your posts in this forumYou cannot post attachments in this forum

 Jump to:  Select a forum ------------------ Announcements, News, FAQ    Official Announcements and News    FAQ - Frequently Asked Questions Unicode Standard and Related Specifications    Unicode General Discussion    How Do I...? The How-to Forum    Algorithms, Properties, and Security Issues    Arabic/Bidi/Hebrew Technical Issues    Encoding Scripts and Characters    Indic Scripts    Symbols    Unihan and CJKV Forum Unicode Locales (CLDR)    CLDR Users Forum Localization / Interoperability    ULI / ULI-TC Interest Forum Public Review Discussions    PRI 207 - UTR #50, Unicode Properties for Vertical Text Layout    Closed Public Review Issue Forums       PRI 185 - Extension of UBA for improved display of URL/IRIs       PRI 205 - AL MARK and LEVEL DIRECTION MARK Everything Else    Water Cooler    Forum Problems and Suggestions