L2/10-308 Date/Time: Sun Aug 8 18:05:12 CDT 2010 Contact: cewcathar@hotmail.com Name: cew Report Type: Public Review Issue Opt Subject: PRI #174 - Proposed Draft UTR #49: Unicode Character Categories Proposed Draft UTR #49: Unicode Character Categories http://www.unicode.org/reports/tr49/tr49-1.html * * * PROOFREADING * * * Section 1 Last par, first sentence "What is needed to address the general problem is an approach that focusses on the character category distinctions needed by such applications, without being entangled with the editorial requirements for the Unicode names list maintenance. This document presents such an approach, and documents the resulting data file that implementers can use for defining Unicode character categories." { COMMENT [minor]: personally I prefer focuses to focusses; it seems some people do double the 's' -- in BE at least, whether or not it should be doubled; see also the discussion at: http://forum.wordreference.com/showthread.php?t=3458 this is a minor point and I've seen "focusses" so suit yourself! } * * * Section 2.2 par 2 "The program that is used to maintain annotations for the Unicode names list has been modified slightly, and is then used for an automated merger of categorial annotations file with particular versions of the UnicodeData.txt file, producing as output a structured data file containing categorial information about all Unicode characters, with an explicit listing for each separate character, including its code point and Unicode character name." { COMMENT [IMPORTANT]: "has been modified slightly" and "is then used" -- the second does not follow sequentially from the first; choose either "is modified slightly and is then used" to indicate something that should be done; or "has been modified slightly and is [now] used" to indicate something that has been done, that is the program has already been modified slightly and is now used . . . } =>? "The program that is used to maintain annotations for the unicode names list must be first modified slightly, and is then used for an automated merger . . . " * * * CONTENT * * * Section 1, par before last "The existing subheaders also often group characters which other applications might want to distinguish. For example, the header for the range U+2600..U+260D is "Weather and astrological symbols". But we can do much better, distinguishing more precisely those which are weather symbols, such as U+2602 UMBRELLA, those which are astrological symbols, such as U+260A ASCENDING NODE, and those which really are not either, such as U+2606 WHITE STAR." { COMMENT: are you allowing only one categry per character -- such as weather or astrological? Another note: the symbols for planets are astronomical as well as astrological symbols; see: see http://www.fourmilab.ch/yoursky/ where you can get a sky chart for anytime anyplace which shows the planets! Also while discussing astrological symbols, chiron is a semi-planet in astrology; Pluto is no longer a planet as far as most astronomers are concerned; but chiron of course is not a planet in astrology } Best, --C. E. Whitehead cewcathar@hotmail.com