L2/06-374 Source: Patrik FŠltstršm Date: Nov 1, 2006 10:42 PM Subject: IDNA Proposal Background To: Mark Davis [snip] The reason for not updating the draft (there is a -01, but that was only due to editorial things (like wrapping) is that we are now in the "silent period" just before the IETF. Very unfortunate. Let me explain how the tables are created, so you get data on what we would like to get input on. We first looked at the various classes defined in Unicode 5. For each class, we decided whether we thought that class would be included on a scale from 1-4 where 4 is absolutely included, 1 absolutely not included and 2 and 3 in between. The goal is of course to have a document with only values 1 and 4 in it. Question is, can we move 2 to 1, and 3 to 4? You can see the result of that review in section 2. (1) Please comment on the findings here. Second step was to look at the blocks of codepoints, and try to say whether a block is needed or not. This decision is only 50% based on the findings from the above exercise. Here we also have a scale 1-4, but also "notes" that try to say what the issues are. You can see the result of that review in section 3. (2) Please comment on the findings here. Third step was to take all codepoints between U+0000 and U+0FFF (as a "random sample") and apply the classification of classes in the first step above, and "see what happens". The codepoints get values 1-4 based on the value of the class the codepoint belong to. (3) Please comment on the findings here. As you can see above, the process is very much an algorithm, and our goal is to find a combination of blocks and classes that can be used as an inclusion based selection algorithm of codepoints for what codepoints can be used in IDNA. Patrik [snip]