L2/10-196 Date/Time: Mon May 10 10:11:50 CDT 2010 Contact: lunde@adobe.com Name: Ken Lunde Report Type: Public Review Issue Opt Subject: PRI 167 Comments PRI 167 Comments Dr. Ken Lunde, Adobe Systems Incorporated lunde@adobe.com Version: 04/23/2010 My analysis of PRI 167 indicated the following basic statistics: Total Glyphs: 4,214 Total Base Characters: 1,924 Total Variant Forms: 2,290 All of the glyphs that represent the default/standard forms for the 1,924 Base Characters map to Adobe-Japan1-6 CIDs. 1) I found two possible JIS X 0213:2004 prototypical glyph issues: U+4148 IA0532: 11th stroke curve/terminal U+7515 JTB5F8: The 17th stroke penetrates the 15th stroke 2) I established Adobe-Japan1-6 mappings for 636 of the 2,290 variant forms, but the following 38 have subtle enough glyph difference to cause the mappings to be dropped (I may have the stroe order wrong for a small number of these): U+50ED JTADADS vs CID+20078: 5th/7th and 10th/12th stroke connection U+5272 KS025050S vs CID+20086: 5th/9th stroke connection U+5272 JTAE24S vs CID+13684: 5th/9th stroke connection U+53DF HG1640 vs CID+14111: 8th/9th stroke connection U+5448 JTAEAF vs CID+13942: 6th stroke width U+5544 IB1535S vs CID+7730: 8th/9th stroke connection U+5858 FT1877S vs CID+7757: 10th/12th stroke connection U+5ABE FT1951S vs CID+7824: 8th/11th stroke connection U+5BB3 KS081420S vs CID+20111: 5th/9th stroke connection U+5E63 JTB0B8S vs CID+14010: 1st/6th stroke connection/fusion U+5EE3 JTB0E1S vs CID+14127: 7th/10th stroke connection U+61B2 KS127180 vs CID+20120: 5th/9th stroke connection U+659C KS151460 vs CID+13805: 2nd stroke curve/terminal U+6CBF JTB41A vs CID+13656: 4th/5th stroke connection U+6F54 KS205610 vs CID+13744: 5th stroke angle U+6F5B JTB4DC vs CID+20164: 6th/8th and 11th/13th stroke connection U+6F5B JTB4D4 vs CID+20165: 6th/8th and 11th/13th stroke connection U+7027 IB2291 vs CID+13911: 19th stroke connection U+7336 JTB586 vs CID+14072: 7th/11th stroke connection U+74B0 JTB5D5 vs CID+13689: 15th stroke corner U+750D JTB5EB vs CID+7842: 14th stroke curve U+7B51 KS291140 vs CID+13923: 10th/12th stroke connection U+7CD6 JTB7FES vs CID+13956: 13th/15th stroke connection U+7E2B JTB853 vs CID+14024: 14th stroke curve/terminal U+8056 JTC0CB vs CID+13869: 11th stroke width U+83DF JTB9B2S vs CID+7976: 8th/10th stroke connection U+83DF FT1875 vs CID+7755: 8th/10th stroke connection U+8A3B FT2927 vs CID+7740: 8th/10th stroke connection/fusion U+8B19 JTBB7FS vs CID+13753: 9th/15th stroke connection U+8E91 FT1737 vs CID+13605: 12th/14th stroke connection U+8F44 KS435180 vs CID+20227: 12th/16th stroke connection U+8FED JTBC62 vs CID+13947: 9th stroke curve/terminal U+8FFA FT2593 vs CID+14229: 6th/8th stroke connection U+9000 JTBC7B vs CID+13905: 10th stroke curve/terminal U+901F JTBC9C vs CID+13899: 11th stroke curve/terminal U+9050 FT2613 vs CID+14232: 12th stroke curve/terminal U+938C JTBE1DS vs CID+13686: 10th/16th stroke connection U+938B JB6909 vs CID+20239: 13th/17th stroke connection 3) The following six glyphs are very close matches for Adobe-Japan1-6 glyphs, and are not including among the mappings: U+52C7 JTAE37 vs CID+14070: 4th stroke terminal U+5339 KS030150S vs CID+13994: 1st/2nd stroke connection U+5951 KS067790 vs CID+20104: 2nd stroke angle U+7515 JTB5F8 vs CID+20267: The 17th stroke penetrates the 15th stroke U+908A JTBD61 vs CID+20234: 13th stroke terminal U+9A5F KS510550 vs CID+14267: 19th stroke Given that U+7515 JTB5F8 might be a JIS X 0213:2004 prototypical glyph issue as indicated in #1, it may be a genuine mapping to CID+20267, and thus an additional mapping. 4) Two of the glyphs may have a better or alternate choice for their Base Character: U+5DE2 JTB323: U+5DE3 U+9115 IP9115: U+90F7 U+93AD JTBE25: U+93AE 5) The following eight glyphs were found to be encoded, and thus should have a different Base Character: U+4E55 KS001760 -> U+200B0 (Extension B; CID+14209) U+50B3 JTAD7A -> U+2B74A (Extension D; "J" Source: JTAD7A) U+51DE JTB546 -> U+20611 (Extension B; CID+14294) U+7A3D FT1786S -> U+25874 (Extension B; CID+7670) U+7B0B JTB7A2 -> U+25B01 (Extension B) U+82E5 IB0861 -> U+20C25 (Extension B) U+8613 JTBA87 -> U+27068 (Extension B) U+9F21 JTC095 -> U+21FF3 (Extension B) 6) The following got my attention, and need to be checked more carefully (this does not mean that I disagree with the unification or with the Base Character selection, but merely means that I plan to look at them more closely, and report about my findings at a later date): U+4543 JTBA54 U+5377 JTAE76 U+5DFB JTB099S U+6852 JTB302 U+710F JTB51AS U+85EA JTBA91S U+9060 JTBD02 U+990A JTBF1F 7) There are no variants for the twelve CJK Unified Ideographs that are among the CJK Compatibility Ideographs (those that map to JIS X 0213:2004 have been asterisked): U+FA0E U+FA0F * U+FA11 * U+FA13 * U+FA14 * U+FA1F * U+FA21 * U+FA23 U+FA24 * U+FA27 U+FA28 U+FA29 8) Only one of the glyphs that correspond to the 75 CJK Compatibility Ideographs that map to JIS X 0213:2004 is covered: U+51DE JC8758: U+FA15 1-8758 CID+20307 The appropriate Base Characters for the following 15 CJK Compatibility Ideographs that map to JIS X 0213:2004 are covered, but their glyphs are not included: U+50E7 U+FA31 1-1441 CID+13360 U+514D U+FA32 1-1448 CID+13389 U+585A U+FA10 1-1555 CID+7746 U+6168 U+FA3E 1-8460 CID+13328 U+65E2 U+FA42 1-8511 CID+13334 U+6717 U+F929 1-8546 CID+20305 U+7422 U+FA4A 1-8805 CID+7732 U+7BC0 U+FA56 1-8968 CID+13358 U+8457 U+FA5F 1-9107 CID+13367 U+8612 U+FA20 2-8724 CID+21073 U+8B39 U+FA63 1-9216 CID+13339 U+9038 U+FA67 1-9257 CID+13320 U+9686 U+F9DC 1-9361 CID+13393 U+96E3 U+FA68 1-9367 CID+13374 U+97FF U+FA69 1-9386 CID+13337 The remaining 59 CJK Compatibility Ideographs that map to JIS X 0213:2004 are not covered: U+985E U+F9D0 1-9404 CID+13396 U+4FAE U+FA30 1-1424 CID+13382 U+52C9 U+FA33 1-1467 CID+13385 U+52E4 U+FA34 1-1472 CID+13338 U+5351 U+FA35 1-1478 CID+13378 U+559D U+FA36 1-1512 CID+7651 U+5606 U+FA37 1-1515 CID+13366 U+5668 U+FA38 1-1522 CID+13333 U+5840 U+FA39 1-1558 CID+13384 U+58A8 U+FA3A 1-1562 CID+13387 U+5C64 U+FA3B 1-4765 CID+13361 U+5C6E U+FA3C 1-4766 CID+16837 U+5ECA U+F928 1-8414 CID+20303 U+6094 U+FA3D 1-8448 CID+13326 U+618E U+FA3F 1-8462 CID+13363 U+61F2 U+FA40 1-8465 CID+21072 U+654F U+FA41 1-8508 CID+13381 U+6691 U+FA43 1-8535 CID+13352 U+6885 U+FA44 1-8569 CID+13375 U+6B04 U+F91D 1-8627 CID+13392 U+6BBA U+F970 1-8641 CID+13344 U+6D77 U+FA45 1-8673 CID+13327 U+6E1A U+FA46 1-8687 CID+7700 U+6F22 U+FA47 1-8705 CID+13332 U+716E U+FA48 1-8753 CID+13347 U+722B U+FA49 2-8009 CID+15398 U+732A U+FA16 1-8779 CID+8548 U+7891 U+FA4B 1-8907 CID+13379 U+793E U+FA4C 1-8919 CID+13348 U+7948 U+FA4E 1-8923 CID+13335 U+7949 U+FA4D 1-8920 CID+13345 U+7950 U+FA4F 1-8924 CID+13391 U+7956 U+FA50 1-8925 CID+13359 U+795D U+FA51 1-8927 CID+13351 U+795E U+FA19 1-8928 CID+8580 U+7965 U+FA1A 1-8929 CID+8581 U+798D U+FA52 1-8931 CID+13325 U+798E U+FA53 1-8932 CID+13371 U+798F U+FA1B 1-8933 CID+8583 U+7A40 U+FA54 1-8945 CID+13343 U+7A81 U+FA55 1-8949 CID+13373 U+7DF4 U+FA57 1-9014 CID+13399 U+7E09 U+FA58 2-8448 CID+18366 U+7E41 U+FA59 1-9019 CID+13376 U+7F72 U+FA5A 1-9026 CID+13353 U+8005 U+FA5B 1-9036 CID+13349 U+81ED U+FA5C 1-9056 CID+13350 U+8279 U+FA5D 2-8584 CID+14199 U+8279 U+FA5E 2-8585 CID+14198 U+865C U+F936 1-9147 CID+13394 U+8910 U+FA60 1-9179 CID+13331 U+8996 U+FA61 1-9189 CID+13346 U+8AF8 U+FA22 1-9214 CID+8622 U+8B01 U+FA62 1-9215 CID+13321 U+8CD3 U+FA64 1-9224 CID+13380 U+8D08 U+FA65 1-9229 CID+13364 U+8FB6 U+FA66 2-8973 CID+15403 U+90FD U+FA26 1-9274 CID+8636 U+983B U+FA6A 1-9391 CID+7788 The following three additional CJK Compatibility Ideographs are also not covered: U+6075 U+FA6B CID+13740 U+242EE U+FA6C CID+14281 U+8218 U+FA6D CID+13695 9) The only Base Characters that are covered are those with at least one variant form. This is different than what was done for the "Adobe-Japan1" IVD Collection in which all kanji were registered and have IVSes. That is all. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- --