| Subject: | CJK Ext-A fix | 
|---|---|
| Date: | Tue, 22 Jan 2013 20:41:04 +0000 | 
| From: | Michel Suignard | 
Please add this message to agenda for this UTC.
This could affect 3 G sources for Ext-A characters. This is 
		potentially a Unicode 6.3 issue. Clearly the G source for 3828 is 
		erroneous.
Michel
From: Michel Suignard 
				Sent: Monday, December 03, 2012 11:08 AM
				To: 'chen-zhuang'
				Cc: csluqin
				Subject: RE: Correction of 3 G sources
Dear Chen Zhuang,
This needs a WG2 document with some explanation. I spent sometime 
		this morning deciphering what went on here.
Originally the GHZ sources were added in Extension A w/o numeric 
		references. I verified that 10646-2003 had none (just said GHZ). They 
		were added later using data provided by the Unihan database. The only 
		GHZ that had no Unihan GHZ numeric value value was 3ABF which explains 
		why it was left as is and was dubious. Removing that value is probably 
		OK. Note that every Microsoft font supporting China have had that 
		character for a while, so to some degree it has become self-referenced. 
		Font vendors use the presence of an IRG source to add the corresponding 
		glyph in their local font so there is always a small regression risk in 
		removing existing source. So I am wondering whether we should add a new 
		G ‘virtual’ source to keep the G reference alive.
Concerning 3828, the issue is duplicate GHZ source between that 
		character and 21FE2. Both are RS 46.25, Kangxi 0323.161 and for now GHZ 
		10810.02. The glyphs are very different. Evidences seem to suggest that 
		original Unihan GHZ data concerning 3828 was in error and should have 
		been 10810.03 instead of 10810.02. From a WG2/10646 point of view I 
		think that the G source change for 3828 is not controversial (Unihan has 
		still to fix its kIRGHanyuDaZidian field).
Concerning 400B, the issue is duplicate GHZ source between that 
		character and 2A279. However they have different RS (108.16 versus 
		197.10) and Kangxi (0798.171 versus 1507.311). But the only visual 
		difference between the two glyphs is that the low ‘Dish’ component
		皿 is either spanning the whole cell or the second 
		half. First we have to make sure that the dis-unification reflected in 
		the two code points (400B and 2A279) is genuine. If it is, then the GHZ 
		source for 400B should be removed with still the pending issue whether 
		or not we should keep a virtual G source for the same reason as for 
		3ABF. If the dis-unification was not correct, we have to determine which 
		of the two is the genuine code point and deprecate the other one (using 
		the UCI notation describing characters with no identified source 
		reference).
Depending on the urgency this could be reflected in Amendment 2 
		(currently under DAM ballot) or the 4th edition (currently 
		under CD ballot). Using the 4th edition is easier. This means 
		that along with the WG2 document, China should consider making a comment 
		concerning this in either ballot.
Best regards,
Michel
From: chen-zhuang
		Sent: Monday, December 03, 2012 2:15 AM
		To: Michel Suignard
		Cc: csluqin
		Subject: Correction of 3 G sources
Dear Michel,
According to IRG Resolusion M39.2, I'm requesting you to correct 
			3 G source references in UCS.
Resolution IRG M39.2: Editorial issue on CJK Unified Ideographs 
			(IRGN1884, IRGN1896)
			Unanimous
			Action: China
			The IRG requests China to report to WG2 about the deletion and 
			modification of the G-source reference of characters agreed in 
			IRGN1896.
G source of U+03828 㠨 should be changed 
			from GHZ-10810.02 to GHZ-10810.03 according to Hanyu Da Zidian (漢語大字典)
			G sources of U+0400B 䀋 and U+03ABF
			㪿 should be deleted because the real G 
			sources are not found so far.
Should I prepapare a document to WG2?
Best regards,
Chen Zhuang