Re: CJK Ideograph Fragments

From: John H. Jenkins (jenkins@apple.com)
Date: Wed Apr 28 2010 - 14:36:53 CDT

Next message: Kenneth Whistler: "Re: [indic] Halant - can it be called a "Linguistic Zero" (Panini)?"

Previous message: Uriah Eisenstein: "CJK Ideograph Fragments"
In reply to: Uriah Eisenstein: "CJK Ideograph Fragments"
Next in thread: mpsuzuki@hiroshima-u.ac.jp: "Re: [unicode] CJK Ideograph Fragments"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

No. You could certainly write up a proposal and submit it to the UTC. Should the UTC feel the idea has merit, it would then move it on to WG2 and/or the IRG.

The main problem here is that there is a very strong desire to limit ideograph encoding to attested and documentable forms. Anything which does not exist in actual texts is not likely to be well-regarded. Similarly, the UTC has a strong preference not to encoding anything which isn't in actual use. Proposals to encode characters because they would be useful if encoded even though they aren't actually being used right now are generally looked on with disfavor.

在 Apr 28, 2010 12:03 PM 時， Uriah Eisenstein 寫到：

> Hello,
> My question is about common components of CJK Ideographs which are not encoded as independent Han characters (and perhaps indeed aren't). A good example is the right-hand part of the character 漢 itself: it is a distinct component appearing in multiple other characters, but is not encoded to the best of my knowledge. The same goes for the top part of 鳥 and 島, the surrounding part of 與 and 興 and several others. My question is whether there are any plans or discussions for encoding these fragments in Unicode.
>
> (I haven't found anything about this in mailing list archives; I did find statements that Unicode does not intend to provide any decomposition data of Han characters :) And for good reasons. However, such fragments may well be useful for third-party software dealing with 漢字 glyph generation, lookup by components etc.)
>
> Thanks,
> Uriah Eisenstein

Next message: Kenneth Whistler: "Re: [indic] Halant - can it be called a "Linguistic Zero" (Panini)?"
Previous message: Uriah Eisenstein: "CJK Ideograph Fragments"
In reply to: Uriah Eisenstein: "CJK Ideograph Fragments"
Next in thread: mpsuzuki@hiroshima-u.ac.jp: "Re: [unicode] CJK Ideograph Fragments"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
Mail actions: [ respond to this message ] [ mail a new topic ]

This archive was generated by hypermail 2.1.5 : Wed Apr 28 2010 - 14:38:20 CDT