Re: Unihan

From: samuel gilman (samuelgilman@gmail.com)
Date: Thu Jan 06 2011 - 17:39:40 CST

  • Next message: John H. Jenkins: "Re: Unihan"

    Thanks for the fast reply but it is still confusing for me.

    What I want to do is separate traditional from simplified.
    How could I do that?

    Sam

    On Thu, Jan 6, 2011 at 5:26 PM, Magnus Bodin <magnus@bodin.org> wrote:

    > 2011/1/7 samuel gilman <samuelgilman@gmail.com>:
    > > Hope there are some people still on this list!
    > > I'm trying to separate out the traditional and simplified Chinese
    > characters
    > > within the Unihan database.
    > > In the Unihan_Variants.txt it seems to show when the characters vary but
    > > it's unclear to me.
    > > U+3469 kTraditionalVariant U+5138
    > > U+346E kSimplifiedVariant U+2B748
    > > U+346F kSimplifiedVariant U+3454
    > > U+346F kTraditionalVariant U+3454
    > > U+3473 kSimplifiedVariant U+3447
    > > U+3473 kTraditionalVariant U+3447
    > > I took this straight out of Unihan_Varients.txt.
    > > Can someone explain what this means?
    > > All I need from this is to figure out which variant traditional and
    > which
    > > form is simplified.
    >
    > I'll quote an answer I got to a similar question from August 2008:
    >
    > "Please see the description for field kSimplifiedVariant in [1]:
    >
    > Note that a character can be *both* a traditional Chinese character in its
    > own right *and* the simplified variant for other characters (e.g.,
    > U+53F0).
    >
    > In such case, the character is listed as its own simplified variant and
    > one
    > of its own traditional variants. This distinguishes this from the case
    > where
    > the character is not the simplified form for any character (e.g., U+4E95).
    >
    > [1] http://www.unicode.org/reports/tr38/#AlphabeticalListing"
    >
    > -- magnus
    >



    This archive was generated by hypermail 2.1.5 : Thu Jan 06 2011 - 17:42:03 CST