Re: Most commonly used characters not in BMP

From: John H. Jenkins (
Date: Mon Jun 14 2010 - 11:10:28 CDT

  • Next message: Stephen Slevinski: "Re: Writing a proposal for an unusual script: SignWriting"

    Some characters in the SIP are more common in Chinese written in the HK SAR than any character in Extension A, either because they are Hong Kong toponyms (or the like), or are Cantonese-specific. (My own analysis of text on the Chinese Wikipediæ is that the most common are U+23D13, U+282E2, U+28B4E, and U+2A568, which occur seven times each.)

    I imagine that the best data would come from Google.

    And there are some Web sites out there in Deseret and Shavian, as well. (If nothing else, both Deseret and Shavian versions of xkcd are available. I'm not aware of any Linear B translations.)

    On 2010/6/14, at 上午8:48, Frédéric Grosshans wrote:

    > Is there any data on the most commonly used characters which are not in
    > BMP ?
    > I have the impression that SMP characters are mainly used scholars
    > (historic scripts and math symbols). However, I have no idea whether the
    > SIP characters are mainly historical, or if they include not-so rare
    > characters needed for name and/or chinese dialects.
    > Frédéric Grosshans

    This archive was generated by hypermail 2.1.5 : Mon Jun 14 2010 - 11:13:50 CDT