Re: Characters

From: Mark Davis ☕ (mark@macchiato.com)
Date: Sun Feb 13 2011 - 20:26:01 CST

  • Next message: William_J_G Overington: "Re: Characters"

    As people have said on this thread, it depends entirely on the data sample
    in use. As it turns out, when looking at HTML pages on the web (with a
    good-sized sample from work here at Google), SPACE is the most frequent
    character (by a huge margin). That is even true on Chinese pages, just
    because of the proportion of markup on pages.

    For those interested, the most frequent Alphabetic is 'e'.

    Mark

    *— Il meglio è l’inimico del bene —*

    On Sat, Feb 12, 2011 at 02:13, Charlie Ruland <ruland@luckymail.com> wrote:

    > U+0020 SPACE is by no means ‘the most used character’ universally. For
    > Chinese it is completely unnecessary, not only when writing from top to
    > bottom. The same is probably true for Japanese and ‘early forms’ of
    > influential W Eurasian languages such as Phoenician, Hebrew, Greek and
    > Latin. And further examples from other parts of the world won’t be hard to
    > find.
    > Charlie



    This archive was generated by hypermail 2.1.5 : Sun Feb 13 2011 - 20:31:19 CST