RE: Unicode conference papers

From: Richard Ishida (
Date: Tue Nov 21 2006 - 05:04:39 CST

  • Next message: fantasai: "Re: Fwd: Creative commons' license symbols"

    Hi Mark,
    Thanks for making these available. Would it be possible to add PDF versions of the slides?
    I have a couple of questions about Unicode at Google:
    1. could you explain slide 17 a little (Queries vs. pages)? What quantity does the y axis represent in each case?
    2. what is doubly-encoded utf-8?
    3. Slide 20 (Charset tagging trends) seems to indicate that around 72% of HTML pages now contain encoding declarations in the meta tag. Is that correct? (eg. Is the declaration for some pages in the xml declaration?) That seems like a high number (though I'm not complaining). I'm surprised that the HTTP header isn't at least as high, though, since I'd have thought that many servers are set up to serve a default encoding. Do you have any explanation for that result?
    4. It would be interesting to know what proportion of character encodings and language declarations shown are considered to be incorrect (presumably the graphs alluded to in question 3 include those).

    Richard Ishida
    Internationalization Lead
    W3C (World Wide Web Consortium)



            From: [] On Behalf Of Mark Davis
            Sent: 21 November 2006 02:44
            To: Unicode
            Subject: Unicode conference papers
            A few people asked about getting my slides from last week's conference. I posted them on my site, at :

            * Unicode at Google
            * Globalization News


    This archive was generated by hypermail 2.1.5 : Tue Nov 21 2006 - 05:07:35 CST