Re: Korean compression (was: Re: Ternary search trees for Unicode dictionaries)

From: Mark Davis (mark.davis@jtcsv.com)
Date: Tue Dec 02 2003 - 18:15:56 EST

  • Next message: Peter Kirk: "Re: MS Windows and Unicode 4.0 ?"

    Someone else originated that list.

    Mark
    __________________________________
    http://www.macchiato.com
    ► शिष्यादिच्छेत्पराजयम् ◄

    ----- Original Message -----
    From: "Frank Yung-Fong Tang" <ytang0648@aol.com>
    To: "Mark Davis" <mark.davis@jtcsv.com>
    Cc: "Doug Ewell" <dewell@adelphia.net>; "Unicode Mailing List"
    <unicode@unicode.org>; "Jungshik Shin" <jshin@mailaps.org>; "John Cowan"
    <jcowan@reutershealth.com>
    Sent: Tue, 2003 Dec 02 15:03
    Subject: Re: Korean compression (was: Re: Ternary search trees for Unicode
    dictionaries)

    Mark Davis wrote:

    > > >> UTF-16 6,634,430 bytes
    > > >> UTF-8 7,637,601 bytes
    > > >> SCSU 6,414,319 bytes
    > > >> BOCU-1 5,897,258 bytes
    > > >> Legacy encoding (*) 5,477,432 bytes
    > > >> (*) KS C 5601, KS X 1001, or EUC-KR)

    What is the size of gzip these? Just wonder
    gzip of UTF-16
    gzip of UTF-8
    gzip of SCSU
    gzip of BOCU-1
    gzip of Legacy encoding

    -- 
    --
    Frank Yung-Fong Tang
    Šýštém Årçhîtéçt, Iñtërnâtiônàl Dèvélôpmeñt, AOL Intèrâçtívë Sërviçes
    AIM:yungfongta   mailto:ytang0648@aol.com Tel:650-937-2913
    Yahoo! Msg: frankyungfongtan
    


    This archive was generated by hypermail 2.1.5 : Tue Dec 02 2003 - 19:07:20 EST