I tested several file compression programs like zip, gzip, arj, bzip2, jar etc for compressing big files.我試了幾個文件壓縮程序一樣,郵編, gzip , arj ,的bzip2 ,瓦罐等壓縮大文件。 The corpus constituted 5 POI generated Microsoft Excel documents totaling 298.8 MB.語料庫構成5 poi生成的Microsoft Excel文件,共計298.8 MB的容量。 And there is a clear winner!和有一個明顯的贏家!

About the data 有關數據
The file excel documents were standard corporate data of a very big corporation (read Fortune 500).該文件Excel文件都是標準的公司數據的一個很大的公司(閱讀財富500強) 。 There is nothing special about the data as such, regular text data in excel files.有沒有什麼特別的有關數據等,定期文本數據在Excel文件。 For obvious reasons I cannot share the data for independent verifications.原因很明顯,我不能共享數據,為獨立的核查。

Archive formats not tested 存檔格式尚未測試
I haven't tested two popular file formats - rar & 7zip as they aren't easily available on Linux.我沒有測試,兩種流行的文件格式-的R AR& 7 zip,因為他們是不會輕易可在L inux上。

Results 結果

Compression Algorithm壓縮算法 Compressed Size壓縮規模 % Compression %壓縮
tar.bz2 10.9 MB 10.9 MB的 96.35
tar.gz 52.5 MB 52.5 MB的 82.43
zip郵編 52.5 MB 52.5 MB的 82.43
arj 52.5 MB 52.5 MB的 82.43
jar瓦罐 52.5 MB 52.5 MB的 82.43

Test Notes 測試債券
The File Roller archive manager which ships with Gnome UI on Linux provides even better bzip2 compression than bzip2 -9!檔案輥存檔經理,其中船舶與GNOME的用戶界面在Linux上提供更Bzip2壓縮比的bzip2 -9 !
bzip2 -9 compressed to 12 MB.的bzip2 -9壓縮到了12 MB 。

I also tried .tar.zip which was the worst.我也嘗試。 tar.zip這是最壞的。

All the file formats took comparable times but then I tested them on a Core 2 Duo 6600 with 2 GB RAM and RAID 1 SATA drives所有的文件格式了可比倍,但後來我測試,他們就一Core 2 Duo的6600具有2 GB的RAM和RAID 1的SATA驅動器 : )
As such the results do not speculate about performance.作為這樣的結果並不猜測的表現。

All the compressed files were tested for accuracy of data.所有壓縮文件測試數據的準確性。

Winner 贏家
This shows all popular compression algorithms are on the same level with the sole exception of bzip2, which stands leagues ahead of the rest.這表明,所有流行的壓縮算法是在同一水平,唯一的例外的的bzip2 ,即聯賽先行。 The clear winner of compression algorithms is bzip2.明確的贏家壓縮算法的bzip2 。

Linux and Windows users can use bzip2 by directly running the bzip executable (downloadable from Linux和Windows用戶可以使用的bzip2 ,直接運行bzip可執行文件(下載 bzip2.org ). ) 。 The latest version of 7Zip and WinZip, both supports bzip2 format.最新版本的7zip和WinZip的,無論是支持的bzip2格式。

Linux users have a winner in Linux用戶有一個贏家, File Roller檔案輥 .