Analysis and Comparison of Data Compression Algorithms

 
 
 
  • Abstract
  • Keywords
  • References
  • PDF
  • Abstract


    The amount of data being shared over the internet is increasing exponentially. In this digital age, where even devices like refrigerators are connected, data needs to be stored in compressed form. The compressed data should be retrieved without loss of information else the data will be deemed corrupt. As we are approaching 5G communication, the data need to be transferred over the internet at a higher rate. This cannot be achieved by older compression algorithms which has lesser compression ratio and even lesser compression and decompression speed. In this paper, an analysis of modern compression algorithms along with some older compression algorithms have been conducted. Also the implementation and comparison is conducted. The comparison was done with the help of graphs plotted using MATLAB soft- ware. The compression algorithms compared were Deflate, bzip2, Zstandard, Brotli, LZ4 and LZO. The files used for compression were taken from Canterbury and Silesia Corpus.

     

     

     

  • Keywords


    Compression; Compression Ratio; MATLAB; Compression Speed; Decompression Speed; Canterbury Corpus; Silesia Corpus.

  • References


      [1] P. Deutsch (1996) DEFLATE Compressed Data Format Specification version 1.3 [Online]. Available: https://tools.ietf.org/html/rfc1951. https://doi.org/10.17487/rfc1951.

      [2] The Silesia corpus, http://sun.aei.polsl.pl/~sdeor/index.php?page=silesia.

      [3] LZO Compression library, https://boutell.com/lsm/lsmbyid.cgi/001070.

      [4] The Canterbury corpus, http://corpus.canterbury.ac.nz/.

      [5] The Calgary corpus, http://www.data- compression.info/Corpora/.

      [6] David Solomon, Data Compression, The Complete Reference, 3rd edition, Springers Publication, 2003.

      [7] Vlad Krasnov, Results of experimenting with Brotli for dynamic web content, https://blog.cloudflare.com/results- experimenting-brotli/

      [8] Fano, R.M.”The transmission of information”. Technical Report No. 65, USA: Research Laboratory of Electronics at MIT, 1949.

      [9] Ziv, Jacob; Lempel, Abraham. ”A Universal Algorithm for Sequential Data Compression”. IEEE Transactions on Information Theory23 (3): pp. 337343, May 1977. https://doi.org/10.1109/TIT.1977.1055714.

      [10] Huffman, D.” A Method for the Construction of Minimum- Redundancy Codes”. Proceedings of the IRE40 (9): pp. 10981101, 1952. https://doi.org/10.1109/JRPROC.1952.273898.

      [11] Michael Burrows, David Wheeler, A Block-sorting Lossless Data Com-pression Algorithm, Digital Systems Research Center, Research Report 124, 1994.

      [12] Zstandard A Stronger Compression Algorithm, http://fastcompression.blogspot.in/2015/01/zstd-stronger- compression-algorithm.html.

      [13] LZ4 explained, http://fastcompression.blogspot.in/2011/05/lz4- explained.html.


 

View

Download

Article ID: 20350
 
DOI: 10.14419/ijet.v8i3.20350




Copyright © 2012-2015 Science Publishing Corporation Inc. All rights reserved.