Miscellany

TR1.C

 

tr1.c (included in tr1.zip) - is an implementation in C of a data source transformation method suitable for *binary* source (e.g., arithmetic) coding.

 

 

SIEVE.C

 

sieve93.c - is a C implementation of a prime number generation method (the "Sieve") due to the mathematician Eratosthenes (200 B.C.). It can be used to "see" if an odd number is prime or composite. Sieve93 is a very compact sieve though not optimal sieving. The size of the LZW string table employed in the *compress* program of Unix, for example, must be a prime number. 



LZUF file compressor 


Unary "folded" coding (UFC) in lzuf.c is actually a kind of Golomb-Rice coding. The put_vlcode() function in ucodes.c library can write the universal codes Elias-Gamma codes and Exponential-Golomb codes. 



Reduced Length LZ (RLLZ) 


A description of a reduced length LZ is here, which improves on LZ77/LZSS/LZW. This is the most compact LZ algorithm.



Large Text Compression Benchmark


A very good source for information theory and data compression algorithms is "Data Compression Explained," an online book by Matt Mahoney. As well as the "Large Text Compression Benchmark" page. 


"There is no such thing as universal compression, recursive compression, or compression of random data." 

   -- Matt Mahoney, Data Compression Explained



Download Matt Mahoney's enwik8 and enwik9 test files for the Large Text Compression Benchmark (LTCB). And the Calgary Corpus.



Compiled Executables


Provided compiled executables (EXEs) of some of the compression source codes featured in this site. The programs are in the archive compressgt-exes.zip and are compiled using the free Borland BCC32 compiler. The newer compressors in this site are compiled with MingGW's g++. Of course, you can opt to recompile using your best compiling optimization tricks. (9/2/2017)


TheDataCompressionGuideZIP - all ZIP files in one ZIP for your easy download!



My Blog: (Gerald R Tamayo blog). 


Facebook: You can find me on Facebook here


My Twitter:

https://www.twitter.com/geraldrtamayo1


GitHub Page:

https://github.com/grtamayo     (If  you have trouble downloading the ZIP files, download from GitHub instead. It is more up to date there.)
















Copyright (c) 2010-2024