• A framework for statistical compressors I've uploaded some baseline package, see here, i usually use to write statistical compressors, there's everything done you need: a rangecoder, some common stuff, fast IO, a driver ...
    Posted Jan 26, 2012 11:19 PM by Christopher M.
  • Interesting Papers? For those who are interested in data compression research: I've uploaded two of my papers (preprint versions) in the Documents section. One of the papers is about basic bit ...
    Posted Jan 13, 2012 8:04 AM by Christopher M.
  • Weight update for logistic mixing Following a recent thread  and having found a possibility to display formulas in Google sites, i decided to mess around with this again. First i want to point out that ...
    Posted Nov 4, 2010 11:00 AM by Christopher M.
  • CMM* Internals The series of CMM compressors was due to my interest in learning the basics of bitwise context mixing (CM). Basically all of these are PAQ clones with speed optimizations and ...
    Posted Oct 20, 2010 12:33 PM by Christopher M.
  • Bit history analysis On a request i recovered a partially reimplemented tool (due to hdd crash) for dumping bit histories and generating some stats. To build models one first needs some understanding of ...
    Posted Oct 17, 2010 7:42 AM by Christopher M.
Showing posts 1 - 5 of 7. View more »


Welcome to my site!

From time to time i will publish some of my data compression projects and experiments on this site. My work is mostly located in the area of statistical compression, context mixing (CM) in particular.

Data compression algorithms transform data streams, typically a sequence of symbols over a finite alphabet, into a less redundant representation. Statistical data compression breaks the compression process into two phases: modeling and coding. A statistical model generates probability estimations for upcoming symbols. Based on these estimations the coding step assigns unique encodings to the symbols. In contrast to classical methods, like Prediction by Partial Matching (PPM), CM combines multiple models prediction and often achieves outstanding compression performance. However it comes at its cost: astronomic CPU time and RAM usage.

I'm interested in defeating these shortcomings, i.e. creating efficient, universal statistical models and their automated fitting to training data. This approach mirrors engineering techniques for model development. The inefficiency which is related to wrong model parameters might mask out the superiority of a certain model structure and lead to wrong conclusions, i.e. parameter optimization can minimize this effect.

Christopher M. aka toffer
nickname86@gmx.net