What is this?
This site is for dataset and mining scripts for a technical report entitled "Mining programming competition archives from code forces". You can download a paper
Abstract
Programming competition sites (e.g., TopCoder, Codeforces) have attracted a large number of programmers to participate in recent times. In this paper, we present a dataset of programming competition archives from Codeforces, the largest, public programming competition site. As an illustrative usage scenario, we demonstrate the use of this dataset to reveal how high-skill participants modify their code compared to low-skill ones.
Dataset
You can download dataset from a dataset download site
Mining Scripts
The mining scripts can be found here, you can run/modify them to get your own dataset.