What is this?

This site is for dataset and mining scripts for a technical report entitled "Mining programming competition archives from code forces". You can download a paper

Abstract

Programming competition sites (e.g., TopCoder, Codeforces) have attracted a large number of programmers to participate in recent times. In this paper, we present a dataset of programming competition archives from Codeforces, the largest, public programming competition site. As an illustrative usage scenario, we demonstrate the use of this dataset to reveal how high-skill participants modify their code compared to low-skill ones.

Dataset

You can download dataset from a dataset download site

Mining Scripts

The mining scripts can be found here, you can run/modify them to get your own dataset.

>> Mining script for submission information data