Download The Dataset

The full CoSTAR Block Stacking Dataset is approximately 0.5 TB in size with 16k files. Individual stacking attempt files are typically between 30MB and 50MB, outliers can range from 15kb if there are no images to about 80MB. Text files (txt, csv, yaml, md) with documentation, subsets and configuration information are also included.

The dataset files can be viewed and downloaded individually at the Johns Hopkins CoSTAR Dataset on the Internet Archive. We thank the Internet Archive for graciously hosting our dataset and for their mission of universal knowledge access and preservation! If you have a dataset of your own please consider uploading it to them.

Download Successful Stacking Examples

This will most likely be the easiest way for you to get started by downloading the successful stacking examples in the dataset without distractors.

git clone https://github.com/ahundt/costar_dataset.git

cd costar_dataset/costar_dataset

mkdir -p ~/.keras/datasets/costar_block_stacking_dataset_v0.4

python download_success.py ~/.keras/datasets/costar_block_stacking_dataset_v0.4

Download directly from the Internet Archive

Install the internetarchive python download utility

pip install internetarchive

Download the complete dataset

mkdir -p ~/.keras/datasets/costar_block_stacking_dataset_v0.4

cd ~/.keras/datasets/costar_block_stacking_dataset_v0.4

ia download johns_hopkins_costar_dataset

Download a smaller subset

View other download options:

ia download --help

Download small files including documentation, human created labels, train/val/test subsets, and camera calibration:

mkdir -p ~/.keras/datasets/costar_block_stacking_dataset_v0.4

cd ~/.keras/datasets/costar_block_stacking_dataset_v0.4

ia download johns_hopkins_costar_dataset --glob="*.md"

ia download johns_hopkins_costar_dataset --glob="*.csv"

ia download johns_hopkins_costar_dataset --glob="*.txt"

ia download johns_hopkins_costar_dataset --glob="*.yml"

Files containing the substring 'success':

mkdir -p ~/.keras/datasets/costar_block_stacking_dataset_v0.4

cd ~/.keras/datasets/costar_block_stacking_dataset_v0.4

ia download johns_hopkins_costar_dataset --glob="*success*"

Download files Individually on the Web

The dataset files can be viewed and downloaded individually at the Johns Hopkins CoSTAR Dataset on the Internet Archive.

GitHub Code

Code for this dataset and neural network models is from our paper The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints, with examples and instructions further down on this page:

  1. CoSTAR Dataset Loading Repository — PyTorch and TensorFlow dataset loader

  2. HyperTree Architecture Search Code — aka CoSTAR Hyper, requires TensorFlow

  3. rENAS: regression Efficient Neural Architecture Search Code — requires TensorFlow

  4. CoSTAR Objects - 3D Models of blocks, bin, and other objects

Code for CoSTAR itself is at:

Download with BitTorrent

johns_hopkins_costar_dataset_archive.torrent

This will most likely be the fastest way for you to download the dataset. BitTorrent is an efficient distributed data transfer protocol which you can use by installing a bittorrent client application. You then use the small bittorrent file provided by the Internet Archive which tells the client how to download the dataset.

Ubuntu 16.04 download instructions:

# Ubuntu

sudo apt-get update

sudo apt-get install aria2

Then, use the following commands to download the torrent file.

mkdir -p ~/.keras/datasets

cd ~/.keras/datasets


# single line command to perform the full dataset download

aria2c https://archive.org/download/johns_hopkins_costar_dataset/johns_hopkins_costar_dataset_archive.torrent -d ~/.keras/datasets/

After the files are downloaded use the following command to rename the folder.

cd ~/.keras/datasets/

mv johns_hopkins_costar_dataset costar_block_stacking_dataset_v0.4