Datasets

Contact us if you know of other datasets we could include here

ROSE-X (2021)

ROSE-X: an annotated data set for evaluation of 3D plant organ segmentation methods

Sorghum Biomass Prediction (2021)

CVPPPA @ ICCV 2021 Workshop Challenge Dataset. Given RGB stereo images of plots of Sorghum, estimate the end-of-season biomass. This dataset contains 277,327 images of 176 sorghum species with biomass yields from 349 plots.


GLOBAL WHEAT HEAD DETECTION DATASET (2021)

CVPPPA @ ICCV 2021 Workshop Challenge Dataset. This dataset contains 4,700 high-resolution RGB images and 190,000 labelled wheat heads collected from several countries around the world at different growth stages with a wide range of genotypes. All images share a common format of 1024Ă—1024 px.


AGRICULTURE-VISION DATABASE (2020)

CVPR 2020 Agri-Vision Workshop Challenge. This challenge dataset contains 21,061 aerial farmland images captured throughout 2019 across the US. Each image consists of four 512x512 color channels, which are RGB and Near Infra-red (NIR). Each image also has a boundary map and a mask. The boundary map indicates the region of the farmland, and the mask indicates valid pixels in the image.

The dataset contains six types of annotations: Cloud shadow, Double plant, Planter skip, Standing Water, Waterway and Weed cluster.

MinnieApple (2019)

A field-level dataset contain images and annotations for fruit detection, segmentation, and counting in orchard environments.


Mango Yolo (2017)

A field-level fruit localization dataset containing 1,700 annotated images of mango plants.


ACFR Orchard Fruit Dataset (2016)

A field-level fruit localization dataset containing 1120 Apple image, 1964 Mango images, and 620 Almond images. The apple and Almond images are 308x202 RGB images, while the mango images are 500x500 RGB images. Fruits are localized using either bounding boxes or bounding circles.


MSU-PID

This dataset contains multimodal images of arabidopsis bean plants. The represented modalities are fluorescence, infrared(IR), RGB color, and depth. There are 16 arabidopsis plants and 5 bean plants tracked across several days, with each image having a corresponding segmentation mask for each leaf.

For related papers, click here.

cvppp lsc + lcc

This dataset contains the ground truth data for the CVPPP leaf segmentation challenge and the leaf counting challenge. It contains several images of arabidopsis plants with associated masks for each leaf.

For related papers, click here.

KOMATSuna

This dataset is utilized for 3D phenotyping, instance segmentation, leaf tracking, and reconstruction. It contains temporal RGB-D and multi-view images of several Komatsuna plants across their lifespan.

For related papers, click here.

fruit flower detection

This dataset contains images of flowering pear, apple, and peach plants under different lighting conditions. The dataset contains instance segmentation masks for the flowers, and there are 190 images. The intention for this dataset is that the network is trained on 100 images of apple flowers and the network parameters are transferred to the task of instance segmentation for the flowers of other plant species.

For related papers, click here.

Leaf counting

This dataset contains 9372 RGB images of various species of weeds annotated with the number of leaves on the plants.

For related papers, click here.

Plant Seedlings


Plant image analysis


Oil Radish Growth Dataset


PLANT 3D

3D scans of shoots and roots for different plant species

VIVC DATABASE

Vitis International Variety Catalogue (VIVC) contains an inventory of Vitis species, varieties and other Vitis genotypes existing in the grapevine collections world-wide

GRASS CLOVER IMAGE DATASET

The dataset consists of synthetic images of grass and clover mixtures with pixelwise species labels in a hierarchy, collected images of grass and clover mixtures with biomass composition labels (grass, clover, weeds), and unlabeled images of grass and clover mixtures.


DRYAD TERRA-REF DATASET

An open-access reference dataset for the study of plant sensing.

Sensor data were generated by a field scanner sensing platform that captures color, thermal, hyperspectral, and active flourescence imagery as well as three dimensional structure and associated environmental measurements. This dataset is provided alongside data collected using traditional field methods.

Plant PATHOLOGY 2020

This dataset contains expert-annotated, manually-captured, high-quality, real-life symptom images of multiple apple foliar diseases, with variable illumination, angles, surfaces, and noise. It is a pilot dataset for apple scab, cedar apple rust, and healthy leaves, which was made available to the Kaggle community for 'Plant Pathology Challenge'; part of the Fine-Grained Visual Categorization (FGVC) workshop at CVPR 2020 (Computer Vision and Pattern Recognition).


Aberystwyth Leaf Evaluation


Awesome Satellite Imagery Datasets

List of aerial and satellite imagery datasets with annotations for computer vision and deep learning.

Synthetic plant generation tools

Several resources for synthetic plant generation in virtual scenes are listed in this section.

Procedural generation algorithms: A survey containing popular systems that implement procedural techniques to model plants and natural phenomena. Some of the resources mentioned are listed below.


Software Packages: A table comparing different software tools for plant generation can be found here. Examples of said tools are listed below.