Search this site
Embedded Files
Intelligent Systems for Date Fruits
  • Home
  • Date database
  • Variety classification
  • Maturity analysis
  • Harvesting decision system
  • Fruit detection & segmentation
  • Visual yield estimation
  • About
Intelligent Systems for Date Fruits
  • Home
  • Date database
  • Variety classification
  • Maturity analysis
  • Harvesting decision system
  • Fruit detection & segmentation
  • Visual yield estimation
  • About
  • More
    • Home
    • Date database
    • Variety classification
    • Maturity analysis
    • Harvesting decision system
    • Fruit detection & segmentation
    • Visual yield estimation
    • About

Date database

Variety classification

Maturity analysis

Harvesting decision

Fruit detection

Yield estimation

About

Date fruit dataset for intelligent harvesting

Importance

Research on automated date fruit harvesting is limited as there is no public dataset for date fruits to aid in this. Hence in this work, we built a comprehensive dataset for date fruits that can be used by the research community for multiple tasks including automated harvesting, visual yield estimation, and classification tasks. 

The dataset has been fully labeled, coded, and released with their associated files freely to the research community in the IEEE DataPort repository ( http://dx.doi.org/10.21227/x46j-sk98). 

Value of the data

  • Advanced agriculture automation such as robotic harvesting can significantly increase quality and yield and reduce production costs and delay. This dataset is a contribution to agriculture automation research.

  • The dataset advances the machine vision research of date fruit in pre-harvesting and harvesting stages. To the best of our knowledge, this is the only publicly available dataset for date fruit pre-harvesting and harvesting applications.

  • The date dataset consists of images, videos, and weight measurements, which can be used for various machine vision applications for date fruit in orchard environments, including automated harvesting, fruit detection and segmentation, classification, maturity analysis, and weight/yield estimation.

  • The dataset has multiple sets of variations that reflect the challenges in agriculture environments and date orchards, which is essential for building a reliable and robust machine vision.

Data Description

The date fruit dataset was created to address the requirements of many applications in the pre-harvesting and harvesting stages. The two most important applications are automatic harvesting and visual yield estimation. Since date applications require different types of data with different characteristics, we built two separate datasets for the benefits of researchers in different date applications. Fig. 1 gives a brief overview of the two datasets.

Dataset-1

The first dataset (dataset-1) contains 8079 images of date fruit bunches for five date varieties (Naboot Saif, Khalas, Barhi, Meneifi, and Sullaj), showing various pre-maturity and maturity stages. These images cover a large degree of variabilities in order to address the challenges in the date orchard environment, such as multi-scale images, variable illumination, different angles, and diverse bagging states. Dataset-1 can be used in many applications such as automatic harvesting tasks, fruit detection, fruit recognition, maturity analysis, etc.

Sample images of the dataset-1 showing large variation in scales, angles, and illumination.

Sample images of the five date varieties in dataset-1 showing different maturity stages. Some bunches are covered with bags for their protection. 


Sample images of a Sullaj date captured in the six imaging sessions and covered all date maturity stages (immature, Khalal, Rutab, and Tamar). 


Distribution of the images captured at each imaging session between the seven maturity classes. Images captured in one imaging session are distributed between many classes. For instance, the images captured in session-4 are distributed between four classes for Meneifi dates (Immature-2, Pre-Khalal, Khalal, and Khalal-with-Rutab) and Sullaj dates (Pre-Khalal, Khalal, Khalal-with-Rutab, and Pre-Tamar). 


The state of each palm during the six imaging sessions: harvested, partially harvested, or not harvested. 

Dataset-2

The second dataset (dataset-2) consists of 152 Barhi date bunches belonging to 13 palms. All bunches were weighted after harvesting, and their images were captured in front of a white graph paper, as shown in  Fig. 6. Subset of these bunches (120 bunches from nine palms) were provided with more inclusive data: we marked the whole bunches in the palm, captured their images from different angles before and during harvesting, recorded 360° video for each palm, and registered their characteristics (height, trunk circumference, total yield, and number of bunches).

1.  Date bunches were marked on trees . 

3.  The bunches were captured in front of a graph paper. 

2.  Images and videos were collected . 

4.  The weights of date bunches were recorded. 

In addition to the whole bunch data, other information was collected: images and weights of the bunch components (individual dates and bunch stalk).

64-individual Barhi dates at the four maturity stages (16 dates per stage): immature, Khalal, Rutab, and Tamar 


Samples of single date weights for the Barhi variety in three maturity stages: Khalal, Rutab, and Tamar. 

Dataset Coding

In this dataset, we arranged the data (dates, bunches, and palms and their related visual data, i.e., image/video files, or numerical data, e.g., weight), with a coding scheme to simplify referring, linking, and facilitating future extensions of the dataset. 

The coding scheme consists of four parts separated by dots, as shown in the following figure. The first part is the code for the palm tree, which consists of two parts: palm type (variety) and palm number. The palm number is omitted if the data is not associated with a specific palm (e.g. in the case of individual dates). The second part usage depends on the dataset. In dataset-2, this part consists of a character that refers to the maturity stage of the dates (I: Immature, K: Khalal, R: Rutab, and T: Tamar). In the dataset-1, this part refers to the imaging session code, which consists of a session symbol [S] and session number, e.g., S1 refers to the first imaging session. The imaging session was used in dataset-1 instead of referring to the maturity stage explicitly because this dataset was collected over several periods of time during the maturity development of the fruit. Therefore, in one period (imaging session) the dates may have multiple maturity stages (some dates ripen before others). The third part contains one or more symbols that refer to the type of data, as stated in the following figure. Multiple symbols in this part refer to multi-data, e.g., BW refers to a visual bunch data that also has a weight associated with it. The combination of these symbols should be in the same order as given in the following figure, starting with visual data symbols (B, S, T) followed by numerical data symbols (W, D). The fourth part consists of a numerical value that indicates the data sequence. In the single date images, where the image contains 16 dates, this part refers to the range sequence, e.g., 1-16. 

A description of the symbols used in the dataset coding. 

Data source location 

The datasets were recorded in an orchard in Al-Ammaria, located 25 km Northwest of Riyadh, Saudi Arabia. The orchard contained more than 800 date palms of different varieties and ages. 

Referred paper:

  • H. Altaheri, M. Alsulaiman, M. Faisal, and G. Muhammad, Date Fruit Dataset for Automated Harvesting and Visual Yield Estimation, IEEE DataPort, v1, 2019. doi: 10.21227/x46j-sk98. 

  • H. Altaheri, M. Alsulaiman, G. Muhammad, S. U. Amin, M. Bencherif, and M. Mekhtiche, “Date Fruit Dataset for Intelligent Harvesting", Data in Brief, vol. 26, p. 104514, Oct. 2019. doi: 10.1016/j.dib.2019.104514 


Dataset download links: (You have to login with an IEEE Account to download the files. IEEE Account is FREE)

  • Dataset-1

- DATASET-1: 8079 images of five date types captured in six imaging sessions.

- DATASET-1: A preview of the 8079 images. size: 224 X 224.

- DATASET-1: The annotation (labeling) files for type classification, maturity analysis, and harvesting decision applications.

  • Dataset-2

- DATASET-2: Images of 152 Barhi date bunches before and during harvesting.

- DATASET-2: Images of the 152 Barhi date bunches in front of graph paper.

- DATASET-2: The weight measurements of the 152 Barhi date bunches.

- DATASET-2: 360-degree videos of the nine Barhi date palms (120 date bunches).

- DATASET-2: Images of 11 Sullaj date bunches in front of graph paper with weight & dimensions measurements.

- DATASET-2: Images of individual dates and bunches stalks with weight & dimensions measurements.

Contact
For any questions, feel free to contact:Hamdi Altaheri: altaheri@ieee.org Prof. Mansour Alsulaiman: msuliman@ksu.edu.sa 
Intelligent Systems for Date Fruits
Google Sites
Report abuse
Google Sites
Report abuse