Welcome to the CornSeeds project! This is an ongoing research effort to provide agricultural researchers around the world with corn seed image data for training seeds classification/detection models. The data is available for free to researchers for non-commercial use.
Yes. For details, visit the link given below.
github.com/Naagar/Seeds_Classification
Sandeep Nagar, ML Lab, IIIT-Hyderabad
Prateek Pani, ML Lab, IIIT-Hyderabad
Raj Nair, AdTech Corp.
Prof. Girish Varma, CSTAR Lab and ML Lab, IIIT-Hyderabad
For students, advisors, and other contributors to the project, please see the list of publications below.
Nagar, S., Pani, P., Nair, R., Varma, G.,: Automated Seed Quality Testing System using GAN & Active Learning, In: PReMI 2021, ISI Kolkata
Dataset (folder_name)
Train
-broken
-discolored
-pure
-silkcut
Test
-broken
-discolored
-pure
-silkcut
We are grateful for support from Adtech and the International Institute of Information Technology (IIIT)-Hyderabad, which enabled this project.
This dataset is the images of corn seeds considering the top and bottom view independently (two images for one corn seed: top and bottom). There are four classes of the corn seed (Broken-B, Discolored-D, Silkcut-S, and Pure-P) 17802 images are labled by the experts at the AdTech Corp. and 26K images were unlablled out of which 9k images were labled using the Active Learning (BatchBALD)
We have created three different datasets: (1). Primary dataset: contains the 17802 images labeled by the experts. Top-view(8901) and Bottom-view(8901).
(2). Dataset with fake images: We generated fake images using Conditional GAN (BigGAN) as follows: broken-2937, discolored-5823, pure-2937, and silk cut-5823 instances and added them into the train set to balance the data set.
(3). Balanced dataset: In this case of adding newly captured images labeled using the Batch Active Learning method, new 9000 labeled images are added to the primary dataset. This new dataset contains 26,802 images split into train and validation set 80: 20, respectively. Contains the 17802 images and the 9K images labeled by the Active Learning (BatchBALD).
Machine vision for precision agriculture has attracted research interest in recent years. Plant health monitoring approaches are addressed, including weed, insect, and disease detection. With the success of DNNs, different methods have been proposed to tackle problems of corn seed classification. Fine-grained objects (seeds) are visually similar by a rough glimpse, and details can correctly recognize them in discriminative local regions. We hope that this dataset set focus of computer vision and machine learning researchers on the problems related to automation and agriculture.
Image pre-processing python's code click here