Towards precision agriculture: A dataset for early detection of corn leaf pests

IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Data in Brief Pub Date : 2025-04-01 Epub Date: 2025-02-14 DOI:10.1016/j.dib.2025.111394
Thierry Tchokogoué , Auguste Vigny Noumsi , Marcellin Atemkeng , Louis Aimé Fono
{"title":"Towards precision agriculture: A dataset for early detection of corn leaf pests","authors":"Thierry Tchokogoué ,&nbsp;Auguste Vigny Noumsi ,&nbsp;Marcellin Atemkeng ,&nbsp;Louis Aimé Fono","doi":"10.1016/j.dib.2025.111394","DOIUrl":null,"url":null,"abstract":"<div><div>Corn (<em>Zea mays</em>), commonly referred to as Indian wheat, is a widely cultivated tropical annual herbaceous plant of the Poaceae family. It is primarily grown for its starch-rich grains and as a forage crop. In Cameroon, corn is the most consumed cereal, surpassing rice and sorghum, with an estimated production of 2.2 million tons annually. However, corn production is frequently threatened by insect infestations, which hinder crop development, reduce yields, and degrade its quality. Early detection of insect attacks is essential for farmers, as timely intervention can prevent widespread damage, reduce pesticide usage, and improve production yields. Insect infestations on corn manifest through various symptoms on leaves, stems, and seeds. Among these, foliar attacks are particularly detrimental, disrupting plant growth and significantly reducing yields. Symptoms of these attacks include leaf perforations, yellowing, and white spot deposits, ultimately altering the leaf texture. To address these challenges, machine learning models offer a promising solution for early detection of foliar attacks, enabling farmers to take timely and effective action. This paper introduces a dataset focused on three major pests: Spodoptera frugiperda (Fall Armyworm), <em>Helminthosporium</em> leaf blight, and Zonocerus variegatus (Variegated Grasshopper), which are among the most frequent and destructive agents affecting corn crops. The dataset comprises images of corn leaves captured in natural environments at various growth stages and field locations. Images were taken using smartphone cameras at different times of the day, providing diverse lighting conditions, and in various fields, which introduced several background contaminations, ensuring a realistic representation of field conditions. The dataset comprises eight directories: two containing healthy leaf images (1308 without augmentation and 11,772 with augmentation), two containing manually segmented backgrounds of healthy leaves (1308 without augmentation and 11,772 with augmentation), two containing healthy leaves with CNDVI algorithm-segmented backgrounds (1308 without augmentation and 11,772 with augmentation), one containing 848 infected images with manually segmented backgrounds and highlighted infected areas, and one containing 7632 augmented versions of the infected images. This dataset serves as a valuable resource for researchers and students, providing opportunities to develop machine learning and deep learning models for corn disease detection, classification, natural image segmentation, and model interpretability and explainability. By facilitating advancements in precision agriculture and automated pest detection, the dataset contributes to sustainable agricultural practices and the broader field of agroinformatics.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"59 ","pages":"Article 111394"},"PeriodicalIF":1.4000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235234092500126X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/14 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Corn (Zea mays), commonly referred to as Indian wheat, is a widely cultivated tropical annual herbaceous plant of the Poaceae family. It is primarily grown for its starch-rich grains and as a forage crop. In Cameroon, corn is the most consumed cereal, surpassing rice and sorghum, with an estimated production of 2.2 million tons annually. However, corn production is frequently threatened by insect infestations, which hinder crop development, reduce yields, and degrade its quality. Early detection of insect attacks is essential for farmers, as timely intervention can prevent widespread damage, reduce pesticide usage, and improve production yields. Insect infestations on corn manifest through various symptoms on leaves, stems, and seeds. Among these, foliar attacks are particularly detrimental, disrupting plant growth and significantly reducing yields. Symptoms of these attacks include leaf perforations, yellowing, and white spot deposits, ultimately altering the leaf texture. To address these challenges, machine learning models offer a promising solution for early detection of foliar attacks, enabling farmers to take timely and effective action. This paper introduces a dataset focused on three major pests: Spodoptera frugiperda (Fall Armyworm), Helminthosporium leaf blight, and Zonocerus variegatus (Variegated Grasshopper), which are among the most frequent and destructive agents affecting corn crops. The dataset comprises images of corn leaves captured in natural environments at various growth stages and field locations. Images were taken using smartphone cameras at different times of the day, providing diverse lighting conditions, and in various fields, which introduced several background contaminations, ensuring a realistic representation of field conditions. The dataset comprises eight directories: two containing healthy leaf images (1308 without augmentation and 11,772 with augmentation), two containing manually segmented backgrounds of healthy leaves (1308 without augmentation and 11,772 with augmentation), two containing healthy leaves with CNDVI algorithm-segmented backgrounds (1308 without augmentation and 11,772 with augmentation), one containing 848 infected images with manually segmented backgrounds and highlighted infected areas, and one containing 7632 augmented versions of the infected images. This dataset serves as a valuable resource for researchers and students, providing opportunities to develop machine learning and deep learning models for corn disease detection, classification, natural image segmentation, and model interpretability and explainability. By facilitating advancements in precision agriculture and automated pest detection, the dataset contributes to sustainable agricultural practices and the broader field of agroinformatics.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
迈向精准农业:玉米叶片病虫害早期检测数据集
玉米(Zea mays),通常被称为印度小麦,是一种广泛种植的禾本科科热带一年生草本植物。种植它主要是为了它富含淀粉的谷物和作为饲料作物。在喀麦隆,玉米是消费量最大的谷物,超过大米和高粱,估计年产量为220万吨。然而,玉米生产经常受到虫害的威胁,这阻碍了作物的生长,降低了产量,降低了质量。早期发现虫害对农民至关重要,因为及时干预可以防止大面积损害,减少农药使用,提高产量。昆虫对玉米的侵害表现在叶片、茎和种子上的各种症状。其中,叶面攻击尤其有害,破坏植物生长并显著降低产量。这些疾病的症状包括叶片穿孔、发黄和白斑沉积,最终改变叶片质地。为了应对这些挑战,机器学习模型为早期发现叶面攻击提供了一个有希望的解决方案,使农民能够及时有效地采取行动。本文介绍了影响玉米作物最常见和最具破坏性的三种主要害虫:秋粘虫(Spodoptera frugiperda)、Helminthosporium叶枯病(Helminthosporium leaf blight)和斑纹蚱蜢(Zonocerus variegatus)。该数据集包括在不同生长阶段和田间位置的自然环境中拍摄的玉米叶片图像。使用智能手机相机在一天中的不同时间拍摄图像,提供不同的照明条件,并在不同的领域,这引入了一些背景污染,确保了现场条件的真实表现。该数据集包括八个目录:2张包含健康叶片图像(1308张未增强,11772张增强),2张包含手动分割的健康叶片背景(1308张未增强,11772张增强),2张包含CNDVI算法分割的健康叶片背景(1308张未增强,11772张增强),1张包含848张手动分割背景和突出显示感染区域的感染图像。另一个包含7632张被感染图像的增强版本。该数据集为研究人员和学生提供了宝贵的资源,为玉米病害检测、分类、自然图像分割以及模型可解释性和可解释性提供了开发机器学习和深度学习模型的机会。通过促进精准农业和害虫自动检测的进步,该数据集有助于可持续农业实践和更广泛的农业信息学领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Data in Brief
Data in Brief MULTIDISCIPLINARY SCIENCES-
CiteScore
3.10
自引率
0.00%
发文量
996
审稿时长
70 days
期刊介绍: Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.
期刊最新文献
Shotgun sequencing data and SSR mining data of aibika (Abelmoschus manihot, Malvaceae) Dataset on multiregional variations of Bangla language (BD-Dialect) InclusiveHAR: A smartphone-based dataset for human activity recognition across diverse physical abilities UzbekPOS: A multi-domain dataset for Uzbek part-of-speech tagging AvianAction101: A Dataset for the dancing behavior of rose-ringed parakeets(Psittacula krameri)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1