{"title":"为精准农业中的深度学习任务制定严格的数据集质量标准:案例研究探索","authors":"A. Carraro , G. Saurio , F. Marinello","doi":"10.1016/j.atech.2024.100721","DOIUrl":null,"url":null,"abstract":"<div><div>Deep Learning (DL) through Convolutional Neural Networks (CNNs) has emerged as a critical player in classifying plant diseases from images. This prominence has intensified the demand for a substantial volume of annotated training data. However, acquiring such data is costly and intricate, fraught with subtle challenges. In the domain of plants, where data collection can be even more complex, this study scrutinises how one dataset was gathered. Specifically, it delves into the nuances of collecting images of grapevine leaves in an open field for a binary classification task, discerning the presence or absence of Esca disease.</div><div>Adherence to rigorous dataset quality standards during image collection is paramount in precision agriculture. Errors made in this phase can have devastating repercussions on all subsequent work. For instance, collections of photos may exhibit a consistent disparity in background characteristics between images belonging to different classes. This persistent difference can lead a deep-learning algorithm to learn undesired correlations, even though the algorithm's performances are excellent because the train and test sets possess the same kind of disparity.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"10 ","pages":"Article 100721"},"PeriodicalIF":6.3000,"publicationDate":"2024-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards rigorous dataset quality standards for deep learning tasks in precision agriculture: A case study exploration\",\"authors\":\"A. Carraro , G. Saurio , F. Marinello\",\"doi\":\"10.1016/j.atech.2024.100721\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep Learning (DL) through Convolutional Neural Networks (CNNs) has emerged as a critical player in classifying plant diseases from images. This prominence has intensified the demand for a substantial volume of annotated training data. However, acquiring such data is costly and intricate, fraught with subtle challenges. In the domain of plants, where data collection can be even more complex, this study scrutinises how one dataset was gathered. Specifically, it delves into the nuances of collecting images of grapevine leaves in an open field for a binary classification task, discerning the presence or absence of Esca disease.</div><div>Adherence to rigorous dataset quality standards during image collection is paramount in precision agriculture. Errors made in this phase can have devastating repercussions on all subsequent work. For instance, collections of photos may exhibit a consistent disparity in background characteristics between images belonging to different classes. This persistent difference can lead a deep-learning algorithm to learn undesired correlations, even though the algorithm's performances are excellent because the train and test sets possess the same kind of disparity.</div></div>\",\"PeriodicalId\":74813,\"journal\":{\"name\":\"Smart agricultural technology\",\"volume\":\"10 \",\"pages\":\"Article 100721\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2024-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Smart agricultural technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772375524003253\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375524003253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
Towards rigorous dataset quality standards for deep learning tasks in precision agriculture: A case study exploration
Deep Learning (DL) through Convolutional Neural Networks (CNNs) has emerged as a critical player in classifying plant diseases from images. This prominence has intensified the demand for a substantial volume of annotated training data. However, acquiring such data is costly and intricate, fraught with subtle challenges. In the domain of plants, where data collection can be even more complex, this study scrutinises how one dataset was gathered. Specifically, it delves into the nuances of collecting images of grapevine leaves in an open field for a binary classification task, discerning the presence or absence of Esca disease.
Adherence to rigorous dataset quality standards during image collection is paramount in precision agriculture. Errors made in this phase can have devastating repercussions on all subsequent work. For instance, collections of photos may exhibit a consistent disparity in background characteristics between images belonging to different classes. This persistent difference can lead a deep-learning algorithm to learn undesired correlations, even though the algorithm's performances are excellent because the train and test sets possess the same kind of disparity.