Sonay Duman , Abdullah Elewi , Abdulsalam Hajhamed , Rasheed Khankan , Amina Souag , Asma Ahmed
{"title":"用于机器学习应用的带有环境背景的杏鲍菇图像注释新数据集","authors":"Sonay Duman , Abdullah Elewi , Abdulsalam Hajhamed , Rasheed Khankan , Amina Souag , Asma Ahmed","doi":"10.1016/j.dib.2024.111074","DOIUrl":null,"url":null,"abstract":"<div><div>State-of-the-art technologies such as computer vision and machine learning, are revolutionizing the smart mushroom industry by addressing diverse challenges in yield prediction, growth analysis, mushroom classification, disease and deformation detection, and digital twinning. However, mushrooms have long presented a challenge to automated systems due to their varied sizes, shapes, and surface characteristics, limiting the effectiveness of technologies aimed at mushroom classification and growth analysis. Clean and well-labelled datasets are therefore a cornerstone for developing efficient machine-learning models. Bridging this gap in oyster mushroom cultivation, we present a novel dataset comprising 555 high-quality camera raw images, from which approximately 16.000 manually annotated images were extracted. These images capture mushrooms in various shapes, maturity stages, and conditions, photographed in a greenhouse using two cameras for comprehensive coverage. Alongside the images, we recorded key environmental parameters within the mushroom greenhouse, such as temperature, relative humidity, moisture, and air quality, for a holistic analysis. This dataset is unique in providing both visual and environmental time-point data, organized into four storage folders: “Raw Images”; “Mushroom Labelled Images and Annotation Files”; “Maturity Labelled Images and Annotation Files”; and “Sensor Data”, which includes time-stamped sensor readings in Excel files. This dataset can enable researchers to develop high-quality prediction and classification machine learning models for the intelligent cultivation of oyster mushrooms. Beyond mushroom cultivation, this dataset also has the potential to be utilized in the fields of computer vision, artificial intelligence, robotics, precision agriculture, and fungal studies in general<em>.</em></div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 111074"},"PeriodicalIF":1.0000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel dataset of annotated oyster mushroom images with environmental context for machine learning applications\",\"authors\":\"Sonay Duman , Abdullah Elewi , Abdulsalam Hajhamed , Rasheed Khankan , Amina Souag , Asma Ahmed\",\"doi\":\"10.1016/j.dib.2024.111074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>State-of-the-art technologies such as computer vision and machine learning, are revolutionizing the smart mushroom industry by addressing diverse challenges in yield prediction, growth analysis, mushroom classification, disease and deformation detection, and digital twinning. However, mushrooms have long presented a challenge to automated systems due to their varied sizes, shapes, and surface characteristics, limiting the effectiveness of technologies aimed at mushroom classification and growth analysis. Clean and well-labelled datasets are therefore a cornerstone for developing efficient machine-learning models. Bridging this gap in oyster mushroom cultivation, we present a novel dataset comprising 555 high-quality camera raw images, from which approximately 16.000 manually annotated images were extracted. These images capture mushrooms in various shapes, maturity stages, and conditions, photographed in a greenhouse using two cameras for comprehensive coverage. Alongside the images, we recorded key environmental parameters within the mushroom greenhouse, such as temperature, relative humidity, moisture, and air quality, for a holistic analysis. This dataset is unique in providing both visual and environmental time-point data, organized into four storage folders: “Raw Images”; “Mushroom Labelled Images and Annotation Files”; “Maturity Labelled Images and Annotation Files”; and “Sensor Data”, which includes time-stamped sensor readings in Excel files. This dataset can enable researchers to develop high-quality prediction and classification machine learning models for the intelligent cultivation of oyster mushrooms. Beyond mushroom cultivation, this dataset also has the potential to be utilized in the fields of computer vision, artificial intelligence, robotics, precision agriculture, and fungal studies in general<em>.</em></div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"57 \",\"pages\":\"Article 111074\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340924010369\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340924010369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
A novel dataset of annotated oyster mushroom images with environmental context for machine learning applications
State-of-the-art technologies such as computer vision and machine learning, are revolutionizing the smart mushroom industry by addressing diverse challenges in yield prediction, growth analysis, mushroom classification, disease and deformation detection, and digital twinning. However, mushrooms have long presented a challenge to automated systems due to their varied sizes, shapes, and surface characteristics, limiting the effectiveness of technologies aimed at mushroom classification and growth analysis. Clean and well-labelled datasets are therefore a cornerstone for developing efficient machine-learning models. Bridging this gap in oyster mushroom cultivation, we present a novel dataset comprising 555 high-quality camera raw images, from which approximately 16.000 manually annotated images were extracted. These images capture mushrooms in various shapes, maturity stages, and conditions, photographed in a greenhouse using two cameras for comprehensive coverage. Alongside the images, we recorded key environmental parameters within the mushroom greenhouse, such as temperature, relative humidity, moisture, and air quality, for a holistic analysis. This dataset is unique in providing both visual and environmental time-point data, organized into four storage folders: “Raw Images”; “Mushroom Labelled Images and Annotation Files”; “Maturity Labelled Images and Annotation Files”; and “Sensor Data”, which includes time-stamped sensor readings in Excel files. This dataset can enable researchers to develop high-quality prediction and classification machine learning models for the intelligent cultivation of oyster mushrooms. Beyond mushroom cultivation, this dataset also has the potential to be utilized in the fields of computer vision, artificial intelligence, robotics, precision agriculture, and fungal studies in general.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.