利用新型开放数据集推进基于深度学习的漂浮垃圾检测

IF 2.6 Q2 WATER RESOURCES Frontiers in Water Pub Date : 2023-12-06 DOI:10.3389/frwa.2023.1298465

Tianlong Jia, Andre Jehan Vallendar, Rinze de Vries, Z. Kapelan, R. Taormina

{"title":"利用新型开放数据集推进基于深度学习的漂浮垃圾检测","authors":"Tianlong Jia, Andre Jehan Vallendar, Rinze de Vries, Z. Kapelan, R. Taormina","doi":"10.3389/frwa.2023.1298465","DOIUrl":null,"url":null,"abstract":"Supervised Deep Learning (DL) methods have shown promise in monitoring the floating litter in rivers and urban canals but further advancements are hard to obtain due to the limited availability of relevant labeled data. To address this challenge, researchers often utilize techniques such as transfer learning (TL) and data augmentation (DA). However, there is no study currently reporting a rigorous evaluation of the effectiveness of these approaches for floating litter detection and their effects on the models' generalization capability. To overcome the problem of limited data availability, this work introduces the “TU Delft—Green Village” dataset, a novel labeled dataset of 9,473 camera and phone images of floating macroplastic litter and other litter items, captured using experiments in a drainage canal of TU Delft. We use the new dataset to conduct a thorough evaluation of the detection performance of five DL architectures for multi-class image classification. We focus the analysis on a systematic evaluation of the benefits of TL and DA on model performances. Moreover, we evaluate the generalization capability of these models for unseen litter items and new device settings, such as increasing the cameras' height and tilting them to 45°. The results obtained show that, for the specific problem of floating litter detection, fine-tuning all layers is more effective than the common approach of fine-tuning the classifier alone. Among the tested DA techniques, we find that simple image flipping boosts model accuracy the most, while other methods have little impact on the performance. The SqueezeNet and DenseNet121 architectures perform the best, achieving an overall accuracy of 89.6 and 91.7%, respectively. We also observe that both models retain good generalization capability which drops significantly only for the most complex scenario tested, but the overall accuracy raises significantly to around 75% when adding a limited amount of images to training data, combined with flipping augmentation. The detailed analyses conducted here and the released open source dataset offer valuable insights and serve as a precious resource for future research.","PeriodicalId":33801,"journal":{"name":"Frontiers in Water","volume":"59 8","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2023-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advancing deep learning-based detection of floating litter using a novel open dataset\",\"authors\":\"Tianlong Jia, Andre Jehan Vallendar, Rinze de Vries, Z. Kapelan, R. Taormina\",\"doi\":\"10.3389/frwa.2023.1298465\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Supervised Deep Learning (DL) methods have shown promise in monitoring the floating litter in rivers and urban canals but further advancements are hard to obtain due to the limited availability of relevant labeled data. To address this challenge, researchers often utilize techniques such as transfer learning (TL) and data augmentation (DA). However, there is no study currently reporting a rigorous evaluation of the effectiveness of these approaches for floating litter detection and their effects on the models' generalization capability. To overcome the problem of limited data availability, this work introduces the “TU Delft—Green Village” dataset, a novel labeled dataset of 9,473 camera and phone images of floating macroplastic litter and other litter items, captured using experiments in a drainage canal of TU Delft. We use the new dataset to conduct a thorough evaluation of the detection performance of five DL architectures for multi-class image classification. We focus the analysis on a systematic evaluation of the benefits of TL and DA on model performances. Moreover, we evaluate the generalization capability of these models for unseen litter items and new device settings, such as increasing the cameras' height and tilting them to 45°. The results obtained show that, for the specific problem of floating litter detection, fine-tuning all layers is more effective than the common approach of fine-tuning the classifier alone. Among the tested DA techniques, we find that simple image flipping boosts model accuracy the most, while other methods have little impact on the performance. The SqueezeNet and DenseNet121 architectures perform the best, achieving an overall accuracy of 89.6 and 91.7%, respectively. We also observe that both models retain good generalization capability which drops significantly only for the most complex scenario tested, but the overall accuracy raises significantly to around 75% when adding a limited amount of images to training data, combined with flipping augmentation. The detailed analyses conducted here and the released open source dataset offer valuable insights and serve as a precious resource for future research.\",\"PeriodicalId\":33801,\"journal\":{\"name\":\"Frontiers in Water\",\"volume\":\"59 8\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2023-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Water\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/frwa.2023.1298465\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"WATER RESOURCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Water","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frwa.2023.1298465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"WATER RESOURCES","Score":null,"Total":0}

引用次数: 0

摘要

监督深度学习(DL)方法在监测河流和城市运河中的漂浮垃圾方面显示出了希望，但由于相关标记数据的可用性有限，很难取得进一步的进展。为了应对这一挑战，研究人员经常使用迁移学习(TL)和数据增强(DA)等技术。然而，目前还没有研究报告严格评估这些方法对漂浮垃圾检测的有效性及其对模型泛化能力的影响。为了克服数据可用性有限的问题，本工作引入了“代尔夫特理工大学-绿色村庄”数据集，这是一个新颖的标记数据集，包含9,473张漂浮的宏观塑料垃圾和其他垃圾的相机和手机图像，这些图像是通过在代尔夫特理工大学的排水沟中进行实验捕获的。我们使用新的数据集对五种用于多类图像分类的深度学习架构的检测性能进行了全面的评估。我们重点分析了TL和DA对模型性能的好处的系统评估。此外，我们评估了这些模型对于看不见的垃圾和新设备设置的泛化能力，例如增加相机的高度并将它们倾斜到45°。结果表明，对于漂浮垃圾检测这一特定问题，对所有层进行微调比对分类器单独进行微调的常见方法更有效。在经过测试的数据分析技术中，我们发现简单的图像翻转对模型精度的提高最大，而其他方法对性能的影响很小。SqueezeNet和DenseNet121架构表现最好，总体准确率分别达到89.6和91.7%。我们还观察到，这两种模型都保持了良好的泛化能力，只有在最复杂的场景测试中才会显著下降，但当向训练数据中添加有限数量的图像并结合翻转增强时，总体准确率显著提高到75%左右。这里进行的详细分析和发布的开源数据集提供了有价值的见解，并作为未来研究的宝贵资源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Advancing deep learning-based detection of floating litter using a novel open dataset

Supervised Deep Learning (DL) methods have shown promise in monitoring the floating litter in rivers and urban canals but further advancements are hard to obtain due to the limited availability of relevant labeled data. To address this challenge, researchers often utilize techniques such as transfer learning (TL) and data augmentation (DA). However, there is no study currently reporting a rigorous evaluation of the effectiveness of these approaches for floating litter detection and their effects on the models' generalization capability. To overcome the problem of limited data availability, this work introduces the “TU Delft—Green Village” dataset, a novel labeled dataset of 9,473 camera and phone images of floating macroplastic litter and other litter items, captured using experiments in a drainage canal of TU Delft. We use the new dataset to conduct a thorough evaluation of the detection performance of five DL architectures for multi-class image classification. We focus the analysis on a systematic evaluation of the benefits of TL and DA on model performances. Moreover, we evaluate the generalization capability of these models for unseen litter items and new device settings, such as increasing the cameras' height and tilting them to 45°. The results obtained show that, for the specific problem of floating litter detection, fine-tuning all layers is more effective than the common approach of fine-tuning the classifier alone. Among the tested DA techniques, we find that simple image flipping boosts model accuracy the most, while other methods have little impact on the performance. The SqueezeNet and DenseNet121 architectures perform the best, achieving an overall accuracy of 89.6 and 91.7%, respectively. We also observe that both models retain good generalization capability which drops significantly only for the most complex scenario tested, but the overall accuracy raises significantly to around 75% when adding a limited amount of images to training data, combined with flipping augmentation. The detailed analyses conducted here and the released open source dataset offer valuable insights and serve as a precious resource for future research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in Water WATER RESOURCES-

CiteScore

4.00

自引率

6.90%

发文量

224

审稿时长

13 weeks