训练卷积神经网络来检测火车车厢中的废物

Nathan Western, X. Kong, Mustafa Erden
{"title":"训练卷积神经网络来检测火车车厢中的废物","authors":"Nathan Western, X. Kong, Mustafa Erden","doi":"10.1109/ICSIPA52582.2021.9576771","DOIUrl":null,"url":null,"abstract":"This research constitutes a systematic investigation of the effect of image view on Convolutional Neural Networks (CNNs) when trained to detect waste in train carriages. Additionally, this research identifies neural network architecture and training conditions for use in an automated train cleaning robot. Specifically, we investigate the relationship between the size of the CNN training dataset, whether these images are taken from a view sympathetic to the CNN application, and the effectiveness of the trained networks. Three datasets were constructed specifically for this research; a large dataset of 58,300 studio images of waste in a variety of conditions, a smaller dataset of 4,515 images taken of actual waste items on trains, and a dataset of 7,290 images of actual waste on trains used to test the CNNs. The images taken on trains were captured from the perspective of a hypothetical cleaning robot that would use these networks. Additionally, we provide a comparison of MobileNetV2, ShuffleNet, and SqueezeNet CNNs based on their suitability for implementation in an automated train cleaning system, and the optimum conditions to do so. Training with a smaller dataset of images taken from a “robot-eye view” resulted in an average increase in classification accuracy of 10.5%, with the largest increase being 26%, when compared to training with a larger dataset of images of waste items in various poses. ShuffleNet was identified as the optimally performing CNN for waste detection, achieving an accuracy of 88.61% when trained with a small dataset of images sympathetic to the end use. MobileNetV2 was found to perform optimally with a larger dataset of training images, even if these are less specific to the application of the network.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Training Convolutional Neural Networks to Detect Waste in Train Carriages\",\"authors\":\"Nathan Western, X. Kong, Mustafa Erden\",\"doi\":\"10.1109/ICSIPA52582.2021.9576771\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This research constitutes a systematic investigation of the effect of image view on Convolutional Neural Networks (CNNs) when trained to detect waste in train carriages. Additionally, this research identifies neural network architecture and training conditions for use in an automated train cleaning robot. Specifically, we investigate the relationship between the size of the CNN training dataset, whether these images are taken from a view sympathetic to the CNN application, and the effectiveness of the trained networks. Three datasets were constructed specifically for this research; a large dataset of 58,300 studio images of waste in a variety of conditions, a smaller dataset of 4,515 images taken of actual waste items on trains, and a dataset of 7,290 images of actual waste on trains used to test the CNNs. The images taken on trains were captured from the perspective of a hypothetical cleaning robot that would use these networks. Additionally, we provide a comparison of MobileNetV2, ShuffleNet, and SqueezeNet CNNs based on their suitability for implementation in an automated train cleaning system, and the optimum conditions to do so. Training with a smaller dataset of images taken from a “robot-eye view” resulted in an average increase in classification accuracy of 10.5%, with the largest increase being 26%, when compared to training with a larger dataset of images of waste items in various poses. ShuffleNet was identified as the optimally performing CNN for waste detection, achieving an accuracy of 88.61% when trained with a small dataset of images sympathetic to the end use. MobileNetV2 was found to perform optimally with a larger dataset of training images, even if these are less specific to the application of the network.\",\"PeriodicalId\":326688,\"journal\":{\"name\":\"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSIPA52582.2021.9576771\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA52582.2021.9576771","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本研究系统地研究了卷积神经网络(cnn)在训练后检测火车车厢中的废物时,图像视图对其的影响。此外,本研究确定了用于自动列车清洁机器人的神经网络架构和训练条件。具体来说,我们研究了CNN训练数据集的大小、这些图像是否取自与CNN应用相一致的视图以及训练网络的有效性之间的关系。本研究专门构建了三个数据集;一个大型数据集包含58300张不同条件下的垃圾照片,一个较小的数据集包含4515张火车上的实际垃圾照片,一个数据集包含7290张火车上的实际垃圾照片,用于测试cnn。在火车上拍摄的图像是从一个假想的清洁机器人的角度拍摄的,这个机器人将使用这些网络。此外,我们根据MobileNetV2、ShuffleNet和SqueezeNet cnn在自动列车清洁系统中的适用性和最佳条件,对它们进行了比较。与使用各种姿势的废物图像的更大数据集进行训练相比,使用从“机器眼视图”拍摄的较小图像数据集进行训练导致分类准确率平均提高10.5%,最大增幅为26%。ShuffleNet被认为是用于垃圾检测的性能最佳的CNN,当使用与最终用途一致的小图像数据集进行训练时,准确率达到了88.61%。MobileNetV2被发现在更大的训练图像数据集上表现最佳,即使这些数据集对网络应用的特异性较低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Training Convolutional Neural Networks to Detect Waste in Train Carriages
This research constitutes a systematic investigation of the effect of image view on Convolutional Neural Networks (CNNs) when trained to detect waste in train carriages. Additionally, this research identifies neural network architecture and training conditions for use in an automated train cleaning robot. Specifically, we investigate the relationship between the size of the CNN training dataset, whether these images are taken from a view sympathetic to the CNN application, and the effectiveness of the trained networks. Three datasets were constructed specifically for this research; a large dataset of 58,300 studio images of waste in a variety of conditions, a smaller dataset of 4,515 images taken of actual waste items on trains, and a dataset of 7,290 images of actual waste on trains used to test the CNNs. The images taken on trains were captured from the perspective of a hypothetical cleaning robot that would use these networks. Additionally, we provide a comparison of MobileNetV2, ShuffleNet, and SqueezeNet CNNs based on their suitability for implementation in an automated train cleaning system, and the optimum conditions to do so. Training with a smaller dataset of images taken from a “robot-eye view” resulted in an average increase in classification accuracy of 10.5%, with the largest increase being 26%, when compared to training with a larger dataset of images of waste items in various poses. ShuffleNet was identified as the optimally performing CNN for waste detection, achieving an accuracy of 88.61% when trained with a small dataset of images sympathetic to the end use. MobileNetV2 was found to perform optimally with a larger dataset of training images, even if these are less specific to the application of the network.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Personal Protective Equipment Detection with Live Camera A Fast and Unbiased Minimalistic Resampling Approach for the Particle Filter Sparse Checkerboard Corner Detection from Global Perspective Comparison of Dental Caries Level Images Classification Performance using KNN and SVM Methods An Insight Into the Rise Time of Exponential Smoothing for Speech Enhancement Methods
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1