Transferable Learning Architecture for Scalable Visual Quality Inspection

M. Talha, Sheikh Faisal Rashid, Zain Iftikhar, Muhammad Touseef Afzal, Liu Ying
{"title":"Transferable Learning Architecture for Scalable Visual Quality Inspection","authors":"M. Talha, Sheikh Faisal Rashid, Zain Iftikhar, Muhammad Touseef Afzal, Liu Ying","doi":"10.1109/ICAI55435.2022.9773637","DOIUrl":null,"url":null,"abstract":"In recent years, convolutional neural networks (CNNs) have become a de facto standard in computer vision for object detection and recognition. At present, CNNs have been used in many application areas including the automation of industrial manufacturing processes. But using CNN in a real-time environment to track defects on products has many shortcomings like long training time, large data requirements, slow inference time, dynamic environment, and hardware dependency. This paper evaluates the state-of-the-art CNN architectures for object detection to address the mentioned challenges and provide the best possible solution. A set of pre-trained models has been trained on just 781 annotated images by applying transfer learning. Experimental results showed that Faster RCNN with VGG-16 backbone outperforms the other models in case of accuracy and mAP. But RetinaNet with an FPN backbone has the fastest inference time on multi-scaled defects. Paper also presents the deployment pipeline for inference on mobile devices to use in a real-time environment without any special hardware. In addition, an improved dataset of submersible pump impellers, based on the existing Kaggle dataset is introduced.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Artificial Intelligence (ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAI55435.2022.9773637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In recent years, convolutional neural networks (CNNs) have become a de facto standard in computer vision for object detection and recognition. At present, CNNs have been used in many application areas including the automation of industrial manufacturing processes. But using CNN in a real-time environment to track defects on products has many shortcomings like long training time, large data requirements, slow inference time, dynamic environment, and hardware dependency. This paper evaluates the state-of-the-art CNN architectures for object detection to address the mentioned challenges and provide the best possible solution. A set of pre-trained models has been trained on just 781 annotated images by applying transfer learning. Experimental results showed that Faster RCNN with VGG-16 backbone outperforms the other models in case of accuracy and mAP. But RetinaNet with an FPN backbone has the fastest inference time on multi-scaled defects. Paper also presents the deployment pipeline for inference on mobile devices to use in a real-time environment without any special hardware. In addition, an improved dataset of submersible pump impellers, based on the existing Kaggle dataset is introduced.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
可扩展视觉质量检测的可转移学习架构
近年来,卷积神经网络(cnn)已经成为计算机视觉对象检测和识别的事实上的标准。目前,cnn已经应用于包括工业制造过程自动化在内的许多应用领域。但在实时环境下使用CNN跟踪产品缺陷存在训练时间长、数据需求大、推理时间慢、环境动态性强、依赖硬件等缺点。本文评估了用于目标检测的最先进的CNN架构,以解决上述挑战并提供最佳解决方案。通过应用迁移学习,一组预训练模型仅在781张带注释的图像上进行了训练。实验结果表明,采用VGG-16骨干网的Faster RCNN在准确率和mAP方面都优于其他模型。而采用FPN骨干网的retanet对多尺度缺陷的推理速度最快。本文还介绍了在移动设备上进行推理的部署管道,以便在不需要任何特殊硬件的情况下在实时环境中使用。此外,在现有Kaggle数据集的基础上,介绍了一种改进的潜水泵叶轮数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
LSTM-based Model for Forecasting of COVID-19 Vaccines in Pakistan Adaptive Neural-Sliding Mode Control of a Quadrotor Vehicle with Uncertainties and Disturbances Compensation A Force Myography based HMI for Classification of Upper Extremity Gestures Evolving computationally efficient prediction model for Stock Volatility using CGPANN A Review on Different Approaches for Assessing Student Attentiveness in Classroom using Behavioural Elements
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1