Rail Surface Defects Detection Based on Yolo v5 Integrated with Transformer

Qian-mao Hu, Bo Tang, Lin Jiang, Faxun Zhu, Xiaoke Zhao
{"title":"Rail Surface Defects Detection Based on Yolo v5 Integrated with Transformer","authors":"Qian-mao Hu, Bo Tang, Lin Jiang, Faxun Zhu, Xiaoke Zhao","doi":"10.1109/icet55676.2022.9824255","DOIUrl":null,"url":null,"abstract":"The traditional machine vision detection method needs to manually design the characteristics of the target, the feature expression ability is insufficient and the generalization ability is not strong. Deep learning can automatically learn high-level feature information, improve the efficiency and accuracy of image recognition, and has better adaptability and universality. Transformer abandons the structure of CNN with deep neural network mainly based on self-attention mechanism, which can be processed in parallel and has global information. This paper combines CNN with Transformer and integrates transformer’s attention mechanism into Yolo V5 network structure to detect rail surface defects. The AP (average precision) of Type-I and Type-II rail defects reached 99.5% and 97.8% respectively, and FPS (frame per second) reaches 76.92 on RSDDs dataset.","PeriodicalId":166358,"journal":{"name":"2022 IEEE 5th International Conference on Electronics Technology (ICET)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Electronics Technology (ICET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icet55676.2022.9824255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The traditional machine vision detection method needs to manually design the characteristics of the target, the feature expression ability is insufficient and the generalization ability is not strong. Deep learning can automatically learn high-level feature information, improve the efficiency and accuracy of image recognition, and has better adaptability and universality. Transformer abandons the structure of CNN with deep neural network mainly based on self-attention mechanism, which can be processed in parallel and has global information. This paper combines CNN with Transformer and integrates transformer’s attention mechanism into Yolo V5 network structure to detect rail surface defects. The AP (average precision) of Type-I and Type-II rail defects reached 99.5% and 97.8% respectively, and FPS (frame per second) reaches 76.92 on RSDDs dataset.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Yolo v5集成变压器的钢轨表面缺陷检测
传统的机器视觉检测方法需要人工设计目标的特征,特征表达能力不足,泛化能力不强。深度学习可以自动学习高级特征信息,提高图像识别的效率和准确性,具有较好的适应性和通用性。变压器摒弃了CNN的结构,采用以自关注机制为主的深度神经网络,可以并行处理,具有全局信息。本文将CNN与变压器相结合,将变压器的注意机制集成到Yolo V5网络结构中,检测钢轨表面缺陷。在RSDDs数据集上,i型和ii型钢轨缺陷的平均精度AP分别达到99.5%和97.8%,FPS(帧/秒)达到76.92。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on Tanks Combat Automatic Decision Using Multi-agent A2C Algorithm Electrical and Thermal Analyses of RF-Power GaN HEMT Devices and Layout Optimization Recognition of Catenary Mast Number in Rail Transit A Novel Dual-Polarized Millimeter Wave Filtering Antenna for 5G Applications Text Matching Model with Multi-granularity Term Alignment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1