Transmission Line Insulator Defect Detection Based on Swin Transformer and Context

IF 6.4 4区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Machine Intelligence Research Pub Date : 2023-09-15 DOI:10.1007/s11633-022-1355-y
Yu Xi, Ke Zhou, Ling-Wen Meng, Bo Chen, Hao-Min Chen, Jing-Yi Zhang
{"title":"Transmission Line Insulator Defect Detection Based on Swin Transformer and Context","authors":"Yu Xi, Ke Zhou, Ling-Wen Meng, Bo Chen, Hao-Min Chen, Jing-Yi Zhang","doi":"10.1007/s11633-022-1355-y","DOIUrl":null,"url":null,"abstract":"Insulators are important components of power transmission lines. Once a failure occurs, it may cause a large-scale blackout and other hidden dangers. Due to the large image size and complex background, detecting small defect objects is a challenge. We make improvements based on the two-stage network Faster R-convolutional neural networks (CNN). First, we use a hierarchical Swin Transformer with shifted windows as the feature extraction network, instead of ResNet, to extract more discriminative features, and then design the deformable receptive field block to encode global and local context information, which is utilized to capture key clues for detecting objects in complex backgrounds. Finally, the filling data augmentation method is proposed for the problem of insufficient defects and more images of insulator defects under different backgrounds are added to the training set to improve the robustness of the model. As a result, the recall increases from 89.5% to 92.1%, and the average precision increases from 81.0% to 87.1%. To further prove the superiority of the proposed algorithm, we also tested the model on the public data set Pascal visual object classes (VOC), which also yields outstanding results.","PeriodicalId":29727,"journal":{"name":"Machine Intelligence Research","volume":null,"pages":null},"PeriodicalIF":6.4000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Intelligence Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11633-022-1355-y","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Insulators are important components of power transmission lines. Once a failure occurs, it may cause a large-scale blackout and other hidden dangers. Due to the large image size and complex background, detecting small defect objects is a challenge. We make improvements based on the two-stage network Faster R-convolutional neural networks (CNN). First, we use a hierarchical Swin Transformer with shifted windows as the feature extraction network, instead of ResNet, to extract more discriminative features, and then design the deformable receptive field block to encode global and local context information, which is utilized to capture key clues for detecting objects in complex backgrounds. Finally, the filling data augmentation method is proposed for the problem of insufficient defects and more images of insulator defects under different backgrounds are added to the training set to improve the robustness of the model. As a result, the recall increases from 89.5% to 92.1%, and the average precision increases from 81.0% to 87.1%. To further prove the superiority of the proposed algorithm, we also tested the model on the public data set Pascal visual object classes (VOC), which also yields outstanding results.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Swin变压器和上下文的输电线路绝缘子缺陷检测
绝缘子是输电线路的重要组成部分。一旦发生故障,可能造成大面积停电等隐患。由于图像尺寸大,背景复杂,小缺陷物体的检测是一个挑战。我们在两阶段网络的基础上改进了更快的r -卷积神经网络。首先,我们使用带移位窗口的分层Swin Transformer作为特征提取网络,代替ResNet提取更多的判别特征,然后设计可变形的接受场块,对全局和局部上下文信息进行编码,用于捕获复杂背景下目标检测的关键线索。最后,针对缺陷不足的问题,提出了填充数据增强方法,并在训练集中加入更多不同背景下的绝缘子缺陷图像,提高了模型的鲁棒性。召回率从89.5%提高到92.1%,平均准确率从81.0%提高到87.1%。为了进一步证明该算法的优越性,我们还在公共数据集Pascal visual object classes (VOC)上对该模型进行了测试,同样取得了显著的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.70
自引率
0.00%
发文量
0
期刊最新文献
Editorial for Special Issue on Multi-modal Representation Learning MOSS: An Open Conversational Large Language Model Autonomy Evaluation of Unmanned Systems Based on Task Models A Survey of Synthetic Data Augmentation Methods in Machine Vision Structural Dependence Learning Based on Self-attention for Face Alignment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1