A Real-Time Dual-Task Defect Segmentation Network for Grinding Wheels with Coordinate Attentioned-ASP and Masked Autoencoder

Machines Pub Date : 2024-04-21 DOI:10.3390/machines12040276
Yifan Li, Chuanbao Li, Ping Zhang, Han Wang
{"title":"A Real-Time Dual-Task Defect Segmentation Network for Grinding Wheels with Coordinate Attentioned-ASP and Masked Autoencoder","authors":"Yifan Li, Chuanbao Li, Ping Zhang, Han Wang","doi":"10.3390/machines12040276","DOIUrl":null,"url":null,"abstract":"The current network for the dual-task grinding wheel defect semantic segmentation lacks high-precision lightweight designs, making it challenging to balance lightweighting and segmentation accuracy, thus severely limiting its practical application in grinding wheel production lines. Additionally, recent approaches for addressing the natural class imbalance in defect segmentation fail to leverage the inexhaustible unannotated raw data on the production line, posing huge data wastage. Targeting these two issues, firstly, by discovering the similarity between Coordinate Attention (CA) and ASPP, this study has introduced a novel lightweight CA-ASP module to the DeeplabV3+, which is 45.3% smaller in parameter size and 53.2% lower in FLOPs compared to the ASPP, while achieving better segmentation precision. Secondly, we have innovatively leveraged the Masked Autoencoder (MAE) to address imbalance. By developing a new Hybrid MAE and applying it to self-supervised pretraining on tremendous unannotated data, we have significantly uplifted the network’s semantic understanding on the minority classes, which leads to further rises in both the overall accuracy and accuracy of the minorities without additional computational growth. Lastly, transfer learning has been deployed to fully utilize the highly related dual tasks. Experimental results demonstrate that the proposed methods with a real-time latency of 9.512 ms obtain a superior segmentation accuracy on the mIoU score over the compared real-time state-of-the-art methods, excelling in managing the imbalance and ensuring stability on the complicated scenes across the dual tasks.","PeriodicalId":509264,"journal":{"name":"Machines","volume":"104 8","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machines","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/machines12040276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The current network for the dual-task grinding wheel defect semantic segmentation lacks high-precision lightweight designs, making it challenging to balance lightweighting and segmentation accuracy, thus severely limiting its practical application in grinding wheel production lines. Additionally, recent approaches for addressing the natural class imbalance in defect segmentation fail to leverage the inexhaustible unannotated raw data on the production line, posing huge data wastage. Targeting these two issues, firstly, by discovering the similarity between Coordinate Attention (CA) and ASPP, this study has introduced a novel lightweight CA-ASP module to the DeeplabV3+, which is 45.3% smaller in parameter size and 53.2% lower in FLOPs compared to the ASPP, while achieving better segmentation precision. Secondly, we have innovatively leveraged the Masked Autoencoder (MAE) to address imbalance. By developing a new Hybrid MAE and applying it to self-supervised pretraining on tremendous unannotated data, we have significantly uplifted the network’s semantic understanding on the minority classes, which leads to further rises in both the overall accuracy and accuracy of the minorities without additional computational growth. Lastly, transfer learning has been deployed to fully utilize the highly related dual tasks. Experimental results demonstrate that the proposed methods with a real-time latency of 9.512 ms obtain a superior segmentation accuracy on the mIoU score over the compared real-time state-of-the-art methods, excelling in managing the imbalance and ensuring stability on the complicated scenes across the dual tasks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用坐标附加-ASP 和掩码自动编码器的砂轮缺陷实时双任务分割网络
目前用于双任务砂轮缺陷语义分割的网络缺乏高精度的轻量化设计,使得平衡轻量化和分割精度成为挑战,从而严重限制了其在砂轮生产线上的实际应用。此外,近期解决缺陷分割中自然类不平衡的方法未能充分利用生产线上取之不尽、用之不竭的未标注原始数据,造成了巨大的数据浪费。针对这两个问题,首先,本研究通过发现坐标注意(CA)与 ASPP 之间的相似性,在 DeeplabV3+ 中引入了新型轻量级 CA-ASP 模块,与 ASPP 相比,参数大小减少了 45.3%,FLOPs 减少了 53.2%,同时实现了更好的分割精度。其次,我们创新性地利用了掩码自动编码器(MAE)来解决不平衡问题。通过开发一种新的混合 MAE 并将其应用于对大量未标注数据的自我监督预训练,我们显著提高了网络对少数群体类别的语义理解,从而在不增加额外计算量的情况下进一步提高了整体准确率和少数群体准确率。最后,我们还利用迁移学习来充分利用高度相关的双重任务。实验结果表明,所提出的实时延迟为 9.512 毫秒的方法在 mIoU 分数上的分割精度优于所比较的最先进的实时方法,在管理不平衡方面表现出色,并确保了复杂场景中双重任务的稳定性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Study on Micro-Pit Texture Parameter Optimization and Its Tribological Properties Determination of Energy Losses of the Crank Press Mechanism Brush Seal Performance with Ideal Gas Working Fluid under Static Rotor Condition The State of Health of Electrical Connectors Dual-Arm Obstacle Avoidance Motion Planning Based on Improved RRT Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1