An enhanced network for extracting tunnel lining defects using transformer encoder and aggregate decoder

IF 7.5 1区 地球科学 Q1 Earth and Planetary Sciences International Journal of Applied Earth Observation and Geoinformation Pub Date : 2024-12-16 DOI:10.1016/j.jag.2024.104259
Bo Guo, Zhihai Huang, Haitao Luo, Perpetual Hope Akwensi, Ruisheng Wang, Bo Huang, Tsz Nam Chan
{"title":"An enhanced network for extracting tunnel lining defects using transformer encoder and aggregate decoder","authors":"Bo Guo, Zhihai Huang, Haitao Luo, Perpetual Hope Akwensi, Ruisheng Wang, Bo Huang, Tsz Nam Chan","doi":"10.1016/j.jag.2024.104259","DOIUrl":null,"url":null,"abstract":"The tunnel environment is characterized by insufficient ambient light, obstructed view, and complex inner lining construction conditions. These factors frequently result in limited anti-interference capability, reduced recognition accuracy, and suboptimal segmentation results for defect extraction. We propose a deep network model utilizing an encoder–decoder framework that integrates Transformer and convolution for comprehensive defect extraction. The proposed model utilizes an encoder that integrates a hierarchical Transformer backbone with an efficient attention mechanism to fully explore complete information at multi-scale granularities. In the decoder, multi-scale information is initially aggregated using a Multi-Layer Perceptron (MLP) module. Additionally, the Stacking Filters with Atrous Convolutions (SFAC) module are implemented to enhance the perception of the complete defect scope. Furthermore, a Boundary-aware Attention Module (BAM) is implemented to enhance edge information to improve the detection of defects. With this well-designed decoder, the multi-scale information from the encoder can be fully aggregated and exploited for complete defect detection. Experimental findings illustrate the effectiveness of our proposed approach in addressing tunnel lining defects within the image dataset. The outcomes reveal that our proposed network achieves an accuracy (Acc) of 94.4% and a mean intersection over union (mIoU) of 78.14%. Compared to state-of-the-art segmentation networks, our model improves the accuracy of tunnel lining defect extraction, showcasing enhanced extraction effectiveness and anti-interference capability, thus meeting the engineering requirements for defect detection in complex environments of tunnels.","PeriodicalId":50341,"journal":{"name":"International Journal of Applied Earth Observation and Geoinformation","volume":"1 1","pages":""},"PeriodicalIF":7.5000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Earth Observation and Geoinformation","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1016/j.jag.2024.104259","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

The tunnel environment is characterized by insufficient ambient light, obstructed view, and complex inner lining construction conditions. These factors frequently result in limited anti-interference capability, reduced recognition accuracy, and suboptimal segmentation results for defect extraction. We propose a deep network model utilizing an encoder–decoder framework that integrates Transformer and convolution for comprehensive defect extraction. The proposed model utilizes an encoder that integrates a hierarchical Transformer backbone with an efficient attention mechanism to fully explore complete information at multi-scale granularities. In the decoder, multi-scale information is initially aggregated using a Multi-Layer Perceptron (MLP) module. Additionally, the Stacking Filters with Atrous Convolutions (SFAC) module are implemented to enhance the perception of the complete defect scope. Furthermore, a Boundary-aware Attention Module (BAM) is implemented to enhance edge information to improve the detection of defects. With this well-designed decoder, the multi-scale information from the encoder can be fully aggregated and exploited for complete defect detection. Experimental findings illustrate the effectiveness of our proposed approach in addressing tunnel lining defects within the image dataset. The outcomes reveal that our proposed network achieves an accuracy (Acc) of 94.4% and a mean intersection over union (mIoU) of 78.14%. Compared to state-of-the-art segmentation networks, our model improves the accuracy of tunnel lining defect extraction, showcasing enhanced extraction effectiveness and anti-interference capability, thus meeting the engineering requirements for defect detection in complex environments of tunnels.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于变压器编码器和聚合解码器的隧道衬砌缺陷提取网络
隧道环境具有环境光不足、视野遮挡、衬砌施工条件复杂等特点。这些因素经常导致有限的抗干扰能力,降低识别精度,以及对缺陷提取的次优分割结果。我们提出了一个深度网络模型,利用一个编码器-解码器框架,集成了变压器和卷积,以全面的缺陷提取。该模型利用一种集成了分层Transformer主干和高效关注机制的编码器来充分探索多尺度粒度的完整信息。在解码器中,使用多层感知器(MLP)模块初始聚合多尺度信息。此外,实现了带亚特罗斯卷积的堆叠滤波器(SFAC)模块,以增强对完整缺陷范围的感知。此外,采用边界感知注意模块(BAM)增强边缘信息,提高缺陷的检测效率。利用这种设计良好的解码器,可以将来自编码器的多尺度信息充分聚合并利用于完整的缺陷检测。实验结果证明了我们提出的方法在图像数据集中处理隧道衬砌缺陷的有效性。结果表明,我们提出的网络达到了94.4%的准确率(Acc)和78.14%的平均交联(mIoU)。与现有的分割网络相比,该模型提高了隧道衬砌缺陷提取的精度,增强了提取效果和抗干扰能力,满足了隧道复杂环境下缺陷检测的工程需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
10.20
自引率
8.00%
发文量
49
审稿时长
7.2 months
期刊介绍: The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.
期刊最新文献
Modeling the impact of pandemic on the urban thermal environment over megacities in China: Spatiotemporal analysis from the perspective of heat anomaly variations BSG-WSL: BackScatter-guided weakly supervised learning for water mapping in SAR images Identification of standing dead trees in Robinia pseudoacacia plantations across China’s Loess Plateau using multiple deep learning models Detecting glacial lake water quality indicators from RGB surveillance images via deep learning Synergistic mapping of urban tree canopy height using ICESat-2 data and GF-2 imagery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1