{"title":"A pyramid auxiliary supervised U-Net model for road crack detection with dual-attention mechanism","authors":"Yingxiang Lu, Guangyuan Zhang, Shukai Duan, Feng Chen","doi":"10.1016/j.displa.2024.102787","DOIUrl":null,"url":null,"abstract":"<div><p>The application of road crack detection technology plays a pivotal role in the domain of transportation infrastructure management. However, the diversity of crack morphologies within images and the complexity of background noise still pose significant challenges to automated detection technologies. This necessitates that deep learning models possess more precise feature extraction capabilities and resistance to noise interference. In this paper, we propose a pyramid auxiliary supervised U-Net model with Dual-Attention mechanism. Pyramid auxiliary supervision module is integrated into the U-Net model, alleviating information loss at the encoder end due to pooling operations, thereby enhancing its global perception capability. Besides, within dual-attention module, our model learns crucial segmentation features both at the pixel and channel levels. These enable our model to avoid noise interference and achieve a higher level of precision in crack pixel segmentation. To substantiate the superiority and generalizability of our model, we conducted a comprehensive performance evaluation using public datasets. The experimental results indicate that our model surpasses current great methods. Additionally, we performed ablation studies to confirm the efficacy of the proposed modules.</p></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"84 ","pages":"Article 102787"},"PeriodicalIF":3.7000,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938224001513","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
The application of road crack detection technology plays a pivotal role in the domain of transportation infrastructure management. However, the diversity of crack morphologies within images and the complexity of background noise still pose significant challenges to automated detection technologies. This necessitates that deep learning models possess more precise feature extraction capabilities and resistance to noise interference. In this paper, we propose a pyramid auxiliary supervised U-Net model with Dual-Attention mechanism. Pyramid auxiliary supervision module is integrated into the U-Net model, alleviating information loss at the encoder end due to pooling operations, thereby enhancing its global perception capability. Besides, within dual-attention module, our model learns crucial segmentation features both at the pixel and channel levels. These enable our model to avoid noise interference and achieve a higher level of precision in crack pixel segmentation. To substantiate the superiority and generalizability of our model, we conducted a comprehensive performance evaluation using public datasets. The experimental results indicate that our model surpasses current great methods. Additionally, we performed ablation studies to confirm the efficacy of the proposed modules.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.