Integrated pixel-level crack detection and quantification using an ensemble of advanced U-Net architectures

IF 7.9 Q1 ENGINEERING, MULTIDISCIPLINARY Results in Engineering Pub Date : 2025-03-01 Epub Date: 2024-12-11 DOI:10.1016/j.rineng.2024.103726
Rakshitha R , Srinath S , N Vinay Kumar , Rashmi S , Poornima B V
{"title":"Integrated pixel-level crack detection and quantification using an ensemble of advanced U-Net architectures","authors":"Rakshitha R ,&nbsp;Srinath S ,&nbsp;N Vinay Kumar ,&nbsp;Rashmi S ,&nbsp;Poornima B V","doi":"10.1016/j.rineng.2024.103726","DOIUrl":null,"url":null,"abstract":"<div><div>Automated pavement crack detection faces significant challenges due to the complex shapes of crack patterns, their similarity to non-crack textures, and varying environmental conditions such as lighting and noise. Traditional methods often struggle to adapt, leading to inconsistent and less accurate results in real-world scenarios. This study introduces a hybrid framework that combines convolutional and transformer-based architectures, leveraging their strengths to achieve reliable crack segmentation and pixel-level quantification. The framework incorporates state-of-the-art deep learning models, including U-Net, Attention U-Net, Residual Attention U-Net (RAUNet), TransUNet, and Swin-Unet. U-Net variants, enhanced with attention mechanisms and residual connections, improve feature extraction and gradient flow, enabling precise delineation of crack boundaries. Transformer-based models like TransUNet and Swin-Unet use self-attention mechanisms to capture both local and global spatial relationships, enhancing robustness across diverse crack patterns. A key contribution of this study is the evaluation of loss functions, including Binary Cross-Entropy (BCE) Loss, Dice Loss, and Binary Focal Loss. Binary Focal Loss proved particularly effective in addressing class imbalance across four benchmark datasets. To further improve segmentation performance, two ensemble strategies were applied: stochastic reordering using logical operations (AND, OR, and averaging) and a weighted average ensemble optimized through grid search. The weighted average ensemble demonstrated superior performance, achieving mean Intersection over Union (mIoU) scores of 0.73, 0.70, 0.78, and 0.86 on the CFD, AgileRN, Crack500, and DeepCrack datasets, respectively. In addition to segmentation, this study developed a method for accurately quantifying crack length and width. By using Euclidean distance along skeletal paths, the algorithm minimized error rates in length and width estimation. This framework provides a scalable and efficient solution for automated pavement crack analysis. It addresses critical challenges in accuracy, adaptability, and reliability under diverse operational conditions, marking significant progress in crack detection technology.</div></div>","PeriodicalId":36919,"journal":{"name":"Results in Engineering","volume":"25 ","pages":"Article 103726"},"PeriodicalIF":7.9000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590123024019698","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Automated pavement crack detection faces significant challenges due to the complex shapes of crack patterns, their similarity to non-crack textures, and varying environmental conditions such as lighting and noise. Traditional methods often struggle to adapt, leading to inconsistent and less accurate results in real-world scenarios. This study introduces a hybrid framework that combines convolutional and transformer-based architectures, leveraging their strengths to achieve reliable crack segmentation and pixel-level quantification. The framework incorporates state-of-the-art deep learning models, including U-Net, Attention U-Net, Residual Attention U-Net (RAUNet), TransUNet, and Swin-Unet. U-Net variants, enhanced with attention mechanisms and residual connections, improve feature extraction and gradient flow, enabling precise delineation of crack boundaries. Transformer-based models like TransUNet and Swin-Unet use self-attention mechanisms to capture both local and global spatial relationships, enhancing robustness across diverse crack patterns. A key contribution of this study is the evaluation of loss functions, including Binary Cross-Entropy (BCE) Loss, Dice Loss, and Binary Focal Loss. Binary Focal Loss proved particularly effective in addressing class imbalance across four benchmark datasets. To further improve segmentation performance, two ensemble strategies were applied: stochastic reordering using logical operations (AND, OR, and averaging) and a weighted average ensemble optimized through grid search. The weighted average ensemble demonstrated superior performance, achieving mean Intersection over Union (mIoU) scores of 0.73, 0.70, 0.78, and 0.86 on the CFD, AgileRN, Crack500, and DeepCrack datasets, respectively. In addition to segmentation, this study developed a method for accurately quantifying crack length and width. By using Euclidean distance along skeletal paths, the algorithm minimized error rates in length and width estimation. This framework provides a scalable and efficient solution for automated pavement crack analysis. It addresses critical challenges in accuracy, adaptability, and reliability under diverse operational conditions, marking significant progress in crack detection technology.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
集成使用先进的U-Net体系结构集成的像素级裂纹检测和量化
由于裂缝模式的复杂形状、与非裂缝纹理的相似性以及光照和噪声等环境条件的变化,自动路面裂缝检测面临着重大挑战。传统的方法往往难以适应,导致在现实场景中结果不一致、不准确。本研究引入了一个混合框架,结合了卷积和基于变压器的架构,利用它们的优势来实现可靠的裂缝分割和像素级量化。该框架结合了最先进的深度学习模型,包括U-Net、注意力U-Net、剩余注意力U-Net (RAUNet)、TransUNet和swan - unet。U-Net变体,增强了注意机制和残余连接,改进了特征提取和梯度流动,能够精确描绘裂缝边界。TransUNet和swing - unet等基于变压器的模型使用自关注机制来捕获局部和全局空间关系,增强了不同裂缝模式的鲁棒性。本研究的一个关键贡献是对损失函数的评估,包括二元交叉熵(BCE)损失、骰子损失和二元焦点损失。二元焦损被证明在解决四个基准数据集的类不平衡方面特别有效。为了进一步提高分割性能,采用了两种集成策略:使用逻辑运算(AND、OR和平均)进行随机重排序,以及通过网格搜索优化加权平均集成。加权平均集成表现出优异的性能,在CFD、AgileRN、Crack500和DeepCrack数据集上,平均mIoU得分分别为0.73、0.70、0.78和0.86。除了分割外,本研究还开发了一种精确量化裂缝长度和宽度的方法。通过在骨架路径上使用欧几里得距离,最小化了长度和宽度估计的错误率。该框架为路面裂缝自动分析提供了一个可扩展的、高效的解决方案。它解决了在不同操作条件下精度、适应性和可靠性方面的关键挑战,标志着裂缝检测技术的重大进步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Results in Engineering
Results in Engineering Engineering-Engineering (all)
CiteScore
5.80
自引率
34.00%
发文量
441
审稿时长
47 days
期刊最新文献
Meshless Local Petrov–Galerkin Analysis of Hydro elastic Sloshing Frequency Tuning in Type-V Composite Tanks with CFRP Perforated Baffles Study on optimization of layout and timing of destress borehole in excavation roadways A deep learning based model for aluminum agglomeration in solid propellant Development and characterization of post-consumer diaper waste reinforced epoxy composite: A circular economy approach to municipal solid waste management YOLOv8n-3SE-PD: A lightweight model for small object detection in smart vehicle edge sensing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1