Transfer learning-based encoder-decoder model with visual explanations for infrastructure crack segmentation: New open database and comprehensive evaluation

IF 8.2 1区工程技术 Q1 ENGINEERING, CIVIL Underground Space Pub Date : 2023-12-28 DOI:10.1016/j.undsp.2023.09.012

Fangyu Liu , Wenqi Ding , Yafei Qiao , Linbing Wang

{"title":"Transfer learning-based encoder-decoder model with visual explanations for infrastructure crack segmentation: New open database and comprehensive evaluation","authors":"Fangyu Liu , Wenqi Ding , Yafei Qiao , Linbing Wang","doi":"10.1016/j.undsp.2023.09.012","DOIUrl":null,"url":null,"abstract":"<div><p>Contemporary demands necessitate the swift and accurate detection of cracks in critical infrastructures, including tunnels and pavements. This study proposed a transfer learning-based encoder-decoder method with visual explanations for infrastructure crack segmentation. Firstly, a vast dataset containing 7089 images was developed, comprising diverse conditions—simple and complex crack patterns as well as clean and rough backgrounds. Secondly, leveraging transfer learning, an encoder-decoder model with visual explanations was formulated, utilizing varied pre-trained convolutional neural network (CNN) as the encoder. Visual explanations were achieved through gradient-weighted class activation mapping (Grad-CAM) to interpret the CNN segmentation model. Thirdly, accuracy, complexity (computation and model), and memory usage assessed CNN feasibility in practical engineering. Model performance was gauged via prediction and visual explanation. The investigation encompassed hyperparameters, data augmentation, deep learning from scratch vs. transfer learning, segmentation model architectures, segmentation model encoders, and encoder pre-training strategies. Results underscored transfer learning's potency in enhancing CNN accuracy for crack segmentation, surpassing deep learning from scratch. Notably, encoder classification accuracy bore no significant correlation with CNN segmentation accuracy. Among all tested models, UNet-EfficientNet_B7 excelled in crack segmentation, harmonizing accuracy, complexity, memory usage, prediction, and visual explanation.</p></div>","PeriodicalId":48505,"journal":{"name":"Underground Space","volume":"17 ","pages":"Pages 60-81"},"PeriodicalIF":8.2000,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2467967423001733/pdfft?md5=e84501be3daf3b7e67ca1c30b3bf6be5&pid=1-s2.0-S2467967423001733-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Underground Space","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2467967423001733","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

Abstract

Contemporary demands necessitate the swift and accurate detection of cracks in critical infrastructures, including tunnels and pavements. This study proposed a transfer learning-based encoder-decoder method with visual explanations for infrastructure crack segmentation. Firstly, a vast dataset containing 7089 images was developed, comprising diverse conditions—simple and complex crack patterns as well as clean and rough backgrounds. Secondly, leveraging transfer learning, an encoder-decoder model with visual explanations was formulated, utilizing varied pre-trained convolutional neural network (CNN) as the encoder. Visual explanations were achieved through gradient-weighted class activation mapping (Grad-CAM) to interpret the CNN segmentation model. Thirdly, accuracy, complexity (computation and model), and memory usage assessed CNN feasibility in practical engineering. Model performance was gauged via prediction and visual explanation. The investigation encompassed hyperparameters, data augmentation, deep learning from scratch vs. transfer learning, segmentation model architectures, segmentation model encoders, and encoder pre-training strategies. Results underscored transfer learning's potency in enhancing CNN accuracy for crack segmentation, surpassing deep learning from scratch. Notably, encoder classification accuracy bore no significant correlation with CNN segmentation accuracy. Among all tested models, UNet-EfficientNet_B7 excelled in crack segmentation, harmonizing accuracy, complexity, memory usage, prediction, and visual explanation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于迁移学习的编码器-解码器模型，用于基础设施裂缝分割的可视化解释：新的开放数据库和综合评估

现代需求要求对隧道和路面等重要基础设施的裂缝进行快速准确的检测。本研究提出了一种基于迁移学习的编码器-解码器方法，用于基础设施裂缝分割的可视化解释。首先，研究人员开发了一个包含 7089 幅图像的庞大数据集，其中包括各种不同的条件--简单和复杂的裂缝模式，以及干净和粗糙的背景。其次，利用迁移学习，制定了一个具有视觉解释的编码器-解码器模型，并利用各种预训练的卷积神经网络（CNN）作为编码器。通过梯度加权类激活映射（Grad-CAM）来解释 CNN 分割模型，从而实现视觉解释。第三，准确性、复杂性（计算和模型）和内存使用情况评估了 CNN 在实际工程中的可行性。模型性能通过预测和视觉解释来衡量。调查内容包括超参数、数据增强、从零开始的深度学习与迁移学习、分割模型架构、分割模型编码器和编码器预训练策略。结果凸显了迁移学习在提高裂缝分割 CNN 准确性方面的功效，超过了从头开始的深度学习。值得注意的是，编码器的分类准确性与 CNN 的分割准确性没有明显的相关性。在所有测试模型中，UNet-EfficientNet_B7 在裂缝分割方面表现出色，在准确性、复杂性、内存使用、预测和可视化解释方面都取得了协调的效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Underground Space ENGINEERING, CIVIL-

CiteScore

10.20

自引率

14.10%

发文量

审稿时长

63 days

期刊介绍： Underground Space is an open access international journal without article processing charges (APC) committed to serving as a scientific forum for researchers and practitioners in the field of underground engineering. The journal welcomes manuscripts that deal with original theories, methods, technologies, and important applications throughout the life-cycle of underground projects, including planning, design, operation and maintenance, disaster prevention, and demolition. The journal is particularly interested in manuscripts related to the latest development of smart underground engineering from the perspectives of resilience, resources saving, environmental friendliness, humanity, and artificial intelligence. The manuscripts are expected to have significant innovation and potential impact in the field of underground engineering, and should have clear association with or application in underground projects.