Multi-modal deep fusion for bridge condition assessment

Mozhgan Momtaz , Tianshu Li , Devin K. Harris , David Lattanzi
{"title":"Multi-modal deep fusion for bridge condition assessment","authors":"Mozhgan Momtaz ,&nbsp;Tianshu Li ,&nbsp;Devin K. Harris ,&nbsp;David Lattanzi","doi":"10.1016/j.iintel.2023.100061","DOIUrl":null,"url":null,"abstract":"<div><p>Bridge condition rating is a challenging task as it largely depends on the experience-level of the manual inspection and therefore is prone to human errors. The inspection report often consists of a collection of images and sequences of sentences (text) explaining the condition of the considered bridge. In a routine manual bridge inspection, an inspector collects a set of images and textual descriptions of bridge components and assigns an overall condition rating (ranging between 0 and 9) based on the collected information. Unfortunately, this method of bridge inspection has been shown to yield inconsistent condition ratings that correlate with inspector experience. To improve the consistency among image-text inspection data and further predict the accordant condition ratings, this study first provides a collective image-text dataset, extracted from the collection of bridge inspection reports from the Virginia Department of Transportation. Using this dataset, we have developed novel deep learning-base methods for an automatic bridge condition rating prediction based on data fusion between the textual and visual data from the collected report sets.</p><p>Our proposed multi modal deep fusion approach constructs visual and textual representations for images and sentences separately using appropriate encoding functions, and then fuses representations of images and text to enhance the multi-modal prediction performance of the assigned condition ratings. Moreover, we study interpretations of the deployed deep models using saliency maps to identify parts of the image-text inputs that are essential in condition rating predictions. The findings of this study point to potential improvements by leveraging consistent image-text inspection data collection as well as leveraging the proposed deep fusion model to improve the bridge condition prediction rating from both visual and textual reports.</p></div>","PeriodicalId":100791,"journal":{"name":"Journal of Infrastructure Intelligence and Resilience","volume":"2 4","pages":"Article 100061"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Infrastructure Intelligence and Resilience","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772991523000361","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Bridge condition rating is a challenging task as it largely depends on the experience-level of the manual inspection and therefore is prone to human errors. The inspection report often consists of a collection of images and sequences of sentences (text) explaining the condition of the considered bridge. In a routine manual bridge inspection, an inspector collects a set of images and textual descriptions of bridge components and assigns an overall condition rating (ranging between 0 and 9) based on the collected information. Unfortunately, this method of bridge inspection has been shown to yield inconsistent condition ratings that correlate with inspector experience. To improve the consistency among image-text inspection data and further predict the accordant condition ratings, this study first provides a collective image-text dataset, extracted from the collection of bridge inspection reports from the Virginia Department of Transportation. Using this dataset, we have developed novel deep learning-base methods for an automatic bridge condition rating prediction based on data fusion between the textual and visual data from the collected report sets.

Our proposed multi modal deep fusion approach constructs visual and textual representations for images and sentences separately using appropriate encoding functions, and then fuses representations of images and text to enhance the multi-modal prediction performance of the assigned condition ratings. Moreover, we study interpretations of the deployed deep models using saliency maps to identify parts of the image-text inputs that are essential in condition rating predictions. The findings of this study point to potential improvements by leveraging consistent image-text inspection data collection as well as leveraging the proposed deep fusion model to improve the bridge condition prediction rating from both visual and textual reports.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多模态深度融合的桥梁状态评估
桥梁状态评定是一项具有挑战性的任务,因为它在很大程度上取决于人工检查的经验水平,因此容易出现人为错误。检查报告通常由一组图像和一系列句子(文本)组成,说明所考虑的桥梁的状况。在常规的人工桥梁巡检中,检查员收集一组桥梁部件的图像和文字描述,并根据收集到的信息给出一个整体的状态等级(范围为0到9)。不幸的是,这种桥梁检查方法已被证明产生与检查员经验相关的不一致的状态评级。为了提高图像-文本检查数据之间的一致性,并进一步预测相应的状况评级,本研究首先提供了一个集体图像-文本数据集,该数据集提取自弗吉尼亚州交通部的桥梁检查报告集合。利用该数据集,我们开发了一种新颖的基于深度学习的方法,用于基于收集的报告集的文本数据和视觉数据之间的数据融合的桥梁状况自动预测。我们提出的多模态深度融合方法使用适当的编码函数分别构建图像和句子的视觉和文本表示,然后融合图像和文本的表示,以提高指定条件评级的多模态预测性能。此外,我们研究了使用显著性图对部署的深度模型的解释,以识别在状态评级预测中必不可少的图像-文本输入部分。这项研究的结果指出了利用一致的图像-文本检测数据收集以及利用所提出的深度融合模型来提高视觉和文本报告的桥梁状态预测评级的潜在改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.10
自引率
0.00%
发文量
0
期刊最新文献
Review on optimization strategies of probabilistic diagnostic imaging methods An integrated management system (IMS) approach to sustainable construction development and management Quantitative risk analysis of road transportation of hazardous materials in coastal areas Multimodal vortex-induced vibration mitigation and design approach of bistable nonlinear energy sink inerter on bridge structure Enhanced operational modal analysis and change point detection for vibration-based structural health monitoring of bridges
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1