基于特定领域和任务感知预调优方法的桥梁检测少镜头机器阅读理解

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2025-05-01 Epub Date: 2025-02-22 DOI:10.1016/j.engappai.2025.110361
Ren Li , Luyi Zhang , Qiao Xiao , Jianxi Yang , Yu Chen , Shixin Jiang , Di Wang
{"title":"基于特定领域和任务感知预调优方法的桥梁检测少镜头机器阅读理解","authors":"Ren Li ,&nbsp;Luyi Zhang ,&nbsp;Qiao Xiao ,&nbsp;Jianxi Yang ,&nbsp;Yu Chen ,&nbsp;Shixin Jiang ,&nbsp;Di Wang","doi":"10.1016/j.engappai.2025.110361","DOIUrl":null,"url":null,"abstract":"<div><div>With the wide application of information technologies in the field of bridge engineering, many electronic bridge inspection reports have been generated. However, due to insufficient research on machine reading comprehension (MRC) in this field, a lot of bridge inspection information, e.g., structural basic data, inspected defects, and maintenance suggestions, has not been fully used. Especially, it is time-consuming and labor-intensive to pre-train a domain-specific language model from scratch or annotate large-scale question answering corpora, which also brings challenges to the MRC research in this field. To tackle the problems, this paper proposes a novel few-shot MRC approach for bridge inspection based on the idea of data augmentation. The proposed model uses a pre-trained model as backbone, along with introducing a pre-tuning stage to bridge the gaps between general-purpose pre-training and domain-specific MRC tasks. In order to reduce the workload of manual annotation, we present a novel pre-tuning data generation algorithm which is based on the domain-specific question classification and answer prediction neural models. After pre-tuning and fine-tuning, the proposed model achieves efficient bridge inspection MRC. The experimental results show that the proposed model outperforms the mainstream fine-tuning-based approaches and few-shot MRC baseline models in various settings. With 1024 fine-tuning samples, the F1 value and Exact Match (EM) value are 86.42%, 74.65%, respectively. Our research work can serve as a foundation for the construction of automatic question answering systems for intelligent bridge management and maintenance.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"147 ","pages":"Article 110361"},"PeriodicalIF":8.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Few-shot machine reading comprehension for bridge inspection via domain-specific and task-aware pre-tuning approach\",\"authors\":\"Ren Li ,&nbsp;Luyi Zhang ,&nbsp;Qiao Xiao ,&nbsp;Jianxi Yang ,&nbsp;Yu Chen ,&nbsp;Shixin Jiang ,&nbsp;Di Wang\",\"doi\":\"10.1016/j.engappai.2025.110361\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the wide application of information technologies in the field of bridge engineering, many electronic bridge inspection reports have been generated. However, due to insufficient research on machine reading comprehension (MRC) in this field, a lot of bridge inspection information, e.g., structural basic data, inspected defects, and maintenance suggestions, has not been fully used. Especially, it is time-consuming and labor-intensive to pre-train a domain-specific language model from scratch or annotate large-scale question answering corpora, which also brings challenges to the MRC research in this field. To tackle the problems, this paper proposes a novel few-shot MRC approach for bridge inspection based on the idea of data augmentation. The proposed model uses a pre-trained model as backbone, along with introducing a pre-tuning stage to bridge the gaps between general-purpose pre-training and domain-specific MRC tasks. In order to reduce the workload of manual annotation, we present a novel pre-tuning data generation algorithm which is based on the domain-specific question classification and answer prediction neural models. After pre-tuning and fine-tuning, the proposed model achieves efficient bridge inspection MRC. The experimental results show that the proposed model outperforms the mainstream fine-tuning-based approaches and few-shot MRC baseline models in various settings. With 1024 fine-tuning samples, the F1 value and Exact Match (EM) value are 86.42%, 74.65%, respectively. Our research work can serve as a foundation for the construction of automatic question answering systems for intelligent bridge management and maintenance.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"147 \",\"pages\":\"Article 110361\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625003616\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/22 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625003616","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/22 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

随着信息技术在桥梁工程领域的广泛应用,产生了许多电子桥梁检测报告。然而,由于机器阅读理解(MRC)在这一领域的研究不足,许多桥梁检测信息,如结构基础数据、检测缺陷、维修建议等都没有得到充分利用。特别是,从头开始预训练特定领域的语言模型或对大规模问答语料库进行标注是费时费力的,这也给该领域的MRC研究带来了挑战。为了解决这些问题,本文提出了一种基于数据增强思想的桥梁检测少镜头MRC方法。提出的模型使用预训练模型作为主干,同时引入预调优阶段,以弥合通用预训练和特定领域的MRC任务之间的差距。为了减少人工标注的工作量,提出了一种基于特定领域问题分类和答案预测神经模型的预调优数据生成算法。该模型经过预整和微调,实现了高效的桥梁检测MRC。实验结果表明,该模型在各种情况下都优于主流的基于微调的方法和少镜头MRC基线模型。在1024个微调样本中,F1值为86.42%,EM值为74.65%。本文的研究工作可为智能桥梁管理与维修自动答疑系统的建设奠定基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Few-shot machine reading comprehension for bridge inspection via domain-specific and task-aware pre-tuning approach
With the wide application of information technologies in the field of bridge engineering, many electronic bridge inspection reports have been generated. However, due to insufficient research on machine reading comprehension (MRC) in this field, a lot of bridge inspection information, e.g., structural basic data, inspected defects, and maintenance suggestions, has not been fully used. Especially, it is time-consuming and labor-intensive to pre-train a domain-specific language model from scratch or annotate large-scale question answering corpora, which also brings challenges to the MRC research in this field. To tackle the problems, this paper proposes a novel few-shot MRC approach for bridge inspection based on the idea of data augmentation. The proposed model uses a pre-trained model as backbone, along with introducing a pre-tuning stage to bridge the gaps between general-purpose pre-training and domain-specific MRC tasks. In order to reduce the workload of manual annotation, we present a novel pre-tuning data generation algorithm which is based on the domain-specific question classification and answer prediction neural models. After pre-tuning and fine-tuning, the proposed model achieves efficient bridge inspection MRC. The experimental results show that the proposed model outperforms the mainstream fine-tuning-based approaches and few-shot MRC baseline models in various settings. With 1024 fine-tuning samples, the F1 value and Exact Match (EM) value are 86.42%, 74.65%, respectively. Our research work can serve as a foundation for the construction of automatic question answering systems for intelligent bridge management and maintenance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
期刊最新文献
Novel artificial intelligence driven model-based control framework for solar–electric vehicle home energy optimization Exploring semantic dependency for reasoning over temporal knowledge graph Enhancing the safety assessment of open-pit mine slopes with interpretable, data-driven stacking learning and three-dimensional stability analysis Dual-domain data enhancement and lightweight deep architecture for robust powder bed defect detection Modeling function-level relationships for vulnerability detection in graph neural networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1