{"title":"基于自然语言处理的深度迁移学习模型,适用于各种表格数据集,用于预测混凝土中复合杆件的粘结强度","authors":"Pei‐Fu Zhang, Daxu Zhang, Xiao‐Ling Zhao, Xuan Zhao, Mudassir Iqbal, Yiliyaer Tuerxunmaimaiti, Qi Zhao","doi":"10.1111/mice.13357","DOIUrl":null,"url":null,"abstract":"As conventional machine learning models often struggle with scarcity and structural variation of training data, this paper proposes a novel regression transfer learning framework called transferable tabular regressor (TransTabRegressor) to address this challenge. The TransTabRegressor integrates natural language processing (NLP) for feature encoding, transformer for enhanced feature representation, and deep learning (DL) for robust modeling, facilitating effective transfer learning across tabular datasets using reducing input parameters. By leveraging the NLP data processor, the framework embeds both parameter names and values, enabling it to recognize and adapt to different expressions of similar parameters. For instance, the bond strength of fiber‐reinforced polymer (FRP) bars embedded in ultra‐high‐performance concrete (UHPC) is critical for ensuring the integrity of FRP‐UHPC structures. While pullout tests are widely adopted for their simplicity to generate substantial data, beam tests provide a closer approximation to actual stress conditions but are more complex thus resulting in limited data size. As a verification, the framework is applied to predict the bond strength of FRP bars embedded in UHPC using limited beam test data. A pre‐trained model is first established using 479 pieces of pullout test data. Subsequently, two transfer learning models are developed by fine‐tuning on 115 pieces of beam test data, where 66 correspond to concrete splitting failure and 49 correspond to pullout failure. For comparative analysis, XGBoost and neural network models are directly trained on the beam test data. Evaluation results demonstrate that the transfer learning models achieve significantly improved prediction accuracy and generalization capability. This study significantly highlights the effectiveness of the proposed TransTabRegressor in handling data scarcity and variability in input parameters across various engineering applications.","PeriodicalId":156,"journal":{"name":"Computer-Aided Civil and Infrastructure Engineering","volume":"9 1","pages":""},"PeriodicalIF":8.5000,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Natural language processing‐based deep transfer learning model across diverse tabular datasets for bond strength prediction of composite bars in concrete\",\"authors\":\"Pei‐Fu Zhang, Daxu Zhang, Xiao‐Ling Zhao, Xuan Zhao, Mudassir Iqbal, Yiliyaer Tuerxunmaimaiti, Qi Zhao\",\"doi\":\"10.1111/mice.13357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As conventional machine learning models often struggle with scarcity and structural variation of training data, this paper proposes a novel regression transfer learning framework called transferable tabular regressor (TransTabRegressor) to address this challenge. The TransTabRegressor integrates natural language processing (NLP) for feature encoding, transformer for enhanced feature representation, and deep learning (DL) for robust modeling, facilitating effective transfer learning across tabular datasets using reducing input parameters. By leveraging the NLP data processor, the framework embeds both parameter names and values, enabling it to recognize and adapt to different expressions of similar parameters. For instance, the bond strength of fiber‐reinforced polymer (FRP) bars embedded in ultra‐high‐performance concrete (UHPC) is critical for ensuring the integrity of FRP‐UHPC structures. While pullout tests are widely adopted for their simplicity to generate substantial data, beam tests provide a closer approximation to actual stress conditions but are more complex thus resulting in limited data size. As a verification, the framework is applied to predict the bond strength of FRP bars embedded in UHPC using limited beam test data. A pre‐trained model is first established using 479 pieces of pullout test data. Subsequently, two transfer learning models are developed by fine‐tuning on 115 pieces of beam test data, where 66 correspond to concrete splitting failure and 49 correspond to pullout failure. For comparative analysis, XGBoost and neural network models are directly trained on the beam test data. Evaluation results demonstrate that the transfer learning models achieve significantly improved prediction accuracy and generalization capability. This study significantly highlights the effectiveness of the proposed TransTabRegressor in handling data scarcity and variability in input parameters across various engineering applications.\",\"PeriodicalId\":156,\"journal\":{\"name\":\"Computer-Aided Civil and Infrastructure Engineering\",\"volume\":\"9 1\",\"pages\":\"\"},\"PeriodicalIF\":8.5000,\"publicationDate\":\"2024-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer-Aided Civil and Infrastructure Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1111/mice.13357\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer-Aided Civil and Infrastructure Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1111/mice.13357","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Natural language processing‐based deep transfer learning model across diverse tabular datasets for bond strength prediction of composite bars in concrete
As conventional machine learning models often struggle with scarcity and structural variation of training data, this paper proposes a novel regression transfer learning framework called transferable tabular regressor (TransTabRegressor) to address this challenge. The TransTabRegressor integrates natural language processing (NLP) for feature encoding, transformer for enhanced feature representation, and deep learning (DL) for robust modeling, facilitating effective transfer learning across tabular datasets using reducing input parameters. By leveraging the NLP data processor, the framework embeds both parameter names and values, enabling it to recognize and adapt to different expressions of similar parameters. For instance, the bond strength of fiber‐reinforced polymer (FRP) bars embedded in ultra‐high‐performance concrete (UHPC) is critical for ensuring the integrity of FRP‐UHPC structures. While pullout tests are widely adopted for their simplicity to generate substantial data, beam tests provide a closer approximation to actual stress conditions but are more complex thus resulting in limited data size. As a verification, the framework is applied to predict the bond strength of FRP bars embedded in UHPC using limited beam test data. A pre‐trained model is first established using 479 pieces of pullout test data. Subsequently, two transfer learning models are developed by fine‐tuning on 115 pieces of beam test data, where 66 correspond to concrete splitting failure and 49 correspond to pullout failure. For comparative analysis, XGBoost and neural network models are directly trained on the beam test data. Evaluation results demonstrate that the transfer learning models achieve significantly improved prediction accuracy and generalization capability. This study significantly highlights the effectiveness of the proposed TransTabRegressor in handling data scarcity and variability in input parameters across various engineering applications.
期刊介绍:
Computer-Aided Civil and Infrastructure Engineering stands as a scholarly, peer-reviewed archival journal, serving as a vital link between advancements in computer technology and civil and infrastructure engineering. The journal serves as a distinctive platform for the publication of original articles, spotlighting novel computational techniques and inventive applications of computers. Specifically, it concentrates on recent progress in computer and information technologies, fostering the development and application of emerging computing paradigms.
Encompassing a broad scope, the journal addresses bridge, construction, environmental, highway, geotechnical, structural, transportation, and water resources engineering. It extends its reach to the management of infrastructure systems, covering domains such as highways, bridges, pavements, airports, and utilities. The journal delves into areas like artificial intelligence, cognitive modeling, concurrent engineering, database management, distributed computing, evolutionary computing, fuzzy logic, genetic algorithms, geometric modeling, internet-based technologies, knowledge discovery and engineering, machine learning, mobile computing, multimedia technologies, networking, neural network computing, optimization and search, parallel processing, robotics, smart structures, software engineering, virtual reality, and visualization techniques.