Yupeng Cao , Wei Luo , Yadong Xue , Weiren Lin , Feng Zhang
{"title":"优化隧道掘进机运行的基于模型的离线强化学习框架","authors":"Yupeng Cao , Wei Luo , Yadong Xue , Weiren Lin , Feng Zhang","doi":"10.1016/j.undsp.2024.01.008","DOIUrl":null,"url":null,"abstract":"<div><p>Research on automation and intelligent operation of tunnel boring machine (TBM) is receiving more and more attention, benefiting from the increasing construction data. However, most studies on TBM operations optimization were trained by the labels of human drivers’ decisions, which were subjective and stochastic. As a result, the control parameters suggested by these models could hardly surpass the performance of a human driver, even the possibility of subjective incorrect decisions. Considering that the geomechanical feedback to TBM under drivers’ actions is objective, in this paper, a transformer-based model called the geological response for tunnel boring machine (GRTBM), is proposed to learn the relationship between operation-adjust and TBM monitoring changes. Additionally, with the model-based offline reinforcement learning, this paper provided a novel approach to optimizing the TBM excavation operations. The decision processes, recorded in the Yin-song TBM project for a waterway tunnel in Jilin Province of China, were used for the validation of the model. By adopting an implicit perception of geological conditions in the GRTBM model, the suggested method achieved the desired state within a single action, greatly outperformed the practical adjustments where 500 s were taken, revealing the fact that the proposed model has the potential to surpass the capability of human beings.</p></div>","PeriodicalId":48505,"journal":{"name":"Underground Space","volume":"19 ","pages":"Pages 47-71"},"PeriodicalIF":8.2000,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2467967424000540/pdfft?md5=12e60e379c29962c638a0fecd8e75a42&pid=1-s2.0-S2467967424000540-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Model-based offline reinforcement learning framework for optimizing tunnel boring machine operation\",\"authors\":\"Yupeng Cao , Wei Luo , Yadong Xue , Weiren Lin , Feng Zhang\",\"doi\":\"10.1016/j.undsp.2024.01.008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Research on automation and intelligent operation of tunnel boring machine (TBM) is receiving more and more attention, benefiting from the increasing construction data. However, most studies on TBM operations optimization were trained by the labels of human drivers’ decisions, which were subjective and stochastic. As a result, the control parameters suggested by these models could hardly surpass the performance of a human driver, even the possibility of subjective incorrect decisions. Considering that the geomechanical feedback to TBM under drivers’ actions is objective, in this paper, a transformer-based model called the geological response for tunnel boring machine (GRTBM), is proposed to learn the relationship between operation-adjust and TBM monitoring changes. Additionally, with the model-based offline reinforcement learning, this paper provided a novel approach to optimizing the TBM excavation operations. The decision processes, recorded in the Yin-song TBM project for a waterway tunnel in Jilin Province of China, were used for the validation of the model. By adopting an implicit perception of geological conditions in the GRTBM model, the suggested method achieved the desired state within a single action, greatly outperformed the practical adjustments where 500 s were taken, revealing the fact that the proposed model has the potential to surpass the capability of human beings.</p></div>\",\"PeriodicalId\":48505,\"journal\":{\"name\":\"Underground Space\",\"volume\":\"19 \",\"pages\":\"Pages 47-71\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2024-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2467967424000540/pdfft?md5=12e60e379c29962c638a0fecd8e75a42&pid=1-s2.0-S2467967424000540-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Underground Space\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2467967424000540\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Underground Space","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2467967424000540","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Research on automation and intelligent operation of tunnel boring machine (TBM) is receiving more and more attention, benefiting from the increasing construction data. However, most studies on TBM operations optimization were trained by the labels of human drivers’ decisions, which were subjective and stochastic. As a result, the control parameters suggested by these models could hardly surpass the performance of a human driver, even the possibility of subjective incorrect decisions. Considering that the geomechanical feedback to TBM under drivers’ actions is objective, in this paper, a transformer-based model called the geological response for tunnel boring machine (GRTBM), is proposed to learn the relationship between operation-adjust and TBM monitoring changes. Additionally, with the model-based offline reinforcement learning, this paper provided a novel approach to optimizing the TBM excavation operations. The decision processes, recorded in the Yin-song TBM project for a waterway tunnel in Jilin Province of China, were used for the validation of the model. By adopting an implicit perception of geological conditions in the GRTBM model, the suggested method achieved the desired state within a single action, greatly outperformed the practical adjustments where 500 s were taken, revealing the fact that the proposed model has the potential to surpass the capability of human beings.
期刊介绍:
Underground Space is an open access international journal without article processing charges (APC) committed to serving as a scientific forum for researchers and practitioners in the field of underground engineering. The journal welcomes manuscripts that deal with original theories, methods, technologies, and important applications throughout the life-cycle of underground projects, including planning, design, operation and maintenance, disaster prevention, and demolition. The journal is particularly interested in manuscripts related to the latest development of smart underground engineering from the perspectives of resilience, resources saving, environmental friendliness, humanity, and artificial intelligence. The manuscripts are expected to have significant innovation and potential impact in the field of underground engineering, and should have clear association with or application in underground projects.