Diffusion-Based Deep Reinforcement Learning for Resource Management in Connected Construction Equipment Networks: A Hierarchical Framework

IF 10.7 1区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Wireless Communications Pub Date : 2025-01-13 DOI:10.1109/TWC.2024.3525410
Pengfei Ning;Hongwei Wang;Tao Tang;Jie Zhang;Hongyang Du;Dusit Niyato;F. Richard Yu
{"title":"Diffusion-Based Deep Reinforcement Learning for Resource Management in Connected Construction Equipment Networks: A Hierarchical Framework","authors":"Pengfei Ning;Hongwei Wang;Tao Tang;Jie Zhang;Hongyang Du;Dusit Niyato;F. Richard Yu","doi":"10.1109/TWC.2024.3525410","DOIUrl":null,"url":null,"abstract":"With the extensive adoption of information technology, tunnel construction is experiencing a rapid digital transformation. Integrating powerful direct communication among construction equipment (CE) facilitates real-time data exchange, promoting collaborative operations among CE. Concurrent execution of multiple construction procedures leads to a significant rise in the amount of CE and communication links, resulting in resource competition. However, this competition is aimed at enhancing collaboration. To address this inherently contradictory issue, we propose a hierarchical resource management framework and align communication quality of service (QoS) to construction efficiency using construction procedure coherence degree (CPCD) based on age of information (AoI). By formulating resource management as a stochastic optimization problem, a suitable online two-level deep reinforcement learning algorithm referred to as diffusion based soft actor critic (DSAC)-QMIX is designed to derive the radio resource allocation strategies. DSAC is responsible for orchestrating spectrum inter-fleets at the high-level, and QMIX makes the resource management and power control decision for each CE at the low-level. Simulation results validate the effectiveness of the DSAC-QMIX algorithm with comparable transmission rate, and show superior performance in terms of CPCD satisfaction compared with other benchmarks.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 4","pages":"2847-2861"},"PeriodicalIF":10.7000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10838290/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

With the extensive adoption of information technology, tunnel construction is experiencing a rapid digital transformation. Integrating powerful direct communication among construction equipment (CE) facilitates real-time data exchange, promoting collaborative operations among CE. Concurrent execution of multiple construction procedures leads to a significant rise in the amount of CE and communication links, resulting in resource competition. However, this competition is aimed at enhancing collaboration. To address this inherently contradictory issue, we propose a hierarchical resource management framework and align communication quality of service (QoS) to construction efficiency using construction procedure coherence degree (CPCD) based on age of information (AoI). By formulating resource management as a stochastic optimization problem, a suitable online two-level deep reinforcement learning algorithm referred to as diffusion based soft actor critic (DSAC)-QMIX is designed to derive the radio resource allocation strategies. DSAC is responsible for orchestrating spectrum inter-fleets at the high-level, and QMIX makes the resource management and power control decision for each CE at the low-level. Simulation results validate the effectiveness of the DSAC-QMIX algorithm with comparable transmission rate, and show superior performance in terms of CPCD satisfaction compared with other benchmarks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于扩散的深度强化学习,用于互联建筑设备网络中的资源管理:分层框架
随着信息技术的广泛应用,隧道建设正在经历快速的数字化转型。集成强大的施工设备间直接通信,方便实时数据交换,促进施工设备间协同作业。多个施工工序同时执行,导致CE和通信环节数量显著增加,造成资源竞争。然而,这次比赛的目的是加强合作。为了解决这一内在矛盾的问题,我们提出了一个分层资源管理框架,并使用基于信息时代(AoI)的构建过程相干度(CPCD)将通信服务质量(QoS)与构建效率保持一致。通过将资源管理描述为一个随机优化问题,设计了一种适合的在线两级深度强化学习算法——基于扩散的软行为者评价(DSAC)-QMIX,以导出无线电资源分配策略。DSAC在高层负责编排频谱之间的机群,QMIX在低层为每个CE做出资源管理和电源控制决策。仿真结果验证了DSAC-QMIX算法在传输速率相当的情况下的有效性,与其他基准测试相比,在CPCD满意度方面表现出优越的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
18.60
自引率
10.60%
发文量
708
审稿时长
5.6 months
期刊介绍: The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols. The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies. Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.
期刊最新文献
Deterministic Statistical QoS Guarantee over FBL-AMC-HARQ based Cell-Free mMIMO Resource Allocation for Heterogeneous Services in Satellite-Terrestrial IoT Networks With Multi-Access Edge Computing Dual Time-Scale Resource Allocation for Hybrid VR and Haptic Services With Diffusion-Based DRL Joint Energy-Efficient and Throughput Optimization in Large-Scale Mobile Networks via Safe Hierarchical MARL A Fingerprint Database Generation Method for RIS-Assisted Indoor Positioning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1