首页 > 最新文献

IEEE Transactions on Mobile Computing最新文献

英文 中文
MAIR: Model Agnostic Instance Reweighing for Heterogeneous Federated Learning maair:异构联邦学习的模型不可知实例重加权
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-22 DOI: 10.1109/TMC.2025.3624064
Dongping Liao;Xitong Gao;Chengzhong Xu
Federated learning (FL) enables collaborative training on decentralized data while preserving the data owners’ privacy, under the orchestration of a central server. FL has seen tremendous growth and advancements in recent years. Despite its progress, FL faces a significant challenge raised by data heterogeneity, leading to a slower convergence rate and a larger performance gap compared to centralized training. In this work, we empirically reveal that direct applying empirical risk minimizing (ERM) on skewed client training data causes the client model suffers from biased predictions towards majority classes. To address this problem, we propose a model agnostic instance reweighing method (MAIR). At a coarse-grained level, MAIR adjusts the logits predictions for each class to counteract the data heterogeneity. At a fine-grained level, it dynamically reweighs the importance of individual training samples with a predictive meta network. As a results, MAIR prevents client models from over-fitting on heterogeneous data and therefore substantially reduces client drift. Theoretically, we justify its non-convex convergence property. Extensive experiments demonstrate that MAIR reliably speeds up convergence and improves the quality of global models, outperforming its best competitor by a clear margin. It notably delivers $8.3%$ improvements on ImageNet subset and achieves $67.6%$ energy footprint reduction on CIFAR-100 over the FedAvg baseline. Our findings also suggest that improving the performance of FL-trained models necessitates rethinking clients’ local optimization objectives, and ERM should thus no longer be viewed as a de facto standard in FL under data heterogeneity.
联邦学习(FL)支持在分散数据上进行协作训练,同时在中央服务器的编排下保护数据所有者的隐私。近年来,FL取得了巨大的发展和进步。尽管取得了进展,但与集中式训练相比,FL面临着数据异质性带来的重大挑战,导致其收敛速度较慢,性能差距较大。在这项工作中,我们实证地揭示了在倾斜的客户培训数据上直接应用经验风险最小化(ERM)会导致客户模型对大多数类别的预测存在偏见。为了解决这一问题,我们提出了一种与模型无关的实例重加权方法(MAIR)。在粗粒度级别上,MAIR调整每个类的logits预测以抵消数据异构性。在细粒度层面上,它动态地用一个预测元网络重新加权单个训练样本的重要性。因此,MAIR可以防止客户端模型过度拟合异构数据,从而大大减少客户端漂移。从理论上证明了它的非凸收敛性。大量的实验表明,MAIR可靠地加快了收敛速度,提高了全局模型的质量,明显优于其最佳竞争对手。它显著地在ImageNet子集上提供了8.3%的改进,并在fedag基线上实现了CIFAR-100上67.6%的能源足迹减少。我们的研究结果还表明,提高FL训练模型的性能需要重新考虑客户的局部优化目标,因此,在数据异构的情况下,ERM不应再被视为FL的事实上的标准。
{"title":"MAIR: Model Agnostic Instance Reweighing for Heterogeneous Federated Learning","authors":"Dongping Liao;Xitong Gao;Chengzhong Xu","doi":"10.1109/TMC.2025.3624064","DOIUrl":"https://doi.org/10.1109/TMC.2025.3624064","url":null,"abstract":"Federated learning (FL) enables collaborative training on decentralized data while preserving the data owners’ privacy, under the orchestration of a central server. FL has seen tremendous growth and advancements in recent years. Despite its progress, FL faces a significant challenge raised by data heterogeneity, leading to a slower convergence rate and a larger performance gap compared to centralized training. In this work, we empirically reveal that direct applying empirical risk minimizing (ERM) on skewed client training data causes the client model suffers from biased predictions towards majority classes. To address this problem, we propose a model agnostic instance reweighing method (MAIR). At a coarse-grained level, MAIR adjusts the logits predictions for each class to counteract the data heterogeneity. At a fine-grained level, it dynamically reweighs the importance of individual training samples with a predictive meta network. As a results, MAIR prevents client models from over-fitting on heterogeneous data and therefore substantially reduces client drift. Theoretically, we justify its non-convex convergence property. Extensive experiments demonstrate that MAIR reliably speeds up convergence and improves the quality of global models, outperforming its best competitor by a clear margin. It notably delivers <inline-formula><tex-math>$8.3%$</tex-math></inline-formula> improvements on ImageNet subset and achieves <inline-formula><tex-math>$67.6%$</tex-math></inline-formula> energy footprint reduction on CIFAR-100 over the FedAvg baseline. Our findings also suggest that improving the performance of FL-trained models necessitates rethinking clients’ local optimization objectives, and ERM should thus no longer be viewed as a de facto standard in FL under data heterogeneity.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4241-4252"},"PeriodicalIF":9.2,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TeraTex: Contactless Textile Tactile Sensing Using Terahertz Signal TeraTex:使用太赫兹信号的非接触式纺织品触觉传感
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-22 DOI: 10.1109/TMC.2025.3624475
Zhan Zhang;Denghui Song;Anfu Zhou;Huadong Ma
Tactile sensing is an important capability for intelligent machines to interact with the external physical world. Current tactile sensing methods are mainly based on contact, which mostly perceive the object’s surface features, limiting for deeper features such as internal structure. In this paper, we propose terahertz textile tactile sensing, i.e., TeraTex, which utilizes the effect of textile material and structure on terahertz reflected signals to perceive its multidimensional tactile properties. To realize TeraTex, the key challenges lie in that the textile tactile properties influenced by material and structure are subtle and intricately entangled within the reflected signals, and different reflective surfaces can interfere with feature extraction. To address these challenges, we custom-design a biologically inspired network to extract different dimensions of tactile properties from the terahertz reflected signals, and eliminate the influence of different reflective surfaces. We prototype and validate TeraTex using a terahertz time-domain spectrometer on 30 types of textiles and 7 different reflective surfaces. TeraTex achieves an average accuracy of 92.42% across 11 tactile property dimensions. Furthermore, when the reflective surfaces are extended to unknown surfaces, TeraTex still achieves an average accuracy of 88.3% .
触觉感知是智能机器与外部物理世界进行交互的重要能力。目前的触觉传感方法主要是基于接触的,大多是感知物体的表面特征,局限于内部结构等更深层次的特征。在本文中,我们提出了太赫兹纺织品触觉传感,即TeraTex,它利用纺织品材料和结构对太赫兹反射信号的影响来感知其多维触觉特性。实现TeraTex的关键挑战在于,受材料和结构影响的纺织品触觉特性在反射信号中是微妙而复杂的纠缠,不同的反射表面会干扰特征提取。为了解决这些挑战,我们定制设计了一个受生物学启发的网络,从太赫兹反射信号中提取不同维度的触觉特性,并消除不同反射表面的影响。我们使用太赫兹时域光谱仪在30种纺织品和7种不同的反射表面上对TeraTex进行了原型和验证。TeraTex在11个触觉属性维度上实现了92.42%的平均精度。此外,当反射面扩展到未知表面时,TeraTex的平均精度仍然达到88.3%。
{"title":"TeraTex: Contactless Textile Tactile Sensing Using Terahertz Signal","authors":"Zhan Zhang;Denghui Song;Anfu Zhou;Huadong Ma","doi":"10.1109/TMC.2025.3624475","DOIUrl":"https://doi.org/10.1109/TMC.2025.3624475","url":null,"abstract":"Tactile sensing is an important capability for intelligent machines to interact with the external physical world. Current tactile sensing methods are mainly based on contact, which mostly perceive the object’s surface features, limiting for deeper features such as internal structure. In this paper, we propose terahertz textile tactile sensing, i.e., <italic>TeraTex</i>, which utilizes the effect of textile material and structure on terahertz reflected signals to perceive its multidimensional tactile properties. To realize <italic>TeraTex</i>, the key challenges lie in that the textile tactile properties influenced by material and structure are subtle and intricately entangled within the reflected signals, and different reflective surfaces can interfere with feature extraction. To address these challenges, we custom-design a biologically inspired network to extract different dimensions of tactile properties from the terahertz reflected signals, and eliminate the influence of different reflective surfaces. We prototype and validate <italic>TeraTex</i> using a terahertz time-domain spectrometer on 30 types of textiles and 7 different reflective surfaces. <italic>TeraTex</i> achieves an average accuracy of 92.42% across 11 tactile property dimensions. Furthermore, when the reflective surfaces are extended to unknown surfaces, <italic>TeraTex</i> still achieves an average accuracy of 88.3% .","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4269-4285"},"PeriodicalIF":9.2,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lightweight Adaptive Quantization Algorithms for Federated Learning With Heterogeneous Clients 基于异构客户端的联邦学习轻量级自适应量化算法
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-22 DOI: 10.1109/TMC.2025.3623984
Hengrui Cui;Zhihao Qu;Baoliu Ye;Bin Tang;Tao Zhuang;Xinyu Wang;Yue Zeng
Quantization is a common method to improve communication efficiency in federated learning (FL) by compressing the gradients that clients upload. Currently, most application scenarios involve cloud-edge collaboration, where edge clients exhibit significant heterogeneity, making previous methods with uniform quantization levels unsuitable. To address these issues, we introduce a novel algorithm named Lightweight Adaptive Quantization for Heterogeneous Clients (LAQ-HC), which enables each client to adaptively choose its quantization level based on its data quality and communication capabilities, without increasing computation costs. The core idea is that clients with lower communication capabilities should use higher quantization levels, whereas those with higher capabilities should use lower levels. This ensures that clients complete their uploads in a similar time. Furthermore, LAQ-HC models the relationship between quantization levels and the impact on client quality, which remains consistent between clients and adjacent training rounds. This allows for a lightweight estimation of the impact of quantization levels on training convergence, as demonstrated in our theoretical analysis. Under the constraints of limited wireless mobile communication bandwidth, LAQ-HC achieves faster convergence and higher accuracy compared to the latest adaptive quantization algorithms, while using only 56.74% of the computation time, 80.57% of the overall runtime, and 94.04% of the communication overhead.
量化是联邦学习(FL)中常用的一种通过压缩客户端上传的梯度来提高通信效率的方法。目前,大多数应用场景涉及云边缘协作,其中边缘客户端表现出明显的异构性,使得以前具有统一量化级别的方法不适合。为了解决这些问题,我们引入了一种名为异构客户端轻量级自适应量化(LAQ-HC)的新算法,该算法使每个客户端能够根据其数据质量和通信能力自适应选择量化级别,而不会增加计算成本。核心思想是通信能力较低的客户端应该使用较高的量化级别,而通信能力较高的客户端应该使用较低的量化级别。这确保了客户端在相同的时间内完成他们的上传。此外,LAQ-HC建立了量化水平与客户质量影响之间的关系模型,该模型在客户和相邻培训轮次之间保持一致。这允许对量化水平对训练收敛的影响进行轻量级估计,正如我们的理论分析所证明的那样。在有限的无线移动通信带宽约束下,与最新的自适应量化算法相比,LAQ-HC实现了更快的收敛速度和更高的精度,而计算时间仅为56.74%,总运行时间为80.57%,通信开销为94.04%。
{"title":"Lightweight Adaptive Quantization Algorithms for Federated Learning With Heterogeneous Clients","authors":"Hengrui Cui;Zhihao Qu;Baoliu Ye;Bin Tang;Tao Zhuang;Xinyu Wang;Yue Zeng","doi":"10.1109/TMC.2025.3623984","DOIUrl":"https://doi.org/10.1109/TMC.2025.3623984","url":null,"abstract":"Quantization is a common method to improve communication efficiency in federated learning (FL) by compressing the gradients that clients upload. Currently, most application scenarios involve cloud-edge collaboration, where edge clients exhibit significant heterogeneity, making previous methods with uniform quantization levels unsuitable. To address these issues, we introduce a novel algorithm named Lightweight Adaptive Quantization for Heterogeneous Clients (LAQ-HC), which enables each client to adaptively choose its quantization level based on its data quality and communication capabilities, without increasing computation costs. The core idea is that clients with lower communication capabilities should use higher quantization levels, whereas those with higher capabilities should use lower levels. This ensures that clients complete their uploads in a similar time. Furthermore, LAQ-HC models the relationship between quantization levels and the impact on client quality, which remains consistent between clients and adjacent training rounds. This allows for a lightweight estimation of the impact of quantization levels on training convergence, as demonstrated in our theoretical analysis. Under the constraints of limited wireless mobile communication bandwidth, LAQ-HC achieves faster convergence and higher accuracy compared to the latest adaptive quantization algorithms, while using only 56.74% of the computation time, 80.57% of the overall runtime, and 94.04% of the communication overhead.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4409-4424"},"PeriodicalIF":9.2,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RoLEX: A LoRa-Based Rotation Speed Measurement System for Ubiquitous Long-Distance Monitoring Applications 劳力士:基于lora的转速测量系统,用于无处不在的远程监控应用
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-20 DOI: 10.1109/TMC.2025.3623385
Keran Li;Haipeng Dai;Wei Wang;Junlong Chen;Jiliang Wang;Shuai Tong;Meng Li;Lei Wang;Haoran Wang;Guihai Chen
Rotation is a fundamental form of motion and rotation speed measurement holds paramount importance for assessing the health and performance of machinery with rotating components. However, existing measurement systems often face challenges such as limited measurement distance, low accuracy, and complex installation or maintenance processes. In this paper, we propose RoLEX, a LoRa-based rotation speed measurement system for long-distance and contactless monitoring of rotating machinery in ubiquitous scenarios. RoLEX employs a novel Signal Selection method to eliminate chirp interference and adapt to varying rotation speeds, along with a Boost Sensing method to enhance sampling rates and an advanced feature processing algorithm for precise rotation speed estimation and tracking. Comprehensive experiments validate that RoLEX achieves a measurement distance of 50 m, approximately 17 times farther than the latest wireless rotation speed measurement systems. Moreover, RoLEX is robust to interference and obstructions (including through-wall scenarios) and achieves an average measurement error less than 0.69% across different rotation speeds (100–5100 Revolutions Per Minute). For tracking performance, RoLEX achieves a relative error less than 2.8% in 90% of cases. We also present a case study to highlight RoLEX’s practical applicability in real-world scenarios.
旋转是运动的基本形式,转速测量对于评估带有旋转部件的机械的健康和性能至关重要。然而,现有的测量系统经常面临测量距离有限、精度低、安装或维护过程复杂等挑战。在本文中,我们提出了RoLEX,一个基于lora的转速测量系统,用于在无处不在的场景中对旋转机械进行远距离和非接触式监测。劳力士采用新颖的信号选择方法来消除啁啾干扰,适应不同的旋转速度,以及一个Boost传感方法,以提高采样率和先进的特征处理算法,精确的转速估计和跟踪。综合实验证实,RoLEX实现了50米的测量距离,比最新的无线转速测量系统远约17倍。此外,劳力士对干扰和障碍物(包括穿墙场景)具有鲁棒性,在不同转速(100-5100转/分钟)下的平均测量误差小于0.69%。对于跟踪性能,劳力士在90%的情况下实现了相对误差小于2.8%。我们还提出了一个案例研究,以突出劳力士的实际适用性在现实世界的场景。
{"title":"RoLEX: A LoRa-Based Rotation Speed Measurement System for Ubiquitous Long-Distance Monitoring Applications","authors":"Keran Li;Haipeng Dai;Wei Wang;Junlong Chen;Jiliang Wang;Shuai Tong;Meng Li;Lei Wang;Haoran Wang;Guihai Chen","doi":"10.1109/TMC.2025.3623385","DOIUrl":"https://doi.org/10.1109/TMC.2025.3623385","url":null,"abstract":"Rotation is a fundamental form of motion and rotation speed measurement holds paramount importance for assessing the health and performance of machinery with rotating components. However, existing measurement systems often face challenges such as limited measurement distance, low accuracy, and complex installation or maintenance processes. In this paper, we propose RoLEX, a LoRa-based rotation speed measurement system for long-distance and contactless monitoring of rotating machinery in ubiquitous scenarios. RoLEX employs a novel Signal Selection method to eliminate chirp interference and adapt to varying rotation speeds, along with a Boost Sensing method to enhance sampling rates and an advanced feature processing algorithm for precise rotation speed estimation and tracking. Comprehensive experiments validate that RoLEX achieves a measurement distance of 50 m, approximately 17 times farther than the latest wireless rotation speed measurement systems. Moreover, RoLEX is robust to interference and obstructions (including through-wall scenarios) and achieves an average measurement error less than 0.69% across different rotation speeds (100–5100 Revolutions Per Minute). For tracking performance, RoLEX achieves a relative error less than 2.8% in 90% of cases. We also present a case study to highlight RoLEX’s practical applicability in real-world scenarios.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4170-4189"},"PeriodicalIF":9.2,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Traffic Digital Twin-Enabled Orchestration and Scheduling in O-RAN: A Multi-Timescale Joint Optimization Approach O-RAN交通数字双启用编排与调度:一种多时间尺度联合优化方法
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-17 DOI: 10.1109/TMC.2025.3622895
Yinlin Ren;Longyu Zhou;Shaoyong Guo;Xuesong Qiu;Tony Q. S. Quek
Open Radio Access Network (O-RAN) supports heterogeneous service coexistence through functional splitting and open interfaces, enabling traffic steering via functional orchestration and resource scheduling. However, existing studies focus on known traffic patterns and lack the ability to anticipate dynamic service demands in advance. Isolated optimization of orchestration and scheduling fails to ensure End-to-End (E2E) latency. The varying time scales and vast solution space further complicate the joint optimization. To address this, we propose a traffic twin-enabled orchestration and scheduling multi-timescale joint optimization scheme. Explicitly, we design a spatiotemporal attention-assisted Time Series Generative Adversarial Network (TimeGAN) traffic twin model (STAG-TD) to capture unknown traffic patterns. Based on twin results, we formulate a joint optimization problem and design a dual-timescale algorithm framework, including propose a Task Decomposed Dueling Double Deep Q-Network (TD3QN) algorithm to handle large-timescale orchestration, and use a Penalty-based Particle Swarm Optimization (PPSO) algorithm to manage small-timescale scheduling. Our scheme achieves a predictive joint optimization to reduce the transmission latency of services. Extensive results show our scheme outperforms state-of-the-art methods, reducing E2E latency by over 39% and increasing throughput by over 14.9%. The highly consistent results between real and twin data also demonstrate the effectiveness of the traffic twin model.
开放无线接入网(O-RAN)通过功能拆分和开放接口支持异构服务共存,通过功能编排和资源调度实现流量引导。然而,现有的研究主要集中在已知的交通模式上,缺乏对动态服务需求提前预测的能力。单独的业务流程和调度优化无法保证端到端时延。多变的时间尺度和巨大的求解空间使联合优化变得更加复杂。为了解决这个问题,我们提出了一种流量双启用编排和调度多时间尺度联合优化方案。明确地,我们设计了一个时空注意力辅助时间序列生成对抗网络(TimeGAN)交通孪生模型(STAG-TD)来捕获未知的交通模式。在此基础上,提出了一个联合优化问题,并设计了一个双时间尺度的算法框架,包括提出了一种任务分解Dueling双深度Q-Network (TD3QN)算法来处理大时间尺度的编排,以及一种基于惩罚的粒子群优化(PPSO)算法来管理小时间尺度的调度。我们的方案实现了一种预测性联合优化,以减少业务的传输延迟。广泛的结果表明,我们的方案优于最先进的方法,将端到端延迟减少了39%以上,并将吞吐量提高了14.9%以上。真实数据与孪生数据高度一致的结果也证明了流量孪生模型的有效性。
{"title":"Traffic Digital Twin-Enabled Orchestration and Scheduling in O-RAN: A Multi-Timescale Joint Optimization Approach","authors":"Yinlin Ren;Longyu Zhou;Shaoyong Guo;Xuesong Qiu;Tony Q. S. Quek","doi":"10.1109/TMC.2025.3622895","DOIUrl":"https://doi.org/10.1109/TMC.2025.3622895","url":null,"abstract":"Open Radio Access Network (O-RAN) supports heterogeneous service coexistence through functional splitting and open interfaces, enabling traffic steering via functional orchestration and resource scheduling. However, existing studies focus on known traffic patterns and lack the ability to anticipate dynamic service demands in advance. Isolated optimization of orchestration and scheduling fails to ensure End-to-End (E2E) latency. The varying time scales and vast solution space further complicate the joint optimization. To address this, we propose a traffic twin-enabled orchestration and scheduling multi-timescale joint optimization scheme. Explicitly, we design a spatiotemporal attention-assisted Time Series Generative Adversarial Network (TimeGAN) traffic twin model (STAG-TD) to capture unknown traffic patterns. Based on twin results, we formulate a joint optimization problem and design a dual-timescale algorithm framework, including propose a Task Decomposed Dueling Double Deep Q-Network (TD3QN) algorithm to handle large-timescale orchestration, and use a Penalty-based Particle Swarm Optimization (PPSO) algorithm to manage small-timescale scheduling. Our scheme achieves a predictive joint optimization to reduce the transmission latency of services. Extensive results show our scheme outperforms state-of-the-art methods, reducing E2E latency by over 39% and increasing throughput by over 14.9%. The highly consistent results between real and twin data also demonstrate the effectiveness of the traffic twin model.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4223-4240"},"PeriodicalIF":9.2,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling Nonsaturated IEEE 802.11ax Networks With the Coexistence of UORA and UONRA in Imperfect Channels 不完全信道下UORA和UONRA共存的非饱和IEEE 802.11ax网络建模
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-17 DOI: 10.1109/TMC.2025.3622596
Jin Meng;Chao Deng;Penghui Song;Minghao Jin;Yingzhuang Liu
This study introduces an analytical model for a nonsaturated IEEE 802.11ax network, designed to capture the coexistence characteristics of uplink orthogonal frequency division multiple access (OFDMA)-based random access (UORA) and non-random access (UONRA) mechanisms under imperfect channels. Existing models fail to assess this realistic network due to three issues: 1. When accounting for frame aggregation, imperfect channels, and queue buffer, the network exhibits an overwhelming number of states, which renders the evaluation of its performance infeasible. 2. Existing bulk-service queue models fail to evaluate these queue characteristics influenced by imperfect channels. 3. The coexistence characteristics of the two mechanisms remain unexplored due to their complex interactions. To address Issue 1, we propose two designs: device updates and hardware implementation, which reduce the massive states and facilitate the network evaluation. To address Issue 2, we have developed a feedback-driven bulk-service queue model that captures the joint effects of imperfect channels and queue characteristics. To address Issue 3, we comprehensively analyzed the joint impact of the two mechanisms from a probabilistic perspective. Extensive simulations demonstrate that the proposed model accurately captures network performance (throughput, delay, and collision probability) and reduces the mean estimation error by a factor of 30 compared to existing studies.
本研究引入了一个非饱和IEEE 802.11ax网络的分析模型,旨在捕捉不完全信道下基于上行正交频分多址(OFDMA)的随机接入(UORA)和非随机接入(UONRA)机制的共存特性。由于三个问题,现有模型无法评估这一现实网络。当考虑到帧聚合、不完美通道和队列缓冲时,网络表现出压倒性的状态,这使得对其性能的评估变得不可行的。2. 现有的大容量服务队列模型无法评估这些受不完善通道影响的队列特征。3. 由于这两种机制之间复杂的相互作用,它们的共存特征尚未得到探索。为了解决问题1,我们提出了两种设计:设备更新和硬件实现,以减少大量状态,方便网络评估。为了解决问题2,我们开发了一个反馈驱动的批量服务队列模型,该模型捕捉了不完美通道和队列特征的共同影响。为了解决问题3,我们从概率的角度全面分析了两种机制的共同影响。大量的仿真表明,与现有研究相比,所提出的模型准确地捕获了网络性能(吞吐量、延迟和碰撞概率),并将平均估计误差降低了30倍。
{"title":"Modeling Nonsaturated IEEE 802.11ax Networks With the Coexistence of UORA and UONRA in Imperfect Channels","authors":"Jin Meng;Chao Deng;Penghui Song;Minghao Jin;Yingzhuang Liu","doi":"10.1109/TMC.2025.3622596","DOIUrl":"https://doi.org/10.1109/TMC.2025.3622596","url":null,"abstract":"This study introduces an analytical model for a nonsaturated IEEE 802.11ax network, designed to capture the coexistence characteristics of uplink orthogonal frequency division multiple access (OFDMA)-based random access (UORA) and non-random access (UONRA) mechanisms under imperfect channels. Existing models fail to assess this realistic network due to three issues: 1. When accounting for frame aggregation, imperfect channels, and queue buffer, the network exhibits an overwhelming number of states, which renders the evaluation of its performance infeasible. 2. Existing bulk-service queue models fail to evaluate these queue characteristics influenced by imperfect channels. 3. The coexistence characteristics of the two mechanisms remain unexplored due to their complex interactions. To address Issue 1, we propose two designs: device updates and hardware implementation, which reduce the massive states and facilitate the network evaluation. To address Issue 2, we have developed a feedback-driven bulk-service queue model that captures the joint effects of imperfect channels and queue characteristics. To address Issue 3, we comprehensively analyzed the joint impact of the two mechanisms from a probabilistic perspective. Extensive simulations demonstrate that the proposed model accurately captures network performance (throughput, delay, and collision probability) and reduces the mean estimation error by a factor of 30 compared to existing studies.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4190-4205"},"PeriodicalIF":9.2,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cola: Cross-Processor Operator Parallelism for Asynchronous Deep Learning Inference 跨处理器操作并行异步深度学习推理
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-17 DOI: 10.1109/TMC.2025.3622698
Changyao Lin;Zhenming Chen;Ziyang Zhang;Jie Liu
Multi-task inference, as a prevalent inference paradigm nowadays, requires deploying multiple deep learning models on the hardware platform to concurrently process inference tasks. Modern platforms are typically equipped with various heterogeneous processors, such as CPU-GPU platform. To reduce resource contention and improve quality of service in the multi-task scenario, existing work has studied cross-processor inference at the fine-grained operator-level. However, it lacks specific optimizations for asynchronous multi-task inference systems. In such systems, tasks arrive dynamically, leading to diverse inference progress for each model. This renders offline optimization strategies based solely on the original computation graph suboptimal or even ineffective. Therefore, we propose a novel framework, Cola, to address the cross-processor operator scheduling for asynchronous tasks. Cola introduces intermediate representation to abstract and simplify such dynamic scheduling problem, considering the impact of task arrival patterns on the inference progress, and employs an efficient two-phase search algorithm. We implemented and validated Cola on a real-world case of intelligent steel structure manufacturing. Cola outperforms the state-of-the-art cross-processor operator scheduling framework in both throughput and resource utilization with highly acceptable runtime overhead.
多任务推理作为当今流行的一种推理范式,需要在硬件平台上部署多个深度学习模型来并行处理推理任务。现代平台通常配备各种异构处理器,如CPU-GPU平台。为了减少多任务场景下的资源争用,提高服务质量,已有的工作主要是在细粒度操作符级别研究跨处理器推理。然而,它缺乏针对异步多任务推理系统的特定优化。在这样的系统中,任务是动态到达的,导致每个模型的推理进度不同。这使得仅基于原始计算图的离线优化策略不是最优的,甚至无效。因此,我们提出了一个新的框架Cola来解决异步任务的跨处理器操作员调度问题。考虑到任务到达模式对推理进度的影响,Cola引入中间表示对动态调度问题进行了抽象和简化,并采用了高效的两阶段搜索算法。我们在智能钢结构制造的实际案例中实施并验证了Cola。Cola在吞吐量和资源利用率方面都优于最先进的跨处理器操作员调度框架,并且运行时开销非常可接受。
{"title":"Cola: Cross-Processor Operator Parallelism for Asynchronous Deep Learning Inference","authors":"Changyao Lin;Zhenming Chen;Ziyang Zhang;Jie Liu","doi":"10.1109/TMC.2025.3622698","DOIUrl":"https://doi.org/10.1109/TMC.2025.3622698","url":null,"abstract":"Multi-task inference, as a prevalent inference paradigm nowadays, requires deploying multiple deep learning models on the hardware platform to concurrently process inference tasks. Modern platforms are typically equipped with various heterogeneous processors, such as CPU-GPU platform. To reduce resource contention and improve quality of service in the multi-task scenario, existing work has studied cross-processor inference at the fine-grained operator-level. However, it lacks specific optimizations for asynchronous multi-task inference systems. In such systems, tasks arrive dynamically, leading to diverse inference progress for each model. This renders offline optimization strategies based solely on the original computation graph suboptimal or even ineffective. Therefore, we propose a novel framework, Cola, to address the cross-processor operator scheduling for asynchronous tasks. Cola introduces intermediate representation to abstract and simplify such dynamic scheduling problem, considering the impact of task arrival patterns on the inference progress, and employs an efficient two-phase search algorithm. We implemented and validated Cola on a real-world case of intelligent steel structure manufacturing. Cola outperforms the state-of-the-art cross-processor operator scheduling framework in both throughput and resource utilization with highly acceptable runtime overhead.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4346-4362"},"PeriodicalIF":9.2,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation CoDS:通过领域分离增强异构场景中的协同感知
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-17 DOI: 10.1109/TMC.2025.3622937
Yushan Han;Hui Zhang;Honglei Zhang;Chuntao Ding;Yuanzhouhan Cao;Yidong Li
Collaborative perception has been proven to improve individual perception in autonomous driving through multi-agent interaction. Nevertheless, most methods often assume identical encoders for all agents, which does not hold true when these models are deployed in real-world applications. To realize collaborative perception in actual heterogeneous scenarios, existing methods usually align neighbor features to those of the ego vehicle, which is vulnerable to noise from domain gaps and thus fails to address feature discrepancies effectively. Moreover, they adopt transformer-based modules for domain adaptation, which causes the model inference inefficiency on mobile devices. To tackle these issues, we propose CoDS, a Collaborative perception method that leverages Domain Separation to address feature discrepancies in heterogeneous scenarios. The CoDS employs two feature alignment modules, i.e., Lightweight Spatial-Channel Resizer (LSCR) and Distribution Alignment via Domain Separation (DADS). Besides, it utilizes the Domain Alignment Mutual Information (DAMI) loss to ensure effective feature alignment. Specifically, the LSCR aligns the neighbor feature across spatial and channel dimensions using a lightweight convolutional layer. Subsequently, the DADS mitigates feature distribution discrepancy with encoder-specific and encoder-agnostic domain separation modules. The former removes domain-dependent information and the latter captures task-related information. During training, the DAMI loss maximizes the mutual information between aligned heterogeneous features to enhance the domain separation process. The CoDS employs a fully convolutional architecture, which ensures high inference efficiency. Extensive experiments demonstrate that the CoDS effectively mitigates feature discrepancies in heterogeneous scenarios and achieves a trade-off between detection accuracy and inference efficiency.
协作感知已被证明可以通过多智能体交互来改善自动驾驶中的个体感知。然而,大多数方法通常为所有代理假设相同的编码器,当这些模型部署在实际应用程序中时,这是不成立的。为了实现实际异构场景下的协同感知,现有方法通常将相邻特征与自我车辆的特征对齐,容易受到域间隙噪声的影响,无法有效地解决特征差异问题。此外,它们采用基于变压器的模块进行领域自适应,导致在移动设备上的模型推理效率低下。为了解决这些问题,我们提出了CoDS,一种利用领域分离来解决异构场景中特征差异的协作感知方法。CoDS采用两个特征对齐模块,即轻量级空间通道调整器(LSCR)和通过域分离的分布对齐(DADS)。利用域对齐互信息(Domain Alignment Mutual Information, DAMI)损失来保证有效的特征对齐。具体来说,LSCR使用轻量级卷积层跨空间和通道维度对齐邻居特征。随后,DADS通过特定于编码器和与编码器无关的域分离模块减轻了特征分布差异。前者删除与领域相关的信息,后者捕获与任务相关的信息。在训练过程中,DAMI损失最大化了对齐异构特征之间的相互信息,增强了域分离过程。CoDS采用全卷积架构,保证了较高的推理效率。大量的实验表明,CoDS有效地缓解了异构场景下的特征差异,实现了检测精度和推理效率之间的平衡。
{"title":"CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation","authors":"Yushan Han;Hui Zhang;Honglei Zhang;Chuntao Ding;Yuanzhouhan Cao;Yidong Li","doi":"10.1109/TMC.2025.3622937","DOIUrl":"https://doi.org/10.1109/TMC.2025.3622937","url":null,"abstract":"Collaborative perception has been proven to improve individual perception in autonomous driving through multi-agent interaction. Nevertheless, most methods often assume identical encoders for all agents, which does not hold true when these models are deployed in real-world applications. To realize collaborative perception in actual heterogeneous scenarios, existing methods usually align neighbor features to those of the ego vehicle, which is vulnerable to noise from domain gaps and thus fails to address feature discrepancies effectively. Moreover, they adopt transformer-based modules for domain adaptation, which causes the model inference inefficiency on mobile devices. To tackle these issues, we propose CoDS, a <underline>Co</u>llaborative perception method that leverages <underline>D</u>omain <underline>S</u>eparation to address feature discrepancies in heterogeneous scenarios. The CoDS employs two feature alignment modules, i.e., Lightweight Spatial-Channel Resizer (LSCR) and Distribution Alignment via Domain Separation (DADS). Besides, it utilizes the Domain Alignment Mutual Information (DAMI) loss to ensure effective feature alignment. Specifically, the LSCR aligns the neighbor feature across spatial and channel dimensions using a lightweight convolutional layer. Subsequently, the DADS mitigates feature distribution discrepancy with encoder-specific and encoder-agnostic domain separation modules. The former removes domain-dependent information and the latter captures task-related information. During training, the DAMI loss maximizes the mutual information between aligned heterogeneous features to enhance the domain separation process. The CoDS employs a fully convolutional architecture, which ensures high inference efficiency. Extensive experiments demonstrate that the CoDS effectively mitigates feature discrepancies in heterogeneous scenarios and achieves a trade-off between detection accuracy and inference efficiency.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4286-4299"},"PeriodicalIF":9.2,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PRFL: Personalized and Robust Federated Learning for Non-IID Data With Malicious Participants PRFL:具有恶意参与者的非iid数据的个性化和鲁棒联邦学习
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-10-17 DOI: 10.1109/TMC.2025.3623072
Lixiang Yuan;Jiapeng Zhang;Mingxing Duan;Guoqing Xiao;Zhuo Tang;Kenli Li
Federated learning (FL) enables collaborative training of a global model while preserving participants’ local data privacy, making it ideal for data-sensitive fields like Industrial Internet of Things (IIoT), finance, and healthcare. However, Non-IID data among participants and the presence of malicious participants pose significant challenges to the model’s performance and convergence. The global model is difficult to achieve consistent performance across all participants. Therefore, this paper proposes personalized and robust federated learning (PRFL) to handle non-independently and identically distributed (Non-IID) data with malicious participants. First, to enhance the robustness and convergence, a model similarity-based division mechanism is employed. It groups participants with similar data and removes both independent and colluding malicious participants. Second, we propose a three-stage knowledge sharing personalized federated learning framework. Each participant undergoes inner-loop knowledge sharing, outer-loop knowledge sharing, and personalized knowledge distillation, incorporating performance-driven dynamic weighted sharing mechanism. Moreover, extensive experiments demonstrate that PRFL outperforms other advanced personalized federated learning methods across various benchmark datasets, particularly in scenarios with Non-IID data and malicious participants.
联邦学习(FL)支持全球模型的协作训练,同时保护参与者的本地数据隐私,使其成为工业物联网(IIoT)、金融和医疗保健等数据敏感领域的理想选择。然而,参与者之间的非iid数据和恶意参与者的存在对模型的性能和收敛性提出了重大挑战。全球模型很难在所有参与者之间实现一致的绩效。因此,本文提出了个性化和鲁棒性联邦学习(PRFL)来处理具有恶意参与者的非独立和同分布(Non-IID)数据。首先,为了增强算法的鲁棒性和收敛性,采用了基于模型相似性的分割机制。它将具有相似数据的参与者分组,并删除独立的和串通的恶意参与者。其次,提出了一个分三阶段的知识共享个性化联邦学习框架。每个参与者经历了内环知识共享、外环知识共享和个性化知识提炼,结合了绩效驱动的动态加权共享机制。此外,大量实验表明,PRFL在各种基准数据集上优于其他高级个性化联邦学习方法,特别是在具有非iid数据和恶意参与者的场景中。
{"title":"PRFL: Personalized and Robust Federated Learning for Non-IID Data With Malicious Participants","authors":"Lixiang Yuan;Jiapeng Zhang;Mingxing Duan;Guoqing Xiao;Zhuo Tang;Kenli Li","doi":"10.1109/TMC.2025.3623072","DOIUrl":"https://doi.org/10.1109/TMC.2025.3623072","url":null,"abstract":"Federated learning (FL) enables collaborative training of a global model while preserving participants’ local data privacy, making it ideal for data-sensitive fields like Industrial Internet of Things (IIoT), finance, and healthcare. However, Non-IID data among participants and the presence of malicious participants pose significant challenges to the model’s performance and convergence. The global model is difficult to achieve consistent performance across all participants. Therefore, this paper proposes personalized and robust federated learning (PRFL) to handle non-independently and identically distributed (Non-IID) data with malicious participants. First, to enhance the robustness and convergence, a model similarity-based division mechanism is employed. It groups participants with similar data and removes both independent and colluding malicious participants. Second, we propose a three-stage knowledge sharing personalized federated learning framework. Each participant undergoes inner-loop knowledge sharing, outer-loop knowledge sharing, and personalized knowledge distillation, incorporating performance-driven dynamic weighted sharing mechanism. Moreover, extensive experiments demonstrate that PRFL outperforms other advanced personalized federated learning methods across various benchmark datasets, particularly in scenarios with Non-IID data and malicious participants.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 3","pages":"4331-4345"},"PeriodicalIF":9.2,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Layer Scheduling in Gig Platforms Using a Generative Diffusion Model With Duality Guidance 基于二元导向的生成扩散模型的Gig平台多层调度
IF 9.2 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-09-23 DOI: 10.1109/TMC.2025.3613450
Xinyu Lu;Zhanbo Feng;Jiong Lou;Chentao Wu;Guangtao Xue;Wei Zhao;Jie Li
In recent years, gig platforms have emerged as a new paradigm, seamlessly connecting workers and tasks while leveraging workers’ collective intelligence, participation, and shared resources. Traditionally, platforms have operated under the assumption of worker homogeneity, where service capabilities and associated service costs are similar. However, in mobile computing scenarios, such as mobile crowdsensing, the diversity in worker capabilities and costs renders the supply and demand matching into a complex problem characterized by multiple layers of workers possessing distinct attributes. The dynamic nature of incoming task requests requires the continual reallocation of these workers, thereby introducing a time-dependent overhead. In this paper, we introduce a framework, called the Generative Diffusion Model with Duality Guidance, termed Guid, to address the intricate multi-layer scheduling problem. We formalize a time-slotted long-term optimization problem that captures the spatiotemporal dynamics of task requests and worker services, as well as the intricate time-coupled overhead. Our framework employs a generative diffusion model to explore the complex solution space of the problem and generate superior solutions. To effectively manage time coupling, we utilize dual optimization theory to generate time slot-aware information, guiding the generative diffusion model towards solutions that assure long-term performance. We provide a rigorous theoretical analysis demonstrating that our guidance solution ensures a parameterized competitive ratio guarantee relative to the theoretically optimal solution. Our comprehensive experiments further illustrate that the proposed method outperforms benchmark techniques, achieving reduced overhead compared to seven baseline methods.
近年来,零工平台已经成为一种新的范例,它将工人和任务无缝连接起来,同时利用工人的集体智慧、参与和共享资源。传统上,平台在工人同质性的假设下运行,其中服务能力和相关服务成本是相似的。然而,在移动众测等移动计算场景中,工人能力和成本的多样性使得供需匹配成为一个复杂的问题,其特征是具有不同属性的多层工人。传入任务请求的动态特性要求不断地重新分配这些工作者,从而引入了与时间相关的开销。在本文中,我们引入了一个框架,称为生成扩散模型与对偶制导,简称Guid,以解决复杂的多层调度问题。我们形式化了一个时间间隔的长期优化问题,该问题捕获了任务请求和工人服务的时空动态,以及复杂的时间耦合开销。我们的框架采用生成扩散模型来探索问题的复杂解空间并生成优解。为了有效地管理时间耦合,我们利用双重优化理论来生成时隙感知信息,引导生成扩散模型走向确保长期性能的解决方案。我们提供了一个严格的理论分析,证明我们的指导方案确保了相对于理论最优方案的参数化竞争比保证。我们的综合实验进一步表明,所提出的方法优于基准技术,与七个基线方法相比,实现了更低的开销。
{"title":"Multi-Layer Scheduling in Gig Platforms Using a Generative Diffusion Model With Duality Guidance","authors":"Xinyu Lu;Zhanbo Feng;Jiong Lou;Chentao Wu;Guangtao Xue;Wei Zhao;Jie Li","doi":"10.1109/TMC.2025.3613450","DOIUrl":"https://doi.org/10.1109/TMC.2025.3613450","url":null,"abstract":"In recent years, gig platforms have emerged as a new paradigm, seamlessly connecting workers and tasks while leveraging workers’ collective intelligence, participation, and shared resources. Traditionally, platforms have operated under the assumption of worker homogeneity, where service capabilities and associated service costs are similar. However, in mobile computing scenarios, such as mobile crowdsensing, the diversity in worker capabilities and costs renders the supply and demand matching into a complex problem characterized by multiple layers of workers possessing distinct attributes. The dynamic nature of incoming task requests requires the continual reallocation of these workers, thereby introducing a time-dependent overhead. In this paper, we introduce a framework, called the <italic><u>G</u>enerative Diffusion Model with Duality G<u>uid</u>ance</i>, termed <italic>Guid</i>, to address the intricate multi-layer scheduling problem. We formalize a time-slotted long-term optimization problem that captures the spatiotemporal dynamics of task requests and worker services, as well as the intricate time-coupled overhead. Our framework employs a generative diffusion model to explore the complex solution space of the problem and generate superior solutions. To effectively manage time coupling, we utilize dual optimization theory to generate time slot-aware information, guiding the generative diffusion model towards solutions that assure long-term performance. We provide a rigorous theoretical analysis demonstrating that our guidance solution ensures a parameterized competitive ratio guarantee relative to the theoretically optimal solution. Our comprehensive experiments further illustrate that the proposed method outperforms benchmark techniques, achieving reduced overhead compared to seven baseline methods.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"25 2","pages":"2927-2940"},"PeriodicalIF":9.2,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Mobile Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1