首页 > 最新文献

IEEE Transactions on Sustainable Computing最新文献

英文 中文
2025 Reviewers List* 2025审稿人名单*
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-02-04 DOI: 10.1109/TSUSC.2026.3653582
{"title":"2025 Reviewers List*","authors":"","doi":"10.1109/TSUSC.2026.3653582","DOIUrl":"https://doi.org/10.1109/TSUSC.2026.3653582","url":null,"abstract":"","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"11 1","pages":"72-75"},"PeriodicalIF":3.9,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11372599","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Formal Modeling and Analysis of Small-Scale Data Centers Integrating Renewable Energy Using Timed Automata 利用时间自动机集成可再生能源的小型数据中心形式化建模与分析
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-12-16 DOI: 10.1109/TSUSC.2025.3645150
Ismael Samaye;Gilles Sassatelli;Abdoulaye Gamatié
Integrating renewable energy into data centers is essential for reducing reliance on fossil fuels and minimize the environmental impact of digital infrastructures. However, the variability and unpredictability of renewable sources come with significant design and operational challenges. This paper introduces a formal modeling framework for solar-powered small-scale data centers, using stochastic timed automata and statistical model checking for mathematical analysis. The solution supports efficient resource sizing, reduces grid energy consumption through optimized workload scheduling and server renewal strategies. It enables robustness evaluation under component failure scenarios. A case study demonstrates the applicability, flexibility, and scalability of the framework for distributed system topologies and energy-aware design exploration.
将可再生能源整合到数据中心对于减少对化石燃料的依赖和最大限度地减少数字基础设施对环境的影响至关重要。然而,可再生能源的可变性和不可预测性带来了重大的设计和运营挑战。本文介绍了太阳能小型数据中心的形式化建模框架,采用随机时间自动机和统计模型检验进行数学分析。该解决方案支持高效的资源调整,通过优化工作负载调度和服务器更新策略减少网格能耗。它支持在组件故障场景下进行鲁棒性评估。案例研究展示了该框架在分布式系统拓扑和能源感知设计探索方面的适用性、灵活性和可扩展性。
{"title":"Formal Modeling and Analysis of Small-Scale Data Centers Integrating Renewable Energy Using Timed Automata","authors":"Ismael Samaye;Gilles Sassatelli;Abdoulaye Gamatié","doi":"10.1109/TSUSC.2025.3645150","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3645150","url":null,"abstract":"Integrating renewable energy into data centers is essential for reducing reliance on fossil fuels and minimize the environmental impact of digital infrastructures. However, the variability and unpredictability of renewable sources come with significant design and operational challenges. This paper introduces a formal modeling framework for solar-powered small-scale data centers, using stochastic timed automata and statistical model checking for mathematical analysis. The solution supports efficient resource sizing, reduces grid energy consumption through optimized workload scheduling and server renewal strategies. It enables robustness evaluation under component failure scenarios. A case study demonstrates the applicability, flexibility, and scalability of the framework for distributed system topologies and energy-aware design exploration.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"11 1","pages":"57-71"},"PeriodicalIF":3.9,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Load Balancing in Vehicular Edge Computing Using Deep Reinforcement Learning and Model Compression 基于深度强化学习和模型压缩的车辆边缘计算自适应负载平衡
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-12-11 DOI: 10.1109/TSUSC.2025.3643431
Liang Zhao;Jiating Xu;Ammar Hawbani;Zhi Liu;Keping Yu;Yuanguo Bi
In Vehicular Edge Computing (VEC), load imbalances among edge servers, driven by varying traffic densities and computational demands across geographic areas, can lead to significant delays, decreased efficiency, and potential service disruptions, adversely affecting both user experience and system reliability. This study proposes an innovative adaptive load balancing method that integrates deep reinforcement learning with predictive analytics to optimize resource allocation in VEC. The framework comprises a predictive model called ST-ChebNet, enhanced with Chebyshev polynomials in graph convolutional networks for accurate workload forecasting, and an adaptive model compression strategy utilizing knowledge distillation to dynamically adjust compression ratios based on anticipated workloads. Additionally, the integration of this predictive model with the Soft Actor-Critic (SAC) algorithm, termed GC-SAC, effectively combines graph-based predictive insights with reinforcement learning techniques to tailor resource distribution, minimizing computational delays and enhancing system responsiveness. The simulation results show that the GC-SAC algorithm can significantly reduce the average delay and average energy consumption of vehicular tasks, as well as the workload rate of edge servers.
在车辆边缘计算(VEC)中,由不同地理区域的流量密度和计算需求驱动的边缘服务器之间的负载不平衡可能导致严重的延迟、效率降低和潜在的服务中断,从而对用户体验和系统可靠性产生不利影响。本研究提出了一种创新的自适应负载平衡方法,该方法将深度强化学习与预测分析相结合,以优化VEC中的资源分配。该框架包括一个名为ST-ChebNet的预测模型,该模型利用图卷积网络中的Chebyshev多项式进行精确的工作量预测,以及一个利用知识蒸馏的自适应模型压缩策略,根据预期的工作量动态调整压缩比。此外,该预测模型与软行为者-评论家(SAC)算法(称为GC-SAC)的集成,有效地将基于图的预测见解与强化学习技术相结合,以定制资源分配,最大限度地减少计算延迟并提高系统响应能力。仿真结果表明,GC-SAC算法能显著降低车辆任务的平均时延和平均能耗,并能显著降低边缘服务器的负载率。
{"title":"Adaptive Load Balancing in Vehicular Edge Computing Using Deep Reinforcement Learning and Model Compression","authors":"Liang Zhao;Jiating Xu;Ammar Hawbani;Zhi Liu;Keping Yu;Yuanguo Bi","doi":"10.1109/TSUSC.2025.3643431","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3643431","url":null,"abstract":"In Vehicular Edge Computing (VEC), load imbalances among edge servers, driven by varying traffic densities and computational demands across geographic areas, can lead to significant delays, decreased efficiency, and potential service disruptions, adversely affecting both user experience and system reliability. This study proposes an innovative adaptive load balancing method that integrates deep reinforcement learning with predictive analytics to optimize resource allocation in VEC. The framework comprises a predictive model called ST-ChebNet, enhanced with Chebyshev polynomials in graph convolutional networks for accurate workload forecasting, and an adaptive model compression strategy utilizing knowledge distillation to dynamically adjust compression ratios based on anticipated workloads. Additionally, the integration of this predictive model with the Soft Actor-Critic (SAC) algorithm, termed GC-SAC, effectively combines graph-based predictive insights with reinforcement learning techniques to tailor resource distribution, minimizing computational delays and enhancing system responsiveness. The simulation results show that the GC-SAC algorithm can significantly reduce the average delay and average energy consumption of vehicular tasks, as well as the workload rate of edge servers.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"11 1","pages":"42-56"},"PeriodicalIF":3.9,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Overlapping Coalition Formation-Enabled Noncooperative Game-Combined Multi-Agent DRL for UAV-Assisted Resource Allocation 基于重叠联盟的非合作博弈组合多智能体DRL无人机辅助资源分配
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-12-10 DOI: 10.1109/TSUSC.2025.3642616
Bing Ai;Guodong Ye;Zijun Wu;Yu Sun
Coalition Formation (CF) game emerges as a pioneering framework for resource allocation in uncrewed aerial vehicles (UAVs) equipped with various types of complementary resources. However, both the overlapping-enabled collaborative CF and inter-coalition competitive behaviors significantly impact the system performance in complex multi-UAV scenarios. In this paper, we propose a Multiple Overlapping Coalitions (MOC) noncooperative game. Specifically, we first establish an optimization model encompassing coupled resource constraints. Subsequently, a task-priority-based incentive mechanism is designed to better motivate participation. To achieve the Nash equilibrium, a two-step solution technique incorporating relaxation and fine-tuning of resource granularity is designed. We propose a MOC noncooperative game-combined Multi-agent Proximal Policy Optimization (MAOPPPO). The simulation results substantiate that our approach outperforms the other five state-of-the-art learning countermeasures in terms of average reward with a gain of up to 4.59% after 800 training episodes. In terms of throughput, the proposed MOC noncooperative game increases by 66.67%, 93.68%, and 11.76% compared with that of CF noncooperative game, non-CF noncooperative game, and consensus-based algorithm, respectively. For total resource contribution, the improvements are 62.99%, 94.59%, and 23.16%, respectively. The energy efficiency enhances by 6.82%, 23.68%, and 4.78% compared to the other three baselines, respectively.
在具有多种互补资源的无人机中,联盟编队博弈作为一种开创性的资源配置框架而出现。然而,在复杂的多无人机场景下,重叠协同CF和联盟间竞争行为都会显著影响系统性能。本文提出了一种多重叠联盟(MOC)非合作对策。具体来说,我们首先建立了一个包含耦合资源约束的优化模型。随后,设计了基于任务优先级的激励机制,以更好地激励参与。为了达到纳什均衡,设计了一种结合资源粒度松弛和微调的两步求解技术。提出了一种基于MOC非合作博弈的多智能体近端策略优化方法。仿真结果表明,在800个训练集后,我们的方法在平均奖励方面优于其他五种最先进的学习对策,增益高达4.59%。在吞吐量方面,所提出的MOC非合作对策比CF非合作对策、非CF非合作对策和基于共识的算法分别提高了66.67%、93.68%和11.76%。总资源贡献率分别提高了62.99%、94.59%和23.16%。与其他三个基准相比,能效分别提高了6.82%、23.68%和4.78%。
{"title":"Overlapping Coalition Formation-Enabled Noncooperative Game-Combined Multi-Agent DRL for UAV-Assisted Resource Allocation","authors":"Bing Ai;Guodong Ye;Zijun Wu;Yu Sun","doi":"10.1109/TSUSC.2025.3642616","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3642616","url":null,"abstract":"Coalition Formation (CF) game emerges as a pioneering framework for resource allocation in uncrewed aerial vehicles (UAVs) equipped with various types of complementary resources. However, both the overlapping-enabled collaborative CF and inter-coalition competitive behaviors significantly impact the system performance in complex multi-UAV scenarios. In this paper, we propose a Multiple Overlapping Coalitions (MOC) noncooperative game. Specifically, we first establish an optimization model encompassing coupled resource constraints. Subsequently, a task-priority-based incentive mechanism is designed to better motivate participation. To achieve the Nash equilibrium, a two-step solution technique incorporating relaxation and fine-tuning of resource granularity is designed. We propose a MOC noncooperative game-combined Multi-agent Proximal Policy Optimization (MAOPPPO). The simulation results substantiate that our approach outperforms the other five state-of-the-art learning countermeasures in terms of average reward with a gain of up to 4.59% after 800 training episodes. In terms of throughput, the proposed MOC noncooperative game increases by 66.67%, 93.68%, and 11.76% compared with that of CF noncooperative game, non-CF noncooperative game, and consensus-based algorithm, respectively. For total resource contribution, the improvements are 62.99%, 94.59%, and 23.16%, respectively. The energy efficiency enhances by 6.82%, 23.68%, and 4.78% compared to the other three baselines, respectively.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"11 1","pages":"29-41"},"PeriodicalIF":3.9,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy-Efficient Joint Deployment and Routing for Delay-Sensitive Microservices in Edge Computing 边缘计算中延迟敏感微服务的节能联合部署和路由
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-11-17 DOI: 10.1109/TSUSC.2025.3633312
Kai Peng;Hao Wen;Zhiyong Guo;Hanfang Ge;Chao Cai;Bo Zhou;Menglan Hu
Microservice as a promising architecture has been widely employed in edge computing to support sensitive-latency online applications. Unfortunately, the deployment of numerous microservices creates complex invocations and requires frequent communications, which brings significant challenges to service deployment and request routing. Moreover, the strict requirements for low energy consumption and low latency in edge computing further exacerbate these difficulties. In this case, it is crucial to optimize the joint microservices deployment and request routing using a meticulous and energy-efficient approach. However, existing studies often overlook their interdependence and treat them as separate problems. Therefore, we propose a fine-grained approach in this paper to jointly optimize the deployment and request routing of microservices within edge computing scenarios. First, we utilize queuing networks to conduct detailed modeling and mathematical analysis that study the complex invocation relationships, microservice instance sharing, and communication latency. Second, we propose an energy-efficient microservice orchestration algorithm, referred to as Cluster-Processing-based Adaptive Memory Procedure. This algorithm maintains a memory storing elite solution elements, and it iteratively picks up suitable elements from the memory to construct superior solutions. Finally, extensive simulation experiments demonstrate that the proposed algorithm outperforms baseline algorithms significantly in terms of response latency and energy consumption.
微服务作为一种很有前途的体系结构已被广泛应用于边缘计算,以支持敏感延迟的在线应用。不幸的是,大量微服务的部署创建了复杂的调用,并且需要频繁的通信,这给服务部署和请求路由带来了重大挑战。此外,边缘计算对低能耗和低延迟的严格要求进一步加剧了这些困难。在这种情况下,使用细致且节能的方法来优化联合微服务部署和请求路由是至关重要的。然而,现有的研究往往忽视了它们的相互依赖性,将它们视为单独的问题。因此,我们在本文中提出了一种细粒度的方法来共同优化边缘计算场景下微服务的部署和请求路由。首先,我们利用排队网络进行了详细的建模和数学分析,研究了复杂的调用关系、微服务实例共享和通信延迟。其次,我们提出了一种高效的微服务编排算法,称为基于集群处理的自适应内存过程。该算法保持一个存储精英解元素的内存,并迭代地从内存中选取合适的元素来构造优解。最后,大量的仿真实验表明,该算法在响应延迟和能耗方面明显优于基准算法。
{"title":"Energy-Efficient Joint Deployment and Routing for Delay-Sensitive Microservices in Edge Computing","authors":"Kai Peng;Hao Wen;Zhiyong Guo;Hanfang Ge;Chao Cai;Bo Zhou;Menglan Hu","doi":"10.1109/TSUSC.2025.3633312","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3633312","url":null,"abstract":"Microservice as a promising architecture has been widely employed in edge computing to support sensitive-latency online applications. Unfortunately, the deployment of numerous microservices creates complex invocations and requires frequent communications, which brings significant challenges to service deployment and request routing. Moreover, the strict requirements for low energy consumption and low latency in edge computing further exacerbate these difficulties. In this case, it is crucial to optimize the joint microservices deployment and request routing using a meticulous and energy-efficient approach. However, existing studies often overlook their interdependence and treat them as separate problems. Therefore, we propose a fine-grained approach in this paper to jointly optimize the deployment and request routing of microservices within edge computing scenarios. First, we utilize queuing networks to conduct detailed modeling and mathematical analysis that study the complex invocation relationships, microservice instance sharing, and communication latency. Second, we propose an energy-efficient microservice orchestration algorithm, referred to as Cluster-Processing-based Adaptive Memory Procedure. This algorithm maintains a memory storing elite solution elements, and it iteratively picks up suitable elements from the memory to construct superior solutions. Finally, extensive simulation experiments demonstrate that the proposed algorithm outperforms baseline algorithms significantly in terms of response latency and energy consumption.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"11 1","pages":"1-14"},"PeriodicalIF":3.9,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying Robustness and Sustainability Trade-Off in Federated Adversarial Learning for Cyber-Physical Systems 量化网络物理系统联邦对抗学习的鲁棒性和可持续性权衡
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-11-17 DOI: 10.1109/TSUSC.2025.3633995
Syed Mhamudul Hasan;Ahmed Imteaj;Abdur R. Shahid
Cyber-Physical Systems (CPS) are increasingly leveraging Federated Learning (FL) to enable decentralized intelligence while preserving privacy across distributed devices. Federated adversarial learning (FAL) leverages FL and adversarial training to enhance model robustness against adversarial attacks while maintaining data privacy across decentralized, heterogeneous devices. While FAL strengthens CPS resilience against adversarial threats, variations in energy constraints, carbon emissions, computational capabilities, and latency requirements introduce additional complexity. These variations impact energy consumption, carbon emissions, and power source efficiency, creating a complex trade-off between sustainability and robustness. This underscores the critical need for standardized metrics to systematically evaluate and balance these competing factors in FAL-enabled CPS. In this paper, we propose three novel robustness metrics designed to quantify the interplay between energy efficiency, sustainability dimensions, and adversarial robustness in FAL setups for CPS. The proposed methodology accounts for diverse CPS scenarios, addressing factors such as emissions, energy consumption, latency, renewable energy, and low-energy devices with critical latency needs. We validate our approach through simulations in two setups, including a single-device environment to isolate device variability and a heterogeneous multi-device environment to evaluate architectural impacts. The results demonstrate the effectiveness of our proposed metrics in systemically quantifying the trade-off between sustainability and robustness in FAL-based CPS.
网络物理系统(CPS)越来越多地利用联邦学习(FL)来实现分散的智能,同时保护分布式设备之间的隐私。联邦对抗性学习(FAL)利用FL和对抗性训练来增强模型对对抗性攻击的鲁棒性,同时在分散的异构设备上维护数据隐私。虽然FAL增强了CPS抵御敌对威胁的弹性,但能源限制、碳排放、计算能力和延迟需求的变化带来了额外的复杂性。这些变化会影响能源消耗、碳排放和电源效率,在可持续性和稳健性之间产生复杂的权衡。这强调了对标准化指标的迫切需要,以便系统地评估和平衡fal支持的CPS中的这些竞争因素。在本文中,我们提出了三个新的鲁棒性指标,旨在量化CPS FAL设置中能源效率,可持续性维度和对抗鲁棒性之间的相互作用。提出的方法考虑了不同的CPS场景,解决了诸如排放、能耗、延迟、可再生能源和具有关键延迟需求的低能耗设备等因素。我们通过两种设置中的模拟来验证我们的方法,包括单设备环境以隔离设备可变性和异构多设备环境以评估架构影响。结果表明,我们提出的指标在系统量化基于fal的CPS的可持续性和稳健性之间的权衡方面是有效的。
{"title":"Quantifying Robustness and Sustainability Trade-Off in Federated Adversarial Learning for Cyber-Physical Systems","authors":"Syed Mhamudul Hasan;Ahmed Imteaj;Abdur R. Shahid","doi":"10.1109/TSUSC.2025.3633995","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3633995","url":null,"abstract":"Cyber-Physical Systems (CPS) are increasingly leveraging Federated Learning (FL) to enable decentralized intelligence while preserving privacy across distributed devices. Federated adversarial learning (FAL) leverages FL and adversarial training to enhance model robustness against adversarial attacks while maintaining data privacy across decentralized, heterogeneous devices. While FAL strengthens CPS resilience against adversarial threats, variations in energy constraints, carbon emissions, computational capabilities, and latency requirements introduce additional complexity. These variations impact energy consumption, carbon emissions, and power source efficiency, creating a complex trade-off between sustainability and robustness. This underscores the critical need for standardized metrics to systematically evaluate and balance these competing factors in FAL-enabled CPS. In this paper, we propose three novel robustness metrics designed to quantify the interplay between energy efficiency, sustainability dimensions, and adversarial robustness in FAL setups for CPS. The proposed methodology accounts for diverse CPS scenarios, addressing factors such as emissions, energy consumption, latency, renewable energy, and low-energy devices with critical latency needs. We validate our approach through simulations in two setups, including a single-device environment to isolate device variability and a heterogeneous multi-device environment to evaluate architectural impacts. The results demonstrate the effectiveness of our proposed metrics in systemically quantifying the trade-off between sustainability and robustness in FAL-based CPS.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"11 1","pages":"15-28"},"PeriodicalIF":3.9,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146116923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Energy Efficiency of Graph Processing on Shared-Memory Systems 提高共享内存系统中图形处理的能效
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-11-13 DOI: 10.1109/TSUSC.2025.3632842
Tao Jiang;Le Luo;Chao Li;Jinyang Guo;Sheng Xu
With the number of cores increasing in shared-memory systems, the energy consumption of parallel computing on them is becoming increasingly prominent. Currently, researchers concern with the performance optimization, while ignoring the energy efficiency of graph processing. Meanwhile, existing works that optimize energy efficiency involve mainly the general benchmarks by using dynamic voltage and frequency scaling and thread throttling methods. However, these methods cannot be directly transplanted to graph processing, because most graph algorithms converge in fewer iterations and traditional energy efficiency optimization methods are not applicable to them and will produce much overhead, resulting in the fact that the loss outweighs the gain. And some energy-saving methods estimate the subsequent CPU frequency based on the run-time system state, which leads to an inaccurate prediction of the optimal energy-saving CPU frequency. In view of the above issues, we propose a pre-allocated thread throttling method and a static frequency scaling method. The former achieves thread throttling by establishing a pre-allocated scheduling method, which calculates the optimal energy-saving number of threads promptly when the graph is loaded; On this basis, in order to reduce the cost of dynamic frequency setting at runtime and improve the energy efficiency further, the latter introduces the static frequency scaling method to reduce the execution speed of some tasks by relaxing thread execution time. The experimental results show that the pre-allocated thread throttling method improves the energy efficiency by about 10% compared to the original framework, and the static frequency scaling method further improves it by about 20% with trivial performance loss.
随着共享内存系统核数的不断增加,其上并行计算的能耗问题日益突出。目前,研究人员关注的是性能优化,而忽视了图处理的能量效率。同时,现有的能效优化工作主要是通过动态电压、频率缩放和线程节流等方法进行一般基准测试。然而,这些方法不能直接移植到图处理中,因为大多数图算法迭代次数较少,传统的能效优化方法不适用于图处理,并且会产生很大的开销,导致得不偿失。一些节能方法根据系统运行时的状态来估计后续的CPU频率,导致对最优节能CPU频率的预测不准确。针对上述问题,我们提出了一种预分配的线程节流方法和一种静态频率缩放方法。前者通过建立预分配调度方法实现线程节流,在加载图时及时计算出最优的节能线程数;在此基础上,为了降低运行时动态频率设置的成本,进一步提高能效,后者引入了静态频率缩放方法,通过放松线程执行时间来降低部分任务的执行速度。实验结果表明,预分配线程节流方法比原框架的能量效率提高了约10%,静态频率缩放方法在性能损失不大的情况下进一步提高了约20%。
{"title":"Improving Energy Efficiency of Graph Processing on Shared-Memory Systems","authors":"Tao Jiang;Le Luo;Chao Li;Jinyang Guo;Sheng Xu","doi":"10.1109/TSUSC.2025.3632842","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3632842","url":null,"abstract":"With the number of cores increasing in shared-memory systems, the energy consumption of parallel computing on them is becoming increasingly prominent. Currently, researchers concern with the performance optimization, while ignoring the energy efficiency of graph processing. Meanwhile, existing works that optimize energy efficiency involve mainly the general benchmarks by using dynamic voltage and frequency scaling and thread throttling methods. However, these methods cannot be directly transplanted to graph processing, because most graph algorithms converge in fewer iterations and traditional energy efficiency optimization methods are not applicable to them and will produce much overhead, resulting in the fact that the loss outweighs the gain. And some energy-saving methods estimate the subsequent CPU frequency based on the run-time system state, which leads to an inaccurate prediction of the optimal energy-saving CPU frequency. In view of the above issues, we propose a pre-allocated thread throttling method and a static frequency scaling method. The former achieves thread throttling by establishing a pre-allocated scheduling method, which calculates the optimal energy-saving number of threads promptly when the graph is loaded; On this basis, in order to reduce the cost of dynamic frequency setting at runtime and improve the energy efficiency further, the latter introduces the static frequency scaling method to reduce the execution speed of some tasks by relaxing thread execution time. The experimental results show that the pre-allocated thread throttling method improves the energy efficiency by about 10% compared to the original framework, and the static frequency scaling method further improves it by about 20% with trivial performance loss.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"10 6","pages":"1449-1460"},"PeriodicalIF":3.9,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145712560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FedFusionQuant (FFQ): Federated Learning With Feature Fusion and Model Quantisation for Human Activity Recognition Using CSI FedFusionQuant (FFQ):基于CSI的人类活动识别特征融合和模型量化的联邦学习
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-10-31 DOI: 10.1109/TSUSC.2025.3627484
Ahsan Raza Khan;Rao Naveed Bin Rais;Sarmad Sohaib;Sajjad Hussain;Ahmed Zoha
Human Activity Recognition (HAR) using Channel State Information (CSI) enables energy-efficient and non-invasive healthcare monitoring. However, conventional HAR systems rely on centralised model training, which requires the sharing of raw data, leading to privacy risks, excessive bandwidth usage, and high communication latency that limit scalability. This paper proposes FedFusionQuant (FFQ), a federated learning (FL) framework that jointly performs feature fusion, adaptive aggregation, and quantisation-aware compression during training. A novel federated distance (FedDist) mechanism dynamically adjusts parameter updates using neuron dissimilarity metrics, enhancing generalisation across heterogeneous clients. Meanwhile, quantisation-aware training (QAT) reduces model size and transmission cost while preserving accuracy. Extensive experiments on real CSI data from 30 participants demonstrate that FFQ improves multi-class HAR accuracy by 4.29% and binary fall detection by 5.55% compared to raw fusion models. Furthermore, model compression with QAT achieves a 47% reduction in communication overhead while maintaining accuracy comparable to state-of-the-art methods.
使用通道状态信息(CSI)的人类活动识别(HAR)可实现节能且无创的医疗保健监控。然而,传统的HAR系统依赖于集中的模型训练,这需要共享原始数据,从而导致隐私风险、过度的带宽使用和限制可扩展性的高通信延迟。本文提出了FedFusionQuant (FFQ),这是一个联邦学习(FL)框架,在训练过程中联合执行特征融合、自适应聚合和量化感知压缩。一种新的联邦距离(FedDist)机制使用神经元不相似度指标动态调整参数更新,增强了跨异构客户端的泛化。同时,定量感知训练(QAT)在保持准确性的同时,减少了模型尺寸和传输成本。在30个参与者的真实CSI数据上进行的大量实验表明,与原始融合模型相比,FFQ将多类别HAR的准确率提高了4.29%,将二值跌落检测提高了5.55%。此外,使用QAT的模型压缩可以减少47%的通信开销,同时保持与最先进方法相当的准确性。
{"title":"FedFusionQuant (FFQ): Federated Learning With Feature Fusion and Model Quantisation for Human Activity Recognition Using CSI","authors":"Ahsan Raza Khan;Rao Naveed Bin Rais;Sarmad Sohaib;Sajjad Hussain;Ahmed Zoha","doi":"10.1109/TSUSC.2025.3627484","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3627484","url":null,"abstract":"Human Activity Recognition (HAR) using Channel State Information (CSI) enables energy-efficient and non-invasive healthcare monitoring. However, conventional HAR systems rely on centralised model training, which requires the sharing of raw data, leading to privacy risks, excessive bandwidth usage, and high communication latency that limit scalability. This paper proposes <bold>FedFusionQuant (FFQ)</b>, a federated learning (FL) framework that jointly performs feature fusion, adaptive aggregation, and quantisation-aware compression during training. A novel <bold>federated distance (FedDist)</b> mechanism dynamically adjusts parameter updates using neuron dissimilarity metrics, enhancing generalisation across heterogeneous clients. Meanwhile, <bold>quantisation-aware training (QAT)</b> reduces model size and transmission cost while preserving accuracy. Extensive experiments on real CSI data from 30 participants demonstrate that FFQ improves multi-class HAR accuracy by <bold>4.29%</b> and binary fall detection by <bold>5.55%</b> compared to raw fusion models. Furthermore, model compression with QAT achieves a <bold>47% reduction in communication overhead</b> while maintaining accuracy comparable to state-of-the-art methods.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"10 6","pages":"1421-1434"},"PeriodicalIF":3.9,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145712547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OSPDP: One-Sided Personalized Differential Privacy OSPDP:单边个性化差异隐私
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-10-31 DOI: 10.1109/TSUSC.2025.3626773
Jiajun Chen;Chunqiang Hu;Huijun Zhuang;Ruifeng Zhao;Jiguo Yu
Differential privacy has received considerable attention as a privacy concept for releasing statistical information from datasets. While differential privacy provides strict statistical guarantees, it is equally crucial to investigate how these guarantees interact with individual privacy preferences and privacy policies. Existing solutions, such as one-sided differential privacy, treat all sensitive records equally in terms of privacy protection, although datasets can be classified based on predetermined privacy policies that differentiate between sensitive and insensitive records. In this paper, we present a novel concept of privacy termed One-sided Personalized Differential Privacy (OSPDP), offering verifiable privacy assurances at the user level for sensitive records derived from privacy policies. Specifically, OSPDP enables data owners to articulate their privacy needs more flexibly, avoiding a one-size-fits-all approach to privacy protection and potentially establishing a dichotomous privacy policy regarding the sensitivity of records. Furthermore, the truthful release or legitimate disclosure of non-sensitive records reduces unnecessary privacy consumption and can be utilized to significantly enhance data utility. Additionally, we present several well-performing mechanisms for achieving OSPDP. Finally, we evaluate and analyze the trade-off between privacy and utility of the proposed mechanisms through extensive experiments.
差分隐私作为一种从数据集中发布统计信息的隐私概念受到了相当大的关注。虽然差异隐私提供了严格的统计保证,但研究这些保证如何与个人隐私偏好和隐私政策相互作用同样至关重要。现有的解决方案,如片面差分隐私,在隐私保护方面平等对待所有敏感记录,尽管数据集可以根据预先确定的隐私策略进行分类,以区分敏感记录和不敏感记录。在本文中,我们提出了一种新的隐私概念,称为片面个性化差异隐私(OSPDP),为来自隐私策略的敏感记录在用户层面提供可验证的隐私保证。特别是,OSPDP使数据所有者能够更灵活地表达他们的隐私需求,避免了一刀切的隐私保护方法,并可能建立关于记录敏感性的二分隐私策略。此外,非敏感记录的真实发布或合法披露减少了不必要的隐私消耗,并可用于显着提高数据效用。此外,我们提出了几个实现OSPDP的良好机制。最后,我们通过广泛的实验来评估和分析所提出机制的隐私性和实用性之间的权衡。
{"title":"OSPDP: One-Sided Personalized Differential Privacy","authors":"Jiajun Chen;Chunqiang Hu;Huijun Zhuang;Ruifeng Zhao;Jiguo Yu","doi":"10.1109/TSUSC.2025.3626773","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3626773","url":null,"abstract":"Differential privacy has received considerable attention as a privacy concept for releasing statistical information from datasets. While differential privacy provides strict statistical guarantees, it is equally crucial to investigate how these guarantees interact with individual privacy preferences and privacy policies. Existing solutions, such as one-sided differential privacy, treat all sensitive records equally in terms of privacy protection, although datasets can be classified based on predetermined privacy policies that differentiate between sensitive and insensitive records. In this paper, we present a novel concept of privacy termed One-sided Personalized Differential Privacy (OSPDP), offering verifiable privacy assurances at the user level for sensitive records derived from privacy policies. Specifically, OSPDP enables data owners to articulate their privacy needs more flexibly, avoiding a one-size-fits-all approach to privacy protection and potentially establishing a dichotomous privacy policy regarding the sensitivity of records. Furthermore, the truthful release or legitimate disclosure of non-sensitive records reduces unnecessary privacy consumption and can be utilized to significantly enhance data utility. Additionally, we present several well-performing mechanisms for achieving OSPDP. Finally, we evaluate and analyze the trade-off between privacy and utility of the proposed mechanisms through extensive experiments.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"10 6","pages":"1435-1448"},"PeriodicalIF":3.9,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145712554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FedUP: Federated Unlearning With Prototypes 厌倦:通过原型联合学习
IF 3.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-19 DOI: 10.1109/TSUSC.2025.3612138
Yuhong Huang;Xue Li;Songle Chen;Siguang Chen
As an extension of machine unlearning in distributed scenarios, federated unlearning gains significant attention. However, federated unlearning remains challenging, as many studies require additional resources, such as auxiliary dataset or storage, to achieve high-quality models. These requirements incur extra costs and are often difficult to satisfy in practical applications. To address these issues, we propose a flexible client-level federated unlearning algorithm with prototypes, called FedUP. Specifically, our algorithm consists of two components: prototype-based unlearning and model recovering. First, we design a prototype-based unlearning strategy that uses prototypes of the erased client to guide the unlearning process, and maximizes the prototype loss between the remaining and erased clients to unlearn the information. It does not rely on historical storage updates or additional standard datasets, making the unlearning process more streamlined. To mitigate performance degradation from the unlearning process, we develop a brief model recovering approach guided by global prototypes to swiftly and efficiently restore models’ accuracy on the remaining datasets. Unlike other unlearning algorithms, our approach exchanges prototypes instead of model parameters, significantly reducing communication overhead. Finally, we empirically evaluate the proposed algorithm from multiple perspectives on two datasets, demonstrating that our algorithm can achieve high-quality unlearned models with minimal communication cost.
作为机器学习在分布式场景下的延伸,联合学习得到了广泛的关注。然而,联合学习仍然具有挑战性,因为许多研究需要额外的资源,如辅助数据集或存储,以获得高质量的模型。这些要求会产生额外的成本,并且在实际应用中往往难以满足。为了解决这些问题,我们提出了一种灵活的带有原型的客户端级联合学习算法,称为FedUP。具体来说,我们的算法包括两个部分:基于原型的学习和模型恢复。首先,我们设计了一种基于原型的遗忘策略,该策略使用被擦除客户端的原型来指导遗忘过程,并最大化剩余和被擦除客户端之间的原型损失来遗忘信息。它不依赖于历史存储更新或额外的标准数据集,使遗忘过程更加简化。为了减轻由于学习过程导致的性能下降,我们开发了一种由全局原型指导的简单模型恢复方法,以快速有效地恢复模型在剩余数据集上的准确性。与其他的学习算法不同,我们的方法交换原型而不是模型参数,显著减少了通信开销。最后,我们在两个数据集上从多个角度对所提出的算法进行了实证评估,表明我们的算法可以以最小的通信成本获得高质量的非学习模型。
{"title":"FedUP: Federated Unlearning With Prototypes","authors":"Yuhong Huang;Xue Li;Songle Chen;Siguang Chen","doi":"10.1109/TSUSC.2025.3612138","DOIUrl":"https://doi.org/10.1109/TSUSC.2025.3612138","url":null,"abstract":"As an extension of machine unlearning in distributed scenarios, federated unlearning gains significant attention. However, federated unlearning remains challenging, as many studies require additional resources, such as auxiliary dataset or storage, to achieve high-quality models. These requirements incur extra costs and are often difficult to satisfy in practical applications. To address these issues, we propose a flexible client-level federated unlearning algorithm with prototypes, called FedUP. Specifically, our algorithm consists of two components: prototype-based unlearning and model recovering. First, we design a prototype-based unlearning strategy that uses prototypes of the erased client to guide the unlearning process, and maximizes the prototype loss between the remaining and erased clients to unlearn the information. It does not rely on historical storage updates or additional standard datasets, making the unlearning process more streamlined. To mitigate performance degradation from the unlearning process, we develop a brief model recovering approach guided by global prototypes to swiftly and efficiently restore models’ accuracy on the remaining datasets. Unlike other unlearning algorithms, our approach exchanges prototypes instead of model parameters, significantly reducing communication overhead. Finally, we empirically evaluate the proposed algorithm from multiple perspectives on two datasets, demonstrating that our algorithm can achieve high-quality unlearned models with minimal communication cost.","PeriodicalId":13268,"journal":{"name":"IEEE Transactions on Sustainable Computing","volume":"10 6","pages":"1461-1467"},"PeriodicalIF":3.9,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145712546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Sustainable Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1