首页 > 最新文献

Journal of Systems Architecture最新文献

英文 中文
Fault tolerance DDS communications in Time-Sensitive Networking using IEEE 802.1CB redundancy mechanisms 基于IEEE 802.1CB冗余机制的时间敏感网络中的容错DDS通信
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-06-01 Epub Date: 2026-03-10 DOI: 10.1016/j.sysarc.2026.103756
Binqi Li, Yuan Zhu, Yingqi Zhao, Ke Cui, Xu Zhong, Ke Lu
The integration of the Data Distribution Service (DDS) with Time-Sensitive Networking (TSN) enables deterministic communication for distributed cyber–physical systems. However, existing studies mainly address real-time performance, with limited attention to reliability. This paper presents an Software-Defined Networking (SDN)-based DDS–TSN integration framework incorporating the IEEE 802.1CB Frame Replication and Elimination for Reliability (FRER) mechanism. The framework automatically extracts redundancy requirements from DDS publishers and applies a multicast transformation mechanism for CB flow identification. A heuristic redundancy-aware routing and scheduling (Ra-RaS) algorithm is developed to jointly optimize redundant path selection and time-aware scheduling, reducing path length deviation and improving load balance. Simulation results show that the proposed Ra-RaS algorithm maintains high schedulability and scalability even under high traffic load conditions. Physical experiments further demonstrate that the integrated DDS–TSN framework ensures deterministic and fault-tolerant communication by leveraging the joint operation of FRER and the Time-Aware Shaper (TAS), significantly enhancing the reliability of DDS under link failure and packet loss.
数据分发服务(DDS)与时间敏感网络(TSN)的集成为分布式网络物理系统提供了确定性通信。然而,现有的研究主要关注实时性能,对可靠性的关注有限。本文提出了一种基于软件定义网络(SDN)的DDS-TSN集成框架,该框架结合了IEEE 802.1CB帧复制和消除可靠性(FRER)机制。该框架自动从DDS发布者中提取冗余需求,并应用组播转换机制进行CB流识别。提出了一种启发式冗余感知路由调度(Ra-RaS)算法,用于联合优化冗余路径选择和时间感知调度,减少路径长度偏差,提高负载均衡性。仿真结果表明,即使在高流量负载条件下,Ra-RaS算法也能保持较高的可调度性和可扩展性。物理实验进一步证明,集成DDS - tsn框架利用FRER和时间感知整形器(Time-Aware Shaper, TAS)的联合操作,保证了通信的确定性和容错,显著提高了DDS在链路故障和丢包情况下的可靠性。
{"title":"Fault tolerance DDS communications in Time-Sensitive Networking using IEEE 802.1CB redundancy mechanisms","authors":"Binqi Li,&nbsp;Yuan Zhu,&nbsp;Yingqi Zhao,&nbsp;Ke Cui,&nbsp;Xu Zhong,&nbsp;Ke Lu","doi":"10.1016/j.sysarc.2026.103756","DOIUrl":"10.1016/j.sysarc.2026.103756","url":null,"abstract":"<div><div>The integration of the Data Distribution Service (DDS) with Time-Sensitive Networking (TSN) enables deterministic communication for distributed cyber–physical systems. However, existing studies mainly address real-time performance, with limited attention to reliability. This paper presents an Software-Defined Networking (SDN)-based DDS–TSN integration framework incorporating the IEEE 802.1CB Frame Replication and Elimination for Reliability (FRER) mechanism. The framework automatically extracts redundancy requirements from DDS publishers and applies a multicast transformation mechanism for CB flow identification. A heuristic redundancy-aware routing and scheduling (Ra-RaS) algorithm is developed to jointly optimize redundant path selection and time-aware scheduling, reducing path length deviation and improving load balance. Simulation results show that the proposed Ra-RaS algorithm maintains high schedulability and scalability even under high traffic load conditions. Physical experiments further demonstrate that the integrated DDS–TSN framework ensures deterministic and fault-tolerant communication by leveraging the joint operation of FRER and the Time-Aware Shaper (TAS), significantly enhancing the reliability of DDS under link failure and packet loss.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"175 ","pages":"Article 103756"},"PeriodicalIF":4.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147386522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating an open-source soft-GPU overlay with RISC-V control and high-bandwidth memory 集成开源软gpu覆盖与RISC-V控制和高带宽内存
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-06-01 Epub Date: 2026-02-23 DOI: 10.1016/j.sysarc.2026.103736
Hector Gerardo Muñoz Hernandez , Mahdi Taheri , Muhammad Ali , Keyvan Shahin , Alireza Syavashi , Diana Göhringer , Marc Reichenbach , Christian Herglotz , Michael Hübner
Image and signal processing workloads are widely deployed on Graphics Processing Units (GPUs) for high throughput and on Field-Programmable Gate Arrays (FPGAs) for hardware specialization and energy efficiency. Soft GPU overlays on FPGAs aim to combine these advantages, yet existing solutions often depend on fixed hard processors or impose platform constraints that limit portability. This work extends a popular open-source soft GPGPU overlay to integrate a soft RISC-V control plane and enable compatibility with High-Bandwidth Memory (HBM2). The resulting system can be instantiated on FPGA boards without a hard ARM processor, improving portability, simplifying system integration, and broadening deployability. Across representative image and signal processing kernels, the soft GPGPU achieves geometric-mean speedups of 114.60× over a scalar soft RISC-V core and 19.72× over a hard ARM core, demonstrating substantial performance benefits while retaining FPGA reconfigurability. HBM2 integration further benefits bandwidth-sensitive workloads by increasing sustained throughput and reducing the performance bottlenecks associated with off-chip memory access. Collectively, these results indicate that GPU-like programmability and performance can be delivered on reconfigurable platforms without reliance on hard CPU subsystems, providing a portable and scalable foundation for embedded vision and DSP acceleration.
图像和信号处理工作负载广泛部署在图形处理单元(gpu)上,以实现高吞吐量,并部署在现场可编程门阵列(fpga)上,以实现硬件专业化和能源效率。fpga上的软GPU覆盖旨在结合这些优势,但现有的解决方案通常依赖于固定的硬处理器或施加限制可移植性的平台约束。这项工作扩展了一个流行的开源软GPGPU覆盖层,以集成软RISC-V控制平面,并使其与高带宽内存(HBM2)兼容。由此产生的系统可以在没有硬ARM处理器的FPGA板上实例化,提高了可移植性,简化了系统集成,扩大了可部署性。在代表性的图像和信号处理内核中,软GPGPU比标量软RISC-V内核实现了114.60倍的几何平均速度,比硬ARM内核实现了19.72倍的几何平均速度,在保留FPGA可重构性的同时展示了可观的性能优势。HBM2集成通过增加持续吞吐量和减少与片外内存访问相关的性能瓶颈,进一步有利于带宽敏感的工作负载。总之,这些结果表明,类似gpu的可编程性和性能可以在可重构平台上实现,而不依赖于硬CPU子系统,为嵌入式视觉和DSP加速提供了可移植和可扩展的基础。
{"title":"Integrating an open-source soft-GPU overlay with RISC-V control and high-bandwidth memory","authors":"Hector Gerardo Muñoz Hernandez ,&nbsp;Mahdi Taheri ,&nbsp;Muhammad Ali ,&nbsp;Keyvan Shahin ,&nbsp;Alireza Syavashi ,&nbsp;Diana Göhringer ,&nbsp;Marc Reichenbach ,&nbsp;Christian Herglotz ,&nbsp;Michael Hübner","doi":"10.1016/j.sysarc.2026.103736","DOIUrl":"10.1016/j.sysarc.2026.103736","url":null,"abstract":"<div><div>Image and signal processing workloads are widely deployed on Graphics Processing Units (GPUs) for high throughput and on Field-Programmable Gate Arrays (FPGAs) for hardware specialization and energy efficiency. Soft GPU overlays on FPGAs aim to combine these advantages, yet existing solutions often depend on fixed hard processors or impose platform constraints that limit portability. This work extends a popular open-source soft GPGPU overlay to integrate a soft RISC-V control plane and enable compatibility with High-Bandwidth Memory (HBM2). The resulting system can be instantiated on FPGA boards without a hard ARM processor, improving portability, simplifying system integration, and broadening deployability. Across representative image and signal processing kernels, the soft GPGPU achieves geometric-mean speedups of 114.60<span><math><mo>×</mo></math></span> over a scalar soft RISC-V core and 19.72<span><math><mo>×</mo></math></span> over a hard ARM core, demonstrating substantial performance benefits while retaining FPGA reconfigurability. HBM2 integration further benefits bandwidth-sensitive workloads by increasing sustained throughput and reducing the performance bottlenecks associated with off-chip memory access. Collectively, these results indicate that GPU-like programmability and performance can be delivered on reconfigurable platforms without reliance on hard CPU subsystems, providing a portable and scalable foundation for embedded vision and DSP acceleration.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"175 ","pages":"Article 103736"},"PeriodicalIF":4.1,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147386650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two-layer system architecture for Unmanned Aerial Vehicle (UAV)-assisted offloading: Game-theoretic allocation and CTDE-MADDPG control 无人机辅助卸载的两层系统架构:博弈论分配和CTDE-MADDPG控制
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-02-11 DOI: 10.1016/j.sysarc.2026.103734
Yonglin Zhou, Shi Wang, Qiu Fang, Yaonan Wang
UAV-assisted offloading presents an effective solution for areas with limited infrastructure or temporary congestion. Yet jointly optimising task allocation and trajectory control is difficult due to heterogeneous GU demands and UAV energy/kinematic limits. We propose a two-layer architecture that decouples discrete allocation from continuous control. In the allocation layer, a responder-only external incentive gates accept/reject decisions in an auction–predation–trade scheme, without altering the potential, learning targets, or metrics. This mechanism induces monotone ascent of the potential and terminates in finitely many steps at a pairwise-stable (Nash) allocation. The control layer adopts centralised training with decentralised execution (MADDPG) with a centralised critic and task-specific actor heads to generate 3D trajectories subject to flight and safety constraints. This partition confines the combinatorial complexity to the outer allocation, resulting in a lighter on-board inference than a monolithic policy that jointly handles both allocation and planning. In a 3-UAV/7-GU simulator with obstacles and heterogeneous task sizes, the proposed method achieves higher global reward (2.02 vs 1.91) and shorter completion time (92 s vs 109 s) than a reproduced game-theoretic baseline. Compared with end-to-end MADDPG without prior allocation, it also converges faster and more stably.
无人机辅助卸载为基础设施有限或暂时拥堵的地区提供了有效的解决方案。然而,由于GU需求的异质性和无人机能量/运动学的限制,联合优化任务分配和轨迹控制是困难的。我们提出了一个两层架构,将离散分配与连续控制解耦。在分配层,只有应答者的外部激励门接受/拒绝拍卖-掠夺-交易方案中的决策,而不改变潜力、学习目标或指标。这种机制导致电位单调上升,并在一对稳定(纳什)分配的有限多个步骤中终止。控制层采用集中训练与分散执行(madpg),具有集中的评论家和特定任务的演员头部,以生成受飞行和安全约束的3D轨迹。该分区将组合复杂性限制在外部分配,与联合处理分配和规划的整体策略相比,产生了更轻的板上推理。在具有障碍和异构任务大小的3-UAV/7-GU模拟器中,与复制的博弈论基线相比,该方法获得了更高的全局奖励(2.02 vs 1.91)和更短的完成时间(92 s vs 109 s)。与没有预先分配的端到端MADDPG相比,它的收敛速度更快、更稳定。
{"title":"A two-layer system architecture for Unmanned Aerial Vehicle (UAV)-assisted offloading: Game-theoretic allocation and CTDE-MADDPG control","authors":"Yonglin Zhou,&nbsp;Shi Wang,&nbsp;Qiu Fang,&nbsp;Yaonan Wang","doi":"10.1016/j.sysarc.2026.103734","DOIUrl":"10.1016/j.sysarc.2026.103734","url":null,"abstract":"<div><div>UAV-assisted offloading presents an effective solution for areas with limited infrastructure or temporary congestion. Yet jointly optimising task allocation and trajectory control is difficult due to heterogeneous GU demands and UAV energy/kinematic limits. We propose a two-layer architecture that decouples discrete allocation from continuous control. In the allocation layer, a responder-only external incentive gates accept/reject decisions in an auction–predation–trade scheme, without altering the potential, learning targets, or metrics. This mechanism induces monotone ascent of the potential and terminates in finitely many steps at a pairwise-stable (Nash) allocation. The control layer adopts centralised training with decentralised execution (MADDPG) with a centralised critic and task-specific actor heads to generate 3D trajectories subject to flight and safety constraints. This partition confines the combinatorial complexity to the outer allocation, resulting in a lighter on-board inference than a monolithic policy that jointly handles both allocation and planning. In a 3-UAV/7-GU simulator with obstacles and heterogeneous task sizes, the proposed method achieves higher global reward (2.02 vs 1.91) and shorter completion time (92 s vs 109 s) than a reproduced game-theoretic baseline. Compared with end-to-end MADDPG without prior allocation, it also converges faster and more stably.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103734"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning for distracted driving recognition with multisource data: A comprehensive review 基于多源数据的分心驾驶识别的深度学习:综述
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-02-05 DOI: 10.1016/j.sysarc.2026.103730
Wenbin He , Shifeng Wang , Zhaohui Liu , Zhenying Li , Zhijun Fu , Dengfeng Zhao , Junjian Hou , Wuyi Ming
Distracted driving remains a leading cause of traffic fatalities worldwide, yet existing detection methods face significant challenges in real-time performance and complex driving scenarios. This review presents the first comprehensive analysis of deep learning approaches for distracted driving recognition, systematically addressing the fragmented research landscape. The study evaluates state-of-the-art architectures across three critical dimensions: (1) neural network designs ranging from CNN to Transformer, analyzing their effectiveness in capturing driver behavior patterns; (2) diverse data modalities including visual, physiological, and vehicle dynamics, with empirical comparisons of their discriminative power; and (3) fusion strategies that integrate heterogeneous information sources to enhance detection accuracy. Extensive empirical analysis reveals that hybrid approaches combining attention mechanisms with multimodal fusion achieve remarkable accuracy rates up to 99.8%. Nevertheless, critical challenges persist, from inherent failure modes across modalities to deployment hurdles involving efficiency, energy, and privacy. Through a comprehensive examination of model-based versus deep learning approaches, various neural architectures, data modalities, fusion strategies, failure modes, and deployment challenges this survey identifies current methodological limitations, uncovers underexplored research opportunities, and delineates future directions for advancing distracted driving recognition systems toward real-world deployment.
分心驾驶仍然是全球交通事故死亡的主要原因,但现有的检测方法在实时性能和复杂的驾驶场景中面临重大挑战。这篇综述首次全面分析了用于分心驾驶识别的深度学习方法,系统地解决了碎片化的研究领域。该研究从三个关键维度评估了最先进的架构:(1)从CNN到Transformer的神经网络设计,分析了它们在捕获驾驶员行为模式方面的有效性;(2)包括视觉、生理和车辆动力学在内的多种数据模式,并对其判别能力进行实证比较;(3)融合异构信息源,提高检测精度的融合策略。大量的实证分析表明,将注意机制与多模态融合相结合的混合方法的准确率高达99.8%。然而,关键的挑战仍然存在,从各种模式的固有故障模式到涉及效率、能源和隐私的部署障碍。通过对基于模型的与深度学习方法、各种神经架构、数据模式、融合策略、故障模式和部署挑战的全面检查,本调查确定了当前方法的局限性,揭示了未被开发的研究机会,并描绘了将分心驾驶识别系统推进到现实世界部署的未来方向。
{"title":"Deep learning for distracted driving recognition with multisource data: A comprehensive review","authors":"Wenbin He ,&nbsp;Shifeng Wang ,&nbsp;Zhaohui Liu ,&nbsp;Zhenying Li ,&nbsp;Zhijun Fu ,&nbsp;Dengfeng Zhao ,&nbsp;Junjian Hou ,&nbsp;Wuyi Ming","doi":"10.1016/j.sysarc.2026.103730","DOIUrl":"10.1016/j.sysarc.2026.103730","url":null,"abstract":"<div><div>Distracted driving remains a leading cause of traffic fatalities worldwide, yet existing detection methods face significant challenges in real-time performance and complex driving scenarios. This review presents the first comprehensive analysis of deep learning approaches for distracted driving recognition, systematically addressing the fragmented research landscape. The study evaluates state-of-the-art architectures across three critical dimensions: (1) neural network designs ranging from CNN to Transformer, analyzing their effectiveness in capturing driver behavior patterns; (2) diverse data modalities including visual, physiological, and vehicle dynamics, with empirical comparisons of their discriminative power; and (3) fusion strategies that integrate heterogeneous information sources to enhance detection accuracy. Extensive empirical analysis reveals that hybrid approaches combining attention mechanisms with multimodal fusion achieve remarkable accuracy rates up to 99.8%. Nevertheless, critical challenges persist, from inherent failure modes across modalities to deployment hurdles involving efficiency, energy, and privacy. Through a comprehensive examination of model-based versus deep learning approaches, various neural architectures, data modalities, fusion strategies, failure modes, and deployment challenges this survey identifies current methodological limitations, uncovers underexplored research opportunities, and delineates future directions for advancing distracted driving recognition systems toward real-world deployment.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103730"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ZJC: Constructing fully local repair in erasure codes for distributed cloud storage 在分布式云存储的擦除代码中构建完全本地修复
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-02-07 DOI: 10.1016/j.sysarc.2026.103735
Zijian Zhou , Xiaoheng Deng , Xinjun Pei , Yunlong Zhao , Yurong Qian , Shaohua Wan , Kaiping Xue
Achieving efficient repair with minimal redundancy remains a long-standing challenge for distributed cloud storage. Existing erasure codes such as Reed–Solomon (RS) and Local Reconstruction Codes (LRC) rely on global repair, which causes heavy repair bandwidth consumption and high node construction overhead. This paper presents ZJ Codes (ZJC), a novel erasure coding scheme that achieves fully local repair while maintaining strong availability. ZJC introduces an interleaved local-group structure in which each data block participates in two local parity groups, thus providing two independent recovery paths without any global parity. A new storage overhead model is further proposed, incorporating node construction cost into redundancy evaluation. We analytically prove that ZJC achieves lower storage overhead and constant repair bandwidth with only two helper blocks per repair. Both RS-based and XOR-based implementations are developed and experimentally deployed in a distributed system. Results show that ZJC improves encoding efficiency by up to 87.5% and recovery throughput by 96.5% compared with Reed–Solomon codes at equivalent redundancy. Compared to LRC, ZJC achieves encoding improvements of 62.3% and 86.5%, respectively. RS-based and Xor-based ZJC show recovery improvements of 53.9% and 96.5% compare with RS, respectively. Additionally, Xor-based ZJC outperforms LRC and ZJC RS-based by 77.5% and 90.8%, respectively. These results demonstrate that ZJC effectively reconciles the long-standing trade-off between low redundancy and high repair efficiency in distributed cloud storage.
以最小的冗余实现高效修复仍然是分布式云存储的一个长期挑战。现有的擦除码如RS (Reed-Solomon)和LRC (Local Reconstruction codes)依赖全局修复,这导致修复带宽消耗大,节点构建开销高。本文提出了一种新的擦除编码方案ZJ编码(ZJC),它在保持强可用性的同时实现了完全的局部修复。ZJC引入了交错的本地组结构,其中每个数据块参与两个本地奇偶组,从而提供了两个独立的恢复路径,而不需要任何全局奇偶。进一步提出了一种新的存储开销模型,将节点构建成本纳入冗余评估。通过分析证明,每次修复只有两个helper块,ZJC实现了较低的存储开销和恒定的修复带宽。基于rs和xor的实现都被开发并实验性地部署在分布式系统中。结果表明,与同等冗余的Reed-Solomon码相比,ZJC编码效率提高了87.5%,恢复吞吐量提高了96.5%。与LRC相比,ZJC分别实现了62.3%和86.5%的编码改进。基于RS和xor的ZJC与RS相比,回收率分别提高了53.9%和96.5%。此外,基于xor的ZJC比LRC和基于ZJC rs的ZJC分别高出77.5%和90.8%。这些结果表明,ZJC有效地协调了分布式云存储中长期存在的低冗余和高修复效率之间的权衡。
{"title":"ZJC: Constructing fully local repair in erasure codes for distributed cloud storage","authors":"Zijian Zhou ,&nbsp;Xiaoheng Deng ,&nbsp;Xinjun Pei ,&nbsp;Yunlong Zhao ,&nbsp;Yurong Qian ,&nbsp;Shaohua Wan ,&nbsp;Kaiping Xue","doi":"10.1016/j.sysarc.2026.103735","DOIUrl":"10.1016/j.sysarc.2026.103735","url":null,"abstract":"<div><div>Achieving efficient repair with minimal redundancy remains a long-standing challenge for distributed cloud storage. Existing erasure codes such as Reed–Solomon (RS) and Local Reconstruction Codes (LRC) rely on global repair, which causes heavy repair bandwidth consumption and high node construction overhead. This paper presents ZJ Codes (ZJC), a novel erasure coding scheme that achieves fully local repair while maintaining strong availability. ZJC introduces an interleaved local-group structure in which each data block participates in two local parity groups, thus providing two independent recovery paths without any global parity. A new storage overhead model is further proposed, incorporating node construction cost into redundancy evaluation. We analytically prove that ZJC achieves lower storage overhead and constant repair bandwidth with only two helper blocks per repair. Both RS-based and XOR-based implementations are developed and experimentally deployed in a distributed system. Results show that ZJC improves encoding efficiency by up to 87.5% and recovery throughput by 96.5% compared with Reed–Solomon codes at equivalent redundancy. Compared to LRC, ZJC achieves encoding improvements of 62.3% and 86.5%, respectively. RS-based and Xor-based ZJC show recovery improvements of 53.9% and 96.5% compare with RS, respectively. Additionally, Xor-based ZJC outperforms LRC and ZJC RS-based by 77.5% and 90.8%, respectively. These results demonstrate that ZJC effectively reconciles the long-standing trade-off between low redundancy and high repair efficiency in distributed cloud storage.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103735"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SPARCAM: Sparse matrix multiplication accelerator using multi-port dynamic CAM SPARCAM:使用多端口动态CAM的稀疏矩阵乘法加速器
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-02-04 DOI: 10.1016/j.sysarc.2026.103726
Esteban Garzón , Benjamin Zambrano , David Sheinenzon , Marco Lanuzza , Adam Teman , Leonid Yavits
Sparse General matrix multiplication (SpGEMM) is a fundamental kernel in many scientific and engineering fields, including Artificial Intelligence (AI). However, its intrinsic computation complexity presents substantial challenges, making efficient hardware implementation particularly difficult. This paper proposes SPARCAM, a novel SpGEMM accelerator, developed and optimized for very energy-efficient AI edge applications. SPARCAM is designed using low-power dense Gain Cell embedded DRAM (GC-eDRAM) technology, a processing near memory paradigm, and a modified outer product matrix multiplication algorithm. Despite its quite limited peak theoretical performance, SPARCAM achieves very high energy efficiency due to its low-power architecture and almost 100% utilization of its computing resources. Designed in a commercial 28 nm FDSOI technology, SPARCAM achieves 13.9× speedup over a high-performance embedded CPU when processing large-scale sparse matrices. When multiplying limited-size sparse matrices, SPARCAM obtains 193× speedup over high-performance GPU. SPARCAM reaches about 4.3 orders-of-magnitude, on average, higher energy benefits, and 1892×, 181×, 2×, and 3471×, higher energy efficiency (over CPU) compared with state-of-the-art SpGEMM accelerators SpArch, OuterSPACE, MatRaptor, and high-performance GPU, respectively.
稀疏一般矩阵乘法(SpGEMM)是包括人工智能(AI)在内的许多科学和工程领域的基本核心。然而,其固有的计算复杂性带来了巨大的挑战,使得高效的硬件实现变得尤为困难。本文提出了SPARCAM,一种新型的SpGEMM加速器,为非常节能的人工智能边缘应用而开发和优化。SPARCAM采用低功耗密集增益单元嵌入式DRAM (GC-eDRAM)技术、近内存处理范例和改进的外部乘积矩阵乘法算法设计。尽管SPARCAM的峰值理论性能相当有限,但由于其低功耗架构和几乎100%的计算资源利用率,SPARCAM实现了非常高的能源效率。SPARCAM采用商用28纳米FDSOI技术设计,在处理大规模稀疏矩阵时,在高性能嵌入式CPU上实现了13.9倍的加速。当将有限大小的稀疏矩阵相乘时,SPARCAM在高性能GPU上获得了193倍的加速。与最先进的SpGEMM加速器SpArch、OuterSPACE、MatRaptor和高性能GPU相比,SPARCAM的能量效益平均约为4.3个数量级,能效(CPU)分别为1892倍、181x倍、2x倍和3471倍。
{"title":"SPARCAM: Sparse matrix multiplication accelerator using multi-port dynamic CAM","authors":"Esteban Garzón ,&nbsp;Benjamin Zambrano ,&nbsp;David Sheinenzon ,&nbsp;Marco Lanuzza ,&nbsp;Adam Teman ,&nbsp;Leonid Yavits","doi":"10.1016/j.sysarc.2026.103726","DOIUrl":"10.1016/j.sysarc.2026.103726","url":null,"abstract":"<div><div>Sparse General matrix multiplication (SpGEMM) is a fundamental kernel in many scientific and engineering fields, including Artificial Intelligence (AI). However, its intrinsic computation complexity presents substantial challenges, making efficient hardware implementation particularly difficult. This paper proposes SPARCAM, a novel SpGEMM accelerator, developed and optimized for very energy-efficient AI edge applications. SPARCAM is designed using low-power dense Gain Cell embedded DRAM (GC-eDRAM) technology, a processing near memory paradigm, and a modified outer product matrix multiplication algorithm. Despite its quite limited peak theoretical performance, SPARCAM achieves very high energy efficiency due to its low-power architecture and almost 100% utilization of its computing resources. Designed in a commercial 28<!--> <!-->nm FDSOI technology, SPARCAM achieves <span><math><mrow><mn>13</mn><mo>.</mo><mn>9</mn><mo>×</mo></mrow></math></span> speedup over a high-performance embedded CPU when processing large-scale sparse matrices. When multiplying limited-size sparse matrices, SPARCAM obtains <span><math><mrow><mn>193</mn><mo>×</mo></mrow></math></span> speedup over high-performance GPU. SPARCAM reaches about 4.3 orders-of-magnitude, on average, higher energy benefits, and <span><math><mrow><mn>1892</mn><mo>×</mo></mrow></math></span>, <span><math><mrow><mn>181</mn><mo>×</mo></mrow></math></span>, <span><math><mrow><mn>2</mn><mo>×</mo></mrow></math></span>, and <span><math><mrow><mn>3471</mn><mo>×</mo></mrow></math></span>, higher energy efficiency (over CPU) compared with state-of-the-art SpGEMM accelerators SpArch, OuterSPACE, MatRaptor, and high-performance GPU, respectively.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103726"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post quantum secure and efficient data sharing scheme from attribute based encryption for VANETs over lattices 基于属性加密的栅格VANETs后量子安全高效数据共享方案
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-01-29 DOI: 10.1016/j.sysarc.2026.103718
Fenghe Wang, Meijiao Wang, Junquan Wang, Mengqi Gu
Data sharing plays a significant role for C2V communication in VANETs. To achieve the data confidentiality, access control, data authentication and efficiency in data sharing simultaneously, this paper proposes a post quantum secure and efficient data sharing scheme by designing an attribute-based encryption (ABE) over lattices for VANETs. The proposed scheme is secure under the selective attribute and chosen-plaintexts attack (IND-sAtt-CPA) and the security is proven under the hardness of the Learning With Errors (LWE) problem in the random oracle model. Thanks to the characters of the ABE concept which is the design foundation of the proposed scheme, we also achieve the fine-grained access control of the sharing data in this paper. Furthermore, the signature of the shared ciphertext is also generated before the data is shared. Hence the proposed scheme can efficiently resist several normal network attacks such as replay attack and impersonation attack etc. The space and computation analysis of the proposed scheme show that the proposed data sharing scheme is with shorter ciphertext length and lower encryption/decryption costs which have been checked by using Java language on a personal computer with Intel (R) Core (TM) i9-14900HX processor (2.20 GHz) and 32GB RAM. At last, using NS3 software, a network simulation is given to verify the communication effects. Simulation results show that the response latency of the proposed scheme is lower than that of several known ABE schemes in VANETs. And the average message loss rate of the proposed scheme is also controlled within an acceptable interval (much less than 5%) even in high-intensity communication scenarios.
在VANETs中,数据共享对C2V通信起着重要的作用。为了在数据共享中同时实现数据保密性、访问控制、数据认证和效率,本文提出了一种后量子安全、高效的数据共享方案,通过对VANETs设计基于格的属性加密(ABE)。该方案在选择性属性和选择明文攻击(ind - sat - cpa)下是安全的,并在随机oracle模型中的有错误学习(LWE)问题的困难下证明了该方案的安全性。本文还利用ABE概念的特点,实现了对共享数据的细粒度访问控制。ABE概念是本方案的设计基础。此外,在数据共享之前,还会生成共享密文的签名。因此,该方案能够有效抵御几种常见的网络攻击,如重放攻击和冒充攻击等。对所提方案的空间和计算分析表明,所提数据共享方案具有较短的密文长度和较低的加解密成本,并在Intel (R) Core (TM) i9-14900HX处理器(2.20 GHz)和32GB RAM的个人计算机上使用Java语言进行了验证。最后,利用NS3软件进行了网络仿真,验证了通信效果。仿真结果表明,该方案的响应延迟低于几种已知的VANETs中ABE方案的响应延迟。即使在高强度通信场景下,所提方案的平均消息丢失率也被控制在可接受的间隔内(远小于5%)。
{"title":"Post quantum secure and efficient data sharing scheme from attribute based encryption for VANETs over lattices","authors":"Fenghe Wang,&nbsp;Meijiao Wang,&nbsp;Junquan Wang,&nbsp;Mengqi Gu","doi":"10.1016/j.sysarc.2026.103718","DOIUrl":"10.1016/j.sysarc.2026.103718","url":null,"abstract":"<div><div>Data sharing plays a significant role for C2V communication in VANETs. To achieve the data confidentiality, access control, data authentication and efficiency in data sharing simultaneously, this paper proposes a post quantum secure and efficient data sharing scheme by designing an attribute-based encryption (ABE) over lattices for VANETs. The proposed scheme is secure under the selective attribute and chosen-plaintexts attack (IND-sAtt-CPA) and the security is proven under the hardness of the Learning With Errors (LWE) problem in the random oracle model. Thanks to the characters of the ABE concept which is the design foundation of the proposed scheme, we also achieve the fine-grained access control of the sharing data in this paper. Furthermore, the signature of the shared ciphertext is also generated before the data is shared. Hence the proposed scheme can efficiently resist several normal network attacks such as replay attack and impersonation attack etc. The space and computation analysis of the proposed scheme show that the proposed data sharing scheme is with shorter ciphertext length and lower encryption/decryption costs which have been checked by using Java language on a personal computer with Intel (R) Core (TM) i9-14900HX processor (2.20 GHz) and 32GB RAM. At last, using NS3 software, a network simulation is given to verify the communication effects. Simulation results show that the response latency of the proposed scheme is lower than that of several known ABE schemes in VANETs. And the average message loss rate of the proposed scheme is also controlled within an acceptable interval (much less than 5%) even in high-intensity communication scenarios.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103718"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146098574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Series–parallel-loop decompositions of control-flow graphs 控制流图的串联-并行-循环分解
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-02-03 DOI: 10.1016/j.sysarc.2026.103732
Xuran Cai , Amir Kafshdar Goharshady , S. Hitarth , Chun Kit Lam
Control-flow graphs (CFGs) of structured programs are well known to exhibit strong sparsity properties. Traditionally, this sparsity has been modeled using graph parameters such as treewidth and pathwidth, enabling the development of faster parameterized algorithms for tasks in compiler optimization, model checking, and program analysis. However, these parameters only approximate the structural constraints of CFGs: although every structured CFG has treewidth at most 7, many graphs with treewidth at most 7 cannot arise as CFGs. As a result, existing parameterized techniques are optimized for a substantially broader class of graphs than those encountered in practice.
In this work, we introduce a new grammar-based decomposition framework that characterizes exactly the class of control-flow graphs generated by structured programs. Our decomposition is intuitive, mirrors the syntactic structure of programs, and remains fully compatible with the dynamic-programming paradigm of treewidth-based methods. Using this framework, we design improved algorithms for two classical compiler optimization problems: Register Allocation and Lifetime-Optimal Speculative Partial Redundancy Elimination (LOSPRE). Extensive experimental evaluation demonstrates significant performance improvements over previous state-of-the-art approaches, highlighting the benefits of using decompositions tailored specifically to CFGs.
众所周知,结构化程序的控制流图(CFGs)具有很强的稀疏性。传统上,这种稀疏性是使用图参数(如treewidth和pathwidth)建模的,从而可以为编译器优化、模型检查和程序分析中的任务开发更快的参数化算法。然而,这些参数只是近似CFG的结构约束:尽管每个结构化CFG的树宽最多为7,但许多树宽最多为7的图不能作为CFG出现。因此,现有的参数化技术比在实践中遇到的更广泛的图类进行了优化。在这项工作中,我们引入了一个新的基于语法的分解框架,该框架精确地描述了结构化程序生成的控制流图的类别。我们的分解是直观的,反映了程序的语法结构,并且与基于树宽度的方法的动态规划范例保持完全兼容。利用该框架,我们设计了两个经典编译器优化问题的改进算法:寄存器分配和生命周期最优推测部分冗余消除(LOSPRE)。大量的实验评估表明,与以前最先进的方法相比,性能有了显著提高,突出了使用专门针对CFGs的分解方法的好处。
{"title":"Series–parallel-loop decompositions of control-flow graphs","authors":"Xuran Cai ,&nbsp;Amir Kafshdar Goharshady ,&nbsp;S. Hitarth ,&nbsp;Chun Kit Lam","doi":"10.1016/j.sysarc.2026.103732","DOIUrl":"10.1016/j.sysarc.2026.103732","url":null,"abstract":"<div><div>Control-flow graphs (CFGs) of structured programs are well known to exhibit strong sparsity properties. Traditionally, this sparsity has been modeled using graph parameters such as treewidth and pathwidth, enabling the development of faster parameterized algorithms for tasks in compiler optimization, model checking, and program analysis. However, these parameters only approximate the structural constraints of CFGs: although every structured CFG has treewidth at most 7, many graphs with treewidth at most 7 cannot arise as CFGs. As a result, existing parameterized techniques are optimized for a substantially broader class of graphs than those encountered in practice.</div><div>In this work, we introduce a new grammar-based decomposition framework that characterizes <em>exactly</em> the class of control-flow graphs generated by structured programs. Our decomposition is intuitive, mirrors the syntactic structure of programs, and remains fully compatible with the dynamic-programming paradigm of treewidth-based methods. Using this framework, we design improved algorithms for two classical compiler optimization problems: <em>Register Allocation</em> and <em>Lifetime-Optimal Speculative Partial Redundancy Elimination (LOSPRE)</em>. Extensive experimental evaluation demonstrates significant performance improvements over previous state-of-the-art approaches, highlighting the benefits of using decompositions tailored specifically to CFGs.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103732"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crash consistency in an NVM-enabled hybrid storage system: Problems, solutions, and verification 支持nvm的混合存储系统中的崩溃一致性:问题、解决方案和验证
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-02-07 DOI: 10.1016/j.sysarc.2026.103705
Guoyu Wang , Juncheng Hu , Chenju Pei , Tengfei Li , Kedi Lyu , Xilong Che
Non-volatile memory (NVM), as a byte-addressable persistent device, offers capacity, performance, and cost characteristics that lie between DRAM and disk. Its emergence adds a new tier to the computer memory architecture and introduces new design challenges. Among various hybrid storage designs, we find that the Sync-triggered Selective Absorption (SA) model is both high-performing and broadly applicable when aiming to fully leverage NVM’s characteristics in conjunction with DRAM and disk. However, this model introduces three key crash consistency challenges: the Write Ordering, Write Granularity, and Parallel Write problems.
In this paper, we define a consistency model based on synchronization semantics and conduct an in-depth analysis of the three consistency challenges faced by SA systems. We then present targeted solutions for each problem and formally verify our design using model checking. We hope our exploration and resolution of these issues will offer guidance for the design of efficient and robust hybrid storage systems.
非易失性存储器(NVM)作为一种字节可寻址的持久设备,提供介于DRAM和磁盘之间的容量、性能和成本特征。它的出现为计算机内存体系结构增加了一个新的层次,并引入了新的设计挑战。在各种混合存储设计中,我们发现同步触发选择性吸收(SA)模型既高性能又广泛适用于充分利用NVM与DRAM和磁盘结合的特性。然而,这个模型引入了三个关键的崩溃一致性挑战:写顺序、写粒度和并行写问题。在本文中,我们定义了一个基于同步语义的一致性模型,并对SA系统面临的三个一致性挑战进行了深入的分析。然后,我们针对每个问题提出有针对性的解决方案,并使用模型检查正式验证我们的设计。我们希望我们对这些问题的探索和解决能为高效、稳健的混合存储系统的设计提供指导。
{"title":"Crash consistency in an NVM-enabled hybrid storage system: Problems, solutions, and verification","authors":"Guoyu Wang ,&nbsp;Juncheng Hu ,&nbsp;Chenju Pei ,&nbsp;Tengfei Li ,&nbsp;Kedi Lyu ,&nbsp;Xilong Che","doi":"10.1016/j.sysarc.2026.103705","DOIUrl":"10.1016/j.sysarc.2026.103705","url":null,"abstract":"<div><div>Non-volatile memory (NVM), as a byte-addressable persistent device, offers capacity, performance, and cost characteristics that lie between DRAM and disk. Its emergence adds a new tier to the computer memory architecture and introduces new design challenges. Among various hybrid storage designs, we find that the Sync-triggered Selective Absorption (SA) model is both high-performing and broadly applicable when aiming to fully leverage NVM’s characteristics in conjunction with DRAM and disk. However, this model introduces three key crash consistency challenges: the Write Ordering, Write Granularity, and Parallel Write problems.</div><div>In this paper, we define a consistency model based on synchronization semantics and conduct an in-depth analysis of the three consistency challenges faced by SA systems. We then present targeted solutions for each problem and formally verify our design using model checking. We hope our exploration and resolution of these issues will offer guidance for the design of efficient and robust hybrid storage systems.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103705"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A visual–tactile fusion system for terrain perception under varying illumination conditions 不同光照条件下地形感知的视触觉融合系统
IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2026-05-01 Epub Date: 2026-01-24 DOI: 10.1016/j.sysarc.2026.103698
Rui Wang , Shichun Yang , Yuyi Chen , Zhuoyang Li , Jiayi Lu , Zexiang Tong , Jianyi Xu , Bin Sun , Xinjie Feng , Yaoguang Cao
Road terrain conditions are vital for ensuring the driving safety of autonomous vehicles (AVs). However, traditional sensors like cameras and LiDARs are sensitive to changes in lighting and weather, posing challenges for real-time road condition perception. In this paper, we propose an illumination-aware visual–tactile fusion system (IVTF) for terrain perception, integrating visual and tactile data while optimizing the fusion process based on illumination characteristics. The system employs a camera and an intelligent tire to capture visual and tactile data across various lighting conditions and vehicle speeds. Additionally, we also design a visual–tactile fusion module that dynamically adjusts the weights of different modalities according to illumination features. Comparative results with single-modality perception methods demonstrate the superior ability of visual–tactile fusion to accurately perceive road terrains under diverse lighting conditions. This approach significantly advances the robustness and reliability of terrain perception in AVs, contributing to enhanced driving safety.
道路地形条件对于确保自动驾驶汽车(AVs)的驾驶安全至关重要。然而,摄像头和激光雷达等传统传感器对光线和天气的变化很敏感,这对实时路况感知构成了挑战。本文提出了一种用于地形感知的光照感知视觉触觉融合系统(IVTF),该系统融合了视觉和触觉数据,并基于光照特征对融合过程进行了优化。该系统采用一个摄像头和一个智能轮胎来捕捉各种照明条件和车速下的视觉和触觉数据。此外,我们还设计了一个视触觉融合模块,根据光照特征动态调整不同模态的权重。与单模态感知方法的对比结果表明,视觉-触觉融合在不同光照条件下准确感知道路地形的能力更强。该方法显著提高了自动驾驶汽车地形感知的鲁棒性和可靠性,有助于提高驾驶安全性。
{"title":"A visual–tactile fusion system for terrain perception under varying illumination conditions","authors":"Rui Wang ,&nbsp;Shichun Yang ,&nbsp;Yuyi Chen ,&nbsp;Zhuoyang Li ,&nbsp;Jiayi Lu ,&nbsp;Zexiang Tong ,&nbsp;Jianyi Xu ,&nbsp;Bin Sun ,&nbsp;Xinjie Feng ,&nbsp;Yaoguang Cao","doi":"10.1016/j.sysarc.2026.103698","DOIUrl":"10.1016/j.sysarc.2026.103698","url":null,"abstract":"<div><div>Road terrain conditions are vital for ensuring the driving safety of autonomous vehicles (AVs). However, traditional sensors like cameras and LiDARs are sensitive to changes in lighting and weather, posing challenges for real-time road condition perception. In this paper, we propose an illumination-aware visual–tactile fusion system (IVTF) for terrain perception, integrating visual and tactile data while optimizing the fusion process based on illumination characteristics. The system employs a camera and an intelligent tire to capture visual and tactile data across various lighting conditions and vehicle speeds. Additionally, we also design a visual–tactile fusion module that dynamically adjusts the weights of different modalities according to illumination features. Comparative results with single-modality perception methods demonstrate the superior ability of visual–tactile fusion to accurately perceive road terrains under diverse lighting conditions. This approach significantly advances the robustness and reliability of terrain perception in AVs, contributing to enhanced driving safety.</div></div>","PeriodicalId":50027,"journal":{"name":"Journal of Systems Architecture","volume":"174 ","pages":"Article 103698"},"PeriodicalIF":4.1,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Systems Architecture
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1