首页 > 最新文献

IEEE Embedded Systems Letters最新文献

英文 中文
IoT-Driven Smart Embedded System for Efficient Quality Food Production in Controlled Environments 物联网驱动的智能嵌入式系统,用于受控环境下的高效优质食品生产
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-21 DOI: 10.1109/LES.2025.3572046
Maria C. Moreno;Fabian E. Casares;Juana Martinez;Emily R. Ellis;Oscar J. Suarez
Latin American agriculture encounters significant challenges, such as inefficient land use and vulnerability to climate change, with smallholder farmers being particularly affected. This letter proposes a modular Internet of Things (IoT)-based system that integrates embedded systems, environmental sensors, and artificial intelligence (AI) for real-time monitoring and adaptive control. The system utilizes a Raspberry Pi coupled with advanced sensors measuring temperature, humidity, potential of hydrogen (pH), liquid level, light intensity, air quality, and electrical current. Actuators regulate nutrient delivery in response to pH levels, while intelligent lighting systems enhance plant growth conditions. Remote monitoring facilitated by a vision-equipped camera system allows the detection of anomalies and the tracking of plant growth. Experimental results demonstrated temperature stability of ±1°C, humidity variation within ±5%, 92% accuracy in anomaly detection, and response times of less than 10 s.
拉丁美洲农业面临重大挑战,如土地利用效率低下和易受气候变化影响,小农受到的影响尤其严重。该信函提出了一种基于物联网(IoT)的模块化系统,该系统集成了嵌入式系统、环境传感器和人工智能(AI),用于实时监控和自适应控制。该系统利用树莓派和先进的传感器来测量温度、湿度、氢电位(pH)、液位、光照强度、空气质量和电流。执行器根据pH值调节养分输送,而智能照明系统则改善植物的生长条件。通过配备视觉的摄像系统进行远程监控,可以检测异常情况并跟踪植物生长。实验结果表明,温度稳定性为±1℃,湿度变化在±5%以内,异常检测准确率为92%,响应时间小于10 s。
{"title":"IoT-Driven Smart Embedded System for Efficient Quality Food Production in Controlled Environments","authors":"Maria C. Moreno;Fabian E. Casares;Juana Martinez;Emily R. Ellis;Oscar J. Suarez","doi":"10.1109/LES.2025.3572046","DOIUrl":"https://doi.org/10.1109/LES.2025.3572046","url":null,"abstract":"Latin American agriculture encounters significant challenges, such as inefficient land use and vulnerability to climate change, with smallholder farmers being particularly affected. This letter proposes a modular Internet of Things (IoT)-based system that integrates embedded systems, environmental sensors, and artificial intelligence (AI) for real-time monitoring and adaptive control. The system utilizes a Raspberry Pi coupled with advanced sensors measuring temperature, humidity, potential of hydrogen (pH), liquid level, light intensity, air quality, and electrical current. Actuators regulate nutrient delivery in response to pH levels, while intelligent lighting systems enhance plant growth conditions. Remote monitoring facilitated by a vision-equipped camera system allows the detection of anomalies and the tracking of plant growth. Experimental results demonstrated temperature stability of ±1°C, humidity variation within ±5%, 92% accuracy in anomaly detection, and response times of less than 10 s.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"411-414"},"PeriodicalIF":2.0,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exception Coverage on Automotive Processors 汽车处理器的异常覆盖率
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-19 DOI: 10.1109/LES.2025.3571040
Sourav Roy;Nikhil Jain;Neha Srivastava;Ravinder Dasila;Prokash Ghosh;Ajay Sharma;Afshan Anjum;Jeff Scott
With the advent of modular and independent zonal processing in Software-Defined-Vehicles (SDV), the processor in an automotive system is expected to handle complex exception scenarios elegantly. This letter addresses the need for comprehensive exception coverage during the verification of the automotive processor. Unlike prior research focusing on only two simultaneous exceptions, our work extends to all combinations of simultaneous exceptions. The proposed methodology involves the systematic grouping of exceptions into subcategories based on similarity while maintaining priority, followed by targeted coverage assessment for each group. A two-pass simulation technique is used to drive multiple external events like interrupts in the second pass to create high-activity time windows with simultaneous exceptions. This hierarchical approach allows for the creation of a manageable and effective exception list, facilitating thorough verification without overwhelming computational resources. Applied to a 32-bit RISC-V processor, our methodology shows a large improvement of 7X in efficiency compared to traditional methods, highlighting its advantage in enhancing safety and trustworthiness of automotive systems. This work takes a major step toward the verification of all combinations of simultaneous exceptions in processors.
随着软件定义车辆(SDV)中模块化和独立区域处理的出现,汽车系统中的处理器被期望能够优雅地处理复杂的异常场景。这封信说明了在汽车处理器验证期间需要全面的异常覆盖。与之前的研究只关注两个同时发生的异常不同,我们的工作扩展到同时发生的异常的所有组合。所建议的方法包括在保持优先级的同时,根据相似性将异常系统地分组到子类别中,然后对每个组进行有针对性的覆盖评估。两步模拟技术用于驱动多个外部事件,如第二步中的中断,以创建具有同步异常的高活动时间窗口。这种分层方法允许创建一个可管理且有效的异常列表,在不占用大量计算资源的情况下促进彻底的验证。应用于32位RISC-V处理器,与传统方法相比,我们的方法效率提高了7倍,突出了其在提高汽车系统安全性和可信度方面的优势。这项工作向验证处理器中所有同步异常的组合迈出了重要的一步。
{"title":"Exception Coverage on Automotive Processors","authors":"Sourav Roy;Nikhil Jain;Neha Srivastava;Ravinder Dasila;Prokash Ghosh;Ajay Sharma;Afshan Anjum;Jeff Scott","doi":"10.1109/LES.2025.3571040","DOIUrl":"https://doi.org/10.1109/LES.2025.3571040","url":null,"abstract":"With the advent of modular and independent zonal processing in Software-Defined-Vehicles (SDV), the processor in an automotive system is expected to handle complex exception scenarios elegantly. This letter addresses the need for comprehensive exception coverage during the verification of the automotive processor. Unlike prior research focusing on only two simultaneous exceptions, our work extends to all combinations of simultaneous exceptions. The proposed methodology involves the systematic grouping of exceptions into subcategories based on similarity while maintaining priority, followed by targeted coverage assessment for each group. A two-pass simulation technique is used to drive multiple external events like interrupts in the second pass to create high-activity time windows with simultaneous exceptions. This hierarchical approach allows for the creation of a manageable and effective exception list, facilitating thorough verification without overwhelming computational resources. Applied to a 32-bit RISC-V processor, our methodology shows a large improvement of 7X in efficiency compared to traditional methods, highlighting its advantage in enhancing safety and trustworthiness of automotive systems. This work takes a major step toward the verification of all combinations of simultaneous exceptions in processors.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"27-30"},"PeriodicalIF":2.0,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance of Some AES-Based AE and MAC Schemes Using Fixslicing on STM32 Microcontrollers 一些基于aes的AE和MAC方案在STM32单片机上的固定切片性能
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-19 DOI: 10.1109/LES.2025.3571323
Luis Alejandro Pérez-Sarmiento;Cuauhtemoc Mancillas-López
Message authentication codes (MACs) and authentication encryption algorithms (AEs) are widely used cryptographic primitives. Offset codebook (OCB), OCB random access (OCBRA), PMACRA, and EliMAC are cryptographic schemes based on AES and reduced-round AES. This letter compares the schemes mentioned in a 32-bit ARM microcontroller. For efficiency, we implemented the AES round using a fixslicing technique, which processes two blocks in parallel. Such characteristics are essential when the schemes allow parallelization between the AES calls. OCB and OCBRA were compared for authentication and encryption. OCBRA and EliMAC obtain similar performance for authentication, while OCB needs more clock cycles due to its initialization process. For encryption, OCBRA and OCB were practically identical.
消息认证码(MACs)和认证加密算法(AEs)是广泛使用的密码原语。偏移码本(OCB)、OCB随机存取(OCBRA)、PMACRA和EliMAC是基于AES和缩减轮询AES的加密方案。这封信比较了32位ARM微控制器中提到的方案。为了提高效率,我们使用固定切片技术实现AES轮询,该技术并行处理两个块。当方案允许AES调用之间并行时,这些特性是必不可少的。比较OCB和OCBRA的认证和加密。OCBRA和EliMAC的认证性能相似,而OCB由于其初始化过程,需要更多的时钟周期。对于加密,OCBRA和OCB实际上是相同的。
{"title":"Performance of Some AES-Based AE and MAC Schemes Using Fixslicing on STM32 Microcontrollers","authors":"Luis Alejandro Pérez-Sarmiento;Cuauhtemoc Mancillas-López","doi":"10.1109/LES.2025.3571323","DOIUrl":"https://doi.org/10.1109/LES.2025.3571323","url":null,"abstract":"Message authentication codes (MACs) and authentication encryption algorithms (AEs) are widely used cryptographic primitives. Offset codebook (OCB), OCB random access (OCBRA), PMACRA, and EliMAC are cryptographic schemes based on AES and reduced-round AES. This letter compares the schemes mentioned in a 32-bit ARM microcontroller. For efficiency, we implemented the AES round using a fixslicing technique, which processes two blocks in parallel. Such characteristics are essential when the schemes allow parallelization between the AES calls. OCB and OCBRA were compared for authentication and encryption. OCBRA and EliMAC obtain similar performance for authentication, while OCB needs more clock cycles due to its initialization process. For encryption, OCBRA and OCB were practically identical.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"394-397"},"PeriodicalIF":2.0,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11006872","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Gaze Estimation With a DLA-Based Calibration Module on NVIDIA Jetson Platforms NVIDIA Jetson平台上基于dla的校准模块优化凝视估计
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-17 DOI: 10.1109/LES.2025.3551734
Jangwon Lee;Jiwon Yoo;Dami Ko;Gyeonghwan Kim
Gaze estimation is a commonly used task in human-computer interaction, with its applications ranging from driver monitoring systems to human assistive technologies. However, achieving robust, real-time performance on edge devices remains a challenge due to limited computational resources and the need for high energy efficiency. In this letter, we present a novel gaze estimation model optimized for NVIDIA Jetson platforms, leveraging the unique capabilities of deep learning accelerators (DLAs) for enhanced efficiency. Our model includes a calibration module designed for DLA execution, incorporating convolutional spatial and channel attention to improve robustness after face detection stage. By strategically distributing computational tasks across GPU and DLA, our approach achieves up to an 88% increase in energy efficiency compared to a system without calibration module, making it well suited for real-time, edge-based applications in resource-constrained environments.
注视估计是人机交互中常用的一项任务,其应用范围从驾驶员监控系统到人类辅助技术。然而,由于有限的计算资源和对高能效的需求,在边缘设备上实现强大的实时性能仍然是一个挑战。在这封信中,我们提出了一种针对NVIDIA Jetson平台优化的新型凝视估计模型,利用深度学习加速器(dla)的独特功能来提高效率。我们的模型包括一个为DLA执行设计的校准模块,结合卷积空间和通道注意来提高人脸检测阶段后的鲁棒性。通过在GPU和DLA之间战略性地分配计算任务,与没有校准模块的系统相比,我们的方法实现了高达88%的能源效率提高,使其非常适合资源受限环境下的实时、基于边缘的应用。
{"title":"Optimizing Gaze Estimation With a DLA-Based Calibration Module on NVIDIA Jetson Platforms","authors":"Jangwon Lee;Jiwon Yoo;Dami Ko;Gyeonghwan Kim","doi":"10.1109/LES.2025.3551734","DOIUrl":"https://doi.org/10.1109/LES.2025.3551734","url":null,"abstract":"Gaze estimation is a commonly used task in human-computer interaction, with its applications ranging from driver monitoring systems to human assistive technologies. However, achieving robust, real-time performance on edge devices remains a challenge due to limited computational resources and the need for high energy efficiency. In this letter, we present a novel gaze estimation model optimized for NVIDIA Jetson platforms, leveraging the unique capabilities of deep learning accelerators (DLAs) for enhanced efficiency. Our model includes a calibration module designed for DLA execution, incorporating convolutional spatial and channel attention to improve robustness after face detection stage. By strategically distributing computational tasks across GPU and DLA, our approach achieves up to an 88% increase in energy efficiency compared to a system without calibration module, making it well suited for real-time, edge-based applications in resource-constrained environments.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"56-59"},"PeriodicalIF":2.0,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Navigating the Future: A Kalman Filter-Based Global-Local-Global (GLG) Position Estimation for Autonomous Vehicles in GPS Denied Environments 导航未来:基于卡尔曼滤波的GPS拒绝环境下自动驾驶汽车全局-局部-全局(GLG)位置估计
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-11 DOI: 10.1109/LES.2025.3550171
S. Shakeera;N. Sreyas;V. B. N. Jyothi;H. Venkataraman
Accurate navigation in global positioning system (GPS) denied environments, such as tunnels, rural areas and underwater regions is a critical challenge for autonomous vehicles (AVs). Traditional position estimation methods face challenges due to their computational complexity and inability to consider external factors like sensor drift and human input errors. This letter introduces a global-local–global (GLG) position estimation method that integrates advanced inertial sensors with navigation sensors, utilizing kalman filter (KF) and extended KF (EKF) to enhance trajectory accuracy. The proposed mechanism is validated through analysis and real-time tests using custom-built hardware in terrestrial and underwater environments. It demonstrates effectiveness during GPS outages lasting 5–60 s, achieving position drifts of 2.84 m for GLG, 1.35 m for GLG + KF, and 1.27 m for GLG + EKF over a total distance travelled of 1.1 km. This method is particularly suited for short-range applications, including aqua farming, tunnel navigation, and dense areas, providing a robust solution for accurate position estimation in challenging scenarios.
在隧道、农村地区和水下区域等全球定位系统(GPS)拒绝环境中进行精确导航是自动驾驶汽车(av)面临的关键挑战。传统的位置估计方法由于计算量大、不能考虑传感器漂移和人为输入误差等外部因素而面临挑战。本文介绍了一种全局-局部-全局(GLG)位置估计方法,该方法集成了先进的惯性传感器和导航传感器,利用卡尔曼滤波(KF)和扩展KF (EKF)来提高轨迹精度。利用定制硬件在陆地和水下环境中进行分析和实时测试,验证了所提出的机制。在持续5-60秒的GPS中断期间,它显示了有效性,在1.1公里的总行程中,GLG实现了2.84米的位置漂移,GLG + KF实现了1.35米的位置漂移,GLG + EKF实现了1.27米的位置漂移。该方法特别适用于短程应用,包括水产养殖、隧道导航和密集区域,为在具有挑战性的情况下进行准确的位置估计提供了强大的解决方案。
{"title":"Navigating the Future: A Kalman Filter-Based Global-Local-Global (GLG) Position Estimation for Autonomous Vehicles in GPS Denied Environments","authors":"S. Shakeera;N. Sreyas;V. B. N. Jyothi;H. Venkataraman","doi":"10.1109/LES.2025.3550171","DOIUrl":"https://doi.org/10.1109/LES.2025.3550171","url":null,"abstract":"Accurate navigation in global positioning system (GPS) denied environments, such as tunnels, rural areas and underwater regions is a critical challenge for autonomous vehicles (AVs). Traditional position estimation methods face challenges due to their computational complexity and inability to consider external factors like sensor drift and human input errors. This letter introduces a global-local–global (GLG) position estimation method that integrates advanced inertial sensors with navigation sensors, utilizing kalman filter (KF) and extended KF (EKF) to enhance trajectory accuracy. The proposed mechanism is validated through analysis and real-time tests using custom-built hardware in terrestrial and underwater environments. It demonstrates effectiveness during GPS outages lasting 5–60 s, achieving position drifts of 2.84 m for GLG, 1.35 m for GLG + KF, and 1.27 m for GLG + EKF over a total distance travelled of 1.1 km. This method is particularly suited for short-range applications, including aqua farming, tunnel navigation, and dense areas, providing a robust solution for accurate position estimation in challenging scenarios.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"77-80"},"PeriodicalIF":2.0,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Embedded Auto-Calibrated Offset Current Compensation Technique for PPG/fNIRS System PPG/fNIRS系统的嵌入式自校准偏置电流补偿技术
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-11 DOI: 10.1109/LES.2025.3550393
Sadan Saquib Khan;Pratyush Kumar Padhi;Shufali Ashraf Wani;Meraj Ahmad;Laxmeesha Somappa;Shahid Malik
Optical sensors play a critical role in biomedical monitoring systems, enabling noninvasive measurement of physiological parameters. However, these systems face a persistent challenge: the small amplitude of physiological signals is often overshadowed by a dominant DC offset current. This offset combines the dark current, ambient light, and reflections from fat and skull. This limits the amplification of signals and degrades the sensitivity of PPG/fNIRS systems. This letter presents a mixed-signal auto-calibrated offset compensation technique using dual discrete loops for PPG and fNIRS systems. Two specialized prototypes—a dedicated PPG system and an fNIRS system—were validated using a 3.3 V supply. The results show that the proposed system can effectively compensate for the offset currents within a range of 1 to $100~mu $ A. We have recorded PPG and fNIRS signals and performed breathing experiments to validate the results. The results show that the proposed system is able to record the signal accurately.
光学传感器在生物医学监测系统中起着至关重要的作用,使生理参数的无创测量成为可能。然而,这些系统面临着一个持久的挑战:小幅度的生理信号往往被占主导地位的直流偏置电流所掩盖。这种偏移结合了暗电流、环境光以及脂肪和头骨的反射。这限制了信号的放大,降低了PPG/fNIRS系统的灵敏度。本文介绍了一种混合信号自动校准偏移补偿技术,该技术使用双离散环路用于PPG和fNIRS系统。两个专门的原型——一个专用的PPG系统和一个fNIRS系统——使用3.3 V电源进行了验证。结果表明,该系统能有效补偿1 ~ 100~mu $ a范围内的偏置电流,并记录了PPG和fNIRS信号,并进行了呼吸实验验证。实验结果表明,该系统能够准确地记录信号。
{"title":"An Embedded Auto-Calibrated Offset Current Compensation Technique for PPG/fNIRS System","authors":"Sadan Saquib Khan;Pratyush Kumar Padhi;Shufali Ashraf Wani;Meraj Ahmad;Laxmeesha Somappa;Shahid Malik","doi":"10.1109/LES.2025.3550393","DOIUrl":"https://doi.org/10.1109/LES.2025.3550393","url":null,"abstract":"Optical sensors play a critical role in biomedical monitoring systems, enabling noninvasive measurement of physiological parameters. However, these systems face a persistent challenge: the small amplitude of physiological signals is often overshadowed by a dominant DC offset current. This offset combines the dark current, ambient light, and reflections from fat and skull. This limits the amplification of signals and degrades the sensitivity of PPG/fNIRS systems. This letter presents a mixed-signal auto-calibrated offset compensation technique using dual discrete loops for PPG and fNIRS systems. Two specialized prototypes—a dedicated PPG system and an fNIRS system—were validated using a 3.3 V supply. The results show that the proposed system can effectively compensate for the offset currents within a range of 1 to <inline-formula> <tex-math>$100~mu $ </tex-math></inline-formula> A. We have recorded PPG and fNIRS signals and performed breathing experiments to validate the results. The results show that the proposed system is able to record the signal accurately.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"60-63"},"PeriodicalIF":2.0,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ArKANe: Accelerating Kolmogorov–Arnold Networks on Reconfigurable Spatial Architectures ArKANe:加速可重构空间架构上的Kolmogorov-Arnold网络
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-07 DOI: 10.1109/LES.2025.3567918
Yanze Wu;Md Tanvir Arafin
Kolmogorov–Arnold networks (KANs) are promising for explainable machine learning (ML) on embedded devices. Although KANs require fewer model parameters and are smaller than traditional deep neural networks, their applicability is limited by the complexity of the basis-spline (B-spline) functions used at every neuron. Hence, this letter presents acceleration techniques for KANs on dataflow-processing-based accelerators implemented in AMD-Xilinx Versal adaptive system on chips. Our experiments on the VCK-190 devices demonstrate that for large batch sizes, the average B-spline computation on systolic and wavefront architectures outperforms standard CPU implementation by $11times $ and $60times $ . On the other hand, wavefront accelerators demonstrate more than $3times $ improvement over GPUs (RTX 3090) in terms of energy consumption. Thus, this letter opens new opportunities for spatial accelerators in nonconventional ML algorithms on embedded systems. This letter’s code and experimental artifacts are available at https://github.com/SPIRE-GMU/SPIRE-ARKANE.
Kolmogorov-Arnold网络(KANs)有望用于嵌入式设备上的可解释机器学习(ML)。尽管与传统的深度神经网络相比,KANs需要更少的模型参数并且体积更小,但其适用性受到每个神经元使用的基样条(b样条)函数的复杂性的限制。因此,本文介绍了在AMD-Xilinx通用芯片自适应系统中实现的基于数据流处理的加速器上的KANs加速技术。我们在VCK-190设备上的实验表明,对于大批量,收缩和波前架构上的平均b样条计算比标准CPU实现的平均b样条计算分别高出11倍和60倍。另一方面,波前加速器在能耗方面比gpu (RTX 3090)改进了3倍以上。因此,这封信为嵌入式系统上非传统ML算法中的空间加速器开辟了新的机会。这封信的代码和实验工件可在https://github.com/SPIRE-GMU/SPIRE-ARKANE上获得。
{"title":"ArKANe: Accelerating Kolmogorov–Arnold Networks on Reconfigurable Spatial Architectures","authors":"Yanze Wu;Md Tanvir Arafin","doi":"10.1109/LES.2025.3567918","DOIUrl":"https://doi.org/10.1109/LES.2025.3567918","url":null,"abstract":"Kolmogorov–Arnold networks (KANs) are promising for explainable machine learning (ML) on embedded devices. Although KANs require fewer model parameters and are smaller than traditional deep neural networks, their applicability is limited by the complexity of the basis-spline (B-spline) functions used at every neuron. Hence, this letter presents acceleration techniques for KANs on dataflow-processing-based accelerators implemented in AMD-Xilinx Versal adaptive system on chips. Our experiments on the VCK-190 devices demonstrate that for large batch sizes, the average B-spline computation on systolic and wavefront architectures outperforms standard CPU implementation by <inline-formula> <tex-math>$11times $ </tex-math></inline-formula> and <inline-formula> <tex-math>$60times $ </tex-math></inline-formula>. On the other hand, wavefront accelerators demonstrate more than <inline-formula> <tex-math>$3times $ </tex-math></inline-formula> improvement over GPUs (RTX 3090) in terms of energy consumption. Thus, this letter opens new opportunities for spatial accelerators in nonconventional ML algorithms on embedded systems. This letter’s code and experimental artifacts are available at <uri>https://github.com/SPIRE-GMU/SPIRE-ARKANE</uri>.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"44-47"},"PeriodicalIF":2.0,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hyper-Embedded Computing: An Optimized System Architecture for Area-Constrained Applications 超嵌入式计算:区域约束应用的优化系统架构
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-07 DOI: 10.1109/LES.2025.3548955
Boisy Gene Pitre;Martin Margala;Kevin Phillipson;Michael Rywalt
The widespread adoption of 32-bit and 64-bit processors and ample memory resources in modern embedded systems has led to designs that prioritize performance and functionality over die area. This abundance has resulted in solutions that may not be optimal for applications where power and resource conservation are critical. To address this oversight, we introduce hyper-embedded computing, a paradigm that reinvigorates minimalistic embedded system design by integrating sophisticated multiprocessing capabilities into ultracompact embedded architectures with fixed silicon configurations. This approach is particularly well-suited for mission-critical, area-constrained applications that demand long-term reliability and sustainability. We build on this concept with the Turbo9, a 16-bit microprocessor optimized for minimal logic area, and TurbOS, a real-time embedded operating system that delivers UNIX-like features within a constrained memory area. TurbOS features a modular architecture, a preemptive multitasking kernel, a unified input/output architecture, and efficient memory utilization. Compared to currently used microprocessors and real-time operating systems, our approach achieves similar functionality while reducing memory area by approximately 75%, making it an ideal choice for sustainable and high-reliability embedded applications.
在现代嵌入式系统中,32位和64位处理器的广泛采用以及充足的内存资源导致设计优先考虑性能和功能而不是芯片面积。这种丰富导致的解决方案可能不是最优的应用程序,在电力和资源保护是至关重要的。为了解决这种疏忽,我们引入了超嵌入式计算,这是一种范式,通过将复杂的多处理能力集成到具有固定硅配置的超紧凑嵌入式体系结构中,重新激活了简约的嵌入式系统设计。这种方法特别适合于需要长期可靠性和可持续性的关键任务、区域限制应用程序。基于这一概念,我们推出了Turbo9和TurbOS。Turbo9是一款针对最小逻辑区域进行优化的16位微处理器,而TurbOS是一款实时嵌入式操作系统,可在受限的内存区域内提供类似unix的功能。TurbOS具有模块化架构、抢占式多任务内核、统一的输入/输出架构和高效的内存利用率。与目前使用的微处理器和实时操作系统相比,我们的方法实现了类似的功能,同时减少了大约75%的内存面积,使其成为可持续和高可靠性嵌入式应用的理想选择。
{"title":"Hyper-Embedded Computing: An Optimized System Architecture for Area-Constrained Applications","authors":"Boisy Gene Pitre;Martin Margala;Kevin Phillipson;Michael Rywalt","doi":"10.1109/LES.2025.3548955","DOIUrl":"https://doi.org/10.1109/LES.2025.3548955","url":null,"abstract":"The widespread adoption of 32-bit and 64-bit processors and ample memory resources in modern embedded systems has led to designs that prioritize performance and functionality over die area. This abundance has resulted in solutions that may not be optimal for applications where power and resource conservation are critical. To address this oversight, we introduce hyper-embedded computing, a paradigm that reinvigorates minimalistic embedded system design by integrating sophisticated multiprocessing capabilities into ultracompact embedded architectures with fixed silicon configurations. This approach is particularly well-suited for mission-critical, area-constrained applications that demand long-term reliability and sustainability. We build on this concept with the Turbo9, a 16-bit microprocessor optimized for minimal logic area, and TurbOS, a real-time embedded operating system that delivers UNIX-like features within a constrained memory area. TurbOS features a modular architecture, a preemptive multitasking kernel, a unified input/output architecture, and efficient memory utilization. Compared to currently used microprocessors and real-time operating systems, our approach achieves similar functionality while reducing memory area by approximately 75%, making it an ideal choice for sustainable and high-reliability embedded applications.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"435-438"},"PeriodicalIF":2.0,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Speed True Random Number Generator With Multiple Entropy Sources: Ring Oscillator Jitter and Random Telegraph Noise 具有多熵源的高速真随机数发生器:环形振荡器抖动和随机电报噪声
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-07 DOI: 10.1109/LES.2025.3549110
Yizhi Liu;Yifan Wu;Bo Chen;Pengpeng Sang;Jixuan Wu;Xiangye Wei;Xuepeng Zhan;Jiezhi Chen
True random number generator (TRNG) is widely used in the field of hardware security as a key component that can enhance system security. In this letter, a TRNG with multiple entropy sources is proposed based on random telegraph noise (RTN) and ring oscillator (RO) jitter. The proposed TRNG utilizes a direct digital synthesizer (DDS) to realize the phase control and frequency control for generating random sequences, which is implemented on field programmable gate array (FPGA). By using the NIST randomness test suite, the resilience and randomness characteristics are verified with the compact cell design of four RO and one RTN source. Moreover, the proposed TRNG achieves random sequence output with a maximum frequency of 200 MHz and low energy consumption of 5 pJ/bit. Our findings may provide great potential for developing high-performance and security hardware devices and systems.
真随机数发生器(TRNG)作为增强系统安全性的关键部件,广泛应用于硬件安全领域。本文提出了一种基于随机电报噪声(RTN)和环形振荡器(RO)抖动的多熵源TRNG。该TRNG利用直接数字合成器(DDS)实现随机序列的相位控制和频率控制,并在现场可编程门阵列(FPGA)上实现。利用NIST随机测试套件,通过四个RO和一个RTN源的紧凑电池设计,验证了弹性和随机性特性。此外,所提出的TRNG实现了随机序列输出,最大频率为200 MHz,能耗低至5 pJ/bit。我们的发现可能为开发高性能和安全的硬件设备和系统提供巨大的潜力。
{"title":"High-Speed True Random Number Generator With Multiple Entropy Sources: Ring Oscillator Jitter and Random Telegraph Noise","authors":"Yizhi Liu;Yifan Wu;Bo Chen;Pengpeng Sang;Jixuan Wu;Xiangye Wei;Xuepeng Zhan;Jiezhi Chen","doi":"10.1109/LES.2025.3549110","DOIUrl":"https://doi.org/10.1109/LES.2025.3549110","url":null,"abstract":"True random number generator (TRNG) is widely used in the field of hardware security as a key component that can enhance system security. In this letter, a TRNG with multiple entropy sources is proposed based on random telegraph noise (RTN) and ring oscillator (RO) jitter. The proposed TRNG utilizes a direct digital synthesizer (DDS) to realize the phase control and frequency control for generating random sequences, which is implemented on field programmable gate array (FPGA). By using the NIST randomness test suite, the resilience and randomness characteristics are verified with the compact cell design of four RO and one RTN source. Moreover, the proposed TRNG achieves random sequence output with a maximum frequency of 200 MHz and low energy consumption of 5 pJ/bit. Our findings may provide great potential for developing high-performance and security hardware devices and systems.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"48-51"},"PeriodicalIF":2.0,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Securing RISC-V SoC With Random Clock Self Complementary Countermeasure 利用随机时钟自互补对策保护RISC-V SoC
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-06 DOI: 10.1109/LES.2025.3567317
Titu Mary Ignatius;Roy Paily Palathinkal
The growth of IoT across industries, has made secure communication crucial, especially in resource-limited, power-constrained SoCs. The major contributions of this work are the design of an area and power efficient cryptographic RISC-V SoC by integrating AES hardware accelerator IP with RISC-V, and the development of a secure cryptographic RISC-V SoC using the random clock self complementary (RCSC) countermeasure against power analysis attack (PAA). RCSC enhances security by introducing amplitude and timing variability through complementary operations on random numbers and utilizing variable number of clock cycles per encryption. The design is tested and power traces for PAA were generated on both ASIC and FPGA platforms. The design achieves minimal power overhead of 0.407%and hardware costs of 7.33%, with strong resilience to PAA. It is evidenced by an MTD greater than one million, SNR below 0.5, MI in milli-range, and TVLA within +/-4.5, allowing the SoC to withstand attacks with over one million power traces.
物联网跨行业的发展使得安全通信至关重要,特别是在资源有限、功率受限的soc中。本工作的主要贡献是通过将AES硬件加速器IP与RISC-V集成,设计了一种面积和功耗效率高的加密RISC-V SoC,以及使用随机时钟自互补(RCSC)对抗功率分析攻击(PAA)的安全加密RISC-V SoC。RCSC通过对随机数的互补操作和每次加密使用可变时钟周期数来引入幅度和时间可变性,从而增强了安全性。对该设计进行了测试,并在ASIC和FPGA平台上生成了PAA的功率走线。该设计实现了最小功耗0.407%,硬件成本7.33%,具有较强的抗PAA能力。MTD大于100万,信噪比低于0.5,MI在毫米范围内,TVLA在+/-4.5范围内,这证明了这一点,使SoC能够承受超过100万条电源走线的攻击。
{"title":"Securing RISC-V SoC With Random Clock Self Complementary Countermeasure","authors":"Titu Mary Ignatius;Roy Paily Palathinkal","doi":"10.1109/LES.2025.3567317","DOIUrl":"https://doi.org/10.1109/LES.2025.3567317","url":null,"abstract":"The growth of IoT across industries, has made secure communication crucial, especially in resource-limited, power-constrained SoCs. The major contributions of this work are the design of an area and power efficient cryptographic RISC-V SoC by integrating AES hardware accelerator IP with RISC-V, and the development of a secure cryptographic RISC-V SoC using the random clock self complementary (RCSC) countermeasure against power analysis attack (PAA). RCSC enhances security by introducing amplitude and timing variability through complementary operations on random numbers and utilizing variable number of clock cycles per encryption. The design is tested and power traces for PAA were generated on both ASIC and FPGA platforms. The design achieves minimal power overhead of 0.407%and hardware costs of 7.33%, with strong resilience to PAA. It is evidenced by an MTD greater than one million, SNR below 0.5, MI in milli-range, and TVLA within +/-4.5, allowing the SoC to withstand attacks with over one million power traces.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"52-55"},"PeriodicalIF":2.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Embedded Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1