首页 > 最新文献

IEEE Embedded Systems Letters最新文献

英文 中文
Two Novel Approximate Radix-4 Booth Encoders for Efficient Signed Approximate Booth Multipliers 两种新的近似基数-4展位编码器用于有效的有符号近似展位乘法器
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-06 DOI: 10.1109/LES.2025.3567482
Xiqin Tang;Yang Li;Weijia Liu;Delong Shang
In this article, two approximate Radix-4 Booth encoders Approximate Radix-4 Booth Encoder1, 2 (AR4BE1 and AR4BE2) are. proposed to achieve great accuracy-cost tradeoff, and a series of signed approximation multipliers (SAMs) are designed based on a hybrid coding approach, where the most significant bits (MSBs) use exact radix-4 encoder, and the least significant bits (LSBs) use the proposed AR4BE1 and AR4BE2. And a simple error compensation circuit is used to reduce the accuracy loss in partial product array reduction. The proposed SAMs can be reconfigured for different applications with different accuracy-cost tradeoff requirement. We comprehensively evaluate the SAMs with respect to both hardware implementation and error analysis. Compared to the exact multiplier, the highest accuracy multiplier H10M6L0, has 22.8%, 32.1%, and 18.4% reduction in area, power consumption, and delay, respectively. The case study for image edge detection also proves the validity of the proposed SAMs in error-tolerant application.
在本文中,两个近似Radix-4展位编码器近似Radix-4展位编码器1,2 (AR4BE1和AR4BE2)是。基于混合编码方法设计了一系列有符号近似乘法器(sam),其中最高有效位(msb)使用精确的基数4编码器,最低有效位(LSBs)使用建议的AR4BE1和AR4BE2。采用简单的误差补偿电路,降低了部分积阵列约简的精度损失。所提出的地对空导弹可以针对不同的应用进行重新配置,以满足不同的精度-成本权衡要求。我们从硬件实现和误差分析两方面全面评估了地对空导弹。与精确乘法器相比,最高精度乘法器H10M6L0的面积、功耗和延迟分别减少22.8%、32.1%和18.4%。通过对图像边缘检测的实例研究,验证了该方法在容错应用中的有效性。
{"title":"Two Novel Approximate Radix-4 Booth Encoders for Efficient Signed Approximate Booth Multipliers","authors":"Xiqin Tang;Yang Li;Weijia Liu;Delong Shang","doi":"10.1109/LES.2025.3567482","DOIUrl":"https://doi.org/10.1109/LES.2025.3567482","url":null,"abstract":"In this article, two approximate Radix-4 Booth encoders Approximate Radix-4 Booth Encoder1, 2 (AR4BE1 and AR4BE2) are. proposed to achieve great accuracy-cost tradeoff, and a series of signed approximation multipliers (SAMs) are designed based on a hybrid coding approach, where the most significant bits (MSBs) use exact radix-4 encoder, and the least significant bits (LSBs) use the proposed AR4BE1 and AR4BE2. And a simple error compensation circuit is used to reduce the accuracy loss in partial product array reduction. The proposed SAMs can be reconfigured for different applications with different accuracy-cost tradeoff requirement. We comprehensively evaluate the SAMs with respect to both hardware implementation and error analysis. Compared to the exact multiplier, the highest accuracy multiplier H10M6L0, has 22.8%, 32.1%, and 18.4% reduction in area, power consumption, and delay, respectively. The case study for image edge detection also proves the validity of the proposed SAMs in error-tolerant application.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"73-76"},"PeriodicalIF":2.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep-Learning-Based Visual Aid for Low Vision 基于深度学习的低视力视觉辅助
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-06 DOI: 10.1109/LES.2025.3548959
Rodolfo Bonnin;Claudio Delrieux;María Fabiana Piccoli
Low vision significantly impacts daily navigation, affecting an estimated 295 million people globally (Bourne et al., 2021). While Head-Mounted Displays (hereafter HMDs) have been used throughout the latest decades to mitigate these limitations, to our knowledge, no low-cost existing system utilizes real-time object detection within an HMD for navigation assistance specifically designed for the visually impaired. This letter presents the development and evaluation of a real-time navigation assistance device with automatic detection and highlighting of visual Points of Interest over the visual field, deployed on a resource-constrained embedded platform. We leverage state-of-the-art YOLO models (v8, v9, and v10), optimized for execution on a Raspberry Pi 5. Using optimized embedded systems inference engines, we investigate the tradeoffs between accuracy, speed, and power consumption. Using a custom-built dataset and publicly available benchmark files, our experimental results demonstrate the feasibility of achieving real-time performance with acceptable accuracy on a low-cost, portable device, enabling practical semantics-aware assistive technology for the visually impaired.
低视力严重影响日常导航,全球约有2.95亿人受到影响(Bourne et al., 2021)。虽然头戴式显示器(以下简称HMD)在过去的几十年里一直在使用,以减轻这些限制,但据我们所知,目前还没有低成本的现有系统在头戴式显示器中利用实时物体检测来帮助视力受损的人进行导航。这封信介绍了一种实时导航辅助设备的开发和评估,该设备具有自动检测和突出显示视野上的视觉兴趣点,部署在资源受限的嵌入式平台上。我们利用最先进的YOLO模型(v8、v9和v10),针对Raspberry Pi 5进行了优化。使用优化的嵌入式系统推理引擎,我们研究了精度、速度和功耗之间的权衡。使用定制的数据集和公开可用的基准文件,我们的实验结果证明了在低成本的便携式设备上以可接受的精度实现实时性能的可行性,从而为视障人士提供实用的语义感知辅助技术。
{"title":"Deep-Learning-Based Visual Aid for Low Vision","authors":"Rodolfo Bonnin;Claudio Delrieux;María Fabiana Piccoli","doi":"10.1109/LES.2025.3548959","DOIUrl":"https://doi.org/10.1109/LES.2025.3548959","url":null,"abstract":"Low vision significantly impacts daily navigation, affecting an estimated 295 million people globally (Bourne et al., 2021). While Head-Mounted Displays (hereafter HMDs) have been used throughout the latest decades to mitigate these limitations, to our knowledge, no low-cost existing system utilizes real-time object detection within an HMD for navigation assistance specifically designed for the visually impaired. This letter presents the development and evaluation of a real-time navigation assistance device with automatic detection and highlighting of visual Points of Interest over the visual field, deployed on a resource-constrained embedded platform. We leverage state-of-the-art YOLO models (v8, v9, and v10), optimized for execution on a Raspberry Pi 5. Using optimized embedded systems inference engines, we investigate the tradeoffs between accuracy, speed, and power consumption. Using a custom-built dataset and publicly available benchmark files, our experimental results demonstrate the feasibility of achieving real-time performance with acceptable accuracy on a low-cost, portable device, enabling practical semantics-aware assistive technology for the visually impaired.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"398-401"},"PeriodicalIF":2.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TVI PUF: A Temperature- and Voltage-Independent PUF With High Resistance to Modeling Attacks TVI PUF:一种与温度和电压无关的PUF,具有很高的抗建模攻击能力
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-06 DOI: 10.1109/LES.2025.3548895
Abolfazl Rajaiyan;Seyed Mojtaba Atarodi
Key generation in cryptography is very important, as it helps secure digital communication. physically unclonable functions (PUFs) are among the circuits that ensure security at the hardware level by generating unique keys. However, PUFs are often affected by changes in temperature and voltage, which can impact their reliability. Additionally, PUFs are vulnerable to electromagnetic and modeling attacks, making it essential to use circuits that can resist such threats. In this letter, a temperature- and voltage-independent PUF (TVI PUF) is proposed. This circuit uses a novel two-step (pipeline) structure. This circuit incorporates pipelining and challenge obfuscation. This leads to enhanced reliability and improved resistance to modeling attacks. The proposed circuit is implemented on a Xilinx Zynq7 FPGA and tested under various conditions. The proposed circuit is resilient against both modeling and electromagnetic attacks and operates reliably across a temperature range of $- 40~^{circ }$ C to $150~^{circ }$ C. Measurement results show that the proposed circuit achieves a bit error rate (BER) of $1.21times 10{^{-}11 }$ , uniformity of 50.11%, and uniqueness of 49.89%. The proposed TVI PUF shows a vulnerability rate of 50.47% to modeling attacks. The high reliability (approximately zero BER) and low vulnerability to modeling attacks highlight the advantages of this circuit.
密钥生成在密码学中非常重要,因为它有助于数字通信的安全。物理不可克隆功能(puf)是通过生成唯一密钥来确保硬件级别安全性的电路之一。然而,puf经常受到温度和电压变化的影响,从而影响其可靠性。此外,puf容易受到电磁和建模攻击,因此必须使用能够抵御此类威胁的电路。在这封信中,提出了一个温度和电压无关的PUF (TVI PUF)。该电路采用了一种新颖的两步(管道)结构。该电路结合了流水线和挑战混淆。这将增强可靠性并提高对建模攻击的抵抗力。该电路在Xilinx Zynq7 FPGA上实现,并在各种条件下进行了测试。该电路具有良好的抗建模和抗电磁攻击能力,工作温度范围为$- 40~^{circ}$ C至$150~^{circ}$ C。测试结果表明,该电路的误码率(BER)为$1.21 × 10{^{-}11}$,均匀性为50.11%,唯一性为49.89%。提出的TVI PUF对建模攻击的脆弱性为50.47%。该电路具有高可靠性(近零误码率)和低建模攻击脆弱性的优点。
{"title":"TVI PUF: A Temperature- and Voltage-Independent PUF With High Resistance to Modeling Attacks","authors":"Abolfazl Rajaiyan;Seyed Mojtaba Atarodi","doi":"10.1109/LES.2025.3548895","DOIUrl":"https://doi.org/10.1109/LES.2025.3548895","url":null,"abstract":"Key generation in cryptography is very important, as it helps secure digital communication. physically unclonable functions (PUFs) are among the circuits that ensure security at the hardware level by generating unique keys. However, PUFs are often affected by changes in temperature and voltage, which can impact their reliability. Additionally, PUFs are vulnerable to electromagnetic and modeling attacks, making it essential to use circuits that can resist such threats. In this letter, a temperature- and voltage-independent PUF (TVI PUF) is proposed. This circuit uses a novel two-step (pipeline) structure. This circuit incorporates pipelining and challenge obfuscation. This leads to enhanced reliability and improved resistance to modeling attacks. The proposed circuit is implemented on a Xilinx Zynq7 FPGA and tested under various conditions. The proposed circuit is resilient against both modeling and electromagnetic attacks and operates reliably across a temperature range of <inline-formula> <tex-math>$- 40~^{circ }$ </tex-math></inline-formula>C to <inline-formula> <tex-math>$150~^{circ }$ </tex-math></inline-formula>C. Measurement results show that the proposed circuit achieves a bit error rate (BER) of <inline-formula> <tex-math>$1.21times 10{^{-}11 }$ </tex-math></inline-formula>, uniformity of 50.11%, and uniqueness of 49.89%. The proposed TVI PUF shows a vulnerability rate of 50.47% to modeling attacks. The high reliability (approximately zero BER) and low vulnerability to modeling attacks highlight the advantages of this circuit.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"443-446"},"PeriodicalIF":2.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Storage Class Memory-Enhanced Machine Learning Inference in Embedded Edge Devices 嵌入式边缘设备中混合存储类内存增强的机器学习推理
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-05 DOI: 10.1109/LES.2025.3566669
Hao Sun;Xiaoran Hao;Mande Xie
With the rapid proliferation of machine learning inference (MLI) tasks on Internet of Things (IoT) devices, the demand for enhanced memory system performance in these devices has become increasingly critical. This letter proposes a hybrid storage class memory (HSCM) optimization method to improve the efficiency of MLI on IoT devices. We conducted an in-depth analysis of memory access characteristics in MLI scenarios and found that the diversity and frequency of data access patterns significantly affect storage performance. Based on these characteristics, we propose a hybrid memory migration management strategy using the Markov model, which can dynamically adjust data migration between different storage levels to adapt to changing access patterns. We designed an acceleration method based on HSCM that improves inference speed and energy efficiency by optimizing data access paths and reducing latency. Experiments demonstrate that the proposed method outperforms existing memory systems, providing a new solution for efficient MLI on embedded IoT devices.
随着物联网(IoT)设备上机器学习推理(MLI)任务的快速增长,这些设备对增强存储系统性能的需求变得越来越重要。本文提出了一种混合存储类内存(HSCM)优化方法,以提高IoT设备上MLI的效率。我们对MLI场景中的内存访问特性进行了深入分析,发现数据访问模式的多样性和频率显著影响存储性能。基于这些特点,我们提出了一种基于马尔可夫模型的混合内存迁移管理策略,该策略可以动态调整不同存储级别之间的数据迁移,以适应不断变化的访问模式。我们设计了一种基于HSCM的加速方法,通过优化数据访问路径和减少延迟来提高推理速度和能源效率。实验表明,该方法优于现有的存储系统,为嵌入式物联网设备上的高效MLI提供了一种新的解决方案。
{"title":"Hybrid Storage Class Memory-Enhanced Machine Learning Inference in Embedded Edge Devices","authors":"Hao Sun;Xiaoran Hao;Mande Xie","doi":"10.1109/LES.2025.3566669","DOIUrl":"https://doi.org/10.1109/LES.2025.3566669","url":null,"abstract":"With the rapid proliferation of machine learning inference (MLI) tasks on Internet of Things (IoT) devices, the demand for enhanced memory system performance in these devices has become increasingly critical. This letter proposes a hybrid storage class memory (HSCM) optimization method to improve the efficiency of MLI on IoT devices. We conducted an in-depth analysis of memory access characteristics in MLI scenarios and found that the diversity and frequency of data access patterns significantly affect storage performance. Based on these characteristics, we propose a hybrid memory migration management strategy using the Markov model, which can dynamically adjust data migration between different storage levels to adapt to changing access patterns. We designed an acceleration method based on HSCM that improves inference speed and energy efficiency by optimizing data access paths and reducing latency. Experiments demonstrate that the proposed method outperforms existing memory systems, providing a new solution for efficient MLI on embedded IoT devices.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"15-18"},"PeriodicalIF":2.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Level Synthesis-Based Forensic Watermarking of Hardware IPs Using IP Vendor’s DNA Signature 基于IP厂商DNA签名的硬件IP高级综合取证水印
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-05 DOI: 10.1109/LES.2025.3566674
Anirban Sengupta;Nabendu Bhui;Aditya Anshul
The threats of hardware intellectual property (IP) piracy and fraud IP ownership claim pose a grave concern for the hardware security community. This letter proposes a novel high-level synthesis (HLS) methodology that exploits IP vendor’s deoxyribonucleic acid (DNA) signature to form a forensic hardware watermark for covertly embedding into allocation phase of the design synthesis process. The forensic watermark signature is generated by extracting unique DNA components, such as microsatellites and fragments. The proposed approach, when compared with the existing HLS watermarking techniques, revealed enhancement in security in terms of probability of coincidence and tamper tolerance, at nominal design overhead.
硬件知识产权(IP)盗版和IP所有权欺诈的威胁引起了硬件安全界的严重关注。本文提出了一种新的高级合成(HLS)方法,该方法利用IP供应商的脱氧核糖核酸(DNA)签名来形成一个法医硬件水印,以隐蔽地嵌入到设计合成过程的分配阶段。法医水印签名是通过提取独特的DNA成分,如微卫星和碎片生成的。与现有的HLS水印技术相比,所提出的方法在名义设计开销下,在巧合概率和篡改容忍度方面增强了安全性。
{"title":"High-Level Synthesis-Based Forensic Watermarking of Hardware IPs Using IP Vendor’s DNA Signature","authors":"Anirban Sengupta;Nabendu Bhui;Aditya Anshul","doi":"10.1109/LES.2025.3566674","DOIUrl":"https://doi.org/10.1109/LES.2025.3566674","url":null,"abstract":"The threats of hardware intellectual property (IP) piracy and fraud IP ownership claim pose a grave concern for the hardware security community. This letter proposes a novel high-level synthesis (HLS) methodology that exploits IP vendor’s deoxyribonucleic acid (DNA) signature to form a forensic hardware watermark for covertly embedding into allocation phase of the design synthesis process. The forensic watermark signature is generated by extracting unique DNA components, such as microsatellites and fragments. The proposed approach, when compared with the existing HLS watermarking techniques, revealed enhancement in security in terms of probability of coincidence and tamper tolerance, at nominal design overhead.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"19-22"},"PeriodicalIF":2.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power Efficient Multiplier Design for Error Resilient Edge Applications 用于纠错边缘应用的功率高效乘法器设计
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-05 DOI: 10.1109/LES.2025.3566818
Manav Rathod;Anakhi Hazarika;Ayush Agrawal;Neeraj Kumar;Arunachaleashwer S.
Edge devices require low-power and efficient hardware to support data-intensive computations in real-time applications. Traditional multipliers are computationally demanding and consume considerable power, however, approximate circuits are used in applications requiring low-power consumption and high-performance. The approximate multiplier is the key arithmetic function in many error-tolerant applications, such as signal processing, image processing, etc. This letter presents a power-efficient approximate multiplier design tailored for error-resilient applications on edge devices. The proposed design employs an approximate 4:2 compressor based on input reordering. Input reordering is used to reduce the number of combinations which leads to low power consumption. The experimental analysis demonstrates the efficacy of the proposed approximate multiplier that achieves power and area savings. The results highlight the viability of DNN workloads in resource-constrained environments and the potential for extending edge device lifespans without compromising the accuracy required for real-time, error-tolerant tasks.
边缘设备需要低功耗和高效的硬件来支持实时应用中的数据密集型计算。传统的乘法器在计算上要求很高,并且消耗相当大的功率,然而,近似电路用于要求低功耗和高性能的应用中。近似乘法器是信号处理、图像处理等容错应用中的关键算法。本文介绍了一种为边缘设备上的容错应用量身定制的节能近似乘法器设计。提出的设计采用基于输入重排序的近似4:2压缩器。输入重排序用于减少组合的数量,从而降低功耗。实验分析证明了所提出的近似乘法器在节省功率和面积方面的有效性。研究结果强调了DNN工作负载在资源受限环境中的可行性,以及在不影响实时、容错任务所需的准确性的情况下延长边缘设备寿命的潜力。
{"title":"Power Efficient Multiplier Design for Error Resilient Edge Applications","authors":"Manav Rathod;Anakhi Hazarika;Ayush Agrawal;Neeraj Kumar;Arunachaleashwer S.","doi":"10.1109/LES.2025.3566818","DOIUrl":"https://doi.org/10.1109/LES.2025.3566818","url":null,"abstract":"Edge devices require low-power and efficient hardware to support data-intensive computations in real-time applications. Traditional multipliers are computationally demanding and consume considerable power, however, approximate circuits are used in applications requiring low-power consumption and high-performance. The approximate multiplier is the key arithmetic function in many error-tolerant applications, such as signal processing, image processing, etc. This letter presents a power-efficient approximate multiplier design tailored for error-resilient applications on edge devices. The proposed design employs an approximate 4:2 compressor based on input reordering. Input reordering is used to reduce the number of combinations which leads to low power consumption. The experimental analysis demonstrates the efficacy of the proposed approximate multiplier that achieves power and area savings. The results highlight the viability of DNN workloads in resource-constrained environments and the potential for extending edge device lifespans without compromising the accuracy required for real-time, error-tolerant tasks.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"23-26"},"PeriodicalIF":2.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Sample-Based, Multistage Machine Learning Pipeline for Scalable IoT Threat Detection 用于可扩展物联网威胁检测的基于样本的多阶段机器学习管道
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-05 DOI: 10.1109/LES.2025.3567025
Marcelo V. C. Aragão;Tiago de M. Pereira;Felipe A. P. de Figueiredo;Samuel B. Mafra
The rapid growth of IoT devices demands scalable and efficient threat detection solutions. This article introduces a sample-based, multistage machine learning (ML) pipeline for IoT threat detection using the CICIoT2023 dataset, integrating feature selection, data balancing, and hyperparameter optimization to improve detection accuracy while reducing the computational overhead associated with training. We evaluate LightGBM, XGBoost, and XGBoostRF across binary, multiclass, and fine-grained tasks, showing that XGBoost with 10% sampling achieves the best tradeoff between accuracy and efficiency. Compared to prior methods, our approach eliminates GPU dependence, maintains low latency, and preserves state-of-the-art performance while enabling scalable training for high generalization capacity. Additionally, we provide model selection guidelines based on dataset complexity and computational constraints. The results show that training with a sample-based approach enables effective threat detection on large datasets, producing models that generalize well to diverse IoT attack scenarios, thus ensuring practical applicability in real-world deployments.
物联网设备的快速增长需要可扩展和高效的威胁检测解决方案。本文介绍了一种基于样本的多阶段机器学习(ML)管道,用于使用CICIoT2023数据集进行物联网威胁检测,集成了特征选择、数据平衡和超参数优化,以提高检测精度,同时减少与训练相关的计算开销。我们在二进制、多类和细粒度任务中评估了LightGBM、XGBoost和XGBoostRF,结果表明,10%采样的XGBoost在精度和效率之间达到了最佳平衡。与之前的方法相比,我们的方法消除了对GPU的依赖,保持了低延迟,并保留了最先进的性能,同时实现了高泛化能力的可扩展训练。此外,我们还提供了基于数据集复杂性和计算约束的模型选择指南。结果表明,使用基于样本的方法进行训练可以在大型数据集上进行有效的威胁检测,生成的模型可以很好地概括各种物联网攻击场景,从而确保在实际部署中的实际适用性。
{"title":"A Sample-Based, Multistage Machine Learning Pipeline for Scalable IoT Threat Detection","authors":"Marcelo V. C. Aragão;Tiago de M. Pereira;Felipe A. P. de Figueiredo;Samuel B. Mafra","doi":"10.1109/LES.2025.3567025","DOIUrl":"https://doi.org/10.1109/LES.2025.3567025","url":null,"abstract":"The rapid growth of IoT devices demands scalable and efficient threat detection solutions. This article introduces a sample-based, multistage machine learning (ML) pipeline for IoT threat detection using the CICIoT2023 dataset, integrating feature selection, data balancing, and hyperparameter optimization to improve detection accuracy while reducing the computational overhead associated with training. We evaluate LightGBM, XGBoost, and XGBoostRF across binary, multiclass, and fine-grained tasks, showing that XGBoost with 10% sampling achieves the best tradeoff between accuracy and efficiency. Compared to prior methods, our approach eliminates GPU dependence, maintains low latency, and preserves state-of-the-art performance while enabling scalable training for high generalization capacity. Additionally, we provide model selection guidelines based on dataset complexity and computational constraints. The results show that training with a sample-based approach enables effective threat detection on large datasets, producing models that generalize well to diverse IoT attack scenarios, thus ensuring practical applicability in real-world deployments.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"36-39"},"PeriodicalIF":2.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10985870","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Training and Energy Saving Using Ternary Hardware Acceleration for PCG Classification 基于三元硬件加速的PCG分类高效训练与节能
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-04 DOI: 10.1109/LES.2025.3547961
Nima Eslami;Mohammad Hossein Moaiyeri
Cardiovascular diseases (CVDs) necessitate continuous heart sound monitoring to distinguish normal from abnormal signals for early diagnosis. While wearable devices using phonocardiogram (PCG) technology show promise, energy constraints limit their effectiveness. This letter presents a binary PCG classification method utilizing a ternary neural network with nonvolatile ternary logic gates. Based on negative capacitance carbon nanotube field-effect transistor technology, the design incorporates ternary full adders and multipliers for efficient MAC operations, reducing transistor count and enhancing energy efficiency. Results demonstrate significant improvements in energy dissipation, extending device lifespan and enhancing long-term disease detection capabilities.
心血管疾病(cvd)需要持续的心音监测来区分正常和异常信号,以便早期诊断。虽然使用心音图(PCG)技术的可穿戴设备显示出前景,但能量限制限制了它们的有效性。本文提出了一种利用具有非易失性三元逻辑门的三元神经网络的二元PCG分类方法。该设计基于负电容碳纳米管场效应晶体管技术,采用三元全加法器和乘法器,实现高效的MAC操作,减少晶体管数量,提高能源效率。结果表明,能量耗散显著改善,延长设备寿命,增强长期疾病检测能力。
{"title":"Efficient Training and Energy Saving Using Ternary Hardware Acceleration for PCG Classification","authors":"Nima Eslami;Mohammad Hossein Moaiyeri","doi":"10.1109/LES.2025.3547961","DOIUrl":"https://doi.org/10.1109/LES.2025.3547961","url":null,"abstract":"Cardiovascular diseases (CVDs) necessitate continuous heart sound monitoring to distinguish normal from abnormal signals for early diagnosis. While wearable devices using phonocardiogram (PCG) technology show promise, energy constraints limit their effectiveness. This letter presents a binary PCG classification method utilizing a ternary neural network with nonvolatile ternary logic gates. Based on negative capacitance carbon nanotube field-effect transistor technology, the design incorporates ternary full adders and multipliers for efficient MAC operations, reducing transistor count and enhancing energy efficiency. Results demonstrate significant improvements in energy dissipation, extending device lifespan and enhancing long-term disease detection capabilities.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"447-450"},"PeriodicalIF":2.0,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Control System for Real-Time Driving of LCoS SLM Based on FPGA 基于FPGA的LCoS SLM实时驱动控制系统
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-03 DOI: 10.1109/LES.2025.3547021
Yuwei Jiang;Yingqi Feng;Tao Ren;Wenyu Yang;Li Tian;Hui Wang;Yongxin Zhu;Zunkai Huang
Liquid Crystal on Silicon (LCoS) technology utilizing phase-only modulation offers a wide range of applications. Efficient management of high-speed data flow and complex logic control is crucial for driving LCoS chips. This work presents an FPGA-based control system based for a $2460times 1620$ , 120 Hz LCoS phase modulator. We propose a bandwidth efficient data mapping method by multiplexing the HDMI transmission protocol. The system supports full-resolution display and facilitates real-time projection. Compared to previous implementation schemes, the proposed system achieves a lower power consumption per pixel, as low as 5.70 nW. Additionally, the driven LCoS chip achieves an improved flipping rate of 1.22 kHz. This work demonstrates the effectiveness and feasibility of the proposed system for LCoS applications, providing a robust platform that can be adapted to various implementations.
采用纯相位调制的硅基液晶(LCoS)技术具有广泛的应用前景。高速数据流的高效管理和复杂的逻辑控制是驱动LCoS芯片的关键。本文提出了一种基于fpga的控制系统,该系统基于$2460 × 1620$, 120 Hz LCoS相位调制器。我们提出了一种带宽高效的数据映射方法,通过复用HDMI传输协议。系统支持全分辨率显示,便于实时投影。与以前的实现方案相比,该系统实现了更低的每像素功耗,低至5.70 nW。此外,驱动LCoS芯片实现了1.22 kHz的改进翻转速率。这项工作证明了所提出的lco应用系统的有效性和可行性,提供了一个可以适应各种实现的强大平台。
{"title":"A Control System for Real-Time Driving of LCoS SLM Based on FPGA","authors":"Yuwei Jiang;Yingqi Feng;Tao Ren;Wenyu Yang;Li Tian;Hui Wang;Yongxin Zhu;Zunkai Huang","doi":"10.1109/LES.2025.3547021","DOIUrl":"https://doi.org/10.1109/LES.2025.3547021","url":null,"abstract":"Liquid Crystal on Silicon (LCoS) technology utilizing phase-only modulation offers a wide range of applications. Efficient management of high-speed data flow and complex logic control is crucial for driving LCoS chips. This work presents an FPGA-based control system based for a <inline-formula> <tex-math>$2460times 1620$ </tex-math></inline-formula>, 120 Hz LCoS phase modulator. We propose a bandwidth efficient data mapping method by multiplexing the HDMI transmission protocol. The system supports full-resolution display and facilitates real-time projection. Compared to previous implementation schemes, the proposed system achieves a lower power consumption per pixel, as low as 5.70 nW. Additionally, the driven LCoS chip achieves an improved flipping rate of 1.22 kHz. This work demonstrates the effectiveness and feasibility of the proposed system for LCoS applications, providing a robust platform that can be adapted to various implementations.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"431-434"},"PeriodicalIF":2.0,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A High-Speed ASIC Design of Reed-Solomon Erasure Code (RS-EC) Decoders for Fast Data Recovery in Storage 用于存储中快速数据恢复的Reed-Solomon Erasure Code (RS-EC)解码器的高速ASIC设计
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-02 DOI: 10.1109/LES.2025.3566686
Jiaqi Wang;Hongyang Zhao;Jiafeng Cheng;Wenrui Liu;Nengyuan Sun;Heng Sha;Zhiyuan Pan;Ming Jin;Jinghe Wang;Zhaoyi Niu;Jianghong Li;Kai Shi;Jiawei Zhang;Linhan Wang;Kangning Song;Ke Li;Selcuk Kose;Weize Yu
In this letter, a high-speed Reed-Solomon erasure code (RS-EC) decoder circuit is proposed for accelerating the data reconstruction in storage. By utilizing a check matrix instead of a regular Cauchy matrix for encoding the original data, the size of the matrix for decoding the damaged data can be reduced significantly. Moreover, in order to accelerate the matrix inversion related to the RS-EC decoder, the large matrix is partitioned into several small matrices and these small matrices execute the inversion operations individually. Subsequently, an equation method instead of the Gaussian elimination method is proposed for further speeding up the inversions of the small matrices. Eventually, a novel algorithm is proposed for reducing the overall data stream of the RS-EC decoder. The results show that the proposed RS (14, 10) decoder is able to achieve a 6800 MBps throughput, under the synthesis of TSMC 130 nm process design kits (PDK). The corresponding data stream of the decoder is reduced to 55% if only one data block is required to reconstruct.
本文提出了一种高速里德-所罗门擦除码(RS-EC)解码器电路,以加速存储中的数据重构。利用校验矩阵代替常规柯西矩阵对原始数据进行编码,可以显著减小解码损坏数据时所用矩阵的大小。此外,为了加快与RS-EC解码器相关的矩阵反演,将大矩阵划分为几个小矩阵,这些小矩阵分别执行反演操作。随后,提出了一种方程法代替高斯消去法,进一步加快了小矩阵的反演速度。最后,提出了一种减少RS-EC解码器整体数据流的新算法。结果表明,在台积电130 nm制程设计套件(PDK)的合成下,所提出的RS(14,10)解码器可以达到6800 MBps的吞吐量。如果只需要重建一个数据块,则解码器的相应数据流减少到55%。
{"title":"A High-Speed ASIC Design of Reed-Solomon Erasure Code (RS-EC) Decoders for Fast Data Recovery in Storage","authors":"Jiaqi Wang;Hongyang Zhao;Jiafeng Cheng;Wenrui Liu;Nengyuan Sun;Heng Sha;Zhiyuan Pan;Ming Jin;Jinghe Wang;Zhaoyi Niu;Jianghong Li;Kai Shi;Jiawei Zhang;Linhan Wang;Kangning Song;Ke Li;Selcuk Kose;Weize Yu","doi":"10.1109/LES.2025.3566686","DOIUrl":"https://doi.org/10.1109/LES.2025.3566686","url":null,"abstract":"In this letter, a high-speed Reed-Solomon erasure code (RS-EC) decoder circuit is proposed for accelerating the data reconstruction in storage. By utilizing a check matrix instead of a regular Cauchy matrix for encoding the original data, the size of the matrix for decoding the damaged data can be reduced significantly. Moreover, in order to accelerate the matrix inversion related to the RS-EC decoder, the large matrix is partitioned into several small matrices and these small matrices execute the inversion operations individually. Subsequently, an equation method instead of the Gaussian elimination method is proposed for further speeding up the inversions of the small matrices. Eventually, a novel algorithm is proposed for reducing the overall data stream of the RS-EC decoder. The results show that the proposed RS (14, 10) decoder is able to achieve a 6800 MBps throughput, under the synthesis of TSMC 130 nm process design kits (PDK). The corresponding data stream of the decoder is reduced to 55% if only one data block is required to reconstruct.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"11-14"},"PeriodicalIF":2.0,"publicationDate":"2025-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Embedded Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1