首页 > 最新文献

IEEE Embedded Systems Letters最新文献

英文 中文
Two Novel Approximate Radix-4 Booth Encoders for Efficient Signed Approximate Booth Multipliers 两种新的近似基数-4展位编码器用于有效的有符号近似展位乘法器
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-06 DOI: 10.1109/LES.2025.3567482
Xiqin Tang;Yang Li;Weijia Liu;Delong Shang
In this article, two approximate Radix-4 Booth encoders Approximate Radix-4 Booth Encoder1, 2 (AR4BE1 and AR4BE2) are. proposed to achieve great accuracy-cost tradeoff, and a series of signed approximation multipliers (SAMs) are designed based on a hybrid coding approach, where the most significant bits (MSBs) use exact radix-4 encoder, and the least significant bits (LSBs) use the proposed AR4BE1 and AR4BE2. And a simple error compensation circuit is used to reduce the accuracy loss in partial product array reduction. The proposed SAMs can be reconfigured for different applications with different accuracy-cost tradeoff requirement. We comprehensively evaluate the SAMs with respect to both hardware implementation and error analysis. Compared to the exact multiplier, the highest accuracy multiplier H10M6L0, has 22.8%, 32.1%, and 18.4% reduction in area, power consumption, and delay, respectively. The case study for image edge detection also proves the validity of the proposed SAMs in error-tolerant application.
在本文中,两个近似Radix-4展位编码器近似Radix-4展位编码器1,2 (AR4BE1和AR4BE2)是。基于混合编码方法设计了一系列有符号近似乘法器(sam),其中最高有效位(msb)使用精确的基数4编码器,最低有效位(LSBs)使用建议的AR4BE1和AR4BE2。采用简单的误差补偿电路,降低了部分积阵列约简的精度损失。所提出的地对空导弹可以针对不同的应用进行重新配置,以满足不同的精度-成本权衡要求。我们从硬件实现和误差分析两方面全面评估了地对空导弹。与精确乘法器相比,最高精度乘法器H10M6L0的面积、功耗和延迟分别减少22.8%、32.1%和18.4%。通过对图像边缘检测的实例研究,验证了该方法在容错应用中的有效性。
{"title":"Two Novel Approximate Radix-4 Booth Encoders for Efficient Signed Approximate Booth Multipliers","authors":"Xiqin Tang;Yang Li;Weijia Liu;Delong Shang","doi":"10.1109/LES.2025.3567482","DOIUrl":"https://doi.org/10.1109/LES.2025.3567482","url":null,"abstract":"In this article, two approximate Radix-4 Booth encoders Approximate Radix-4 Booth Encoder1, 2 (AR4BE1 and AR4BE2) are. proposed to achieve great accuracy-cost tradeoff, and a series of signed approximation multipliers (SAMs) are designed based on a hybrid coding approach, where the most significant bits (MSBs) use exact radix-4 encoder, and the least significant bits (LSBs) use the proposed AR4BE1 and AR4BE2. And a simple error compensation circuit is used to reduce the accuracy loss in partial product array reduction. The proposed SAMs can be reconfigured for different applications with different accuracy-cost tradeoff requirement. We comprehensively evaluate the SAMs with respect to both hardware implementation and error analysis. Compared to the exact multiplier, the highest accuracy multiplier H10M6L0, has 22.8%, 32.1%, and 18.4% reduction in area, power consumption, and delay, respectively. The case study for image edge detection also proves the validity of the proposed SAMs in error-tolerant application.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"73-76"},"PeriodicalIF":2.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep-Learning-Based Visual Aid for Low Vision 基于深度学习的低视力视觉辅助
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-06 DOI: 10.1109/LES.2025.3548959
Rodolfo Bonnin;Claudio Delrieux;María Fabiana Piccoli
Low vision significantly impacts daily navigation, affecting an estimated 295 million people globally (Bourne et al., 2021). While Head-Mounted Displays (hereafter HMDs) have been used throughout the latest decades to mitigate these limitations, to our knowledge, no low-cost existing system utilizes real-time object detection within an HMD for navigation assistance specifically designed for the visually impaired. This letter presents the development and evaluation of a real-time navigation assistance device with automatic detection and highlighting of visual Points of Interest over the visual field, deployed on a resource-constrained embedded platform. We leverage state-of-the-art YOLO models (v8, v9, and v10), optimized for execution on a Raspberry Pi 5. Using optimized embedded systems inference engines, we investigate the tradeoffs between accuracy, speed, and power consumption. Using a custom-built dataset and publicly available benchmark files, our experimental results demonstrate the feasibility of achieving real-time performance with acceptable accuracy on a low-cost, portable device, enabling practical semantics-aware assistive technology for the visually impaired.
低视力严重影响日常导航,全球约有2.95亿人受到影响(Bourne et al., 2021)。虽然头戴式显示器(以下简称HMD)在过去的几十年里一直在使用,以减轻这些限制,但据我们所知,目前还没有低成本的现有系统在头戴式显示器中利用实时物体检测来帮助视力受损的人进行导航。这封信介绍了一种实时导航辅助设备的开发和评估,该设备具有自动检测和突出显示视野上的视觉兴趣点,部署在资源受限的嵌入式平台上。我们利用最先进的YOLO模型(v8、v9和v10),针对Raspberry Pi 5进行了优化。使用优化的嵌入式系统推理引擎,我们研究了精度、速度和功耗之间的权衡。使用定制的数据集和公开可用的基准文件,我们的实验结果证明了在低成本的便携式设备上以可接受的精度实现实时性能的可行性,从而为视障人士提供实用的语义感知辅助技术。
{"title":"Deep-Learning-Based Visual Aid for Low Vision","authors":"Rodolfo Bonnin;Claudio Delrieux;María Fabiana Piccoli","doi":"10.1109/LES.2025.3548959","DOIUrl":"https://doi.org/10.1109/LES.2025.3548959","url":null,"abstract":"Low vision significantly impacts daily navigation, affecting an estimated 295 million people globally (Bourne et al., 2021). While Head-Mounted Displays (hereafter HMDs) have been used throughout the latest decades to mitigate these limitations, to our knowledge, no low-cost existing system utilizes real-time object detection within an HMD for navigation assistance specifically designed for the visually impaired. This letter presents the development and evaluation of a real-time navigation assistance device with automatic detection and highlighting of visual Points of Interest over the visual field, deployed on a resource-constrained embedded platform. We leverage state-of-the-art YOLO models (v8, v9, and v10), optimized for execution on a Raspberry Pi 5. Using optimized embedded systems inference engines, we investigate the tradeoffs between accuracy, speed, and power consumption. Using a custom-built dataset and publicly available benchmark files, our experimental results demonstrate the feasibility of achieving real-time performance with acceptable accuracy on a low-cost, portable device, enabling practical semantics-aware assistive technology for the visually impaired.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"398-401"},"PeriodicalIF":2.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TVI PUF: A Temperature- and Voltage-Independent PUF With High Resistance to Modeling Attacks TVI PUF:一种与温度和电压无关的PUF,具有很高的抗建模攻击能力
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-06 DOI: 10.1109/LES.2025.3548895
Abolfazl Rajaiyan;Seyed Mojtaba Atarodi
Key generation in cryptography is very important, as it helps secure digital communication. physically unclonable functions (PUFs) are among the circuits that ensure security at the hardware level by generating unique keys. However, PUFs are often affected by changes in temperature and voltage, which can impact their reliability. Additionally, PUFs are vulnerable to electromagnetic and modeling attacks, making it essential to use circuits that can resist such threats. In this letter, a temperature- and voltage-independent PUF (TVI PUF) is proposed. This circuit uses a novel two-step (pipeline) structure. This circuit incorporates pipelining and challenge obfuscation. This leads to enhanced reliability and improved resistance to modeling attacks. The proposed circuit is implemented on a Xilinx Zynq7 FPGA and tested under various conditions. The proposed circuit is resilient against both modeling and electromagnetic attacks and operates reliably across a temperature range of $- 40~^{circ }$ C to $150~^{circ }$ C. Measurement results show that the proposed circuit achieves a bit error rate (BER) of $1.21times 10{^{-}11 }$ , uniformity of 50.11%, and uniqueness of 49.89%. The proposed TVI PUF shows a vulnerability rate of 50.47% to modeling attacks. The high reliability (approximately zero BER) and low vulnerability to modeling attacks highlight the advantages of this circuit.
密钥生成在密码学中非常重要,因为它有助于数字通信的安全。物理不可克隆功能(puf)是通过生成唯一密钥来确保硬件级别安全性的电路之一。然而,puf经常受到温度和电压变化的影响,从而影响其可靠性。此外,puf容易受到电磁和建模攻击,因此必须使用能够抵御此类威胁的电路。在这封信中,提出了一个温度和电压无关的PUF (TVI PUF)。该电路采用了一种新颖的两步(管道)结构。该电路结合了流水线和挑战混淆。这将增强可靠性并提高对建模攻击的抵抗力。该电路在Xilinx Zynq7 FPGA上实现,并在各种条件下进行了测试。该电路具有良好的抗建模和抗电磁攻击能力,工作温度范围为$- 40~^{circ}$ C至$150~^{circ}$ C。测试结果表明,该电路的误码率(BER)为$1.21 × 10{^{-}11}$,均匀性为50.11%,唯一性为49.89%。提出的TVI PUF对建模攻击的脆弱性为50.47%。该电路具有高可靠性(近零误码率)和低建模攻击脆弱性的优点。
{"title":"TVI PUF: A Temperature- and Voltage-Independent PUF With High Resistance to Modeling Attacks","authors":"Abolfazl Rajaiyan;Seyed Mojtaba Atarodi","doi":"10.1109/LES.2025.3548895","DOIUrl":"https://doi.org/10.1109/LES.2025.3548895","url":null,"abstract":"Key generation in cryptography is very important, as it helps secure digital communication. physically unclonable functions (PUFs) are among the circuits that ensure security at the hardware level by generating unique keys. However, PUFs are often affected by changes in temperature and voltage, which can impact their reliability. Additionally, PUFs are vulnerable to electromagnetic and modeling attacks, making it essential to use circuits that can resist such threats. In this letter, a temperature- and voltage-independent PUF (TVI PUF) is proposed. This circuit uses a novel two-step (pipeline) structure. This circuit incorporates pipelining and challenge obfuscation. This leads to enhanced reliability and improved resistance to modeling attacks. The proposed circuit is implemented on a Xilinx Zynq7 FPGA and tested under various conditions. The proposed circuit is resilient against both modeling and electromagnetic attacks and operates reliably across a temperature range of <inline-formula> <tex-math>$- 40~^{circ }$ </tex-math></inline-formula>C to <inline-formula> <tex-math>$150~^{circ }$ </tex-math></inline-formula>C. Measurement results show that the proposed circuit achieves a bit error rate (BER) of <inline-formula> <tex-math>$1.21times 10{^{-}11 }$ </tex-math></inline-formula>, uniformity of 50.11%, and uniqueness of 49.89%. The proposed TVI PUF shows a vulnerability rate of 50.47% to modeling attacks. The high reliability (approximately zero BER) and low vulnerability to modeling attacks highlight the advantages of this circuit.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"443-446"},"PeriodicalIF":2.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Storage Class Memory-Enhanced Machine Learning Inference in Embedded Edge Devices 嵌入式边缘设备中混合存储类内存增强的机器学习推理
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-05 DOI: 10.1109/LES.2025.3566669
Hao Sun;Xiaoran Hao;Mande Xie
With the rapid proliferation of machine learning inference (MLI) tasks on Internet of Things (IoT) devices, the demand for enhanced memory system performance in these devices has become increasingly critical. This letter proposes a hybrid storage class memory (HSCM) optimization method to improve the efficiency of MLI on IoT devices. We conducted an in-depth analysis of memory access characteristics in MLI scenarios and found that the diversity and frequency of data access patterns significantly affect storage performance. Based on these characteristics, we propose a hybrid memory migration management strategy using the Markov model, which can dynamically adjust data migration between different storage levels to adapt to changing access patterns. We designed an acceleration method based on HSCM that improves inference speed and energy efficiency by optimizing data access paths and reducing latency. Experiments demonstrate that the proposed method outperforms existing memory systems, providing a new solution for efficient MLI on embedded IoT devices.
随着物联网(IoT)设备上机器学习推理(MLI)任务的快速增长,这些设备对增强存储系统性能的需求变得越来越重要。本文提出了一种混合存储类内存(HSCM)优化方法,以提高IoT设备上MLI的效率。我们对MLI场景中的内存访问特性进行了深入分析,发现数据访问模式的多样性和频率显著影响存储性能。基于这些特点,我们提出了一种基于马尔可夫模型的混合内存迁移管理策略,该策略可以动态调整不同存储级别之间的数据迁移,以适应不断变化的访问模式。我们设计了一种基于HSCM的加速方法,通过优化数据访问路径和减少延迟来提高推理速度和能源效率。实验表明,该方法优于现有的存储系统,为嵌入式物联网设备上的高效MLI提供了一种新的解决方案。
{"title":"Hybrid Storage Class Memory-Enhanced Machine Learning Inference in Embedded Edge Devices","authors":"Hao Sun;Xiaoran Hao;Mande Xie","doi":"10.1109/LES.2025.3566669","DOIUrl":"https://doi.org/10.1109/LES.2025.3566669","url":null,"abstract":"With the rapid proliferation of machine learning inference (MLI) tasks on Internet of Things (IoT) devices, the demand for enhanced memory system performance in these devices has become increasingly critical. This letter proposes a hybrid storage class memory (HSCM) optimization method to improve the efficiency of MLI on IoT devices. We conducted an in-depth analysis of memory access characteristics in MLI scenarios and found that the diversity and frequency of data access patterns significantly affect storage performance. Based on these characteristics, we propose a hybrid memory migration management strategy using the Markov model, which can dynamically adjust data migration between different storage levels to adapt to changing access patterns. We designed an acceleration method based on HSCM that improves inference speed and energy efficiency by optimizing data access paths and reducing latency. Experiments demonstrate that the proposed method outperforms existing memory systems, providing a new solution for efficient MLI on embedded IoT devices.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"15-18"},"PeriodicalIF":2.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Level Synthesis-Based Forensic Watermarking of Hardware IPs Using IP Vendor’s DNA Signature 基于IP厂商DNA签名的硬件IP高级综合取证水印
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-05 DOI: 10.1109/LES.2025.3566674
Anirban Sengupta;Nabendu Bhui;Aditya Anshul
The threats of hardware intellectual property (IP) piracy and fraud IP ownership claim pose a grave concern for the hardware security community. This letter proposes a novel high-level synthesis (HLS) methodology that exploits IP vendor’s deoxyribonucleic acid (DNA) signature to form a forensic hardware watermark for covertly embedding into allocation phase of the design synthesis process. The forensic watermark signature is generated by extracting unique DNA components, such as microsatellites and fragments. The proposed approach, when compared with the existing HLS watermarking techniques, revealed enhancement in security in terms of probability of coincidence and tamper tolerance, at nominal design overhead.
硬件知识产权(IP)盗版和IP所有权欺诈的威胁引起了硬件安全界的严重关注。本文提出了一种新的高级合成(HLS)方法,该方法利用IP供应商的脱氧核糖核酸(DNA)签名来形成一个法医硬件水印,以隐蔽地嵌入到设计合成过程的分配阶段。法医水印签名是通过提取独特的DNA成分,如微卫星和碎片生成的。与现有的HLS水印技术相比,所提出的方法在名义设计开销下,在巧合概率和篡改容忍度方面增强了安全性。
{"title":"High-Level Synthesis-Based Forensic Watermarking of Hardware IPs Using IP Vendor’s DNA Signature","authors":"Anirban Sengupta;Nabendu Bhui;Aditya Anshul","doi":"10.1109/LES.2025.3566674","DOIUrl":"https://doi.org/10.1109/LES.2025.3566674","url":null,"abstract":"The threats of hardware intellectual property (IP) piracy and fraud IP ownership claim pose a grave concern for the hardware security community. This letter proposes a novel high-level synthesis (HLS) methodology that exploits IP vendor’s deoxyribonucleic acid (DNA) signature to form a forensic hardware watermark for covertly embedding into allocation phase of the design synthesis process. The forensic watermark signature is generated by extracting unique DNA components, such as microsatellites and fragments. The proposed approach, when compared with the existing HLS watermarking techniques, revealed enhancement in security in terms of probability of coincidence and tamper tolerance, at nominal design overhead.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"19-22"},"PeriodicalIF":2.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Sample-Based, Multistage Machine Learning Pipeline for Scalable IoT Threat Detection 用于可扩展物联网威胁检测的基于样本的多阶段机器学习管道
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-05 DOI: 10.1109/LES.2025.3567025
Marcelo V. C. Aragão;Tiago de M. Pereira;Felipe A. P. de Figueiredo;Samuel B. Mafra
The rapid growth of IoT devices demands scalable and efficient threat detection solutions. This article introduces a sample-based, multistage machine learning (ML) pipeline for IoT threat detection using the CICIoT2023 dataset, integrating feature selection, data balancing, and hyperparameter optimization to improve detection accuracy while reducing the computational overhead associated with training. We evaluate LightGBM, XGBoost, and XGBoostRF across binary, multiclass, and fine-grained tasks, showing that XGBoost with 10% sampling achieves the best tradeoff between accuracy and efficiency. Compared to prior methods, our approach eliminates GPU dependence, maintains low latency, and preserves state-of-the-art performance while enabling scalable training for high generalization capacity. Additionally, we provide model selection guidelines based on dataset complexity and computational constraints. The results show that training with a sample-based approach enables effective threat detection on large datasets, producing models that generalize well to diverse IoT attack scenarios, thus ensuring practical applicability in real-world deployments.
物联网设备的快速增长需要可扩展和高效的威胁检测解决方案。本文介绍了一种基于样本的多阶段机器学习(ML)管道,用于使用CICIoT2023数据集进行物联网威胁检测,集成了特征选择、数据平衡和超参数优化,以提高检测精度,同时减少与训练相关的计算开销。我们在二进制、多类和细粒度任务中评估了LightGBM、XGBoost和XGBoostRF,结果表明,10%采样的XGBoost在精度和效率之间达到了最佳平衡。与之前的方法相比,我们的方法消除了对GPU的依赖,保持了低延迟,并保留了最先进的性能,同时实现了高泛化能力的可扩展训练。此外,我们还提供了基于数据集复杂性和计算约束的模型选择指南。结果表明,使用基于样本的方法进行训练可以在大型数据集上进行有效的威胁检测,生成的模型可以很好地概括各种物联网攻击场景,从而确保在实际部署中的实际适用性。
{"title":"A Sample-Based, Multistage Machine Learning Pipeline for Scalable IoT Threat Detection","authors":"Marcelo V. C. Aragão;Tiago de M. Pereira;Felipe A. P. de Figueiredo;Samuel B. Mafra","doi":"10.1109/LES.2025.3567025","DOIUrl":"https://doi.org/10.1109/LES.2025.3567025","url":null,"abstract":"The rapid growth of IoT devices demands scalable and efficient threat detection solutions. This article introduces a sample-based, multistage machine learning (ML) pipeline for IoT threat detection using the CICIoT2023 dataset, integrating feature selection, data balancing, and hyperparameter optimization to improve detection accuracy while reducing the computational overhead associated with training. We evaluate LightGBM, XGBoost, and XGBoostRF across binary, multiclass, and fine-grained tasks, showing that XGBoost with 10% sampling achieves the best tradeoff between accuracy and efficiency. Compared to prior methods, our approach eliminates GPU dependence, maintains low latency, and preserves state-of-the-art performance while enabling scalable training for high generalization capacity. Additionally, we provide model selection guidelines based on dataset complexity and computational constraints. The results show that training with a sample-based approach enables effective threat detection on large datasets, producing models that generalize well to diverse IoT attack scenarios, thus ensuring practical applicability in real-world deployments.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"36-39"},"PeriodicalIF":2.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10985870","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Training and Energy Saving Using Ternary Hardware Acceleration for PCG Classification 基于三元硬件加速的PCG分类高效训练与节能
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-04 DOI: 10.1109/LES.2025.3547961
Nima Eslami;Mohammad Hossein Moaiyeri
Cardiovascular diseases (CVDs) necessitate continuous heart sound monitoring to distinguish normal from abnormal signals for early diagnosis. While wearable devices using phonocardiogram (PCG) technology show promise, energy constraints limit their effectiveness. This letter presents a binary PCG classification method utilizing a ternary neural network with nonvolatile ternary logic gates. Based on negative capacitance carbon nanotube field-effect transistor technology, the design incorporates ternary full adders and multipliers for efficient MAC operations, reducing transistor count and enhancing energy efficiency. Results demonstrate significant improvements in energy dissipation, extending device lifespan and enhancing long-term disease detection capabilities.
心血管疾病(cvd)需要持续的心音监测来区分正常和异常信号,以便早期诊断。虽然使用心音图(PCG)技术的可穿戴设备显示出前景,但能量限制限制了它们的有效性。本文提出了一种利用具有非易失性三元逻辑门的三元神经网络的二元PCG分类方法。该设计基于负电容碳纳米管场效应晶体管技术,采用三元全加法器和乘法器,实现高效的MAC操作,减少晶体管数量,提高能源效率。结果表明,能量耗散显著改善,延长设备寿命,增强长期疾病检测能力。
{"title":"Efficient Training and Energy Saving Using Ternary Hardware Acceleration for PCG Classification","authors":"Nima Eslami;Mohammad Hossein Moaiyeri","doi":"10.1109/LES.2025.3547961","DOIUrl":"https://doi.org/10.1109/LES.2025.3547961","url":null,"abstract":"Cardiovascular diseases (CVDs) necessitate continuous heart sound monitoring to distinguish normal from abnormal signals for early diagnosis. While wearable devices using phonocardiogram (PCG) technology show promise, energy constraints limit their effectiveness. This letter presents a binary PCG classification method utilizing a ternary neural network with nonvolatile ternary logic gates. Based on negative capacitance carbon nanotube field-effect transistor technology, the design incorporates ternary full adders and multipliers for efficient MAC operations, reducing transistor count and enhancing energy efficiency. Results demonstrate significant improvements in energy dissipation, extending device lifespan and enhancing long-term disease detection capabilities.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"447-450"},"PeriodicalIF":2.0,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Control System for Real-Time Driving of LCoS SLM Based on FPGA 基于FPGA的LCoS SLM实时驱动控制系统
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-03 DOI: 10.1109/LES.2025.3547021
Yuwei Jiang;Yingqi Feng;Tao Ren;Wenyu Yang;Li Tian;Hui Wang;Yongxin Zhu;Zunkai Huang
Liquid Crystal on Silicon (LCoS) technology utilizing phase-only modulation offers a wide range of applications. Efficient management of high-speed data flow and complex logic control is crucial for driving LCoS chips. This work presents an FPGA-based control system based for a $2460times 1620$ , 120 Hz LCoS phase modulator. We propose a bandwidth efficient data mapping method by multiplexing the HDMI transmission protocol. The system supports full-resolution display and facilitates real-time projection. Compared to previous implementation schemes, the proposed system achieves a lower power consumption per pixel, as low as 5.70 nW. Additionally, the driven LCoS chip achieves an improved flipping rate of 1.22 kHz. This work demonstrates the effectiveness and feasibility of the proposed system for LCoS applications, providing a robust platform that can be adapted to various implementations.
采用纯相位调制的硅基液晶(LCoS)技术具有广泛的应用前景。高速数据流的高效管理和复杂的逻辑控制是驱动LCoS芯片的关键。本文提出了一种基于fpga的控制系统,该系统基于$2460 × 1620$, 120 Hz LCoS相位调制器。我们提出了一种带宽高效的数据映射方法,通过复用HDMI传输协议。系统支持全分辨率显示,便于实时投影。与以前的实现方案相比,该系统实现了更低的每像素功耗,低至5.70 nW。此外,驱动LCoS芯片实现了1.22 kHz的改进翻转速率。这项工作证明了所提出的lco应用系统的有效性和可行性,提供了一个可以适应各种实现的强大平台。
{"title":"A Control System for Real-Time Driving of LCoS SLM Based on FPGA","authors":"Yuwei Jiang;Yingqi Feng;Tao Ren;Wenyu Yang;Li Tian;Hui Wang;Yongxin Zhu;Zunkai Huang","doi":"10.1109/LES.2025.3547021","DOIUrl":"https://doi.org/10.1109/LES.2025.3547021","url":null,"abstract":"Liquid Crystal on Silicon (LCoS) technology utilizing phase-only modulation offers a wide range of applications. Efficient management of high-speed data flow and complex logic control is crucial for driving LCoS chips. This work presents an FPGA-based control system based for a <inline-formula> <tex-math>$2460times 1620$ </tex-math></inline-formula>, 120 Hz LCoS phase modulator. We propose a bandwidth efficient data mapping method by multiplexing the HDMI transmission protocol. The system supports full-resolution display and facilitates real-time projection. Compared to previous implementation schemes, the proposed system achieves a lower power consumption per pixel, as low as 5.70 nW. Additionally, the driven LCoS chip achieves an improved flipping rate of 1.22 kHz. This work demonstrates the effectiveness and feasibility of the proposed system for LCoS applications, providing a robust platform that can be adapted to various implementations.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"431-434"},"PeriodicalIF":2.0,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A High-Speed ASIC Design of Reed-Solomon Erasure Code (RS-EC) Decoders for Fast Data Recovery in Storage 用于存储中快速数据恢复的Reed-Solomon Erasure Code (RS-EC)解码器的高速ASIC设计
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-03-02 DOI: 10.1109/LES.2025.3566686
Jiaqi Wang;Hongyang Zhao;Jiafeng Cheng;Wenrui Liu;Nengyuan Sun;Heng Sha;Zhiyuan Pan;Ming Jin;Jinghe Wang;Zhaoyi Niu;Jianghong Li;Kai Shi;Jiawei Zhang;Linhan Wang;Kangning Song;Ke Li;Selcuk Kose;Weize Yu
In this letter, a high-speed Reed-Solomon erasure code (RS-EC) decoder circuit is proposed for accelerating the data reconstruction in storage. By utilizing a check matrix instead of a regular Cauchy matrix for encoding the original data, the size of the matrix for decoding the damaged data can be reduced significantly. Moreover, in order to accelerate the matrix inversion related to the RS-EC decoder, the large matrix is partitioned into several small matrices and these small matrices execute the inversion operations individually. Subsequently, an equation method instead of the Gaussian elimination method is proposed for further speeding up the inversions of the small matrices. Eventually, a novel algorithm is proposed for reducing the overall data stream of the RS-EC decoder. The results show that the proposed RS (14, 10) decoder is able to achieve a 6800 MBps throughput, under the synthesis of TSMC 130 nm process design kits (PDK). The corresponding data stream of the decoder is reduced to 55% if only one data block is required to reconstruct.
本文提出了一种高速里德-所罗门擦除码(RS-EC)解码器电路,以加速存储中的数据重构。利用校验矩阵代替常规柯西矩阵对原始数据进行编码,可以显著减小解码损坏数据时所用矩阵的大小。此外,为了加快与RS-EC解码器相关的矩阵反演,将大矩阵划分为几个小矩阵,这些小矩阵分别执行反演操作。随后,提出了一种方程法代替高斯消去法,进一步加快了小矩阵的反演速度。最后,提出了一种减少RS-EC解码器整体数据流的新算法。结果表明,在台积电130 nm制程设计套件(PDK)的合成下,所提出的RS(14,10)解码器可以达到6800 MBps的吞吐量。如果只需要重建一个数据块,则解码器的相应数据流减少到55%。
{"title":"A High-Speed ASIC Design of Reed-Solomon Erasure Code (RS-EC) Decoders for Fast Data Recovery in Storage","authors":"Jiaqi Wang;Hongyang Zhao;Jiafeng Cheng;Wenrui Liu;Nengyuan Sun;Heng Sha;Zhiyuan Pan;Ming Jin;Jinghe Wang;Zhaoyi Niu;Jianghong Li;Kai Shi;Jiawei Zhang;Linhan Wang;Kangning Song;Ke Li;Selcuk Kose;Weize Yu","doi":"10.1109/LES.2025.3566686","DOIUrl":"https://doi.org/10.1109/LES.2025.3566686","url":null,"abstract":"In this letter, a high-speed Reed-Solomon erasure code (RS-EC) decoder circuit is proposed for accelerating the data reconstruction in storage. By utilizing a check matrix instead of a regular Cauchy matrix for encoding the original data, the size of the matrix for decoding the damaged data can be reduced significantly. Moreover, in order to accelerate the matrix inversion related to the RS-EC decoder, the large matrix is partitioned into several small matrices and these small matrices execute the inversion operations individually. Subsequently, an equation method instead of the Gaussian elimination method is proposed for further speeding up the inversions of the small matrices. Eventually, a novel algorithm is proposed for reducing the overall data stream of the RS-EC decoder. The results show that the proposed RS (14, 10) decoder is able to achieve a 6800 MBps throughput, under the synthesis of TSMC 130 nm process design kits (PDK). The corresponding data stream of the decoder is reduced to 55% if only one data block is required to reconstruct.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"18 1","pages":"11-14"},"PeriodicalIF":2.0,"publicationDate":"2025-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146162213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault Analysis in Induction Motors Through Signal Acquisition With Hilbert Transform and FPGA 基于希尔伯特变换和FPGA信号采集的异步电动机故障分析
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-02-27 DOI: 10.1109/LES.2025.3546847
I. A. Juarez-Trujillo;Susana De Le贸n-Aldaco;J. G. Velasquez-Aguilar;J. Aguayo-Alquicira
This study presents an innovative methodology for the detection of faults in electric motors, specifically in single-phase induction motors, using the Hilbert Transform combined with spectral density analysis. The main innovation lies in the integration of field-programmable gate array (FPGA) technology with the Hilbert Transform for real-time data acquisition and accurate signal analysis, allowing early detection of faults such as short circuits in the coils. The system uses an analog-to-digital converter to capture current and voltage signals, and a first in, first out memory buffer to ensure continuous acquisition without data loss. The Hilbert transform allows the decomposition of the signals and extraction of analytical frequencies, which facilitates the identification of fault-generated harmonics. The results show a significant improvement in fault detection, with the identification of high-frequency harmonics indicating internal problems such as short circuits. This approach, by integrating FPGA for fast signal acquisition and processing, optimizes the monitoring of electric motors, enabling more effective predictive maintenance and reducing downtime in industrial applications. The innovation of this system improves the accuracy and efficiency of fault diagnosis, contributing to the advancement of real-time monitoring technology.
本研究提出了一种利用希尔伯特变换与谱密度分析相结合的方法来检测电动机故障,特别是单相感应电动机故障。主要创新在于将现场可编程门阵列(FPGA)技术与希尔伯特变换(Hilbert Transform)集成在一起,用于实时数据采集和准确的信号分析,从而可以早期检测线圈中的短路等故障。该系统使用模数转换器捕获电流和电压信号,并使用先进先出存储器缓冲,以确保连续采集而不会丢失数据。希尔伯特变换允许对信号进行分解并提取分析频率,这有助于识别故障产生的谐波。结果表明,在故障检测方面有了显着的改进,可以识别出高频谐波,表明内部问题,如短路。这种方法通过集成FPGA进行快速信号采集和处理,优化了对电动机的监控,实现了更有效的预测性维护并减少了工业应用中的停机时间。该系统的创新提高了故障诊断的准确性和效率,促进了实时监测技术的发展。
{"title":"Fault Analysis in Induction Motors Through Signal Acquisition With Hilbert Transform and FPGA","authors":"I. A. Juarez-Trujillo;Susana De Le贸n-Aldaco;J. G. Velasquez-Aguilar;J. Aguayo-Alquicira","doi":"10.1109/LES.2025.3546847","DOIUrl":"https://doi.org/10.1109/LES.2025.3546847","url":null,"abstract":"This study presents an innovative methodology for the detection of faults in electric motors, specifically in single-phase induction motors, using the Hilbert Transform combined with spectral density analysis. The main innovation lies in the integration of field-programmable gate array (FPGA) technology with the Hilbert Transform for real-time data acquisition and accurate signal analysis, allowing early detection of faults such as short circuits in the coils. The system uses an analog-to-digital converter to capture current and voltage signals, and a first in, first out memory buffer to ensure continuous acquisition without data loss. The Hilbert transform allows the decomposition of the signals and extraction of analytical frequencies, which facilitates the identification of fault-generated harmonics. The results show a significant improvement in fault detection, with the identification of high-frequency harmonics indicating internal problems such as short circuits. This approach, by integrating FPGA for fast signal acquisition and processing, optimizes the monitoring of electric motors, enabling more effective predictive maintenance and reducing downtime in industrial applications. The innovation of this system improves the accuracy and efficiency of fault diagnosis, contributing to the advancement of real-time monitoring technology.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"390-393"},"PeriodicalIF":2.0,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Embedded Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1