首页 > 最新文献

IEEE Embedded Systems Letters最新文献

英文 中文
Low Consumption Monitoring and Estimation of the State of Charge System for a Hybrid Electric Vehicle 混合动力电动汽车充电状态系统的低消耗监控与估算
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-03-08 DOI: 10.1109/LES.2024.3398594
M. A. Sandoval-Chileño;N. Lozada-Castillo;R. Cortez;J. Vazquez-Arenas;A. Luviano-Juárez
Electric vehicles need continuous monitoring of the energy supply all the time to provide the user with accurate information about the available energy. However, an increase in the power consumed by the monitoring system may cause a significant decrease in the vehicle’s autonomy. This letter presents a monitoring and estimation system for an electric car with a hybrid energy supply based on supercapacitors and batteries based on an embedded system capable of monitoring the general current, the current from the battery system, the voltage in the battery system, the voltage in the supercapacitor system, estimating the value of the state of charge from batteries, and showing the measurements to the user with an OLED display. The proposed system performs similarly in the estimation but decreases the energy consumption by 99.9% concerning the laboratory systems.
电动汽车需要一直对能源供应进行持续监控,以便为用户提供有关可用能源的准确信息。然而,监控系统耗电量的增加可能会导致车辆自主性的显著下降。本文介绍了一种基于超级电容器和电池的混合能源供应电动汽车监控和估算系统,该系统基于嵌入式系统,能够监控总电流、电池系统电流、电池系统电压、超级电容器系统电压,估算电池的充电状态值,并通过 OLED 显示屏向用户显示测量结果。与实验室系统相比,建议的系统在估算方面表现相似,但能耗降低了 99.9%。
{"title":"Low Consumption Monitoring and Estimation of the State of Charge System for a Hybrid Electric Vehicle","authors":"M. A. Sandoval-Chileño;N. Lozada-Castillo;R. Cortez;J. Vazquez-Arenas;A. Luviano-Juárez","doi":"10.1109/LES.2024.3398594","DOIUrl":"https://doi.org/10.1109/LES.2024.3398594","url":null,"abstract":"Electric vehicles need continuous monitoring of the energy supply all the time to provide the user with accurate information about the available energy. However, an increase in the power consumed by the monitoring system may cause a significant decrease in the vehicle’s autonomy. This letter presents a monitoring and estimation system for an electric car with a hybrid energy supply based on supercapacitors and batteries based on an embedded system capable of monitoring the general current, the current from the battery system, the voltage in the battery system, the voltage in the supercapacitor system, estimating the value of the state of charge from batteries, and showing the measurements to the user with an OLED display. The proposed system performs similarly in the estimation but decreases the energy consumption by 99.9% concerning the laboratory systems.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 3","pages":"311-314"},"PeriodicalIF":1.7,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142090707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RDMA-Based Sampling Port of ARINC-653 基于 RDMA 的 ARINC-653 采样端口
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-03-07 DOI: 10.1109/LES.2024.3373200
Jong-Bin Lee;Sang-Jae Kim;Wook-Hee Kim;Hyun-Wook Jin
ARINC-653, a standard of avionics software platform, defines the sampling communication port that provides only the latest message while discarding old messages. This sampling port is particularly efficient at transmitting sensing data that reflects the actual state of target system without message queueing delay. In this letter, we implement the ARINC-653 sampling port by exploiting remote direct memory access (RDMA) over Ethernet that can directly move data to/from remote memory without CPU intervention on the remote node. We propose two different designs to utilize two-sided RDMA and one-sided RDMA operations, respectively. Performance measurement results show that the sampling port over one-sided RDMA provides lower-communication latency, stronger temporal partitioning, and better-message timeliness than two-sided RDMA and Berkeley sockets-based implementations.
arinc653是一种航空电子软件平台标准,定义了只提供最新消息而丢弃旧消息的采样通信端口。该采样端口在传输反映目标系统实际状态的传感数据时效率特别高,没有消息队列延迟。在这封信中,我们通过利用以太网上的远程直接内存访问(RDMA)来实现arinc653采样端口,该端口可以直接将数据从远程内存移至/移出,而无需远程节点上的CPU干预。我们提出了两种不同的设计,分别利用双边RDMA和单边RDMA操作。性能测量结果表明,与双面RDMA和基于伯克利套接字的实现相比,单侧RDMA上的采样端口提供了更低的通信延迟、更强的时间分区和更好的消息时效性。
{"title":"RDMA-Based Sampling Port of ARINC-653","authors":"Jong-Bin Lee;Sang-Jae Kim;Wook-Hee Kim;Hyun-Wook Jin","doi":"10.1109/LES.2024.3373200","DOIUrl":"10.1109/LES.2024.3373200","url":null,"abstract":"ARINC-653, a standard of avionics software platform, defines the sampling communication port that provides only the latest message while discarding old messages. This sampling port is particularly efficient at transmitting sensing data that reflects the actual state of target system without message queueing delay. In this letter, we implement the ARINC-653 sampling port by exploiting remote direct memory access (RDMA) over Ethernet that can directly move data to/from remote memory without CPU intervention on the remote node. We propose two different designs to utilize two-sided RDMA and one-sided RDMA operations, respectively. Performance measurement results show that the sampling port over one-sided RDMA provides lower-communication latency, stronger temporal partitioning, and better-message timeliness than two-sided RDMA and Berkeley sockets-based implementations.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 4","pages":"437-440"},"PeriodicalIF":1.7,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10459217","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140075555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TAFT: Thermal-Aware Hybrid Fault-Tolerant Technique for Multicore Embedded Systems TAFT:面向多核嵌入式系统的热感知混合容错技术
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-03-02 DOI: 10.1109/LES.2024.3396058
Amir Hossein Ansari;Mohsen Ansari;Alireza Ejlali
To achieve high reliability, fault-tolerance techniques are exploited, but they may increase power consumption and temperature beyond safe limits. Therefore, power-aware fault-tolerance techniques should be used to manage power and temperature issues. We tolerate both permanent and transient faults through hybrid fault-tolerance techniques. In this letter, at first, we investigate how much power and temperature are increased when a hybrid fault-tolerance technique is applied to multicore embedded systems. Then, we propose a peak-power-aware hybrid fault-tolerant technique to meet the temperature constraint. Transient-temperature-based safe power (T-TSP) is a new power budgeting technique whose calculation is based on the current temperature of the processing core. Assigning dynamic budgets through T-TSP to processing cores allows us to effectively reach the full performance of processing cores. Experiments show that our proposed method reduces peak power and energy consumption on average by 13.5% (up to 50.7%) and 41.8% (up to 67.4%), respectively and improves the schedulability on average by 6.8% (up to 22.4%) compared to state-of-the-art methods while meeting the system reliability target.
为了实现高可靠性,需要利用容错技术,但它们可能会使功耗和温度超出安全限制。因此,应该使用电源感知容错技术来管理电源和温度问题。我们通过混合容错技术容忍永久和短暂的故障。在这封信中,首先,我们研究了当混合容错技术应用于多核嵌入式系统时,功率和温度会增加多少。然后,我们提出了一种峰值功率感知混合容错技术来满足温度约束。基于瞬态温度的安全功率(T-TSP)是一种基于加工核心当前温度计算的新型功率预算技术。通过T-TSP为处理核心分配动态预算使我们能够有效地达到处理核心的全部性能。实验表明,该方法在满足系统可靠性目标的前提下,将峰值功率和能耗分别平均降低13.5%(最高50.7%)和41.8%(最高67.4%),可调度性平均提高6.8%(最高22.4%)。
{"title":"TAFT: Thermal-Aware Hybrid Fault-Tolerant Technique for Multicore Embedded Systems","authors":"Amir Hossein Ansari;Mohsen Ansari;Alireza Ejlali","doi":"10.1109/LES.2024.3396058","DOIUrl":"10.1109/LES.2024.3396058","url":null,"abstract":"To achieve high reliability, fault-tolerance techniques are exploited, but they may increase power consumption and temperature beyond safe limits. Therefore, power-aware fault-tolerance techniques should be used to manage power and temperature issues. We tolerate both permanent and transient faults through hybrid fault-tolerance techniques. In this letter, at first, we investigate how much power and temperature are increased when a hybrid fault-tolerance technique is applied to multicore embedded systems. Then, we propose a peak-power-aware hybrid fault-tolerant technique to meet the temperature constraint. Transient-temperature-based safe power (T-TSP) is a new power budgeting technique whose calculation is based on the current temperature of the processing core. Assigning dynamic budgets through T-TSP to processing cores allows us to effectively reach the full performance of processing cores. Experiments show that our proposed method reduces peak power and energy consumption on average by 13.5% (up to 50.7%) and 41.8% (up to 67.4%), respectively and improves the schedulability on average by 6.8% (up to 22.4%) compared to state-of-the-art methods while meeting the system reliability target.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 4","pages":"477-480"},"PeriodicalIF":1.7,"publicationDate":"2024-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140829504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multiplier-Free Discrete Cosine Transform Architecture Using Approximate Full Adder and Subtractor 使用近似全加法器和减法器的无乘法器离散余弦变换架构
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-03-01 DOI: 10.1109/LES.2024.3395900
Elham Esmaeili;Nabiollah Shiri;Mahmood Rafiee;Ayoub Sadeghi
A new approximate full adder (FA) and a new approximate subtractor are presented, both of them have 8 transistors, and their areas are 0.1944 and $0.1689~mu $ m2, respectively. The FA experiences three errors, while the subtractor shows two errors. In both circuits, to improve the speed, output swing, and drivability, the gate diffusion input (GDI) and dynamic threshold (DT) techniques are implemented by carbon nanotube field effect transistor (CNTFET) technology. The FA and subtractor in order are embedded in an 8-bit ripple carry adder (RCA) and an 8-bit subtractor, then they make a new approximate multiplier-free discrete cosine transform (DCT). The 8-point approximate DCT manipulation requires only addition and no multiplication. So, computational complexity is brought down. The DCT shows power delay product (PDP), peak signal-to-noise ratio (PSNR), and a figure of merit (FoM) of 63.61 fJ, 34.96 dB, and 2.39, respectively. The features of the presented approximate DCT confirm its application for image compression and noise removal in medical images.
提出了一种新的近似全加法器(FA)和一种新的近似减法器,它们都有8个晶体管,面积分别为0.1944和$0.1689~mu $ m2。FA出现3个误差,而减法器出现2个误差。在这两种电路中,为了提高速度、输出摆幅和驱动性,栅极扩散输入(GDI)和动态阈值(DT)技术都是通过碳纳米管场效应晶体管(CNTFET)技术实现的。将FA和减法器依次嵌入到8位纹波进位加法器(RCA)和8位减法器中,然后进行新的近似无乘法器离散余弦变换(DCT)。8点近似DCT操作只需要加法而不需要乘法。这样,计算复杂度就降低了。DCT显示功率延迟积(PDP)、峰值信噪比(PSNR)和品质因数(FoM)分别为63.61 fJ、34.96 dB和2.39。所提出的近似DCT的特点证实了它在医学图像压缩和去噪方面的应用。
{"title":"A Multiplier-Free Discrete Cosine Transform Architecture Using Approximate Full Adder and Subtractor","authors":"Elham Esmaeili;Nabiollah Shiri;Mahmood Rafiee;Ayoub Sadeghi","doi":"10.1109/LES.2024.3395900","DOIUrl":"10.1109/LES.2024.3395900","url":null,"abstract":"A new approximate full adder (FA) and a new approximate subtractor are presented, both of them have 8 transistors, and their areas are 0.1944 and \u0000<inline-formula> <tex-math>$0.1689~mu $ </tex-math></inline-formula>\u0000m2, respectively. The FA experiences three errors, while the subtractor shows two errors. In both circuits, to improve the speed, output swing, and drivability, the gate diffusion input (GDI) and dynamic threshold (DT) techniques are implemented by carbon nanotube field effect transistor (CNTFET) technology. The FA and subtractor in order are embedded in an 8-bit ripple carry adder (RCA) and an 8-bit subtractor, then they make a new approximate multiplier-free discrete cosine transform (DCT). The 8-point approximate DCT manipulation requires only addition and no multiplication. So, computational complexity is brought down. The DCT shows power delay product (PDP), peak signal-to-noise ratio (PSNR), and a figure of merit (FoM) of 63.61 fJ, 34.96 dB, and 2.39, respectively. The features of the presented approximate DCT confirm its application for image compression and noise removal in medical images.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 4","pages":"441-444"},"PeriodicalIF":1.7,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140829764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Door Knock: Reverse Engineering the MPSoC Layout Through Timing Attack on NoC 敲门:通过对 NoC 的定时攻击逆向工程 MPSoC 布局
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-02-28 DOI: 10.1109/LES.2024.3371106
Dipesh;Urbi Chatterjee
Multiprocessor systems-on-chip (MPSoC) have emerged as highly versatile and efficient platforms suitable for a wide range of applications like multimedia applications and telecommunication architectures. One of the key components in MPSoC is the network-on-chip (NoC), which facilitates the interconnection of various processing elements (PEs), enabling efficient data communication. Several timing attacks, such as Earthquake attack, P+P Firecracker, and P+P Arrow, have been proposed on NoC that exploit the variations in execution times of operations to infer cryptographic keys. In this letter, we propose to leverage the timing attack on NoC to reverse engineer the mapping of each PE onto the MPSoC architecture. To the best of our knowledge, it is the first work that relies on creating the contention between the requests that are sent to different PEs and reveal the layout by just analyzing the reply latency. In the experimental setup, we are able to map PEs for MPSoC consists of Mesh, Torus, Point-to-Point, Ring, and Flattened Butterfly NoC topology with 100% accuracy that can be extremely useful for the attackers in the reconnaissance phase. Further, as the existing mitigation techniques to counter timing attacks are based on the assumption that the contention is created between the packets of secure-insecure domains, they will not be able to mitigate the proposed reverse engineering attack.
多处理器片上系统(MPSoC)已经成为一种高度通用和高效的平台,适用于广泛的应用,如多媒体应用和电信架构。MPSoC的关键组件之一是片上网络(NoC),它促进了各种处理元件(pe)的互连,从而实现高效的数据通信。在NoC上已经提出了几种定时攻击,例如Earthquake攻击、P+P爆竹和P+P箭头,它们利用操作执行时间的变化来推断加密密钥。在这封信中,我们建议利用NoC的定时攻击来逆向工程每个PE到MPSoC架构的映射。据我们所知,第一项工作依赖于在发送到不同pe的请求之间创建争用,并仅通过分析应答延迟来揭示布局。在实验设置中,我们能够以100%的精度映射由Mesh, Torus, Point-to-Point, Ring和flat Butterfly NoC拓扑组成的MPSoC的pe,这对攻击者在侦察阶段非常有用。此外,由于对抗定时攻击的现有缓解技术是基于在安全-不安全域的数据包之间创建争用的假设,因此它们将无法缓解所提议的反向工程攻击。
{"title":"Door Knock: Reverse Engineering the MPSoC Layout Through Timing Attack on NoC","authors":"Dipesh;Urbi Chatterjee","doi":"10.1109/LES.2024.3371106","DOIUrl":"https://doi.org/10.1109/LES.2024.3371106","url":null,"abstract":"Multiprocessor systems-on-chip (MPSoC) have emerged as highly versatile and efficient platforms suitable for a wide range of applications like multimedia applications and telecommunication architectures. One of the key components in MPSoC is the network-on-chip (NoC), which facilitates the interconnection of various processing elements (PEs), enabling efficient data communication. Several timing attacks, such as Earthquake attack, P+P Firecracker, and P+P Arrow, have been proposed on NoC that exploit the variations in execution times of operations to infer cryptographic keys. In this letter, we propose to leverage the timing attack on NoC to reverse engineer the mapping of each PE onto the MPSoC architecture. To the best of our knowledge, it is the first work that relies on creating the contention between the requests that are sent to different PEs and reveal the layout by just analyzing the reply latency. In the experimental setup, we are able to map PEs for MPSoC consists of Mesh, Torus, Point-to-Point, Ring, and Flattened Butterfly NoC topology with 100% accuracy that can be extremely useful for the attackers in the reconnaissance phase. Further, as the existing mitigation techniques to counter timing attacks are based on the assumption that the contention is created between the packets of secure-insecure domains, they will not be able to mitigate the proposed reverse engineering attack.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 4","pages":"449-452"},"PeriodicalIF":1.7,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142797914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crypto-Coding Scheme via Dynamic Interleaver for New Communication Standards 针对新通信标准的动态交织器密码编码方案
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-02-28 DOI: 10.1109/LES.2024.3371371
Raúl Eduardo Lopresti;Jorge Castiñeira Moreira;Luciana De Micco
Emerging communication standards prioritize both the assurance of secure and reliable communications as well as the reduction of transmission delay and latency. Nevertheless, the task of achieving these objectives presents a complex and demanding challenge. Ensuring secure transmissions while simultaneously minimizing error rates requires the implementation of multistage information processing techniques that integrate coding and encryption methods, which may unfortunately lead to an undesired increase in transmission delay and latency. This letter presents an innovative crypto-coding framework capable of concurrently achieving essential encryption and coding within a single, streamlined process. By leveraging this approach, we aim to address the tradeoff between security, efficiency, and transmission delay, thus contributing to the advancement of secure and efficient communication standards. In this letter, we present a comprehensive evaluation of the proposed scheme, assessing its impact on final system error probability, encryption efficacy, and the complexity of hardware implementation. Our findings shed light on the potential benefits of this approach for future communication standards.
新出现的通信标准把确保通信安全和可靠以及减少传输延迟和延时放在首位。然而,实现这些目标是一项复杂而艰巨的挑战。要在确保安全传输的同时最大限度地降低错误率,就必须采用集成编码和加密方法的多级信息处理技术,但不幸的是,这可能会导致传输延迟和延时的增加。这封信介绍了一种创新的加密编码框架,它能够在单一、精简的流程中同时实现基本的加密和编码。通过利用这种方法,我们旨在解决安全、效率和传输延迟之间的权衡问题,从而促进安全高效通信标准的发展。在这封信中,我们对提出的方案进行了全面评估,评估了它对最终系统错误概率、加密效果和硬件实施复杂性的影响。我们的研究结果阐明了这种方法对未来通信标准的潜在好处。
{"title":"Crypto-Coding Scheme via Dynamic Interleaver for New Communication Standards","authors":"Raúl Eduardo Lopresti;Jorge Castiñeira Moreira;Luciana De Micco","doi":"10.1109/LES.2024.3371371","DOIUrl":"10.1109/LES.2024.3371371","url":null,"abstract":"Emerging communication standards prioritize both the assurance of secure and reliable communications as well as the reduction of transmission delay and latency. Nevertheless, the task of achieving these objectives presents a complex and demanding challenge. Ensuring secure transmissions while simultaneously minimizing error rates requires the implementation of multistage information processing techniques that integrate coding and encryption methods, which may unfortunately lead to an undesired increase in transmission delay and latency. This letter presents an innovative crypto-coding framework capable of concurrently achieving essential encryption and coding within a single, streamlined process. By leveraging this approach, we aim to address the tradeoff between security, efficiency, and transmission delay, thus contributing to the advancement of secure and efficient communication standards. In this letter, we present a comprehensive evaluation of the proposed scheme, assessing its impact on final system error probability, encryption efficacy, and the complexity of hardware implementation. Our findings shed light on the potential benefits of this approach for future communication standards.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 3","pages":"287-290"},"PeriodicalIF":1.7,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140006552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Embedded Systems Letters Publication Information IEEE Embedded Systems Letters 出版信息
IF 1.6 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-02-28 DOI: 10.1109/LES.2024.3362200
{"title":"IEEE Embedded Systems Letters Publication Information","authors":"","doi":"10.1109/LES.2024.3362200","DOIUrl":"https://doi.org/10.1109/LES.2024.3362200","url":null,"abstract":"","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 1","pages":"C2-C2"},"PeriodicalIF":1.6,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10453003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139993759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FPGA Implementation of the Proposed DCNN Model for Detection of Tuberculosis and Pneumonia Using CXR Images 利用 CXR 图像检测肺结核和肺炎的 DCNN 拟议模型的 FPGA 实现
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-02-27 DOI: 10.1109/LES.2024.3370833
Prabhav Guddati;Shaswati Dash;Rajesh Kumar Tripathy
The automated detection of tuberculosis (TB) and pneumonia (PN) from chest X-ray (CXR) images using artificial intelligence (AI) is challenging in clinical studies for rapid diagnosis and initiation of treatment. This letter proposes the field-programmable gate array (FPGA)-based hardware implementation of a novel lightweight deep convolutional neural network (DCNN) model to detect PN and TB ailments using CXR images. Initially, the proposed DCNN (consisting of ten layers) is trained using the Google Cloud central processing unit (CPU) to obtain the model weight and bias parameters. Then, the register transfer logic (RTL) for the trained DCNN model is generated by the VIVADO high-level synthesis (HLS) framework using HLS for machine learning (HLS4ML) with fixed-point representation (8 bit for integer and 12 bit for the fractional part). The hardware implementation of the suggested DCNN model is performed using the PYNQ-Z2 FPGA framework to detect TB and PN diseases automatically. The experimental results demonstrate that the proposed DCNN model has obtained accuracy values of 96.39% and 95.63% on the Google-Cloud CPU and PYNQ-Z2 FPGA frameworks using 422 CXR images in the inference phases. The inference time of the proposed DCNN model on the PYNQ-Z2 FPGA framework is reduced by 85.19% compared to the CPU-based implementation. The suggested DCNN model has only 1831 parameters, less than the transfer learning (TFL) and existing CNN-based models to detect TB and PN using CXR images.
在临床研究中,利用人工智能(AI)从胸部x光片(CXR)图像中自动检测结核病(TB)和肺炎(PN),以实现快速诊断和开始治疗具有挑战性。这封信提出了一种基于现场可编程门阵列(FPGA)的新型轻量级深度卷积神经网络(DCNN)模型的硬件实现,用于使用CXR图像检测PN和TB疾病。首先,使用谷歌云中央处理器(CPU)训练由10层组成的DCNN,以获得模型权重和偏置参数。然后,由VIVADO高级综合(HLS)框架生成训练好的DCNN模型的寄存器传递逻辑(RTL),该框架使用HLS用于机器学习(HLS4ML),具有不动点表示(8位整数部分,12位小数部分)。采用PYNQ-Z2 FPGA框架实现了所建议的DCNN模型的硬件实现,可自动检测TB和PN疾病。实验结果表明,在Google-Cloud CPU和PYNQ-Z2 FPGA框架上,使用422张CXR图像进行推理,所提出的DCNN模型的准确率分别达到96.39%和95.63%。该DCNN模型在PYNQ-Z2 FPGA框架上的推理时间比基于cpu的实现缩短了85.19%。建议的DCNN模型只有1831个参数,少于使用CXR图像检测TB和PN的迁移学习(TFL)和现有的基于cnn的模型。
{"title":"FPGA Implementation of the Proposed DCNN Model for Detection of Tuberculosis and Pneumonia Using CXR Images","authors":"Prabhav Guddati;Shaswati Dash;Rajesh Kumar Tripathy","doi":"10.1109/LES.2024.3370833","DOIUrl":"10.1109/LES.2024.3370833","url":null,"abstract":"The automated detection of tuberculosis (TB) and pneumonia (PN) from chest X-ray (CXR) images using artificial intelligence (AI) is challenging in clinical studies for rapid diagnosis and initiation of treatment. This letter proposes the field-programmable gate array (FPGA)-based hardware implementation of a novel lightweight deep convolutional neural network (DCNN) model to detect PN and TB ailments using CXR images. Initially, the proposed DCNN (consisting of ten layers) is trained using the Google Cloud central processing unit (CPU) to obtain the model weight and bias parameters. Then, the register transfer logic (RTL) for the trained DCNN model is generated by the VIVADO high-level synthesis (HLS) framework using HLS for machine learning (HLS4ML) with fixed-point representation (8 bit for integer and 12 bit for the fractional part). The hardware implementation of the suggested DCNN model is performed using the PYNQ-Z2 FPGA framework to detect TB and PN diseases automatically. The experimental results demonstrate that the proposed DCNN model has obtained accuracy values of 96.39% and 95.63% on the Google-Cloud CPU and PYNQ-Z2 FPGA frameworks using 422 CXR images in the inference phases. The inference time of the proposed DCNN model on the PYNQ-Z2 FPGA framework is reduced by 85.19% compared to the CPU-based implementation. The suggested DCNN model has only 1831 parameters, less than the transfer learning (TFL) and existing CNN-based models to detect TB and PN using CXR images.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 4","pages":"445-448"},"PeriodicalIF":1.7,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140003875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-Time Tomato Quality Assessment Using Hybrid CNN-SVM Model 使用混合 CNN-SVM 模型实时评估番茄质量
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-02-27 DOI: 10.1109/LES.2024.3370634
Hassan Shabani Mputu;Ahmed-Abdel Mawgood;Atsushi Shimada;Mohammed S. Sayed
The current quality assessment for fruits and vegetables relies on subjective human judgment and manual inspection, resulting in inconsistencies and inefficiencies. Due to that, there is a need for a real-time system that can accurately and efficiently assess the quality of fruits and vegetables by analyzing various parameters, such as color, texture, size, and blemishes, to ensure consistency and reduce waste in the food supply chain. This study presents the development of a real-time tomato classification system using a hybrid model that combines convolutional neural network (CNN) and support vector machines (SVMs) deployed on the embedded single-board NVIDIA Jetson TX1. The selected CNN model EfficientNetB0 was used for feature extraction and SVM for classification. Notably, the EfficientNetB0-SVM hybrid model demonstrated impressive efficiency, achieving an average accuracy of 93.54% for classifying static tomato images stored in a board into healthy or reject with a testing time of 0.0216-s per image. Also, during real-time implementation, the proposed hybrid model attained an average inference speed of 15.6 frames per second (15.6 FPS), with an accuracy of 78.57% in classifying actual tomatoes into healthy or reject. The classification decision was taken based on 5 images for each tomato captured at different angles to ensure the detection of any blemishes from almost all sides of the tomato. The performance of the proposed model outperforms that of the state-of-the-art (SOTA) methods in accuracy, testing time per image, and real-time prediction accuracy.
目前的果蔬质量评估主要依靠人的主观判断和人工检验,结果不一致,效率低下。因此,需要一个实时系统,通过分析各种参数,如颜色、质地、大小和瑕疵,准确有效地评估水果和蔬菜的质量,以确保一致性并减少食品供应链中的浪费。本研究提出了一种基于卷积神经网络(CNN)和支持向量机(svm)的混合模型的实时番茄分类系统的开发,该模型部署在嵌入式单板NVIDIA Jetson TX1上。选择CNN模型effentnetb0进行特征提取,使用SVM进行分类。值得注意的是,EfficientNetB0-SVM混合模型显示了令人印象深刻的效率,在将存储在板上的静态番茄图像分类为健康或不健康的平均准确率为93.54%,每张图像的测试时间为0.0216-s。此外,在实时实现过程中,所提出的混合模型的平均推理速度为15.6帧/秒(15.6 FPS),将实际西红柿分类为健康或不健康的准确率为78.57%。分类决策是基于每个番茄在不同角度拍摄的5张图像,以确保检测到番茄几乎所有侧面的任何瑕疵。该模型在精度、每幅图像的测试时间和实时预测精度方面优于最先进的(SOTA)方法。
{"title":"Real-Time Tomato Quality Assessment Using Hybrid CNN-SVM Model","authors":"Hassan Shabani Mputu;Ahmed-Abdel Mawgood;Atsushi Shimada;Mohammed S. Sayed","doi":"10.1109/LES.2024.3370634","DOIUrl":"10.1109/LES.2024.3370634","url":null,"abstract":"The current quality assessment for fruits and vegetables relies on subjective human judgment and manual inspection, resulting in inconsistencies and inefficiencies. Due to that, there is a need for a real-time system that can accurately and efficiently assess the quality of fruits and vegetables by analyzing various parameters, such as color, texture, size, and blemishes, to ensure consistency and reduce waste in the food supply chain. This study presents the development of a real-time tomato classification system using a hybrid model that combines convolutional neural network (CNN) and support vector machines (SVMs) deployed on the embedded single-board NVIDIA Jetson TX1. The selected CNN model EfficientNetB0 was used for feature extraction and SVM for classification. Notably, the EfficientNetB0-SVM hybrid model demonstrated impressive efficiency, achieving an average accuracy of 93.54% for classifying static tomato images stored in a board into healthy or reject with a testing time of 0.0216-s per image. Also, during real-time implementation, the proposed hybrid model attained an average inference speed of 15.6 frames per second (15.6 FPS), with an accuracy of 78.57% in classifying actual tomatoes into healthy or reject. The classification decision was taken based on 5 images for each tomato captured at different angles to ensure the detection of any blemishes from almost all sides of the tomato. The performance of the proposed model outperforms that of the state-of-the-art (SOTA) methods in accuracy, testing time per image, and real-time prediction accuracy.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 4","pages":"453-456"},"PeriodicalIF":1.7,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140003877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Speed Energy-Efficient Fixed-Point Signed Multipliers for FPGA-Based DSP Applications 基于 FPGA 的 DSP 应用中的高速节能定点有符号乘法器
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-02-12 DOI: 10.1109/LES.2024.3364698
Mitul Sudhirkumar Nagar;Aditya Mathuriya;Sohan H. Patel;Pinalkumar J. Engineer
Among various platforms for computer vision algorithms, FPGA has gained popularity as a low-power solution. These algorithms involve convolution operation which are extensively performed using signed multipliers. Hence, this work proposes high-speed and energy-efficient signed fixed-point multipliers for digital signal processing (DSP) applications. This work focuses on reducing the combinational path delay (CPD) using LUT-based Booth radix-4 partial product (PP) generation with Bewick’s sign extension and Dadda-based concurrent PP reduction with carry save adder (CSA) for Xilinx (now AMD) FPGA. The proposed design eliminates the requirement of a long carry chain for PP reduction. The proposed multiplier reduces CPD by 3%, 4%, and 16% compared to the state-of-the-art (SoA) multiplier for $8times 8$ , $16times 16$ , and $32times 32$ sizes, respectively. We have also analyzed our proposed $32times 32$ multiplier by pipelining, which offers CPD and EDP reduction by 12.28% and 19.47% at the cost of a 3% and 80% increase in LUTs and flip-flops, respectively, compared to the combinatorial multiplier.
在计算机视觉算法的各种平台中,FPGA作为一种低功耗的解决方案受到了广泛的欢迎。这些算法涉及卷积运算,通常使用有符号乘法器执行卷积运算。因此,这项工作提出了用于数字信号处理(DSP)应用的高速和节能的签名定点乘法器。这项工作的重点是减少组合路径延迟(CPD),使用基于lut的Booth基数-4部分积(PP)生成与Bewick的符号扩展,以及基于dada的并行PP减少与进位保存加法器(CSA),用于Xilinx(现在的AMD) FPGA。提出的设计消除了对PP减少的长携带链的要求。与最先进的(SoA)乘法器相比,所提出的乘法器分别可将CPD降低3%、4%和16%,分别适用于8 × 8$、16 × 16$和32 × 32$尺寸。我们还通过流水线分析了我们提出的$32 × 32$乘法器,与组合乘法器相比,该乘法器的CPD和EDP分别降低了12.28%和19.47%,而lut和触发器的成本分别增加了3%和80%。
{"title":"High-Speed Energy-Efficient Fixed-Point Signed Multipliers for FPGA-Based DSP Applications","authors":"Mitul Sudhirkumar Nagar;Aditya Mathuriya;Sohan H. Patel;Pinalkumar J. Engineer","doi":"10.1109/LES.2024.3364698","DOIUrl":"10.1109/LES.2024.3364698","url":null,"abstract":"Among various platforms for computer vision algorithms, FPGA has gained popularity as a low-power solution. These algorithms involve convolution operation which are extensively performed using signed multipliers. Hence, this work proposes high-speed and energy-efficient signed fixed-point multipliers for digital signal processing (DSP) applications. This work focuses on reducing the combinational path delay (CPD) using LUT-based Booth radix-4 partial product (PP) generation with Bewick’s sign extension and Dadda-based concurrent PP reduction with carry save adder (CSA) for Xilinx (now AMD) FPGA. The proposed design eliminates the requirement of a long carry chain for PP reduction. The proposed multiplier reduces CPD by 3%, 4%, and 16% compared to the state-of-the-art (SoA) multiplier for \u0000<inline-formula> <tex-math>$8times 8$ </tex-math></inline-formula>\u0000, \u0000<inline-formula> <tex-math>$16times 16$ </tex-math></inline-formula>\u0000, and \u0000<inline-formula> <tex-math>$32times 32$ </tex-math></inline-formula>\u0000 sizes, respectively. We have also analyzed our proposed \u0000<inline-formula> <tex-math>$32times 32$ </tex-math></inline-formula>\u0000 multiplier by pipelining, which offers CPD and EDP reduction by 12.28% and 19.47% at the cost of a 3% and 80% increase in LUTs and flip-flops, respectively, compared to the combinatorial multiplier.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"16 4","pages":"417-420"},"PeriodicalIF":1.7,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139954504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Embedded Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1