首页 > 最新文献

IEEE Embedded Systems Letters最新文献

英文 中文
QLlama: An FPGA-Based Microscaling Quantization Accelerator for Energy-Efficient Llama2 Inference QLlama:一种基于fpga的节能Llama2推理微尺度量化加速器
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-10-16 DOI: 10.1109/LES.2025.3600563
Hongbing Wen;Zihao Wang;Jiale Dong;Wenqi Lou;Lei Gong;Chao Wang;Xuehai Zhou
To address the computational power and energy efficiency challenges in Llama2 large-model inference, this letter proposes a hardware-software co-design method and finally implements a high energy efficiency accelerator named QLlama based on FPGA. This work first employs a novel quantization method based on a microscaling data format, which allows sharing a scaling factor with E8M0 format for each subtensor block, thus enabling quantization and dequantization operations to be completed using only shift operations. Second, on this basis, a mixed precision configuration is implemented for different layers of Llama2 to balance accuracy loss and computational efficiency. Finally, a dedicated accelerator QLlama is designed, whose core units include a quantization unit for dynamic quantization, a vector-matrix multiplication unit for high density computation of quantized weights, a scaled dot product unit, and a basic operator unit. Experimental results show that this scheme achieves energy efficiency improvements of $2.13sim 10.66times $ with negligible accuracy loss, i.e., <0.2>https://github.com/wendadawen/QLlama.
为了解决Llama2大模型推理中的计算能力和能效挑战,本文提出了一种软硬件协同设计方法,并最终实现了基于FPGA的高能效加速器QLlama。本工作首先采用了一种基于微尺度数据格式的新型量化方法,该方法允许每个子张量块与E8M0格式共享一个比例因子,从而使量化和去量化操作仅使用移位操作即可完成。其次,在此基础上,对Llama2的不同层实现混合精度配置,以平衡精度损失和计算效率。最后,设计了专用加速器QLlama,其核心单元包括用于动态量化的量化单元、用于量化权重高密度计算的向量矩阵乘法单元、缩放点积单元和基本算子单元。实验结果表明,该方案在精度损失可忽略不计的情况下,实现了2.13sim $ 10.66 $的能效改进,即https://github.com/wendadawen/QLlama。
{"title":"QLlama: An FPGA-Based Microscaling Quantization Accelerator for Energy-Efficient Llama2 Inference","authors":"Hongbing Wen;Zihao Wang;Jiale Dong;Wenqi Lou;Lei Gong;Chao Wang;Xuehai Zhou","doi":"10.1109/LES.2025.3600563","DOIUrl":"https://doi.org/10.1109/LES.2025.3600563","url":null,"abstract":"To address the computational power and energy efficiency challenges in Llama2 large-model inference, this letter proposes a hardware-software co-design method and finally implements a high energy efficiency accelerator named QLlama based on FPGA. This work first employs a novel quantization method based on a microscaling data format, which allows sharing a scaling factor with E8M0 format for each subtensor block, thus enabling quantization and dequantization operations to be completed using only shift operations. Second, on this basis, a mixed precision configuration is implemented for different layers of Llama2 to balance accuracy loss and computational efficiency. Finally, a dedicated accelerator QLlama is designed, whose core units include a quantization unit for dynamic quantization, a vector-matrix multiplication unit for high density computation of quantized weights, a scaled dot product unit, and a basic operator unit. Experimental results show that this scheme achieves energy efficiency improvements of <inline-formula> <tex-math>$2.13sim 10.66times $ </tex-math></inline-formula> with negligible accuracy loss, i.e., <0.2>https://github.com/wendadawen/QLlama</uri>.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 5","pages":"337-340"},"PeriodicalIF":2.0,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Embedded Systems Letters Publication Information IEEE嵌入式系统通讯出版信息
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-10-16 DOI: 10.1109/LES.2025.3611604
{"title":"IEEE Embedded Systems Letters Publication Information","authors":"","doi":"10.1109/LES.2025.3611604","DOIUrl":"https://doi.org/10.1109/LES.2025.3611604","url":null,"abstract":"","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 5","pages":"C4-C4"},"PeriodicalIF":2.0,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11205911","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accelerating LSM-Tree KV Stores via Caching Hot Keys on Hybrid Zoned Storage 在混合分区存储上通过缓存热键加速LSM-Tree KV存储
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-10-16 DOI: 10.1109/LES.2025.3599998
Shiqiang Nie;Menghan Li;Chi Zhang;Di Zhang;Weiguo Wu
key-value (KV) stores based on log-structured merge trees (LSM-trees) have become vital for managing large-scale unstructured data. Recent studies have proposed hybrid zoned storage architectures—combining host-managed shingled magnetic recording (HM-SMR) HDDs and zoned namespace (ZNS) SSDs—to balance performance and cost, making them well-suited for LSM-tree–based KV stores. Although a number of novel schemes have been developed to optimize write performance, garbage collection, and compaction overhead, read performance remains a critical challenge. Specifically, we observe that read requests often concentrate on low-performance HM-SMR HDDs, resulting in severe read bottlenecks. To address this issue, we propose hybrid zoned cache improvement (HZCI) to enhance read efficiency in hybrid zoned KV stores. First, we construct a hybrid-granularity zoned cache that leverages file access patterns to exploit the high-speed characteristics of ZNS SSDs. Second, we introduce an access-aware cache management strategy to intelligently manage the KV cache within ZNS SSDs. Finally, we design a compaction mechanism that balances read performance with compaction overhead, thereby improving cache efficiency. Experimental results show that HZCI improves average read throughput by 32%, 40%, and 52% compared to GearDB, ZoneKV, and SpanDB, respectively.
基于日志结构合并树(lsm -tree)的键值(KV)存储对于管理大规模非结构化数据已经变得至关重要。最近的研究提出了混合分区存储架构——结合主机管理的带状磁记录(HM-SMR) hdd和分区命名空间(ZNS) ssd——来平衡性能和成本,使它们非常适合基于lsm树的KV存储。尽管已经开发了许多新的方案来优化写性能、垃圾收集和压缩开销,但读性能仍然是一个关键的挑战。具体来说,我们观察到读请求通常集中在性能较低的HM-SMR hdd上,从而导致严重的读瓶颈。为了解决这个问题,我们提出了混合分区缓存改进(HZCI)来提高混合分区KV存储的读取效率。首先,我们构建了一个混合粒度分区缓存,利用文件访问模式来利用ZNS ssd的高速特性。其次,我们引入了一种访问感知缓存管理策略来智能管理ZNS ssd内的KV缓存。最后,我们设计了一种压缩机制来平衡读取性能和压缩开销,从而提高缓存效率。实验结果表明,与GearDB、ZoneKV和SpanDB相比,HZCI的平均读吞吐量分别提高了32%、40%和52%。
{"title":"Accelerating LSM-Tree KV Stores via Caching Hot Keys on Hybrid Zoned Storage","authors":"Shiqiang Nie;Menghan Li;Chi Zhang;Di Zhang;Weiguo Wu","doi":"10.1109/LES.2025.3599998","DOIUrl":"https://doi.org/10.1109/LES.2025.3599998","url":null,"abstract":"key-value (KV) stores based on log-structured merge trees (LSM-trees) have become vital for managing large-scale unstructured data. Recent studies have proposed hybrid zoned storage architectures—combining host-managed shingled magnetic recording (HM-SMR) HDDs and zoned namespace (ZNS) SSDs—to balance performance and cost, making them well-suited for LSM-tree–based KV stores. Although a number of novel schemes have been developed to optimize write performance, garbage collection, and compaction overhead, read performance remains a critical challenge. Specifically, we observe that read requests often concentrate on low-performance HM-SMR HDDs, resulting in severe read bottlenecks. To address this issue, we propose hybrid zoned cache improvement (HZCI) to enhance read efficiency in hybrid zoned KV stores. First, we construct a hybrid-granularity zoned cache that leverages file access patterns to exploit the high-speed characteristics of ZNS SSDs. Second, we introduce an access-aware cache management strategy to intelligently manage the KV cache within ZNS SSDs. Finally, we design a compaction mechanism that balances read performance with compaction overhead, thereby improving cache efficiency. Experimental results show that HZCI improves average read throughput by 32%, 40%, and 52% compared to GearDB, ZoneKV, and SpanDB, respectively.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 5","pages":"321-324"},"PeriodicalIF":2.0,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Embedded Systems Letters Publication Information IEEE嵌入式系统通讯出版信息
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-08-14 DOI: 10.1109/LES.2025.3587504
{"title":"IEEE Embedded Systems Letters Publication Information","authors":"","doi":"10.1109/LES.2025.3587504","DOIUrl":"https://doi.org/10.1109/LES.2025.3587504","url":null,"abstract":"","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 4","pages":"C4-C4"},"PeriodicalIF":2.0,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11125533","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144842962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FPGA-Based RF Signal Generator for Radar Applications 基于fpga的雷达射频信号发生器
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-07-29 DOI: 10.1109/LES.2025.3589350
Sherline Y. Cruz-Nava;Mario J. Rosas-Fregoso;Francisco López-Huerta;Rosa M. Woo-García;Edith Osorio-de-la-Rosa
Advances in communications have enabled the development of various types of radars for applications, such as cartography, military industry, materials testing, air traffic control, and autonomous vehicle guidance. We present the design and implementation of an embedded system for generating radio frequency (RF) signals for radar applications. Using a Cyclone V field programmable gate array (FPGA) and the BladeRF 2.0 platform, the system employs direct digital synthesis (DDS) and in-phase/quadrature (I/Q) modulation techniques for precise signal generation. The carrier signal embeds information from lineal frequency-modulated signals (LFM), which are shifted in frequency to the S-band. Additionally, a synchronization module has been implemented to ensure precise activation during transmission. Simulation and experimental results demonstrate significant improvements in signal stability, flexibility, and precision. This development advances high-frequency embedded technologies with applications in wireless communications and radar detection systems.
通信技术的进步使各种类型的雷达得以发展,用于制图、军事工业、材料测试、空中交通管制、自动驾驶车辆制导等应用。我们提出了一个嵌入式系统的设计和实现,用于雷达应用产生射频(RF)信号。该系统采用Cyclone V现场可编程门阵列(FPGA)和BladeRF 2.0平台,采用直接数字合成(DDS)和同相/正交(I/Q)调制技术来精确生成信号。载波信号嵌入了线性调频信号(LFM)的信息,这些信号的频率被移到s波段。此外,还实现了同步模块,以确保在传输过程中精确激活。仿真和实验结果表明,该方法显著提高了信号的稳定性、灵活性和精度。这一发展推动了高频嵌入式技术在无线通信和雷达探测系统中的应用。
{"title":"FPGA-Based RF Signal Generator for Radar Applications","authors":"Sherline Y. Cruz-Nava;Mario J. Rosas-Fregoso;Francisco López-Huerta;Rosa M. Woo-García;Edith Osorio-de-la-Rosa","doi":"10.1109/LES.2025.3589350","DOIUrl":"https://doi.org/10.1109/LES.2025.3589350","url":null,"abstract":"Advances in communications have enabled the development of various types of radars for applications, such as cartography, military industry, materials testing, air traffic control, and autonomous vehicle guidance. We present the design and implementation of an embedded system for generating radio frequency (RF) signals for radar applications. Using a Cyclone V field programmable gate array (FPGA) and the BladeRF 2.0 platform, the system employs direct digital synthesis (DDS) and in-phase/quadrature (I/Q) modulation techniques for precise signal generation. The carrier signal embeds information from lineal frequency-modulated signals (LFM), which are shifted in frequency to the S-band. Additionally, a synchronization module has been implemented to ensure precise activation during transmission. Simulation and experimental results demonstrate significant improvements in signal stability, flexibility, and precision. This development advances high-frequency embedded technologies with applications in wireless communications and radar detection systems.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"365-369"},"PeriodicalIF":2.0,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secure Protocol for Remote Testing Critical Systems Over Public Networks Using Zero-Knowledge Proofs 使用零知识证明在公共网络上远程测试关键系统的安全协议
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-07-16 DOI: 10.1109/LES.2025.3589547
Santiago Germino;Martín N. Menéndez;Ariel Lutenberg
Essential infrastructure and services depend on critical systems. To ensure that critical systems function properly, regular testing and monitoring are necessary. Establishing direct, dedicated data connections for remote testing can be expensive, while using public cellular, satellite, or fiber Internet connections can introduce privacy and security risks. Securing the medium often requires placing trust in third parties. The novel proposal introduced in this work suggests using zero-knowledge proofs, a modern cryptographic technique, to conduct secure remote testing and monitoring of critical systems over affordable public networks, which can include email or instant messaging apps. This approach guarantees both the integrity and confidentiality of the transmitted data, as well as the integrity of the processes involved in preparing the data for transmission. We will present this approach and demonstrate its implementation through a real-world use case: the remote testing of an electronic railway interlocking system.
重要的基础设施和服务依赖于关键系统。为了确保关键系统正常运行,定期测试和监控是必要的。为远程测试建立直接的、专用的数据连接可能会很昂贵,而使用公共蜂窝、卫星或光纤Internet连接可能会引入隐私和安全风险。保护媒介通常需要信任第三方。这项工作提出的新建议建议使用零知识证明,一种现代加密技术,在可负担得起的公共网络上对关键系统进行安全的远程测试和监控,其中可以包括电子邮件或即时通讯应用程序。这种方法既保证了传输数据的完整性和机密性,也保证了准备传输数据过程的完整性。我们将介绍这种方法,并通过实际用例演示其实现:电子铁路联锁系统的远程测试。
{"title":"Secure Protocol for Remote Testing Critical Systems Over Public Networks Using Zero-Knowledge Proofs","authors":"Santiago Germino;Martín N. Menéndez;Ariel Lutenberg","doi":"10.1109/LES.2025.3589547","DOIUrl":"https://doi.org/10.1109/LES.2025.3589547","url":null,"abstract":"Essential infrastructure and services depend on critical systems. To ensure that critical systems function properly, regular testing and monitoring are necessary. Establishing direct, dedicated data connections for remote testing can be expensive, while using public cellular, satellite, or fiber Internet connections can introduce privacy and security risks. Securing the medium often requires placing trust in third parties. The novel proposal introduced in this work suggests using zero-knowledge proofs, a modern cryptographic technique, to conduct secure remote testing and monitoring of critical systems over affordable public networks, which can include email or instant messaging apps. This approach guarantees both the integrity and confidentiality of the transmitted data, as well as the integrity of the processes involved in preparing the data for transmission. We will present this approach and demonstrate its implementation through a real-world use case: the remote testing of an electronic railway interlocking system.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"427-430"},"PeriodicalIF":2.0,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Embedded Systems Letters Publication Information IEEE嵌入式系统通讯出版信息
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-12 DOI: 10.1109/LES.2025.3564681
{"title":"IEEE Embedded Systems Letters Publication Information","authors":"","doi":"10.1109/LES.2025.3564681","DOIUrl":"https://doi.org/10.1109/LES.2025.3564681","url":null,"abstract":"","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 3","pages":"C4-C4"},"PeriodicalIF":1.7,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11033167","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144272739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Embedded System for Controlling Temperature, Relative Humidity, and Lighting for a Test Chamber 用于控制试验室温度、相对湿度和照明的嵌入式系统
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-02 DOI: 10.1109/LES.2025.3575321
Micaela Benavides;Nicolas Nunovero;Jimmy Tarrillo
Temperature and humidity chambers are often employed for stress testing materials. In the case of cultural heritage materials, it is also crucial to incorporate light condition in these evaluations. Consequently, controlling these parameters is essential to effectively simulate accelerated environmental conditions. This letter outlines the design and implementation of an embedded system that manages temperature, humidity, and light levels. Built on an ARM Cortex-M3 System on Chip, the system integrates various temperature and humidity sensors and actuators, along with a light controller. It also features an embedded user interface and facilitates communication with an external PC. The validity of our proposal is demonstrated through the implementation of a proportional-integer control mechanism for the regulation of temperature and relative humidity.
温度和湿度室通常用于材料的应力测试。就文化遗产材料而言,在这些评估中纳入光照条件也至关重要。因此,控制这些参数对于有效地模拟加速环境条件至关重要。这封信概述了一个管理温度、湿度和光照水平的嵌入式系统的设计和实现。该系统基于ARM Cortex-M3系统芯片,集成了各种温度和湿度传感器和执行器,以及一个光控制器。它还具有嵌入式用户界面,便于与外部PC机通信。通过实施比例整数控制机制来调节温度和相对湿度,证明了我们建议的有效性。
{"title":"Embedded System for Controlling Temperature, Relative Humidity, and Lighting for a Test Chamber","authors":"Micaela Benavides;Nicolas Nunovero;Jimmy Tarrillo","doi":"10.1109/LES.2025.3575321","DOIUrl":"https://doi.org/10.1109/LES.2025.3575321","url":null,"abstract":"Temperature and humidity chambers are often employed for stress testing materials. In the case of cultural heritage materials, it is also crucial to incorporate light condition in these evaluations. Consequently, controlling these parameters is essential to effectively simulate accelerated environmental conditions. This letter outlines the design and implementation of an embedded system that manages temperature, humidity, and light levels. Built on an ARM Cortex-M3 System on Chip, the system integrates various temperature and humidity sensors and actuators, along with a light controller. It also features an embedded user interface and facilitates communication with an external PC. The validity of our proposal is demonstrated through the implementation of a proportional-integer control mechanism for the regulation of temperature and relative humidity.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"415-418"},"PeriodicalIF":2.0,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantized Generative Autoencoder for Audio Spectrograms 音频谱图的量化生成自编码器
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-02 DOI: 10.1109/LES.2025.3575372
M. Celeste Cebedio;Lucas A. Rabioglio;Luciana De Micco
This study analyzes techniques for compressing generative autoencoders (AEs) to enable their deployment on resource-constrained devices, addressing the challenges and optimizations required for such environments. As a case study, we present a quantized generative AE optimized for efficiently generating underwater sound spectrograms. The model is evaluated across diverse scenarios, demonstrating its ability to produce low-dimensional spectrograms while adapting to various acoustic conditions. The hardware optimization process focuses on balancing computational efficiency and model accuracy, ensuring performance comparable to its nonquantized counterpart.
本研究分析了压缩生成式自动编码器(AEs)的技术,以使其能够在资源受限的设备上部署,解决此类环境所需的挑战和优化。作为案例研究,我们提出了一种优化的量化生成声发射,以有效地生成水声频谱图。该模型在不同的场景下进行了评估,证明了其在适应各种声学条件的同时产生低维频谱图的能力。硬件优化过程侧重于平衡计算效率和模型精度,确保性能与非量化的对应物相当。
{"title":"Quantized Generative Autoencoder for Audio Spectrograms","authors":"M. Celeste Cebedio;Lucas A. Rabioglio;Luciana De Micco","doi":"10.1109/LES.2025.3575372","DOIUrl":"https://doi.org/10.1109/LES.2025.3575372","url":null,"abstract":"This study analyzes techniques for compressing generative autoencoders (AEs) to enable their deployment on resource-constrained devices, addressing the challenges and optimizations required for such environments. As a case study, we present a quantized generative AE optimized for efficiently generating underwater sound spectrograms. The model is evaluated across diverse scenarios, demonstrating its ability to produce low-dimensional spectrograms while adapting to various acoustic conditions. The hardware optimization process focuses on balancing computational efficiency and model accuracy, ensuring performance comparable to its nonquantized counterpart.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"419-422"},"PeriodicalIF":2.0,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Seeding Algorithm for Bipolar Stochastic Computing for Polynomial Approximations 多项式近似双极随机计算的种子算法
IF 2 4区 计算机科学 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-04-29 DOI: 10.1109/LES.2025.3565235
Abraham Josue Delgado-Nava;Jorge Rivera;Susana Ortega-Cisneros
Application of stochastic computing (SC) for reckoning trascendental functions as $tanh (x)$ , so used as an activation function in convolutional neural networks is an active research area. Currently, most of the works for computing functions via SC are based on the unipolar encoding format $(xin [{0,1}])$ , due to this, a method based on bipolar encoding format $(xin [-1,1])$ is here proposed with the goal of reducing the implementation complexity, and the correlation between stochastic bitstreams. For that, a collection of existing methods is adapted for the purpose of this letter. Moreover, for reducing correlation between bitstream, an algorithm is proposed for the selection of different seeds for distinct linear feedback shift registers that yields to a low MSE. The seed selection along with the adaptation of methods for implementing polynomials with SC digital circuits based on a bipolar encoding format yields to more accurate results. Simulations were carried out for the polynomial approximation of several functions. Function $tanh (x)$ was compared with an existing solution, verifying in that way the superior performance of the proposed approach.
利用随机计算(SC)来推算平移函数$tanh (x)$作为卷积神经网络的激活函数是一个活跃的研究领域。目前,大多数通过SC计算函数的工作都是基于单极编码格式$(xin [{0,1}])$,因此,本文提出了一种基于双极编码格式$(xin [-1,1])$的方法,以降低实现的复杂性和随机比特流之间的相关性。为此,本文改编了现有方法的集合。此外,为了降低比特流之间的相关性,提出了一种针对不同线性反馈移位寄存器选择不同种子的算法,从而产生较低的MSE。种子选择以及基于双极编码格式的SC数字电路实现多项式的方法适应产生更准确的结果。对几个函数的多项式逼近进行了仿真。函数$tanh (x)$与现有的解决方案进行了比较,以这种方式验证了所提出方法的优越性能。
{"title":"Seeding Algorithm for Bipolar Stochastic Computing for Polynomial Approximations","authors":"Abraham Josue Delgado-Nava;Jorge Rivera;Susana Ortega-Cisneros","doi":"10.1109/LES.2025.3565235","DOIUrl":"https://doi.org/10.1109/LES.2025.3565235","url":null,"abstract":"Application of stochastic computing (SC) for reckoning trascendental functions as <inline-formula> <tex-math>$tanh (x)$ </tex-math></inline-formula>, so used as an activation function in convolutional neural networks is an active research area. Currently, most of the works for computing functions via SC are based on the unipolar encoding format <inline-formula> <tex-math>$(xin [{0,1}])$ </tex-math></inline-formula>, due to this, a method based on bipolar encoding format <inline-formula> <tex-math>$(xin [-1,1])$ </tex-math></inline-formula> is here proposed with the goal of reducing the implementation complexity, and the correlation between stochastic bitstreams. For that, a collection of existing methods is adapted for the purpose of this letter. Moreover, for reducing correlation between bitstream, an algorithm is proposed for the selection of different seeds for distinct linear feedback shift registers that yields to a low MSE. The seed selection along with the adaptation of methods for implementing polynomials with SC digital circuits based on a bipolar encoding format yields to more accurate results. Simulations were carried out for the polynomial approximation of several functions. Function <inline-formula> <tex-math>$tanh (x)$ </tex-math></inline-formula> was compared with an existing solution, verifying in that way the superior performance of the proposed approach.","PeriodicalId":56143,"journal":{"name":"IEEE Embedded Systems Letters","volume":"17 6","pages":"406-410"},"PeriodicalIF":2.0,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Embedded Systems Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1