基于FPGA的高速延迟fxlms硬件架构设计

Jun Yuan, X. Meng, Jianhua Ran, Wei Wang, Qiang Zhao, Jun Li, Qin Li
{"title":"基于FPGA的高速延迟fxlms硬件架构设计","authors":"Jun Yuan, X. Meng, Jianhua Ran, Wei Wang, Qiang Zhao, Jun Li, Qin Li","doi":"10.46300/9106.2022.16.94","DOIUrl":null,"url":null,"abstract":"In order to improve the convergence and clock speed of DFxLMS adaptive filter, a hardware architecture of fine-grained retiming DFxLMS (HS-TF-RDFXLMS) filter in the form of hardware sharing transpose is proposed. Firstly, the architecture adopts delay decomposition algorithm to solve the problem that the convergence of filter decreases due to the increase of delay and output lag. Secondly, on the premise that the algorithm performance remains unchanged, the adaptive filter module and the secondary path module are transposed to further reduce the critical path to improve the clock speed of the system. The number of registers is reduced by optimizing circuit sub-module. Finally, the area/speed tradeoff of TF-RDFXLMS filter is realized by hardware sharing on the basis of constant critical path. Experimental results show that the convergence speed of the algorithm is 3.5 times that of DFxLMS algorithm, and the critical path is shortened by ([log2N]+1)TADD. The circuit structure of adaptive filter designed in this paper is realized by Xilinx Artix7 FPGA platform. The clock speed of HS-TF-RDFXLMS filter is reduced by 4.386% compared with TF-RDFXLMS filter. However, the resources of LUT and FF are saved by 10.964% and 28.322% respectively. The power consumption is 150.73 mW. This improves the performance of the system.","PeriodicalId":13929,"journal":{"name":"International Journal of Circuits, Systems and Signal Processing","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design of High-speed Delay-FXLMS Hardware Architecture Based on FPGA\",\"authors\":\"Jun Yuan, X. Meng, Jianhua Ran, Wei Wang, Qiang Zhao, Jun Li, Qin Li\",\"doi\":\"10.46300/9106.2022.16.94\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the convergence and clock speed of DFxLMS adaptive filter, a hardware architecture of fine-grained retiming DFxLMS (HS-TF-RDFXLMS) filter in the form of hardware sharing transpose is proposed. Firstly, the architecture adopts delay decomposition algorithm to solve the problem that the convergence of filter decreases due to the increase of delay and output lag. Secondly, on the premise that the algorithm performance remains unchanged, the adaptive filter module and the secondary path module are transposed to further reduce the critical path to improve the clock speed of the system. The number of registers is reduced by optimizing circuit sub-module. Finally, the area/speed tradeoff of TF-RDFXLMS filter is realized by hardware sharing on the basis of constant critical path. Experimental results show that the convergence speed of the algorithm is 3.5 times that of DFxLMS algorithm, and the critical path is shortened by ([log2N]+1)TADD. The circuit structure of adaptive filter designed in this paper is realized by Xilinx Artix7 FPGA platform. The clock speed of HS-TF-RDFXLMS filter is reduced by 4.386% compared with TF-RDFXLMS filter. However, the resources of LUT and FF are saved by 10.964% and 28.322% respectively. The power consumption is 150.73 mW. This improves the performance of the system.\",\"PeriodicalId\":13929,\"journal\":{\"name\":\"International Journal of Circuits, Systems and Signal Processing\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Circuits, Systems and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.46300/9106.2022.16.94\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Circuits, Systems and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46300/9106.2022.16.94","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0

摘要

为了提高DFxLMS自适应滤波器的收敛性和时钟速度,提出了一种硬件共享转置形式的细粒度重定时DFxLMS (HS-TF-RDFXLMS)滤波器的硬件架构。首先,该架构采用延迟分解算法,解决了由于延迟和输出滞后增大导致滤波器收敛性降低的问题。其次,在保持算法性能不变的前提下,将自适应滤波模块和副路径模块进行调换,进一步减少关键路径,提高系统时钟速度。通过优化电路子模块,减少了寄存器的数量。最后,在关键路径不变的基础上,通过硬件共享实现TF-RDFXLMS滤波器的面积/速度权衡。实验结果表明,该算法的收敛速度是DFxLMS算法的3.5倍,关键路径缩短了([log2N]+1)TADD。本文设计的自适应滤波器的电路结构是在Xilinx Artix7 FPGA平台上实现的。HS-TF-RDFXLMS滤波器的时钟速度比TF-RDFXLMS滤波器降低了4.386%。而LUT和FF的资源分别节省了10.964%和28.322%。功耗为150.73 mW。这样可以提高系统的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Design of High-speed Delay-FXLMS Hardware Architecture Based on FPGA
In order to improve the convergence and clock speed of DFxLMS adaptive filter, a hardware architecture of fine-grained retiming DFxLMS (HS-TF-RDFXLMS) filter in the form of hardware sharing transpose is proposed. Firstly, the architecture adopts delay decomposition algorithm to solve the problem that the convergence of filter decreases due to the increase of delay and output lag. Secondly, on the premise that the algorithm performance remains unchanged, the adaptive filter module and the secondary path module are transposed to further reduce the critical path to improve the clock speed of the system. The number of registers is reduced by optimizing circuit sub-module. Finally, the area/speed tradeoff of TF-RDFXLMS filter is realized by hardware sharing on the basis of constant critical path. Experimental results show that the convergence speed of the algorithm is 3.5 times that of DFxLMS algorithm, and the critical path is shortened by ([log2N]+1)TADD. The circuit structure of adaptive filter designed in this paper is realized by Xilinx Artix7 FPGA platform. The clock speed of HS-TF-RDFXLMS filter is reduced by 4.386% compared with TF-RDFXLMS filter. However, the resources of LUT and FF are saved by 10.964% and 28.322% respectively. The power consumption is 150.73 mW. This improves the performance of the system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Circuits, Systems and Signal Processing
International Journal of Circuits, Systems and Signal Processing Engineering-Electrical and Electronic Engineering
自引率
0.00%
发文量
155
期刊最新文献
Stochastic Machine Learning Models for Mutation Rate Analysis of Malignant Cancer Cells in Patients with Acute Lymphoblastic Leukemia Detecting Small Objects Using a Smartphone and Neon Camera Optimization of New Energy Vehicle Road Noise Problem Based on Finite Element Analysis Method Base Elements for Artificial Neural Network: Structure Modeling, Production, Properties Distributed Generation Hosting Capacity Evaluation for Distribution Networks Considering Uncertainty
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1