{"title":"基于交换网络架构的高吞吐量可配置变长FFT处理器的设计","authors":"Renfeng Dou, Yifan Bo, Jun Han, Xiaoyang Zeng","doi":"10.1109/ASICON.2013.6811852","DOIUrl":null,"url":null,"abstract":"Fast Fourier transform (FFT) is one of the key operations in digital communication systems and digital signal processing platforms. This paper presents a design of high throughput variable-length FFT processor based on switch network (SN) architecture. Meanwhile, strong runtime configurability and scalability is exploited. Considering the support for variable-length FFT as well as the balance between speed and cost, the mixed-radix (MR) technique and in-place strategy are used. In addition, auto synchronization method is proposed to make stage pipelined mode work efficiently. Batch processing mode is also proposed to boost performance for small size FFTs. The results show that the throughput for 16-point to 256-point FFT can be improved from 5.8X to 1.2X, respectively. The processor supports 16- to 8192-point FFT and provides about 2GSamples/s for FFT size less than or equal to 256 by batch processing, and 1GSamples/s throughput for larger size FFT at 500MHz. The core area is 2.04 mm2 and the power consumption is 68 mW at 100MHz for 1k-point.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design of a high throughput configurable variable-length FFT processor based on switch network architecture\",\"authors\":\"Renfeng Dou, Yifan Bo, Jun Han, Xiaoyang Zeng\",\"doi\":\"10.1109/ASICON.2013.6811852\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fast Fourier transform (FFT) is one of the key operations in digital communication systems and digital signal processing platforms. This paper presents a design of high throughput variable-length FFT processor based on switch network (SN) architecture. Meanwhile, strong runtime configurability and scalability is exploited. Considering the support for variable-length FFT as well as the balance between speed and cost, the mixed-radix (MR) technique and in-place strategy are used. In addition, auto synchronization method is proposed to make stage pipelined mode work efficiently. Batch processing mode is also proposed to boost performance for small size FFTs. The results show that the throughput for 16-point to 256-point FFT can be improved from 5.8X to 1.2X, respectively. The processor supports 16- to 8192-point FFT and provides about 2GSamples/s for FFT size less than or equal to 256 by batch processing, and 1GSamples/s throughput for larger size FFT at 500MHz. The core area is 2.04 mm2 and the power consumption is 68 mW at 100MHz for 1k-point.\",\"PeriodicalId\":150654,\"journal\":{\"name\":\"2013 IEEE 10th International Conference on ASIC\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE 10th International Conference on ASIC\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASICON.2013.6811852\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 10th International Conference on ASIC","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASICON.2013.6811852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design of a high throughput configurable variable-length FFT processor based on switch network architecture
Fast Fourier transform (FFT) is one of the key operations in digital communication systems and digital signal processing platforms. This paper presents a design of high throughput variable-length FFT processor based on switch network (SN) architecture. Meanwhile, strong runtime configurability and scalability is exploited. Considering the support for variable-length FFT as well as the balance between speed and cost, the mixed-radix (MR) technique and in-place strategy are used. In addition, auto synchronization method is proposed to make stage pipelined mode work efficiently. Batch processing mode is also proposed to boost performance for small size FFTs. The results show that the throughput for 16-point to 256-point FFT can be improved from 5.8X to 1.2X, respectively. The processor supports 16- to 8192-point FFT and provides about 2GSamples/s for FFT size less than or equal to 256 by batch processing, and 1GSamples/s throughput for larger size FFT at 500MHz. The core area is 2.04 mm2 and the power consumption is 68 mW at 100MHz for 1k-point.