首页 > 最新文献

2013 IEEE 10th International Conference on ASIC最新文献

英文 中文
A 80-dB DR, 10-MHz BW continuous-time sigma-delta modulator with low power comparators and switch drivers 具有低功率比较器和开关驱动器的80 db DR, 10 mhz BW连续时间σ - δ调制器
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811892
Yuzhong Xiao, Chixiao Chen, Rui Wei, Fan Jiang, Jun Xu, Junyan Ren
A third-order multi-bit feedforward-feedback (FF-FB) mixed continuous time sigma-delta modulator (CTSDM) for WLAN receivers is presented. The comparators and switch drivers are simplified and therefore can be more power-efficient. The FF-FB mixed architecture obtains a signal transfer function (STF) with little out-of-band peaking. Based on the signal bandwidth and anti-aliasing requirements for the WLAN receivers, we realized the modulator with SMIC 0.18um CMOS Mixed Signal process and the core achieves 80dB SNR over 10MHz signal bandwidth while consuming 30mW at 1.8V supply.
提出了一种用于WLAN接收机的三阶多位前馈-反馈(FF-FB)混合连续时间σ - δ调制器(CTSDM)。比较器和开关驱动器被简化,因此可以更节能。FF-FB混合结构获得了带外峰值很小的信号传递函数(STF)。根据无线局域网接收机对信号带宽和抗混频的要求,采用中芯0.18um CMOS混合信号工艺实现调制器,在1.8V电源下功耗30mW,在10MHz信号带宽下实现80dB信噪比。
{"title":"A 80-dB DR, 10-MHz BW continuous-time sigma-delta modulator with low power comparators and switch drivers","authors":"Yuzhong Xiao, Chixiao Chen, Rui Wei, Fan Jiang, Jun Xu, Junyan Ren","doi":"10.1109/ASICON.2013.6811892","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811892","url":null,"abstract":"A third-order multi-bit feedforward-feedback (FF-FB) mixed continuous time sigma-delta modulator (CTSDM) for WLAN receivers is presented. The comparators and switch drivers are simplified and therefore can be more power-efficient. The FF-FB mixed architecture obtains a signal transfer function (STF) with little out-of-band peaking. Based on the signal bandwidth and anti-aliasing requirements for the WLAN receivers, we realized the modulator with SMIC 0.18um CMOS Mixed Signal process and the core achieves 80dB SNR over 10MHz signal bandwidth while consuming 30mW at 1.8V supply.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129475138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A power-efficient network-on-chip for multi-core stream processors 用于多核流处理器的高能效片上网络
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811833
Guoyue Jiang, Fang Wang, Zhaolin Li, Shaojun Wei
Stream processors have emerged as a mainstream solution for computation intensive applications. This paper proposes a power-efficient network-on-chip (NoC) for multi-core stream processors, aiming at improving the communication performance and power consumption. In the proposed NoC, specific stream paths are proposed according to features of multi-core stream processing. Specific stream paths are constructed based on a packet-switched NoC, providing fast and power-efficient transmissions for stream communications. To support specific stream paths on the packet-switched NoC, the modified micro-architecture of the router is proposed with a negligible area overhead. A set of stream applications are exploited for evaluation. Experimental results show that, an average of 16.0% latency reduction and 35.9% power saving can be obtained.
流处理器已经成为计算密集型应用的主流解决方案。本文提出了一种面向多核流处理器的低功耗片上网络(NoC),旨在提高通信性能和功耗。在该NoC中,根据多核流处理的特点,提出了具体的流路径。特定的流路径是基于分组交换NoC构建的,为流通信提供快速和节能的传输。为了支持分组交换NoC上的特定流路径,提出了一种改进的路由器微结构,其面积开销可以忽略不计。一组流应用程序被用于评估。实验结果表明,该方法平均降低了16.0%的延迟,节省了35.9%的功耗。
{"title":"A power-efficient network-on-chip for multi-core stream processors","authors":"Guoyue Jiang, Fang Wang, Zhaolin Li, Shaojun Wei","doi":"10.1109/ASICON.2013.6811833","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811833","url":null,"abstract":"Stream processors have emerged as a mainstream solution for computation intensive applications. This paper proposes a power-efficient network-on-chip (NoC) for multi-core stream processors, aiming at improving the communication performance and power consumption. In the proposed NoC, specific stream paths are proposed according to features of multi-core stream processing. Specific stream paths are constructed based on a packet-switched NoC, providing fast and power-efficient transmissions for stream communications. To support specific stream paths on the packet-switched NoC, the modified micro-architecture of the router is proposed with a negligible area overhead. A set of stream applications are exploited for evaluation. Experimental results show that, an average of 16.0% latency reduction and 35.9% power saving can be obtained.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126297019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A clocked differential switch logic using floating-gate MOS transistors 一种采用浮栅MOS晶体管的时钟差分开关逻辑
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811997
G. Hang, Yang Yang, P. Zhao, Xiaohui Hu, X. You
A novel differential dynamic CMOS logic using multiple-input floating-gate MOS(FGMOS) transistors is proposed. In this circuit family, a pair of n-channel multiple-input FGMOS pull down logic networks is used to replace the nMOS logic tree in the conventional dynamic differential cascode voltage switch logic circuit. A simple synthesis technique of the n-channel multiple-input FGMOS logic tree by employing summation signal is also discussed. By using multiple-input FGMOS, the logic tree can be significantly simplified. HSPICE simulations using TSMC 0.35μm 2-ploy 4-metal CMOS technology have verified the effectiveness of the proposed design scheme.
提出了一种基于多输入浮栅MOS(FGMOS)晶体管的差分动态CMOS逻辑。在该电路系列中,采用一对n通道多输入FGMOS下拉逻辑网络来取代传统动态差分级联电压开关逻辑电路中的nMOS逻辑树。本文还讨论了一种利用求和信号合成n通道多输入FGMOS逻辑树的简单方法。通过使用多输入FGMOS,可以大大简化逻辑树。采用台积电0.35μm 2-ploy 4金属CMOS技术进行HSPICE仿真,验证了该设计方案的有效性。
{"title":"A clocked differential switch logic using floating-gate MOS transistors","authors":"G. Hang, Yang Yang, P. Zhao, Xiaohui Hu, X. You","doi":"10.1109/ASICON.2013.6811997","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811997","url":null,"abstract":"A novel differential dynamic CMOS logic using multiple-input floating-gate MOS(FGMOS) transistors is proposed. In this circuit family, a pair of n-channel multiple-input FGMOS pull down logic networks is used to replace the nMOS logic tree in the conventional dynamic differential cascode voltage switch logic circuit. A simple synthesis technique of the n-channel multiple-input FGMOS logic tree by employing summation signal is also discussed. By using multiple-input FGMOS, the logic tree can be significantly simplified. HSPICE simulations using TSMC 0.35μm 2-ploy 4-metal CMOS technology have verified the effectiveness of the proposed design scheme.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126346688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Mixed-signal verification methods for multi-power mixed-signal System-on-Chip (SoC) design 多功率混合信号片上系统(SoC)设计中的混合信号验证方法
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6812042
Chao Liang
Mixed-signal design becomes more and more popular nowadays because designers are required to quickly integrate IPs, control blocks, functional blocks, and analog modules together and run through the design flow to tape out in short time. Given that the latest designs are becoming more and more complex, the increasing physical effects in advanced process nodes, and request for shorter time to market, a fast and accurate design flow will be critical to ensure the success of the project. This paper will briefly describe various mixed signal verification methods used at Freescale Kinetis MCU which include behavior modeling, AMS validation, connectivity verification, mixed-signal Verification IP (VIP), multi-power verification, SoC transistor level simulation and mixed signal functional coverage. Engineering results are discussed to demonstrate the effectiveness of those methods.
混合信号设计越来越受欢迎,因为设计人员需要快速集成ip,控制块,功能块和模拟模块,并在短时间内运行设计流程。考虑到最新的设计变得越来越复杂,先进工艺节点的物理效应越来越大,并且要求更短的上市时间,快速准确的设计流程将是确保项目成功的关键。本文将简要介绍飞思卡尔Kinetis MCU中使用的各种混合信号验证方法,包括行为建模,AMS验证,连接性验证,混合信号验证IP (VIP),多电源验证,SoC晶体管级仿真和混合信号功能覆盖。工程结果证明了这些方法的有效性。
{"title":"Mixed-signal verification methods for multi-power mixed-signal System-on-Chip (SoC) design","authors":"Chao Liang","doi":"10.1109/ASICON.2013.6812042","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6812042","url":null,"abstract":"Mixed-signal design becomes more and more popular nowadays because designers are required to quickly integrate IPs, control blocks, functional blocks, and analog modules together and run through the design flow to tape out in short time. Given that the latest designs are becoming more and more complex, the increasing physical effects in advanced process nodes, and request for shorter time to market, a fast and accurate design flow will be critical to ensure the success of the project. This paper will briefly describe various mixed signal verification methods used at Freescale Kinetis MCU which include behavior modeling, AMS validation, connectivity verification, mixed-signal Verification IP (VIP), multi-power verification, SoC transistor level simulation and mixed signal functional coverage. Engineering results are discussed to demonstrate the effectiveness of those methods.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129197314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
An extensible and real-time compressive sensing reconstruction hardware for WBANs using OMP 一种基于OMP的可扩展实时压缩感知重构硬件
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811911
Weijing Shi, Yi Li, Jun Han, Xu Cheng, Xiaoyang Zeng
Compressive sensing theory, which allows sparse signal to be sampled at sub-Nyquist rate, is introduced to Wireless Body Area Networks to reduce the hardware requirement and energy consumption of signal acquisition. However, signal recovery by software causes time delay for real-time reconstruction. In this paper, we propose a high speed hardware implementation of orthogonal matching pursuit reconstruction algorithm in SMIC 130nm CMOS. By using original multi-functional systolic arrays, it is highly parallel and extensible. Experimental result shows that it completes a reconstruction of 16-sparseness 256-length ECG signal in 45μs with maximum operating frequency of 167MHz. When the sparseness is 8, it takes 18μs to recover the signal, which is 33% faster than the state-of-art design.
将压缩感知理论引入到无线体域网络中,使稀疏信号能够以亚奈奎斯特速率进行采样,从而降低信号采集的硬件要求和能耗。但是,通过软件恢复信号会造成实时重建的时间延迟。本文提出了一种基于中芯130nm CMOS的正交匹配追踪重构算法的高速硬件实现方案。采用独创的多功能收缩阵列,具有高度的并行性和可扩展性。实验结果表明,该方法在45μs内完成了16稀疏度256长度心电信号的重构,最大工作频率为167MHz。当稀疏度为8时,信号恢复时间为18μs,比现有设计提高33%。
{"title":"An extensible and real-time compressive sensing reconstruction hardware for WBANs using OMP","authors":"Weijing Shi, Yi Li, Jun Han, Xu Cheng, Xiaoyang Zeng","doi":"10.1109/ASICON.2013.6811911","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811911","url":null,"abstract":"Compressive sensing theory, which allows sparse signal to be sampled at sub-Nyquist rate, is introduced to Wireless Body Area Networks to reduce the hardware requirement and energy consumption of signal acquisition. However, signal recovery by software causes time delay for real-time reconstruction. In this paper, we propose a high speed hardware implementation of orthogonal matching pursuit reconstruction algorithm in SMIC 130nm CMOS. By using original multi-functional systolic arrays, it is highly parallel and extensible. Experimental result shows that it completes a reconstruction of 16-sparseness 256-length ECG signal in 45μs with maximum operating frequency of 167MHz. When the sparseness is 8, it takes 18μs to recover the signal, which is 33% faster than the state-of-art design.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117270875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Distributed task migration for thermal hot spot reduction in many-core microprocessors 多核微处理器中减少热热点的分布式任务迁移
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811821
Zao Liu, Xin Huang, S. Tan, Hai Wang, H. Tang
In this paper, we propose a new distributed task migration method to reduce the thermal hot spots and on-chip temperature variance, which leads to better thermal reliability and reduced package costs of emerging many-core processors. The novelty of the new algorithm is that the task migration is done in a fully distributed way while we can still maintain some degrees of global view to guide the process. This is enabled by recently proposed distributed state tracking technique to dynamically estimate the average temperature of all the cores, which provides the important global view of the temperature of the whole chip to efficiently guide local task migration among cores. In addition, the local task migration will be carried out based on the power, temperature, and load influence from neighboring cores. Our experimental results on a 36 core microprocessor demonstrate that the proposed method can reduce 30% more thermal hot spots compared with the existing distributed thermal management method, leading to more balanced temperature distribution of many-core microprocessor chips.
在本文中,我们提出了一种新的分布式任务迁移方法,以减少热热点和片上温度差异,从而提高新兴多核处理器的热可靠性和降低封装成本。新算法的新颖之处在于任务迁移以完全分布式的方式完成,同时我们仍然可以保持一定程度的全局视图来指导过程。最近提出的分布式状态跟踪技术可以动态估计所有内核的平均温度,为整个芯片的温度提供重要的全局视图,从而有效地指导局部任务在内核之间的迁移。此外,本地任务迁移将根据邻近核心的功率、温度和负载影响进行。在36核微处理器上的实验结果表明,与现有的分布式热管理方法相比,该方法可以减少30%的热热点,从而使多核微处理器芯片的温度分布更加均衡。
{"title":"Distributed task migration for thermal hot spot reduction in many-core microprocessors","authors":"Zao Liu, Xin Huang, S. Tan, Hai Wang, H. Tang","doi":"10.1109/ASICON.2013.6811821","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811821","url":null,"abstract":"In this paper, we propose a new distributed task migration method to reduce the thermal hot spots and on-chip temperature variance, which leads to better thermal reliability and reduced package costs of emerging many-core processors. The novelty of the new algorithm is that the task migration is done in a fully distributed way while we can still maintain some degrees of global view to guide the process. This is enabled by recently proposed distributed state tracking technique to dynamically estimate the average temperature of all the cores, which provides the important global view of the temperature of the whole chip to efficiently guide local task migration among cores. In addition, the local task migration will be carried out based on the power, temperature, and load influence from neighboring cores. Our experimental results on a 36 core microprocessor demonstrate that the proposed method can reduce 30% more thermal hot spots compared with the existing distributed thermal management method, leading to more balanced temperature distribution of many-core microprocessor chips.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"10 8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122458657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Improved unified interconnect unit for high speed and scalable FPGA 改进的高速可扩展FPGA统一互连单元
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811912
Lei Li, Jian Wang, Jinmei Lai
This paper presents a completely unified interconnect unit (UINT) including unified input and output multiplexers (UIM and UOM) which are usually non-repeatable [1-3]. UINT ensures different logic modules could have exactly the identical interconnect circuit, providing higher scalability for FPGAs. Furthermore, Multi-Vt switch circuit combining low threshold voltage and high threshold voltage transistors is put forward to minimize the adverse effects brought by threshold voltage loss and decrease of Supply Voltage in Nanometer technology, attaining high speed performance of FPGA. The proposed interconnect unit is applied to own-designed Fudan Programmable (FDP5) FPGA and realized through 65 nm technology. Post-layout simulation results indicate that the proposed interconnect circuit is well-designed with up to 40% improvement of speed performance compared to the prior work [3] equivalent to the same technology, yet maintaining lower power consumption and smaller area, reduced by 12% and 35% respectively.
本文提出了一种完全统一的互连单元(UINT),包括通常不可重复的统一输入和输出复用器(UIM和UOM)[1-3]。UINT确保不同的逻辑模块可以具有完全相同的互连电路,为fpga提供更高的可扩展性。此外,为了最大限度地减少纳米技术中阈值电压损失和电源电压降低带来的不利影响,提出了结合低阈值电压和高阈值电压晶体管的Multi-Vt开关电路,实现了FPGA的高速性能。该互连单元采用自主设计的复旦可编程FPGA (FDP5),采用65nm工艺实现。布局后仿真结果表明,所提出的互连电路设计良好,与同等技术的先前工作[3]相比,速度性能提高了40%,同时保持更低的功耗和更小的面积,分别降低了12%和35%。
{"title":"Improved unified interconnect unit for high speed and scalable FPGA","authors":"Lei Li, Jian Wang, Jinmei Lai","doi":"10.1109/ASICON.2013.6811912","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811912","url":null,"abstract":"This paper presents a completely unified interconnect unit (UINT) including unified input and output multiplexers (UIM and UOM) which are usually non-repeatable [1-3]. UINT ensures different logic modules could have exactly the identical interconnect circuit, providing higher scalability for FPGAs. Furthermore, Multi-Vt switch circuit combining low threshold voltage and high threshold voltage transistors is put forward to minimize the adverse effects brought by threshold voltage loss and decrease of Supply Voltage in Nanometer technology, attaining high speed performance of FPGA. The proposed interconnect unit is applied to own-designed Fudan Programmable (FDP5) FPGA and realized through 65 nm technology. Post-layout simulation results indicate that the proposed interconnect circuit is well-designed with up to 40% improvement of speed performance compared to the prior work [3] equivalent to the same technology, yet maintaining lower power consumption and smaller area, reduced by 12% and 35% respectively.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"203 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123039703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An area-efficient implementation of ΣΔ ADC multistage decimation filter 一种面积高效的ΣΔ ADC多级抽取滤波器实现
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811998
Chenxi Deng
In this paper, an area-efficient implementation of a multistage decimation filter for audio ΣΔ ADC is presented. The decimator with a decimation ratio of 256 has less than 0.005dB passband ripple and 100dB stop band attenuation. It has an audio passband of 0-20kHz and outputs 16-bit resolution signal at 48kHz. With an area-efficient architecture involving RAM and ROM, and the dedicated instruction scheduling through 256 steps in a cycle, the decimator is synthesized with fewer than 300 LUTs and fewer than 160 Slices on a Xilinx Spartan3E FPGA. An ALU with only one 32-bit processing register and one 16-bit output register is designed. The computing rate or the clock rate is equal to the input sampling rate, which lowers power consumption and simplifies clock generation design. A Matlab compiler is developed to automate the generation of ROM word bits according to the instruction scheduling. At last, the simulation result of the RTL model in Modelsim is verified by the MatlabSimulink programs to ensure that the internal 32-bit register data is `bit true' while processing the 1-bit input stream.
本文提出了一种用于音频ΣΔ ADC的多级抽取滤波器的面积高效实现方法。抽取比为256的抽取器通带纹波小于0.005dB,阻带衰减小于100dB。它具有0-20kHz的音频通带,并在48kHz输出16位分辨率信号。该decimator采用面积高效的架构,包括RAM和ROM,以及在一个周期内通过256个步骤进行专用指令调度,在Xilinx Spartan3E FPGA上使用不到300个lut和不到160个Slices来合成。设计了一个只有一个32位处理寄存器和一个16位输出寄存器的ALU。计算速率或时钟速率等于输入采样率,降低功耗,简化时钟产生设计。开发了一个Matlab编译器,根据指令调度自动生成ROM字位。最后,通过MatlabSimulink程序对Modelsim中RTL模型的仿真结果进行验证,确保在处理1位输入流时,内部32位寄存器数据是“位真”的。
{"title":"An area-efficient implementation of ΣΔ ADC multistage decimation filter","authors":"Chenxi Deng","doi":"10.1109/ASICON.2013.6811998","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811998","url":null,"abstract":"In this paper, an area-efficient implementation of a multistage decimation filter for audio ΣΔ ADC is presented. The decimator with a decimation ratio of 256 has less than 0.005dB passband ripple and 100dB stop band attenuation. It has an audio passband of 0-20kHz and outputs 16-bit resolution signal at 48kHz. With an area-efficient architecture involving RAM and ROM, and the dedicated instruction scheduling through 256 steps in a cycle, the decimator is synthesized with fewer than 300 LUTs and fewer than 160 Slices on a Xilinx Spartan3E FPGA. An ALU with only one 32-bit processing register and one 16-bit output register is designed. The computing rate or the clock rate is equal to the input sampling rate, which lowers power consumption and simplifies clock generation design. A Matlab compiler is developed to automate the generation of ROM word bits according to the instruction scheduling. At last, the simulation result of the RTL model in Modelsim is verified by the MatlabSimulink programs to ensure that the internal 32-bit register data is `bit true' while processing the 1-bit input stream.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114063336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A small-area low-power ADC array for image sensor applications 用于图像传感器应用的小面积低功耗ADC阵列
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6812052
Shengyou Zhong, L. Yao, Jiqing Zhang
Integrating image sensor and analog-to-digital converter (ADC) array on the same substrate and developing digital image sensor become more and more popular with the development of CMOS technologies. It can reduce the complexity of the whole system and decrease the cost of the system. More importantly, it can integrate more digital signal processing which the traditional image sensor cannot achieve. Integrating ADC array into the image sensor requires ADC to have characters of low power and small area. In this work, a 10-bit is proposed resistor-capacitor hybrid successive approximation register ADC (RC SAR ADC) which uses a resistor ladder to generate 3 reference voltages for last 6 least significant bits (LSB) and shares the shift register of SAR control logic for the ADC array to reduce the area. Theory analysis and simulation show that the proposed RC SAR ADC is suitable for array applications.
随着CMOS技术的发展,将图像传感器与模数转换器(ADC)阵列集成在同一衬底上,开发数字图像传感器越来越受到人们的欢迎。它可以降低整个系统的复杂性,降低系统的成本。更重要的是,它可以集成更多传统图像传感器无法实现的数字信号处理。将ADC阵列集成到图像传感器中,要求ADC具有低功耗、小面积的特点。在这项工作中,提出了一个10位的电阻-电容混合连续逼近寄存器ADC (RC SAR ADC),它使用电阻阶梯为最后6个最低有效位(LSB)产生3个参考电压,并为ADC阵列共享SAR控制逻辑的移位寄存器,以减少面积。理论分析和仿真结果表明,所提出的RC SAR ADC适用于阵列应用。
{"title":"A small-area low-power ADC array for image sensor applications","authors":"Shengyou Zhong, L. Yao, Jiqing Zhang","doi":"10.1109/ASICON.2013.6812052","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6812052","url":null,"abstract":"Integrating image sensor and analog-to-digital converter (ADC) array on the same substrate and developing digital image sensor become more and more popular with the development of CMOS technologies. It can reduce the complexity of the whole system and decrease the cost of the system. More importantly, it can integrate more digital signal processing which the traditional image sensor cannot achieve. Integrating ADC array into the image sensor requires ADC to have characters of low power and small area. In this work, a 10-bit is proposed resistor-capacitor hybrid successive approximation register ADC (RC SAR ADC) which uses a resistor ladder to generate 3 reference voltages for last 6 least significant bits (LSB) and shares the shift register of SAR control logic for the ADC array to reduce the area. Theory analysis and simulation show that the proposed RC SAR ADC is suitable for array applications.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115931136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design of a high throughput configurable variable-length FFT processor based on switch network architecture 基于交换网络架构的高吞吐量可配置变长FFT处理器的设计
Pub Date : 2013-10-01 DOI: 10.1109/ASICON.2013.6811852
Renfeng Dou, Yifan Bo, Jun Han, Xiaoyang Zeng
Fast Fourier transform (FFT) is one of the key operations in digital communication systems and digital signal processing platforms. This paper presents a design of high throughput variable-length FFT processor based on switch network (SN) architecture. Meanwhile, strong runtime configurability and scalability is exploited. Considering the support for variable-length FFT as well as the balance between speed and cost, the mixed-radix (MR) technique and in-place strategy are used. In addition, auto synchronization method is proposed to make stage pipelined mode work efficiently. Batch processing mode is also proposed to boost performance for small size FFTs. The results show that the throughput for 16-point to 256-point FFT can be improved from 5.8X to 1.2X, respectively. The processor supports 16- to 8192-point FFT and provides about 2GSamples/s for FFT size less than or equal to 256 by batch processing, and 1GSamples/s throughput for larger size FFT at 500MHz. The core area is 2.04 mm2 and the power consumption is 68 mW at 100MHz for 1k-point.
快速傅里叶变换(FFT)是数字通信系统和数字信号处理平台的关键运算之一。提出了一种基于交换网络结构的高吞吐量变长FFT处理器的设计方案。同时,它具有较强的运行时可配置性和可扩展性。考虑到对变长FFT的支持以及速度和成本之间的平衡,采用了混合基(MR)技术和原地策略。此外,提出了自动同步方法,使阶段流水线模式高效工作。批处理模式也被提出用于提高小尺寸fft的性能。结果表明,16点到256点FFT的吞吐量分别从5.8X提高到1.2X。该处理器支持16至8192点FFT,并提供约2GSamples/s的FFT大小小于或等于256批处理,和1GSamples/s吞吐量较大的FFT在500MHz。核心面积为2.04 mm2,功耗为68 mW, 100MHz, 1k点。
{"title":"Design of a high throughput configurable variable-length FFT processor based on switch network architecture","authors":"Renfeng Dou, Yifan Bo, Jun Han, Xiaoyang Zeng","doi":"10.1109/ASICON.2013.6811852","DOIUrl":"https://doi.org/10.1109/ASICON.2013.6811852","url":null,"abstract":"Fast Fourier transform (FFT) is one of the key operations in digital communication systems and digital signal processing platforms. This paper presents a design of high throughput variable-length FFT processor based on switch network (SN) architecture. Meanwhile, strong runtime configurability and scalability is exploited. Considering the support for variable-length FFT as well as the balance between speed and cost, the mixed-radix (MR) technique and in-place strategy are used. In addition, auto synchronization method is proposed to make stage pipelined mode work efficiently. Batch processing mode is also proposed to boost performance for small size FFTs. The results show that the throughput for 16-point to 256-point FFT can be improved from 5.8X to 1.2X, respectively. The processor supports 16- to 8192-point FFT and provides about 2GSamples/s for FFT size less than or equal to 256 by batch processing, and 1GSamples/s throughput for larger size FFT at 500MHz. The core area is 2.04 mm2 and the power consumption is 68 mW at 100MHz for 1k-point.","PeriodicalId":150654,"journal":{"name":"2013 IEEE 10th International Conference on ASIC","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115476654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2013 IEEE 10th International Conference on ASIC
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1