首页 > 最新文献

2009 22nd International Conference on VLSI Design最新文献

英文 中文
Defect Aware to Power Conscious Tests - The New DFT Landscape 缺陷意识到功率意识测试-新的DFT景观
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.111
N. Mukherjee, J. Rajski, J. Tyszer
The rapid scaling of semiconductor devices along with technological innovations including material and process changes such as high-k gate dielectric, metal gate electrodes, etc., are making conventional fault models inadequate. In addition, the evolution of interconnects from single to multiple-levels, the use of new materials to meet the wire conductivity requirements and reduce dielectric permittivity, and the scaling of conventional metal/dielectric system have had significant impact on the performance and power dissipation of devices. Multi-core designs, heterogeneous component integration, and sophisticated packaging techniques further aggravate the challenge of testing such devices effectively. This tutorial will focus on some of the advances shaping the test industry today to address the above mentioned design and process changes. New fault models that are termed as “defect aware” are being proposed and there is a demand for test vectors targeting such defects. Bridging (static and dynamic), n-detect, stuck-open, inlineresistance, propagation delay, etc., are some examples of new fault models that are being used to various extent in the industry today. At the same time, at-speed testing is becoming the norm as the industry moves towards smaller technology nodes. Methods to handle false and multi-cycle paths effectively are common in practice to prevent unnecessary yield losses. For the first time, timing information is being considered during Automatic Test Pattern Generation (ATPG) targeting small-delay defects. Each one of the fault models will be discussed and the current trends in the industry along with some preliminary silicon experiments will be presented. Another industry trend posing a serious challenge to manufacturing test is the advent of poweraware designs. Various techniques such as architecture driven voltage reduction, switching activity minimization, switched capacitance minimization, and dynamic power management are being deployed in designing low power devices. New DFT techniques are required as well to limit power dissipation during test (preferably matching the power dissipation in functional mode of operation) thereby preventing IR drops, voltage droop, or hot spots. Test pattern generation needs to be tweaked to consider power dissipation during the key steps of the algorithm. In this tutorial, methods to control power dissipation during test, both from DFT as well as test generation perspectives will be discussed. As numerous fault models are being proposed, all the different pattern sets along with poweraware test pattern generation results in a significant increase in the number of patterns. This directly impacts test cost as both test application time as well as the tester memory needed to store the vectors are increasing. Compression schemes with not only aggressive compression ratios are being adopted; they need to fit into the low power DFT methodology. This tutorial will highlight some methods to handle low
半导体器件的快速缩放以及技术创新,包括材料和工艺变化,如高k栅极电介质,金属栅极电极等,使传统的故障模型变得不合适。此外,互连从单级到多级的演变,新材料的使用以满足导线导电性要求并降低介电常数,以及传统金属/介电系统的缩放都对器件的性能和功耗产生了重大影响。多核设计、异构组件集成和复杂的封装技术进一步加剧了有效测试此类器件的挑战。本教程将重点介绍当今测试行业的一些进步,以解决上述设计和过程变化。被称为“缺陷感知”的新故障模型正在被提出,并且需要针对此类缺陷的测试向量。桥接(静态和动态)、n检测、卡开、内联电阻、传播延迟等是当今工业中不同程度使用的新故障模型的一些例子。与此同时,随着行业向更小的技术节点发展,高速测试正成为一种常态。有效处理假路径和多周期路径的方法在实践中很常见,以防止不必要的产量损失。在针对小延迟缺陷的自动测试模式生成(ATPG)过程中,时序信息第一次被考虑。我们将讨论每一种故障模型,并介绍当前工业的发展趋势以及一些初步的硅实验。另一个对制造测试构成严重挑战的行业趋势是功率感知设计的出现。各种技术,如架构驱动的电压降低、开关活动最小化、开关电容最小化和动态电源管理,正在应用于设计低功耗器件。还需要新的DFT技术来限制测试期间的功耗(最好与功能操作模式下的功耗相匹配),从而防止IR下降、电压下降或热点。测试模式生成需要调整,以考虑算法关键步骤的功耗。在本教程中,将从DFT和测试生成的角度讨论在测试过程中控制功耗的方法。由于提出了大量的故障模型,所有不同的模式集以及功率感知测试模式生成导致模式数量的显著增加。这直接影响了测试成本,因为测试应用时间和存储向量所需的测试内存都在增加。压缩方案不仅采用激进的压缩比;它们需要适应低功耗DFT方法。本教程将重点介绍一些使用压缩处理低功耗设计的方法。此外,将讨论处理非常大的压缩比的高级压缩方案。
{"title":"Defect Aware to Power Conscious Tests - The New DFT Landscape","authors":"N. Mukherjee, J. Rajski, J. Tyszer","doi":"10.1109/VLSI.Design.2009.111","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.111","url":null,"abstract":"The rapid scaling of semiconductor devices along with technological innovations including material and process changes such as high-k gate dielectric, metal gate electrodes, etc., are making conventional fault models inadequate. In addition, the evolution of interconnects from single to multiple-levels, the use of new materials to meet the wire conductivity requirements and reduce dielectric permittivity, and the scaling of conventional metal/dielectric system have had significant impact on the performance and power dissipation of devices. Multi-core designs, heterogeneous component integration, and sophisticated packaging techniques further aggravate the challenge of testing such devices effectively. This tutorial will focus on some of the advances shaping the test industry today to address the above mentioned design and process changes. New fault models that are termed as “defect aware” are being proposed and there is a demand for test vectors targeting such defects. Bridging (static and dynamic), n-detect, stuck-open, inlineresistance, propagation delay, etc., are some examples of new fault models that are being used to various extent in the industry today. At the same time, at-speed testing is becoming the norm as the industry moves towards smaller technology nodes. Methods to handle false and multi-cycle paths effectively are common in practice to prevent unnecessary yield losses. For the first time, timing information is being considered during Automatic Test Pattern Generation (ATPG) targeting small-delay defects. Each one of the fault models will be discussed and the current trends in the industry along with some preliminary silicon experiments will be presented. Another industry trend posing a serious challenge to manufacturing test is the advent of poweraware designs. Various techniques such as architecture driven voltage reduction, switching activity minimization, switched capacitance minimization, and dynamic power management are being deployed in designing low power devices. New DFT techniques are required as well to limit power dissipation during test (preferably matching the power dissipation in functional mode of operation) thereby preventing IR drops, voltage droop, or hot spots. Test pattern generation needs to be tweaked to consider power dissipation during the key steps of the algorithm. In this tutorial, methods to control power dissipation during test, both from DFT as well as test generation perspectives will be discussed. As numerous fault models are being proposed, all the different pattern sets along with poweraware test pattern generation results in a significant increase in the number of patterns. This directly impacts test cost as both test application time as well as the tester memory needed to store the vectors are increasing. Compression schemes with not only aggressive compression ratios are being adopted; they need to fit into the low power DFT methodology. This tutorial will highlight some methods to handle low ","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"520 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123077671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A General Approach to High-Level Energy and Performance Estimation in SoCs soc中高层次能量和性能评估的一般方法
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.25
S. Penolazzi, A. Hemani, Luca Bolognino
We present a high-level methodology for efficient and accurate estimation of energy and performance in SoCs at Functional Untimed Level. We then validate the proposed method against gate level for accuracy and against TLM-PV for speed. We show that the method is within 15 % of gate-level accuracy and in average 28x faster than TLM-PV, for the benchmark applications selected.
我们提出了一种高水平的方法,用于在功能非定时水平上对soc的能量和性能进行有效和准确的估计。然后,我们针对栅极电平验证了所提出的方法的准确性,并针对TLM-PV验证了该方法的速度。我们表明,对于所选的基准应用,该方法的门级精度在15%以内,平均速度比TLM-PV快28倍。
{"title":"A General Approach to High-Level Energy and Performance Estimation in SoCs","authors":"S. Penolazzi, A. Hemani, Luca Bolognino","doi":"10.1109/VLSI.Design.2009.25","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.25","url":null,"abstract":"We present a high-level methodology for efficient and accurate estimation of energy and performance in SoCs at Functional Untimed Level. We then validate the proposed method against gate level for accuracy and against TLM-PV for speed. We show that the method is within 15 % of gate-level accuracy and in average 28x faster than TLM-PV, for the benchmark applications selected.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123086573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Low Voltage CMOS Proportional-to-Absolute Temperature Current Reference 一种低电压CMOS比例绝对温度电流基准
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.18
S. Wadhwa
A CMOS low voltage Proportional-to-Absolute Temperature current reference is presented. The proposed circuit can work with supply voltages as low as 1.1V. The circuit is designed in 90nm CMOS technology for 2.2µA reference current at typical corner, 27C, 1.2V. The circuit has been extensively simulated across all possible combinations of MOSFET, Resistor, BJT, supply voltage and temperature variation corners. Simulation results have been given for a wide temperature variation from -40C to 125C and supply voltage variation from 1.1V to 1.3V.
提出了一种CMOS低电压绝对温度比例电流基准。所提出的电路可以在低至1.1V的电源电压下工作。该电路采用90nm CMOS技术设计,参考电流为2.2µA,温度为27C,电压为1.2V。该电路已在MOSFET,电阻,BJT,电源电压和温度变化角的所有可能组合中进行了广泛的模拟。仿真结果表明,温度变化范围从-40℃到125℃,电源电压变化范围从1.1V到1.3V。
{"title":"A Low Voltage CMOS Proportional-to-Absolute Temperature Current Reference","authors":"S. Wadhwa","doi":"10.1109/VLSI.Design.2009.18","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.18","url":null,"abstract":"A CMOS low voltage Proportional-to-Absolute Temperature current reference is presented. The proposed circuit can work with supply voltages as low as 1.1V. The circuit is designed in 90nm CMOS technology for 2.2µA reference current at typical corner, 27C, 1.2V. The circuit has been extensively simulated across all possible combinations of MOSFET, Resistor, BJT, supply voltage and temperature variation corners. Simulation results have been given for a wide temperature variation from -40C to 125C and supply voltage variation from 1.1V to 1.3V.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"377 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122858870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Specification Driven Design of Phase Locked Loops 锁相环的规范驱动设计
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.97
P. Easwaran, Prasenjit Bhowmik, Rupak Ghayal
The factors that impact the topology and the performance specifications for a Phase locked loop (PLL) is presented. Correct specification of the PLL is critical for optimizing the performance and power of the system. PLL specifications for different systems have been derived and the architectural tradeoffs have been discussed. Three PLL design examples have been presented for WLAN base-bandPLL application, DVB-H receiver base-band PLL application and a high speed (MIPI) transmitter application.
介绍了影响锁相环拓扑结构和性能指标的因素。正确的锁相环规格对于优化系统的性能和功率至关重要。推导了不同系统的锁相环规范,并讨论了体系结构的权衡。给出了三种锁相环设计实例,分别用于WLAN基带锁相环应用、DVB-H接收基带锁相环应用和高速(MIPI)发射机应用。
{"title":"Specification Driven Design of Phase Locked Loops","authors":"P. Easwaran, Prasenjit Bhowmik, Rupak Ghayal","doi":"10.1109/VLSI.Design.2009.97","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.97","url":null,"abstract":"The factors that impact the topology and the performance specifications for a Phase locked loop (PLL) is presented. Correct specification of the PLL is critical for optimizing the performance and power of the system. PLL specifications for different systems have been derived and the architectural tradeoffs have been discussed. Three PLL design examples have been presented for WLAN base-bandPLL application, DVB-H receiver base-band PLL application and a high speed (MIPI) transmitter application.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125138288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Novel MOS Decoupling Capacitor Optimization Technique for Nanotechnologies 新型纳米MOS去耦电容优化技术
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.37
B. Bozorgzadeh, A. Afzali-Kusha
Designing MOS decoupling capacitors (DECAPs) in nanotechnologies provides many challenges due to the existing trade-offs among transient time response behavior, area, and gate leakage current. In this paper first it is shown that all of these challenges are functions of the MOS DECAP channel length. Then, we propose a method for optimizing the channel length of MOS DECAPs. The technique is applied to 45nm and 32nm technology nodes and the results are extracted using HSPICE simulations. Also the accuracy of the proposed technique is verified. Finally, based on the results, two optimum DECAP configurations which provide trades off among area and gate leakage for different applications in nanotechnologies are proposed.
在纳米技术中设计MOS去耦电容器(DECAPs)由于存在瞬态时间响应行为,面积和栅极泄漏电流之间的权衡而面临许多挑战。本文首先证明了所有这些挑战都是MOS DECAP信道长度的函数。然后,我们提出了一种优化MOS decap通道长度的方法。该技术应用于45nm和32nm技术节点,并使用HSPICE模拟提取结果。并验证了该方法的准确性。最后,在此基础上,提出了两种最佳的DECAP结构,可以在纳米技术的不同应用中权衡面积和栅极泄漏。
{"title":"Novel MOS Decoupling Capacitor Optimization Technique for Nanotechnologies","authors":"B. Bozorgzadeh, A. Afzali-Kusha","doi":"10.1109/VLSI.Design.2009.37","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.37","url":null,"abstract":"Designing MOS decoupling capacitors (DECAPs) in nanotechnologies provides many challenges due to the existing trade-offs among transient time response behavior, area, and gate leakage current. In this paper first it is shown that all of these challenges are functions of the MOS DECAP channel length. Then, we propose a method for optimizing the channel length of MOS DECAPs. The technique is applied to 45nm and 32nm technology nodes and the results are extracted using HSPICE simulations. Also the accuracy of the proposed technique is verified. Finally, based on the results, two optimum DECAP configurations which provide trades off among area and gate leakage for different applications in nanotechnologies are proposed.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129957372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Design-Space Exploration of Energy-Delay-Area Efficient Coarse-Grain Reconfigurable Datapath 能量-延迟-区域高效粗粒可重构数据路径的设计空间探索
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.33
Sohan Purohit, M. Lanuzza, S. Perri, P. Corsonello, M. Margala
This paper presents the VLSI design of a high data throughput, energy and area efficient data path targeted for DSP and multimedia applications. Three different implementations of the reconfigurable data path using static, dynamic domino and D3L logic styles are presented to serve as low power, high speed, and speed-energy optimized variants of the architecture. When implemented using ST Microelectronics 90nm 1V CMOS technology, the proposed data path leads to a maximum supported clock frequency ranging from 917 MHz to 1.2 GHz with a dynamic power consumption @ 500 MHz ranging from 788 µW   to 1.02 mW.
本文针对DSP和多媒体应用,设计了一种高数据吞吐量、节能、高效的VLSI数据通道。本文提出了使用静态、动态domino和D3L逻辑样式的可重构数据路径的三种不同实现,作为该体系结构的低功耗、高速和速度-能量优化变体。当使用意法半导体90nm 1V CMOS技术实现时,所提出的数据路径导致支持的最大时钟频率范围为917 MHz至1.2 GHz,动态功耗@ 500 MHz范围为788 μ W至1.02 mW。
{"title":"Design-Space Exploration of Energy-Delay-Area Efficient Coarse-Grain Reconfigurable Datapath","authors":"Sohan Purohit, M. Lanuzza, S. Perri, P. Corsonello, M. Margala","doi":"10.1109/VLSI.Design.2009.33","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.33","url":null,"abstract":"This paper presents the VLSI design of a high data throughput, energy and area efficient data path targeted for DSP and multimedia applications. Three different implementations of the reconfigurable data path using static, dynamic domino and D3L logic styles are presented to serve as low power, high speed, and speed-energy optimized variants of the architecture. When implemented using ST Microelectronics 90nm 1V CMOS technology, the proposed data path leads to a maximum supported clock frequency ranging from 917 MHz to 1.2 GHz with a dynamic power consumption @ 500 MHz ranging from 788 µW   to 1.02 mW.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"26 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120848443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Improving scalability and per-core performance in multi-cores through resource sharing and reconfiguration 通过资源共享和重新配置提高多核中的可伸缩性和每核性能
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.58
Tameesh Suri, Aneesh Aggarwal
Increasing the number of cores in a multi-core processor reduces per-core performance. On the other hand, providing more resources to each core limits the number of cores on a chip. In this paper, we propose a mechanism to improve the per-core performance while maintaining the scalability.In particular, we integrate a Reconfigurable Hardware Unit (RHU) in the resource-constrained cores to improve their performance. The RHU executes the frequently encountered instructions to increase the core's overall execution bandwidth, thus improving its performance. The RHU has low area overhead, and hence has minimal impact on scalabilityof the number of cores. To further limit the area overhead of this performance improving mechanism, generation of the reconfiguration bits for the RHUs of multiple cores isdelegated to a single core. Our experiments show that the proposed architecture improves the per-core performance by an average of about 23% across a wide range of applications, while incurring a small per-core area overhead.
增加多核处理器的核数会降低每核性能。另一方面,为每个核心提供更多的资源限制了芯片上的核心数量。在本文中,我们提出了一种机制来提高每核性能,同时保持可伸缩性。特别是,我们在资源受限的核心中集成了可重构硬件单元(RHU)以提高其性能。RHU执行经常遇到的指令,以增加核心的整体执行带宽,从而提高其性能。RHU具有较低的面积开销,因此对核心数量的可扩展性影响最小。为了进一步限制这种性能改进机制的面积开销,多核rhu的重新配置位的生成被委托给单个核。我们的实验表明,在广泛的应用程序中,所提出的体系结构将每核性能平均提高了约23%,同时产生了很小的每核面积开销。
{"title":"Improving scalability and per-core performance in multi-cores through resource sharing and reconfiguration","authors":"Tameesh Suri, Aneesh Aggarwal","doi":"10.1109/VLSI.Design.2009.58","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.58","url":null,"abstract":"Increasing the number of cores in a multi-core processor reduces per-core performance. On the other hand, providing more resources to each core limits the number of cores on a chip. In this paper, we propose a mechanism to improve the per-core performance while maintaining the scalability.In particular, we integrate a Reconfigurable Hardware Unit (RHU) in the resource-constrained cores to improve their performance. The RHU executes the frequently encountered instructions to increase the core's overall execution bandwidth, thus improving its performance. The RHU has low area overhead, and hence has minimal impact on scalabilityof the number of cores. To further limit the area overhead of this performance improving mechanism, generation of the reconfiguration bits for the RHUs of multiple cores isdelegated to a single core. Our experiments show that the proposed architecture improves the per-core performance by an average of about 23% across a wide range of applications, while incurring a small per-core area overhead.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133734361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
FPGA Implementation of Support Vector Machine Based Isolated Digit Recognition System 基于支持向量机的孤立数字识别系统的FPGA实现
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.23
J. Manikandan, B. Venkataramani, V. Avanthi
In this paper, two schemes for FPGA implementation of multi-class SVM based isolated digit recognition system are proposed, one using only logic elements and another using both soft-core processor and logic elements(LEs). One of the major contributions of this paper is the proposal for implementation of the decision function using only fixed point arithmetic without compromising the recognition accuracy. Compared to the scheme which uses floating point arithmetic, the proposed scheme reduces the number of LEs required by a factor of 3.29. The second scheme proposed results in about 25 times lower area compared to the first scheme. For the soft-core processor approach, a custom instruction is proposed for floating point arithmetic. Speaker dependent TI46 database of isolated digits is used for training and testing. Features are extracted using both Linear Predictive Coefficients (LPC) and Mel Frequency Cepstral Coefficients(MFCC) and features are compressed using Self Organized Feature Mapping (SOFM). This in turn is used by the SVM classifier to evaluate the recognition accuracy and the hardware resources utilized. Both the schemes proposed result in 100% recognition accuracy when implemented on Altera Cyclone II FPGA. The proposed schemes can also be used for speaker verification and speaker authentication applications. Since the scheme which uses soft-core processor requires lower area, it can be used for systems which require a large vocabulary size.
本文提出了两种FPGA实现基于多类支持向量机的隔离数字识别系统的方案,一种方案仅使用逻辑元件,另一种方案同时使用软核处理器和逻辑元件。本文的主要贡献之一是在不影响识别精度的情况下,仅使用不动点算法实现决策函数。与使用浮点算法的方案相比,所提出的方案将所需的le数量减少了3.29倍。与第一方案相比,第二方案的面积减少了约25倍。对于软核处理器方法,提出了一个自定义的浮点运算指令。与说话人相关的TI46孤立数字数据库用于训练和测试。使用线性预测系数(LPC)和Mel频率倒谱系数(MFCC)提取特征,并使用自组织特征映射(SOFM)压缩特征。支持向量机分类器利用这一数据来评估识别精度和所使用的硬件资源。在Altera Cyclone II FPGA上实现后,两种方案的识别准确率均达到100%。所提出的方案也可用于说话人验证和说话人身份验证应用。由于采用软核处理器的方案占用的空间较小,因此可用于对词汇量要求较大的系统。
{"title":"FPGA Implementation of Support Vector Machine Based Isolated Digit Recognition System","authors":"J. Manikandan, B. Venkataramani, V. Avanthi","doi":"10.1109/VLSI.Design.2009.23","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.23","url":null,"abstract":"In this paper, two schemes for FPGA implementation of multi-class SVM based isolated digit recognition system are proposed, one using only logic elements and another using both soft-core processor and logic elements(LEs). One of the major contributions of this paper is the proposal for implementation of the decision function using only fixed point arithmetic without compromising the recognition accuracy. Compared to the scheme which uses floating point arithmetic, the proposed scheme reduces the number of LEs required by a factor of 3.29. The second scheme proposed results in about 25 times lower area compared to the first scheme. For the soft-core processor approach, a custom instruction is proposed for floating point arithmetic. Speaker dependent TI46 database of isolated digits is used for training and testing. Features are extracted using both Linear Predictive Coefficients (LPC) and Mel Frequency Cepstral Coefficients(MFCC) and features are compressed using Self Organized Feature Mapping (SOFM). This in turn is used by the SVM classifier to evaluate the recognition accuracy and the hardware resources utilized. Both the schemes proposed result in 100% recognition accuracy when implemented on Altera Cyclone II FPGA. The proposed schemes can also be used for speaker verification and speaker authentication applications. Since the scheme which uses soft-core processor requires lower area, it can be used for systems which require a large vocabulary size.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132920291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Robust Circuit Design: Challenges and Solutions 稳健电路设计:挑战与解决方案
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.119
S. Tiwary, Amith Singhee, V. Chandra
Scaling with Moore’s law is taking us to feature sizes of 32nm and smaller. At these technology nodes designers are faced with an explosion in design complexity at all levels. In this tutorial we discuss three somewhat novel and particularly confounding dimensions of this complexity: • Electrical complexity: Digital circuit designers have particularly benefited from abstractions of the underlying MOS devices while designing circuits. For them, a transistor is an ideal switch and a wire is a perfect short between two nodes. This simplified abstraction is the driving force behind our capability to design and verify chips with about a billion transistors today. However, with aggressive device scaling, the properties of the devices that are being manufactured today are moving further away from the abstractions that we have been using to verify our designs. In this section of the tutorial, we look at some of the recent trends along these lines and some of the techniques that designers use to extract ideal functionality from non-ideal devices. We use design examples, both from analog/mixed-signal (PLL, ADC design) and digital domain (clock tree, power network, static timing, etc.), as illustrative cases studies. • Manufacturing complexity: Minimum feature sizes at 45nm are already a quarter of the wavelength of light used for lithography. Consequently, imperfections in manufacturing are unavoidable and large enough to significantly change the intended design, resulting in dreaded yield loss. Any design today has to satisfy stringent manufacturability and yield requirements. At the same time, the complexity of critical variation mechanisms renders any simplified methods, like corner analysis, ineffective. Design methods and tools are being changed at all levels of the design flow to improve yield prediction and increase manufacturing robustness. In this vein, this tutorial will cover a broad spectrum of topics: 1) relevant state-of-the-art manufacturing process steps at 45 nm (193 nm lithography, ion implantation, etc.) and the physical mechanisms resulting in electrical performance variations, 2) recently proposed design techniques for mitigating the electrical variability, and 3) recently proposed design tools for increasing robustness and predicting the yield impact of this variability. We will look at various design phases from circuit architecture down to post-layout, and at several applications from SRAMs to ASICs to analog. • Reliability complexity: With nearly three decades of continued CMOS scaling, the devices have now been pushed to their physical and reliability limits. Transistors on the latest chips in 45nm technology are so small that some of their parts are just a few atoms apart. Designs manufactured correctly may become unreliable over time because of mechanisms like NBTI, gate oxide breakdown and soft errors. The impact of unreliability manifests as time-dependent variability where the electrical characteristics of the devices vary st
根据摩尔定律,我们的特征尺寸可以达到32纳米甚至更小。在这些技术节点上,设计师在各个层面上都面临着设计复杂性的爆炸。在本教程中,我们将讨论这种复杂性的三个新颖且特别令人困惑的方面:•电气复杂性:数字电路设计者在设计电路时特别受益于底层MOS器件的抽象。对他们来说,晶体管是理想的开关,导线是两个节点之间完美的短路。这种简化的抽象是我们今天设计和验证大约十亿个晶体管芯片的能力背后的驱动力。然而,随着设备规模的不断扩大,如今正在制造的设备的属性正在远离我们用来验证设计的抽象概念。在本教程的这一部分中,我们将介绍这些方面的一些最新趋势,以及设计师用来从非理想设备中提取理想功能的一些技术。我们使用模拟/混合信号(锁相环,ADC设计)和数字域(时钟树,电源网络,静态时序等)的设计示例作为说明性案例研究。•制造复杂性:45nm的最小特征尺寸已经是光刻所用光波长的四分之一。因此,制造中的缺陷是不可避免的,并且大到足以显著改变预期的设计,导致可怕的产量损失。今天的任何设计都必须满足严格的可制造性和良率要求。同时,关键变分机制的复杂性使得任何简化的方法,如拐角分析,都是无效的。设计方法和工具正在改变设计流程的各个层面,以改善良率预测并增加制造稳健性。在这方面,本教程将涵盖广泛的主题:1)相关的最先进的45纳米制造工艺步骤(193纳米光刻,离子注入等)和导致电性能变化的物理机制,2)最近提出的减轻电性能变化的设计技术,以及3)最近提出的增加稳健性和预测这种变化对产量影响的设计工具。我们将研究从电路架构到后期布局的各个设计阶段,以及从sram到asic到模拟的几个应用。•可靠性复杂性:随着近三十年CMOS的持续扩展,这些设备现在已经达到了其物理和可靠性的极限。采用45纳米技术的最新芯片上的晶体管非常小,以至于它们的一些部件之间只有几个原子的距离。由于NBTI、栅氧化击穿和软错误等机制,正确制造的设计可能会随着时间的推移而变得不可靠。不可靠性的影响表现为时间相关的可变性,其中设备的电气特性以时间方式在统计上变化,直接转化为制造芯片的设计不确定性。扩展到45纳米以下的技术节点改变了可靠性影响的性质,从突然的功能问题到性能特征的逐步退化。本节教程中提供的材料旨在让设计师形成一个彻底的
{"title":"Robust Circuit Design: Challenges and Solutions","authors":"S. Tiwary, Amith Singhee, V. Chandra","doi":"10.1109/VLSI.Design.2009.119","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.119","url":null,"abstract":"Scaling with Moore’s law is taking us to feature sizes of 32nm and smaller. At these technology nodes designers are faced with an explosion in design complexity at all levels. In this tutorial we discuss three somewhat novel and particularly confounding dimensions of this complexity: • Electrical complexity: Digital circuit designers have particularly benefited from abstractions of the underlying MOS devices while designing circuits. For them, a transistor is an ideal switch and a wire is a perfect short between two nodes. This simplified abstraction is the driving force behind our capability to design and verify chips with about a billion transistors today. However, with aggressive device scaling, the properties of the devices that are being manufactured today are moving further away from the abstractions that we have been using to verify our designs. In this section of the tutorial, we look at some of the recent trends along these lines and some of the techniques that designers use to extract ideal functionality from non-ideal devices. We use design examples, both from analog/mixed-signal (PLL, ADC design) and digital domain (clock tree, power network, static timing, etc.), as illustrative cases studies. • Manufacturing complexity: Minimum feature sizes at 45nm are already a quarter of the wavelength of light used for lithography. Consequently, imperfections in manufacturing are unavoidable and large enough to significantly change the intended design, resulting in dreaded yield loss. Any design today has to satisfy stringent manufacturability and yield requirements. At the same time, the complexity of critical variation mechanisms renders any simplified methods, like corner analysis, ineffective. Design methods and tools are being changed at all levels of the design flow to improve yield prediction and increase manufacturing robustness. In this vein, this tutorial will cover a broad spectrum of topics: 1) relevant state-of-the-art manufacturing process steps at 45 nm (193 nm lithography, ion implantation, etc.) and the physical mechanisms resulting in electrical performance variations, 2) recently proposed design techniques for mitigating the electrical variability, and 3) recently proposed design tools for increasing robustness and predicting the yield impact of this variability. We will look at various design phases from circuit architecture down to post-layout, and at several applications from SRAMs to ASICs to analog. • Reliability complexity: With nearly three decades of continued CMOS scaling, the devices have now been pushed to their physical and reliability limits. Transistors on the latest chips in 45nm technology are so small that some of their parts are just a few atoms apart. Designs manufactured correctly may become unreliable over time because of mechanisms like NBTI, gate oxide breakdown and soft errors. The impact of unreliability manifests as time-dependent variability where the electrical characteristics of the devices vary st","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131948128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variation-Aware Macromodeling and Synthesis of Analog Circuits Using Spline Center and Range Method and Dynamically Reduced Design Space 基于样条中心与范围法和动态缩减设计空间的模拟电路变化感知宏观建模与综合
Pub Date : 2009-01-05 DOI: 10.1109/VLSI.Design.2009.51
Shubhankar Basu, Balaji Kommineni, R. Vemuri
Manufacturing and process irregularities in nanometer technologies can degrade yield and severely slow down the design cycle time. Process variation aware methodologies can help in yield improvement and meeting time-to-market requirements for system-on-chip designs. Analog circuits are extremely sensitive to device mismatches and exhibit non-linear variations in their performance under the influence of manufacturing irregularities. Performance variation in blocks can lead to degraded system performance. In this work, we present a variation-aware performance macromodeling technique for analog building blocks that is fast and accurate and guarantees convergence during synthesis. The improvements in accuracy and time complexity of the macromodel generation process is achieved by constructing a target design region graph and dynamic reduction of the design space. The target design region also helps in reducing time during re-synthesis and achieving faster convergence. Experimental results demonstrate the accuracy of the macromodels and the reduction in synthesis time compared to spice based simulation-in-the-loop evaluations and static and adaptive sampling based techniques.
纳米技术中制造和工艺的不规则性会降低良率并严重减慢设计周期。过程变化感知方法可以帮助提高成品率,满足片上系统设计的上市时间要求。模拟电路对器件失配非常敏感,在制造不规则性的影响下,其性能表现出非线性变化。块中的性能变化可能导致系统性能下降。在这项工作中,我们提出了一种变化感知的模拟构建块性能宏建模技术,该技术快速准确,并保证了合成过程中的收敛性。通过构建目标设计区域图和对设计空间进行动态约简,提高了宏模型生成过程的精度和时间复杂度。目标设计区域还有助于减少重新合成期间的时间并实现更快的收敛。实验结果表明,与基于香料的环内仿真评估和基于静态和自适应采样的技术相比,宏观模型的准确性和合成时间的缩短。
{"title":"Variation-Aware Macromodeling and Synthesis of Analog Circuits Using Spline Center and Range Method and Dynamically Reduced Design Space","authors":"Shubhankar Basu, Balaji Kommineni, R. Vemuri","doi":"10.1109/VLSI.Design.2009.51","DOIUrl":"https://doi.org/10.1109/VLSI.Design.2009.51","url":null,"abstract":"Manufacturing and process irregularities in nanometer technologies can degrade yield and severely slow down the design cycle time. Process variation aware methodologies can help in yield improvement and meeting time-to-market requirements for system-on-chip designs. Analog circuits are extremely sensitive to device mismatches and exhibit non-linear variations in their performance under the influence of manufacturing irregularities. Performance variation in blocks can lead to degraded system performance. In this work, we present a variation-aware performance macromodeling technique for analog building blocks that is fast and accurate and guarantees convergence during synthesis. The improvements in accuracy and time complexity of the macromodel generation process is achieved by constructing a target design region graph and dynamic reduction of the design space. The target design region also helps in reducing time during re-synthesis and achieving faster convergence. Experimental results demonstrate the accuracy of the macromodels and the reduction in synthesis time compared to spice based simulation-in-the-loop evaluations and static and adaptive sampling based techniques.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122910826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
期刊
2009 22nd International Conference on VLSI Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1