One of the goals of clock tree synthesis in ASIC design flow is skew minimization. There are several approaches used in traditional clock tree synthesis tools to achieve this goal. However, many of the approaches create a large number of clock-buffer levels while others result in congested clock routing. Increase in buffer level and routing congestion essentially triggers the problem of increase in buffer area and total power. Also the performance of the circuit is degraded due to on-chip variation in such situations. For certain fan-out number restricted designs, a few proposals with H-tree routed clock nets have been proposed to reduce the skew, but those proposals can hardly be used across various designs used in industry. Here we propose a method where skew minimization is mainly achieved by structured routing of clock nets. Finally, we show that with this proposal, for a few real designs from industry, we could reduce the skew up to 6.5% with increase in total wire delay up to 1.89% compared to when simple H-tree routing was deployed.
{"title":"Clock Tree Skew Minimization with Structured Routing","authors":"Pinaki Chakrabarti","doi":"10.1109/VLSID.2012.76","DOIUrl":"https://doi.org/10.1109/VLSID.2012.76","url":null,"abstract":"One of the goals of clock tree synthesis in ASIC design flow is skew minimization. There are several approaches used in traditional clock tree synthesis tools to achieve this goal. However, many of the approaches create a large number of clock-buffer levels while others result in congested clock routing. Increase in buffer level and routing congestion essentially triggers the problem of increase in buffer area and total power. Also the performance of the circuit is degraded due to on-chip variation in such situations. For certain fan-out number restricted designs, a few proposals with H-tree routed clock nets have been proposed to reduce the skew, but those proposals can hardly be used across various designs used in industry. Here we propose a method where skew minimization is mainly achieved by structured routing of clock nets. Finally, we show that with this proposal, for a few real designs from industry, we could reduce the skew up to 6.5% with increase in total wire delay up to 1.89% compared to when simple H-tree routing was deployed.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121253518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper presents a companding continuous-time ΔΣ ADC for audio applications. The 3rd-order modulator uses a 3-bit companding quantizer and has an over sampling rate of 64. The companding quantizer is implemented by a log amplifier followed by a flash ADC. The modulator, in simulation, achieves a peak signal-to-noise ratio of 76 dB, a dynamic range of 110 dB in a 24 kHz bandwidth and dissipates 860 μW of power.
{"title":"A 110-dB Dynamic Range, 76-dB Peak SNR Companding Continuous-Time ?S Modulator for Audio Applications","authors":"Saravana Kumar, S. Chatterjee","doi":"10.1109/VLSID.2012.45","DOIUrl":"https://doi.org/10.1109/VLSID.2012.45","url":null,"abstract":"This paper presents a companding continuous-time ΔΣ ADC for audio applications. The 3rd-order modulator uses a 3-bit companding quantizer and has an over sampling rate of 64. The companding quantizer is implemented by a log amplifier followed by a flash ADC. The modulator, in simulation, achieves a peak signal-to-noise ratio of 76 dB, a dynamic range of 110 dB in a 24 kHz bandwidth and dissipates 860 μW of power.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126929923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
An RF envelope detector (ED) and an asynchronous latching circuit have been designed for a wake-up receiver in 400 MHz MICS and 433 MHz / 915 MHz ISM band. The architecture is designed to tolerate significant process, supply-voltage and temperature variations. An alternative bit encoding technique has been used, which eliminates the need for symbol synchronization and the associated circuitry. The power consumption of the entire circuit, which is designed using 1.8 V supply voltage and 180 nm CMOS process, is limited to 43 uW during symbol detection and is limited to 34 uW when no input signal activity is present in the receiver RF front-end. For an input current swing of ±3 μA from the RF front-end, the circuit successfully detects up to a 2.5 Mbps input data rate.
{"title":"An Ultra-low Power Symbol Detection Methodology and Its Circuit Implementation for a Wake-up Receiver in Wireless Sensor Nodes","authors":"D. K. Meher, A. Salimath, Achintya Halder","doi":"10.1109/VLSID.2012.83","DOIUrl":"https://doi.org/10.1109/VLSID.2012.83","url":null,"abstract":"An RF envelope detector (ED) and an asynchronous latching circuit have been designed for a wake-up receiver in 400 MHz MICS and 433 MHz / 915 MHz ISM band. The architecture is designed to tolerate significant process, supply-voltage and temperature variations. An alternative bit encoding technique has been used, which eliminates the need for symbol synchronization and the associated circuitry. The power consumption of the entire circuit, which is designed using 1.8 V supply voltage and 180 nm CMOS process, is limited to 43 uW during symbol detection and is limited to 34 uW when no input signal activity is present in the receiver RF front-end. For an input current swing of ±3 μA from the RF front-end, the circuit successfully detects up to a 2.5 Mbps input data rate.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125684861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Resonant rotary clocking is a high-frequency, low-power technology for high performance integrated circuits (IC). The implementation of the rotary clocking technology requires long interconnects with varying geometric shape segments on the chip, which are modeled by transmission lines. The parasitics exhibited by the transmission line interconnects play a major role in characterizing the high frequency operation. To this end, the impact of parasitics on the operating characteristics of the rotary rings due to the different interconnect segments are identified. The interconnect parasitics are analyzed using a 3D finite element method based full wave electromagnetic analysis. Simulations performed for the rotary ring with 3D full wave based parasitic analysis results in 23.68% reduced clock frequency when compared with a conventional 2D based parasitic analysis. The power dissipated on the rotary ring simulated using the 3D full wave based parasitic analysis is around 84% less than the clock tree and is within 5% of the power dissipated on the ring simulated using the 2D based parasitic analysis.
{"title":"3-D Parasitic Modeling for Rotary Interconnects","authors":"V. Honkote, A. More, B. Taskin","doi":"10.1109/VLSID.2012.60","DOIUrl":"https://doi.org/10.1109/VLSID.2012.60","url":null,"abstract":"Resonant rotary clocking is a high-frequency, low-power technology for high performance integrated circuits (IC). The implementation of the rotary clocking technology requires long interconnects with varying geometric shape segments on the chip, which are modeled by transmission lines. The parasitics exhibited by the transmission line interconnects play a major role in characterizing the high frequency operation. To this end, the impact of parasitics on the operating characteristics of the rotary rings due to the different interconnect segments are identified. The interconnect parasitics are analyzed using a 3D finite element method based full wave electromagnetic analysis. Simulations performed for the rotary ring with 3D full wave based parasitic analysis results in 23.68% reduced clock frequency when compared with a conventional 2D based parasitic analysis. The power dissipated on the rotary ring simulated using the 3D full wave based parasitic analysis is around 84% less than the clock tree and is within 5% of the power dissipated on the ring simulated using the 2D based parasitic analysis.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129189672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper, we propose a technique to improve the accuracy of the final design predicted by Geometric Programming based CMOS analog circuit sizing methodology. Here we use a multi-level AC performance modeling paradigm to develop the empirical models of circuit performance metrics. Performance models are then upgraded over iterations of design cycle. This iterative model up gradation in a sequence of geometric programming guides the final design to converge with better accuracy. The methodology is validated by designing a two-stage amplifier cascaded with a Class-A (source-follower) output buffer stage in UMC 0.18 μm technology.
{"title":"Iterative Performance Model Upgradation in Geometric Programming Based Analog Circuit Sizing for Improved Design Accuracy","authors":"S. Dam, P. Mandal","doi":"10.1109/VLSID.2012.100","DOIUrl":"https://doi.org/10.1109/VLSID.2012.100","url":null,"abstract":"In this paper, we propose a technique to improve the accuracy of the final design predicted by Geometric Programming based CMOS analog circuit sizing methodology. Here we use a multi-level AC performance modeling paradigm to develop the empirical models of circuit performance metrics. Performance models are then upgraded over iterations of design cycle. This iterative model up gradation in a sequence of geometric programming guides the final design to converge with better accuracy. The methodology is validated by designing a two-stage amplifier cascaded with a Class-A (source-follower) output buffer stage in UMC 0.18 μm technology.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131914816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Mukherjee, H. Rahaman, I. Banerjee, T. Samanta, P. Dasgupta
Design automation in Digital micro fluidic biochip is of immense importance in to days clinical diagnosis process. In this paper, we try to build a heuristic algorithm to simultaneously perform droplet routing and electrode actuation. The proposed method is capable of performing (i) droplet routing with minimal electrode usages in optimized routing completion time, and (ii) minimal number of control pin assignment on the routing path for successful droplet transportation. The proposed method is a co-optimization technique that finds the possible shortest path between the source and the target pair for a droplet and assigns control pins in an optimal manner to actuate the routing path. Intersection regions for multiple droplets are also assigned with pins in an efficient manner to avoid unnecessary mixing between several droplets. The proposed method is tested on various benchmarks and random test sets, and experimental results are quite encouraging.
{"title":"A Heuristic Method for Co-optimization of Pin Assignment and Droplet Routing in Digital Microfluidic Biochip","authors":"R. Mukherjee, H. Rahaman, I. Banerjee, T. Samanta, P. Dasgupta","doi":"10.1109/VLSID.2012.75","DOIUrl":"https://doi.org/10.1109/VLSID.2012.75","url":null,"abstract":"Design automation in Digital micro fluidic biochip is of immense importance in to days clinical diagnosis process. In this paper, we try to build a heuristic algorithm to simultaneously perform droplet routing and electrode actuation. The proposed method is capable of performing (i) droplet routing with minimal electrode usages in optimized routing completion time, and (ii) minimal number of control pin assignment on the routing path for successful droplet transportation. The proposed method is a co-optimization technique that finds the possible shortest path between the source and the target pair for a droplet and assigns control pins in an optimal manner to actuate the routing path. Intersection regions for multiple droplets are also assigned with pins in an efficient manner to avoid unnecessary mixing between several droplets. The proposed method is tested on various benchmarks and random test sets, and experimental results are quite encouraging.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116953682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Digital Subscriber Line (DSL) is a family of standards that allow existing twisted pair copper lines to carry modulated digital signals which uses telephone network and unused frequency Spectrum. In this tutorial we describe DSL (Digital Subscriber Line), comparing with POTS and ISDN. We explain the different DSL flavors available with the modulation techniques used and also discuss challenges in getting high performance and throughput, achieving xDSL rate and meeting the DSL standard. We begin by introducing DSL having a pair of modems CO (Central office) and CPE (Customer Premises Equipment) and talk about frequency spectrums. We explain the limitations of POTS (Plain Old Telephone System) with Dial-up Connection and ISDN (Integrated Services Digital Network), and advantages of DSL over POTS. Next we describe various flavors of DSL with their frequency spectrums, power enhancements, standardization, profiles, and band plans. We discuss different modulation techniques with their advantages and disadvantages, including Single carrier Modulation like CAP and QAM and Multi Carrier Modulation like DMT. We explain the interferers/noise: Line Attenuation, The channel attenuation, Bridged taps, Impulse noise, White noise, NEXT, FEXT, RE Interference. We conclude the tutorial with a description of full activation and initialization phases. The targeted audience for the tutorial is designers, developers, testers, people working in the area of VDSL, ADSL technology and People working on different platforms like DSLAM's, Central office, Customer premises equipment, Gateway products as well as products related to access network who are familiar with copper line and modems.
{"title":"Embedded Tutorial ET2: Digital Subscriber Line","authors":"M. K. K. Rao, V. ShanthaKumariP., B. Sellappan","doi":"10.1109/VLSID.2012.41","DOIUrl":"https://doi.org/10.1109/VLSID.2012.41","url":null,"abstract":"Digital Subscriber Line (DSL) is a family of standards that allow existing twisted pair copper lines to carry modulated digital signals which uses telephone network and unused frequency Spectrum. In this tutorial we describe DSL (Digital Subscriber Line), comparing with POTS and ISDN. We explain the different DSL flavors available with the modulation techniques used and also discuss challenges in getting high performance and throughput, achieving xDSL rate and meeting the DSL standard. We begin by introducing DSL having a pair of modems CO (Central office) and CPE (Customer Premises Equipment) and talk about frequency spectrums. We explain the limitations of POTS (Plain Old Telephone System) with Dial-up Connection and ISDN (Integrated Services Digital Network), and advantages of DSL over POTS. Next we describe various flavors of DSL with their frequency spectrums, power enhancements, standardization, profiles, and band plans. We discuss different modulation techniques with their advantages and disadvantages, including Single carrier Modulation like CAP and QAM and Multi Carrier Modulation like DMT. We explain the interferers/noise: Line Attenuation, The channel attenuation, Bridged taps, Impulse noise, White noise, NEXT, FEXT, RE Interference. We conclude the tutorial with a description of full activation and initialization phases. The targeted audience for the tutorial is designers, developers, testers, people working in the area of VDSL, ADSL technology and People working on different platforms like DSLAM's, Central office, Customer premises equipment, Gateway products as well as products related to access network who are familiar with copper line and modems.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126228679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper presents the implementation of a 3GPP standards compliant configurable turbo decoder on a GPU. The challenge in implementing a turbo decoder on a GPU is in suitably parallelizing the Log-MAP decoding algorithm and doing an architecture aware mapping of it on to the GPU. The approximations in parallelizing the Log-MAP algorithm come at the cost of reduced BER performance. To mitigate this reduction, different guarding mechanisms of varying computational complexity have been presented. The limited shared memory and registers available on GPUs are carefully allocated to obtain a high real-time decoding rate without requiring several independent data streams in parallel.
{"title":"GPU Implementation of a Programmable Turbo Decoder for Software Defined Radio Applications","authors":"Dhiraj Reddy Nallapa Yoge, N. Chandrachoodan","doi":"10.1109/VLSID.2012.62","DOIUrl":"https://doi.org/10.1109/VLSID.2012.62","url":null,"abstract":"This paper presents the implementation of a 3GPP standards compliant configurable turbo decoder on a GPU. The challenge in implementing a turbo decoder on a GPU is in suitably parallelizing the Log-MAP decoding algorithm and doing an architecture aware mapping of it on to the GPU. The approximations in parallelizing the Log-MAP algorithm come at the cost of reduced BER performance. To mitigate this reduction, different guarding mechanisms of varying computational complexity have been presented. The limited shared memory and registers available on GPUs are carefully allocated to obtain a high real-time decoding rate without requiring several independent data streams in parallel.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"129 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124232089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
High performance video standards use prediction techniques to achieve high picture quality at low bit rates. The type of prediction decides the bit rates and the image quality. Intra Prediction achieves high video quality with significant reduction in bit rate. This paper presents novel area optimized architecture for Intra prediction of H.264 decoding at HDTV resolution. The architecture has been validated on a Xilinx Virtex-5 FPGA based platform and achieved a frame rate of 64 fps. The architecture is based on multi-level memory hierarchy to reduce latency and ensure optimum resources utilization. It removes redundancy by reusing same functional blocks across different modes. The proposed architecture uses only 13% of the total LUTs available on the Xilinx FPGA XC5VLX50T.
{"title":"HD Resolution Intra Prediction Architecture for H.264 Decoder","authors":"Jimit Shah, K. S. Raghunandan, Kuruvilla Varghese","doi":"10.1109/VLSID.2012.55","DOIUrl":"https://doi.org/10.1109/VLSID.2012.55","url":null,"abstract":"High performance video standards use prediction techniques to achieve high picture quality at low bit rates. The type of prediction decides the bit rates and the image quality. Intra Prediction achieves high video quality with significant reduction in bit rate. This paper presents novel area optimized architecture for Intra prediction of H.264 decoding at HDTV resolution. The architecture has been validated on a Xilinx Virtex-5 FPGA based platform and achieved a frame rate of 64 fps. The architecture is based on multi-level memory hierarchy to reduce latency and ensure optimum resources utilization. It removes redundancy by reusing same functional blocks across different modes. The proposed architecture uses only 13% of the total LUTs available on the Xilinx FPGA XC5VLX50T.","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131708822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Among different multi-gate transistors, Fin FETs and Trigate FETs have set themselves apart as the most promising candidates for the upcoming 22nm technology node and beyond owing to their superior device performance, lower leakage power consumption and cost-effective fabrication process. Innovative circuit design and optimization techniques will be required to harness the power of multi-gate transistors, which in turn will depend on accurate leakage and timing characterization of these devices under spatial and environmental variations. Hence, in order to aid circuit designers, we present accurate analytical models using central composite rotatable design (CCRD) based on response surface methodology (RSM) to estimate the leakage current in Fin FET standard cells under the effect of variations in gate length (LG), fin thickness (TSI), gate-oxide thickness (TOX) and gate-work function (ΦG). To the best of our knowledge, this is the first attempt to develop analytical models for leakage estimation of Fin FET devices/logic gates based on TCAD simulations of adjusted 2D device cross-sections that have been shown to track TCAD simulations of 3D device behavior within a 1-3% error range. This drastically reduces the CPU time of our modeling technique (by several orders of magnitude) without much loss in accuracy. We present analytical leakage models for different logic styles, e.g., shorted-gate (SG) and independent-gate (IG) Fin FETs, at the 22nm technology node. The leakage estimates derived from the analytical models are in close agreement with quasi-Monte Carlo (QMC) simulation results obtained for different adjusted-2D (3D) devices/logic gates with a maximum root mean square error (RMSE) of 5.28% (7.03%).
在不同的多栅极晶体管中,Fin fet和Trigate fet由于其优越的器件性能、更低的泄漏功耗和具有成本效益的制造工艺,已成为即将到来的22nm技术节点最有前途的候选者。创新的电路设计和优化技术将需要利用多栅极晶体管的功率,这反过来又取决于这些器件在空间和环境变化下的准确泄漏和定时特性。因此,为了帮助电路设计者,我们提出了基于响应面方法(RSM)的中心复合可旋转设计(CCRD)的精确分析模型,以估计在栅极长度(LG),鳍片厚度(TSI),栅极氧化厚度(TOX)和栅极功函数(ΦG)变化的影响下,Fin FET标准电池中的泄漏电流。据我们所知,这是基于调整后的2D器件横截面的TCAD模拟开发Fin FET器件/逻辑门泄漏估计的分析模型的第一次尝试,该模型已被证明可以在1-3%的误差范围内跟踪3D器件行为的TCAD模拟。这极大地减少了我们建模技术的CPU时间(减少了几个数量级),而精度没有很大的损失。我们提出了不同逻辑风格的解析泄漏模型,例如在22nm技术节点上的短门(SG)和独立门(IG) Fin fet。从分析模型中得出的泄漏估计与准蒙特卡罗(QMC)模拟结果非常吻合,不同的可调2d (3D)器件/逻辑门的最大均方根误差(RMSE)为5.28%(7.03%)。
{"title":"Accurate Leakage Estimation for FinFET Standard Cells Using the Response Surface Methodology","authors":"S. Chaudhuri, Prateek Mishra, N. Jha","doi":"10.1109/VLSID.2012.77","DOIUrl":"https://doi.org/10.1109/VLSID.2012.77","url":null,"abstract":"Among different multi-gate transistors, Fin FETs and Trigate FETs have set themselves apart as the most promising candidates for the upcoming 22nm technology node and beyond owing to their superior device performance, lower leakage power consumption and cost-effective fabrication process. Innovative circuit design and optimization techniques will be required to harness the power of multi-gate transistors, which in turn will depend on accurate leakage and timing characterization of these devices under spatial and environmental variations. Hence, in order to aid circuit designers, we present accurate analytical models using central composite rotatable design (CCRD) based on response surface methodology (RSM) to estimate the leakage current in Fin FET standard cells under the effect of variations in gate length (LG), fin thickness (TSI), gate-oxide thickness (TOX) and gate-work function (ΦG). To the best of our knowledge, this is the first attempt to develop analytical models for leakage estimation of Fin FET devices/logic gates based on TCAD simulations of adjusted 2D device cross-sections that have been shown to track TCAD simulations of 3D device behavior within a 1-3% error range. This drastically reduces the CPU time of our modeling technique (by several orders of magnitude) without much loss in accuracy. We present analytical leakage models for different logic styles, e.g., shorted-gate (SG) and independent-gate (IG) Fin FETs, at the 22nm technology node. The leakage estimates derived from the analytical models are in close agreement with quasi-Monte Carlo (QMC) simulation results obtained for different adjusted-2D (3D) devices/logic gates with a maximum root mean square error (RMSE) of 5.28% (7.03%).","PeriodicalId":405021,"journal":{"name":"2012 25th International Conference on VLSI Design","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120938921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}