首页 > 最新文献

2018 19th International Symposium on Quality Electronic Design (ISQED)最新文献

英文 中文
Energy-optimal dynamic voltage scaling in multicore platforms with reconfigurable power distribution network 基于可重构配电网的多核平台能量最优动态电压缩放
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357261
Juyeon Kim, Taewhan Kim
This work addresses a new problem of dynamic voltage scaling (DVS) in multicore platforms. We solve the multicore DVS problem, i.e., simultaneously scheduling execution of tasks assigned to cores and determining dynamically-varying voltage levels, with the objective of minimizing total energy consumption of the cores and voltage regulators (VRs) in the reconfigurable VR-to-core power distribution network (PDN) of platform while meeting the arrival/deadline constraint of tasks. Here, the key factors to be exploited for energy saving are (1) available voltage levels, (2) power conversion efficiency curve of VRs, and (3) turning on/off VRs. Specifically, we formulate the problem of task scheduling with the relation between factors 1, 2, and 3 into a linear programming problem and solve optimally in polynomial time.
这项工作解决了多核平台中动态电压缩放(DVS)的新问题。我们解决了多核分布式交换机问题,即同时调度分配给核心的任务的执行并确定动态变化的电压水平,目的是在满足任务到达/截止时间约束的情况下,在可重构的平台VR-to-core配电网络(PDN)中最小化核心和稳压器(VRs)的总能耗。在这里,节能的关键因素是(1)可用电压水平,(2)vr的功率转换效率曲线,(3)打开/关闭vr。具体来说,我们将具有因子1、2、3之间关系的任务调度问题化为一个线性规划问题,并在多项式时间内得到最优解。
{"title":"Energy-optimal dynamic voltage scaling in multicore platforms with reconfigurable power distribution network","authors":"Juyeon Kim, Taewhan Kim","doi":"10.1109/ISQED.2018.8357261","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357261","url":null,"abstract":"This work addresses a new problem of dynamic voltage scaling (DVS) in multicore platforms. We solve the multicore DVS problem, i.e., simultaneously scheduling execution of tasks assigned to cores and determining dynamically-varying voltage levels, with the objective of minimizing total energy consumption of the cores and voltage regulators (VRs) in the reconfigurable VR-to-core power distribution network (PDN) of platform while meeting the arrival/deadline constraint of tasks. Here, the key factors to be exploited for energy saving are (1) available voltage levels, (2) power conversion efficiency curve of VRs, and (3) turning on/off VRs. Specifically, we formulate the problem of task scheduling with the relation between factors 1, 2, and 3 into a linear programming problem and solve optimally in polynomial time.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123549408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ultra-low swing CMOS transceiver for 2.5-D integrated systems 用于2.5 d集成系统的超低摆CMOS收发器
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357298
P. Mroszczyk, V. Pavlidis
This paper presents the design of a low swing transceiver for chip-to-chip communication in 2.5-D integrated systems using a passive interposer. High speed and low power operation is achieved through a new dynamic low swing tunable transmitter (DLST-TX) and inverter-based tunable receiver (INVT-RX) circuits. The novelty of the proposed solution lies in the digital trimming for PVT corners and random parameter variability allowing significant reduction of the voltage swing down to 120 mV with single ended signaling. The compensation method has negligible impact on the circuit performance and silicon area, not typically achievable by device geometry scaling. The proof-of-concept transceiver is implemented in a 65 nm CMOS technology and exhibits up to 4∗ higher energy efficiency at 1 Gb/s speed for 2.5 mm long chip-to-chip interconnect, as compared to state-of-the-art full swing communication schemes operating under the same conditions. The transceiver is suitable for parallel interfaces in 2.5-D integrated systems.
本文介绍了一种采用无源中介器的低摆幅收发器,用于2.5维集成系统的片对片通信。高速和低功耗的操作是通过一个新的动态低摆幅可调谐发射机(dst - tx)和基于逆变器的可调谐接收机(INVT-RX)电路实现的。该解决方案的新颖之处在于对PVT角和随机参数可变性进行了数字修剪,从而可以将单端信号的电压摆幅显著降低至120 mV。补偿方法对电路性能和硅面积的影响可以忽略不计,这通常不是通过器件几何缩放实现的。这款概念验证型收发器采用65纳米CMOS技术实现,在2.5 mm长片对片互连速度为1 Gb/s时,与在相同条件下运行的最先进的全波通信方案相比,其能量效率高达4 *。该收发器适用于2.5维集成系统中的并行接口。
{"title":"Ultra-low swing CMOS transceiver for 2.5-D integrated systems","authors":"P. Mroszczyk, V. Pavlidis","doi":"10.1109/ISQED.2018.8357298","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357298","url":null,"abstract":"This paper presents the design of a low swing transceiver for chip-to-chip communication in 2.5-D integrated systems using a passive interposer. High speed and low power operation is achieved through a new dynamic low swing tunable transmitter (DLST-TX) and inverter-based tunable receiver (INVT-RX) circuits. The novelty of the proposed solution lies in the digital trimming for PVT corners and random parameter variability allowing significant reduction of the voltage swing down to 120 mV with single ended signaling. The compensation method has negligible impact on the circuit performance and silicon area, not typically achievable by device geometry scaling. The proof-of-concept transceiver is implemented in a 65 nm CMOS technology and exhibits up to 4∗ higher energy efficiency at 1 Gb/s speed for 2.5 mm long chip-to-chip interconnect, as compared to state-of-the-art full swing communication schemes operating under the same conditions. The transceiver is suitable for parallel interfaces in 2.5-D integrated systems.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130492252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Low cost and power CNN/deep learning solution for automated driving 用于自动驾驶的低成本和功耗CNN/深度学习解决方案
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357325
Mihir Mody, Kumar Desappan, P. Swami, Manu Mathew, S. Nagori
Automated driving functions, like highway driving and parking assist, are increasingly getting deployed in high-end cars with the ultimate goal of realizing self-driving car using Deep learning techniques like convolution neural network (CNN). For mass-market deployment, the embedded solution is required to address the right cost and performance envelope along with security and safety. In the case of automated driving, one of the key functionality is “finding drivable free space”, which is addressed using deep learning techniques like CNN. These CNN networks pose huge computing requirements in terms of hundreds of GOPS/TOPS (Giga or Tera operations per second), which seems beyond the capability of today's embedded SoC. This paper covers various techniques consisting of fixed-point conversion, sparse multiplication, fusing of layers and network pruning, for tailoring on the embedded solution. These techniques are implemented on the device by means of optimized Deep learning library for inference. The paper concludes by demonstrating the results of a CNN network running in real time on TI's TDA2X embedded platform producing a high-quality drivable space output for automated driving.
高速公路驾驶、停车辅助等自动驾驶功能越来越多地部署在高端汽车上,最终目标是利用卷积神经网络(CNN)等深度学习技术实现自动驾驶汽车。对于大众市场部署,嵌入式解决方案需要解决适当的成本和性能以及安全性问题。就自动驾驶而言,其中一个关键功能是“寻找可驾驶的自由空间”,这是通过CNN等深度学习技术来解决的。这些CNN网络提出了数以百计的GOPS/TOPS(每秒千兆或兆级操作)的巨大计算需求,这似乎超出了当今嵌入式SoC的能力。本文介绍了各种技术,包括不动点转换、稀疏乘法、层融合和网络修剪,以定制嵌入式解决方案。这些技术通过优化的深度学习推理库在设备上实现。论文最后展示了CNN网络在TI的TDA2X嵌入式平台上实时运行的结果,为自动驾驶产生了高质量的可驾驶空间输出。
{"title":"Low cost and power CNN/deep learning solution for automated driving","authors":"Mihir Mody, Kumar Desappan, P. Swami, Manu Mathew, S. Nagori","doi":"10.1109/ISQED.2018.8357325","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357325","url":null,"abstract":"Automated driving functions, like highway driving and parking assist, are increasingly getting deployed in high-end cars with the ultimate goal of realizing self-driving car using Deep learning techniques like convolution neural network (CNN). For mass-market deployment, the embedded solution is required to address the right cost and performance envelope along with security and safety. In the case of automated driving, one of the key functionality is “finding drivable free space”, which is addressed using deep learning techniques like CNN. These CNN networks pose huge computing requirements in terms of hundreds of GOPS/TOPS (Giga or Tera operations per second), which seems beyond the capability of today's embedded SoC. This paper covers various techniques consisting of fixed-point conversion, sparse multiplication, fusing of layers and network pruning, for tailoring on the embedded solution. These techniques are implemented on the device by means of optimized Deep learning library for inference. The paper concludes by demonstrating the results of a CNN network running in real time on TI's TDA2X embedded platform producing a high-quality drivable space output for automated driving.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"455 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133524297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
High performance training of deep neural networks using pipelined hardware acceleration and distributed memory 基于流水线硬件加速和分布式内存的深度神经网络高性能训练
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357317
Raghav Mehta, Yuyang Huang, Mingxi Cheng, S. Bagga, Nishant Mathur, Ji Li, J. Draper, Shahin Nazarian
Recently, Deep Neural Networks (DNNs) have made unprecedented progress in various tasks. However, there is a timely need to accelerate the training process in DNNs specifically for real-time applications that demand high performance, energy efficiency and compactness. Numerous algorithms have been proposed to improve the accuracy, however the network training process is computationally slow. In this paper, we present a scalable pipelined hardware architecture with distributed memories for a digital neuron to implement deep neural networks. We also explore various functions and algorithms as well as different memory topologies, to optimize the performance of our training architecture. The power, area, and delay of our proposed model are evaluated with respect to software implementation. Experimental results on the MNIST dataset demonstrate that compared with the software training, our proposed hardware-based approach for training process achieves 33X runtime reduction, 5X power reduction, and nearly 168X energy reduction.
近年来,深度神经网络(dnn)在各种任务中取得了前所未有的进展。然而,迫切需要加快深度神经网络的训练过程,特别是对于需要高性能、能效和紧凑性的实时应用。为了提高准确率,已经提出了许多算法,但是网络训练过程的计算速度很慢。在本文中,我们提出了一种可扩展的流水线硬件架构,具有分布式存储器,用于数字神经元实现深度神经网络。我们还探索了各种函数和算法以及不同的内存拓扑,以优化我们的训练架构的性能。我们提出的模型的功耗、面积和延迟在软件实现方面进行了评估。在MNIST数据集上的实验结果表明,与软件训练相比,我们提出的基于硬件的训练过程方法的运行时间减少了33倍,功耗降低了5倍,能耗降低了近168倍。
{"title":"High performance training of deep neural networks using pipelined hardware acceleration and distributed memory","authors":"Raghav Mehta, Yuyang Huang, Mingxi Cheng, S. Bagga, Nishant Mathur, Ji Li, J. Draper, Shahin Nazarian","doi":"10.1109/ISQED.2018.8357317","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357317","url":null,"abstract":"Recently, Deep Neural Networks (DNNs) have made unprecedented progress in various tasks. However, there is a timely need to accelerate the training process in DNNs specifically for real-time applications that demand high performance, energy efficiency and compactness. Numerous algorithms have been proposed to improve the accuracy, however the network training process is computationally slow. In this paper, we present a scalable pipelined hardware architecture with distributed memories for a digital neuron to implement deep neural networks. We also explore various functions and algorithms as well as different memory topologies, to optimize the performance of our training architecture. The power, area, and delay of our proposed model are evaluated with respect to software implementation. Experimental results on the MNIST dataset demonstrate that compared with the software training, our proposed hardware-based approach for training process achieves 33X runtime reduction, 5X power reduction, and nearly 168X energy reduction.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"28 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114359606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A study on NBTI-induced delay degradation considering stress frequency dependence 考虑应力频率依赖性的nbti延迟退化研究
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357296
Zuitoku Shin, Shumpei Morita, S. Bian, Michihiro Shintani, Masayuki Hiromoto, Takashi Sato
The degradation of transistors in integrated circuits is known to be dependent on stress frequency in addition to the well-known stress duty cycle. This paper analyzes the impact of frequency dependence of the NBTI degradation on a processor-scale circuit under various workload scenarios by using different levels of available information. A simple estimation for wire switching frequency from duty cycle is also proposed. Using real workloads running on MIPS processor, it is found that frequency dependency of the worst path delay is not large since there are many DC stress components independent of frequency. However, frequency dependency of path delay increases when DC component decreases due to execution of multiple applications.
除了众所周知的应力占空比外,集成电路中晶体管的退化还取决于应力频率。本文通过使用不同级别的可用信息,分析了不同工作负载场景下NBTI退化的频率依赖性对处理器级电路的影响。本文还提出了一种从占空比估计导线开关频率的简单方法。通过在MIPS处理器上运行的实际工作负载,发现由于存在许多与频率无关的直流应力分量,因此最坏路径延迟的频率依赖性不大。然而,由于多个应用程序的执行,当直流分量减少时,路径延迟的频率依赖性增加。
{"title":"A study on NBTI-induced delay degradation considering stress frequency dependence","authors":"Zuitoku Shin, Shumpei Morita, S. Bian, Michihiro Shintani, Masayuki Hiromoto, Takashi Sato","doi":"10.1109/ISQED.2018.8357296","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357296","url":null,"abstract":"The degradation of transistors in integrated circuits is known to be dependent on stress frequency in addition to the well-known stress duty cycle. This paper analyzes the impact of frequency dependence of the NBTI degradation on a processor-scale circuit under various workload scenarios by using different levels of available information. A simple estimation for wire switching frequency from duty cycle is also proposed. Using real workloads running on MIPS processor, it is found that frequency dependency of the worst path delay is not large since there are many DC stress components independent of frequency. However, frequency dependency of path delay increases when DC component decreases due to execution of multiple applications.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122058108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Comparative study and prediction modeling of photonic ring Network on Chip architectures 基于芯片架构的光子环网络的比较研究与预测建模
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357275
Sara Karimi, Jelena Trajkovic
Multicore systems are becoming state-of-the-art and therefore need fast and energy efficient interconnects to take full advantage of the computational capabilities. Integration of silicon photonics with traditional electrical interconnect in Network on Chip (NoC) proposes a promising solution for overcoming the scalability issues of electrical interconnect. In this paper, we implement the simulation model for two Optical NoC architectures and compare their performance. We also derive and evaluate a prediction modeling technique for the design space exploration of ONoCs. Our proposed model accurately predicts packet latency, static and dynamic energy consumption of the network. This work specifically addresses the challenge of accurately estimating performance metrics without having to incur high costs of exhaustive simulations. Our case study shows that by using only 10% of the entire design space, our proposed technique builds a prediction model that achieved average error rates as low as 5.44%, 2.67% and 3.24% for network packet latency, static and dynamic energy consumption respectively in six different benchmarks from Splash-2 benchmark suite.
多核系统正在成为最先进的技术,因此需要快速和节能的互连,以充分利用计算能力。在片上网络(NoC)中,硅光子学与传统电互连的集成为克服电互连的可扩展性问题提供了一种很有前途的解决方案。在本文中,我们实现了两种光学NoC架构的仿真模型,并比较了它们的性能。我们还推导并评估了一种用于onoc设计空间探索的预测建模技术。我们提出的模型准确地预测了网络的数据包延迟、静态和动态能量消耗。这项工作特别解决了准确估计性能指标的挑战,而不必承担详尽模拟的高成本。我们的案例研究表明,仅使用整个设计空间的10%,我们提出的技术建立了一个预测模型,在来自Splash-2基准测试套件的六个不同基准测试中,网络数据包延迟、静态和动态能耗的平均错误率分别达到5.44%、2.67%和3.24%。
{"title":"Comparative study and prediction modeling of photonic ring Network on Chip architectures","authors":"Sara Karimi, Jelena Trajkovic","doi":"10.1109/ISQED.2018.8357275","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357275","url":null,"abstract":"Multicore systems are becoming state-of-the-art and therefore need fast and energy efficient interconnects to take full advantage of the computational capabilities. Integration of silicon photonics with traditional electrical interconnect in Network on Chip (NoC) proposes a promising solution for overcoming the scalability issues of electrical interconnect. In this paper, we implement the simulation model for two Optical NoC architectures and compare their performance. We also derive and evaluate a prediction modeling technique for the design space exploration of ONoCs. Our proposed model accurately predicts packet latency, static and dynamic energy consumption of the network. This work specifically addresses the challenge of accurately estimating performance metrics without having to incur high costs of exhaustive simulations. Our case study shows that by using only 10% of the entire design space, our proposed technique builds a prediction model that achieved average error rates as low as 5.44%, 2.67% and 3.24% for network packet latency, static and dynamic energy consumption respectively in six different benchmarks from Splash-2 benchmark suite.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123015454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Extracting hardware assertions including word-level relations over multiple clock cycles 提取硬件断言,包括多个时钟周期的字级关系
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357295
Mami Miyamoto, K. Hamaguchi
Various mining approaches have been proposed for the automatic generation of temporal assertions from execution traces of hardware designs. These approaches can handle assertions based on LTL formulas or PSL, and many of them can represent word-level relations such as inequalities, additions, and so on. In the existing methods, however, such relations are searched only within a clock cycle. They cannot extract a property such that two values at inputs are added, and its result appears two clock cycles later at an output. We propose a method to extract relations over multiple clock cycles between variables as atomic propositions by analyzing execution traces and to generate assertions including the relations. Our method can also efficiently generate assertions by extracting frequent relations between atomic propositions over multiple clock cycles as propositions, that is, conjunctives of atomic propositions. The experimental results demonstrate the feasibility of the proposed method.
为了从硬件设计的执行轨迹自动生成时态断言,已经提出了各种挖掘方法。这些方法可以处理基于LTL公式或PSL的断言,其中许多方法可以表示单词级关系,如不等式、加法等。然而,在现有的方法中,这种关系只能在一个时钟周期内搜索。它们不能提取这样的属性,即在输入端添加两个值,其结果在两个时钟周期后的输出端出现。我们提出了一种方法,通过分析执行轨迹来提取变量之间多个时钟周期的关系作为原子命题,并生成包含这些关系的断言。我们的方法还可以通过在多个时钟周期内提取原子命题之间的频繁关系作为命题(即原子命题的合取词)来有效地生成断言。实验结果证明了该方法的可行性。
{"title":"Extracting hardware assertions including word-level relations over multiple clock cycles","authors":"Mami Miyamoto, K. Hamaguchi","doi":"10.1109/ISQED.2018.8357295","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357295","url":null,"abstract":"Various mining approaches have been proposed for the automatic generation of temporal assertions from execution traces of hardware designs. These approaches can handle assertions based on LTL formulas or PSL, and many of them can represent word-level relations such as inequalities, additions, and so on. In the existing methods, however, such relations are searched only within a clock cycle. They cannot extract a property such that two values at inputs are added, and its result appears two clock cycles later at an output. We propose a method to extract relations over multiple clock cycles between variables as atomic propositions by analyzing execution traces and to generate assertions including the relations. Our method can also efficiently generate assertions by extracting frequent relations between atomic propositions over multiple clock cycles as propositions, that is, conjunctives of atomic propositions. The experimental results demonstrate the feasibility of the proposed method.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121410179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 125mV 2ns-access-time 16Kb SRAM design based on a 6T hybrid TFET-FinFET cell 基于6T混合TFET-FinFET单元的125mV 2ns访问时间16Kb SRAM设计
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357301
H. Afzali-Kusha, A. Shafaei, M. Pedram
This paper proposes a robust and energy-efficient hybrid TFET-FinFET 6T SRAM cell which takes advantage of the higher ON/OFF current ratio of TFETs compared to that of FinFETs to reliably hold and access data at ultra-low supply voltages. More precisely, in the proposed hybrid cell, to achieve low static currents along with high noise margins, TFETs are used for cross-coupled inverters, and to speed up the access time, high-performance FinFETs are utilized for access transistors. The paper also presents a dual-Vt 6T SRAM, in which low-power (high-Vt) and high-performance (low-Vt) FinFETs are used for cross-coupled inverters and access transistors, respectively. For both SRAM cells, the Vdd boost read-assist technique is employed to improve the read stability. Characteristics of both SRAMs are analyzed using HSPICE simulations for technologies with the gate length of 20 nm for a 128times 128 SRAM array. Simulation results reveal that the lowest operating Vdd for the dual-Vt cell is 225 mV, whereas that of the hybrid cell is 125 mV. Moreover, to further decrease the access delay of the hybrid cell for 125 mV ≤ Vdd ≤ 225 mV, negative Gnd read-assist technique and a boosted voltage for the row decoder are used. Finally, the paper presents a 125mV 2ns-access-time 16Kb SRAM array based on the proposed hybrid TFET-FinFET SRAM cell.
本文提出了一种鲁棒和节能的混合TFET-FinFET 6T SRAM单元,它利用tfet比finfet更高的ON/OFF电流比,在超低电源电压下可靠地保持和访问数据。更准确地说,在提出的混合单元中,为了实现低静态电流和高噪声余量,将tfet用于交叉耦合逆变器,并且为了加快访问时间,将高性能finfet用于访问晶体管。本文还介绍了一种双vt 6T SRAM,其中低功率(高vt)和高性能(低vt) finfet分别用于交叉耦合逆变器和接入晶体管。对于这两个SRAM单元,采用了Vdd boost读辅助技术来提高读稳定性。对于栅极长度为20 nm的128 × 128 SRAM阵列,采用HSPICE模拟分析了这两种SRAM的特性。仿真结果表明,双vt电池的最低工作电压为225 mV,而混合电池的最低工作电压为125 mV。此外,为了进一步降低混合单元在125 mV≤Vdd≤225 mV时的接入延迟,采用了负Gnd读取辅助技术和行解码器升压。最后,本文提出了一个基于TFET-FinFET混合SRAM单元的125mV 2ns访问时间16Kb SRAM阵列。
{"title":"A 125mV 2ns-access-time 16Kb SRAM design based on a 6T hybrid TFET-FinFET cell","authors":"H. Afzali-Kusha, A. Shafaei, M. Pedram","doi":"10.1109/ISQED.2018.8357301","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357301","url":null,"abstract":"This paper proposes a robust and energy-efficient hybrid TFET-FinFET 6T SRAM cell which takes advantage of the higher ON/OFF current ratio of TFETs compared to that of FinFETs to reliably hold and access data at ultra-low supply voltages. More precisely, in the proposed hybrid cell, to achieve low static currents along with high noise margins, TFETs are used for cross-coupled inverters, and to speed up the access time, high-performance FinFETs are utilized for access transistors. The paper also presents a dual-Vt 6T SRAM, in which low-power (high-Vt) and high-performance (low-Vt) FinFETs are used for cross-coupled inverters and access transistors, respectively. For both SRAM cells, the Vdd boost read-assist technique is employed to improve the read stability. Characteristics of both SRAMs are analyzed using HSPICE simulations for technologies with the gate length of 20 nm for a 128times 128 SRAM array. Simulation results reveal that the lowest operating Vdd for the dual-Vt cell is 225 mV, whereas that of the hybrid cell is 125 mV. Moreover, to further decrease the access delay of the hybrid cell for 125 mV ≤ Vdd ≤ 225 mV, negative Gnd read-assist technique and a boosted voltage for the row decoder are used. Finally, the paper presents a 125mV 2ns-access-time 16Kb SRAM array based on the proposed hybrid TFET-FinFET SRAM cell.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123156287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Verification methodology to guarantee low routing resistance to well taps 验证方法,以保证较低的走线阻力
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357297
M. Fakhruddin, Kuok-Khian Lo, J. Karp, M. Hart, Min-Hsing P. Chen
The proposed verification methodology enables designers to meet a maximum resistance specification for the well taps routing. Key strengths of the flow are: automatic identification of both well taps and VDD/VSS grid; comparison of the extracted resistance to a user defined specification value; review of results with a graphical interface; no marker layers to identify the extraction path.
所提出的验证方法使设计人员能够满足井口走向的最大阻力规范。该流的主要优势是:自动识别井的抽头和VDD/VSS网格;将所提取的电阻与用户定义的规格值进行比较;用图形界面审查结果;没有标记层来识别提取路径。
{"title":"Verification methodology to guarantee low routing resistance to well taps","authors":"M. Fakhruddin, Kuok-Khian Lo, J. Karp, M. Hart, Min-Hsing P. Chen","doi":"10.1109/ISQED.2018.8357297","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357297","url":null,"abstract":"The proposed verification methodology enables designers to meet a maximum resistance specification for the well taps routing. Key strengths of the flow are: automatic identification of both well taps and VDD/VSS grid; comparison of the extracted resistance to a user defined specification value; review of results with a graphical interface; no marker layers to identify the extraction path.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127801198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing circuit operation using analog floating gates 利用模拟浮门增强电路操作
Pub Date : 2018-03-01 DOI: 10.1109/ISQED.2018.8357291
Ujas Patel, S. Nimmalapudi, H. Stiegler, A. Marshall, Keith Jarreau
Consideration of random device mismatch is an important factor in the design of high performance analog circuits. Floating gate transistors have long been used for digital nonvolatile memory applications (such as flash memory), but a variant to this technology using an “analog floating gate” allows for higher precision programming. Analog floating gate devices can address mismatch observed in small geometry analog circuits. Enabling smaller devices to be used allows lower operating currents and higher frequencies. This property is exploited here to compensate for input mismatch and device parameter variations in an Operational Transconductance Amplifier.
考虑随机器件失配是设计高性能模拟电路的一个重要因素。浮栅晶体管长期以来一直用于数字非易失性存储器应用(如闪存),但这种技术的一种变体使用“模拟浮栅”允许更高精度的编程。模拟浮门器件可以解决小几何模拟电路中观察到的失配问题。使更小的设备使用允许更低的工作电流和更高的频率。这个特性在这里被用来补偿运算跨导放大器的输入失配和器件参数变化。
{"title":"Enhancing circuit operation using analog floating gates","authors":"Ujas Patel, S. Nimmalapudi, H. Stiegler, A. Marshall, Keith Jarreau","doi":"10.1109/ISQED.2018.8357291","DOIUrl":"https://doi.org/10.1109/ISQED.2018.8357291","url":null,"abstract":"Consideration of random device mismatch is an important factor in the design of high performance analog circuits. Floating gate transistors have long been used for digital nonvolatile memory applications (such as flash memory), but a variant to this technology using an “analog floating gate” allows for higher precision programming. Analog floating gate devices can address mismatch observed in small geometry analog circuits. Enabling smaller devices to be used allows lower operating currents and higher frequencies. This property is exploited here to compensate for input mismatch and device parameter variations in an Operational Transconductance Amplifier.","PeriodicalId":213351,"journal":{"name":"2018 19th International Symposium on Quality Electronic Design (ISQED)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114086425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2018 19th International Symposium on Quality Electronic Design (ISQED)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1