首页 > 最新文献

2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)最新文献

英文 中文
Self Timed SRAM Array with Enhanced low Voltage Read and Write Capability 具有增强低电压读写能力的自定时SRAM阵列
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00117
Prasad Vernekar, Nithin Kumar Yernad Balachandra, V. M. Harishchandra
The scaling of technology resulted in shrinking of device sizes, however suffers from threshold variation. These variations impact the performance and stability of static random access memory (SRAM). The proposed work takes the advantage of the conventional Replica-Bit-Line (RBL) technique to design a self-timed SRAM control circuit which keeps track of timing variations and generated control signals. A variable wordline scheme is proposed to ensure successful read and write operation of SRAM at low voltages. The 2 Kb (2048 bits) SRAM designed in 65 nm UMC technology, operates with clock frequency reaching upto 1 GHz at the nominal supply voltage of 1.2 V and at 50 MHz with the supply voltage of 0.58 V. The variations in the Sense Amplifier Enable (SAE) generation are reduced upto 92% compared to the conventional RBL technique.
技术的规模化导致了设备尺寸的缩小,但也受到阈值变化的影响。这些变化会影响静态随机存取存储器(SRAM)的性能和稳定性。本文利用传统的复制位线(RBL)技术设计了一种自定时SRAM控制电路,该电路可以跟踪时间变化并产生控制信号。提出了一种可变字线方案,以保证SRAM在低电压下能够成功地进行读写操作。采用65纳米UMC技术设计的2kb(2048位)SRAM,在标称电源电压为1.2 V时时钟频率最高可达1 GHz,在50 MHz时电源电压为0.58 V。与传统的RBL技术相比,感应放大器使能(SAE)产生的变化减少了92%。
{"title":"Self Timed SRAM Array with Enhanced low Voltage Read and Write Capability","authors":"Prasad Vernekar, Nithin Kumar Yernad Balachandra, V. M. Harishchandra","doi":"10.1109/ISVLSI.2019.00117","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00117","url":null,"abstract":"The scaling of technology resulted in shrinking of device sizes, however suffers from threshold variation. These variations impact the performance and stability of static random access memory (SRAM). The proposed work takes the advantage of the conventional Replica-Bit-Line (RBL) technique to design a self-timed SRAM control circuit which keeps track of timing variations and generated control signals. A variable wordline scheme is proposed to ensure successful read and write operation of SRAM at low voltages. The 2 Kb (2048 bits) SRAM designed in 65 nm UMC technology, operates with clock frequency reaching upto 1 GHz at the nominal supply voltage of 1.2 V and at 50 MHz with the supply voltage of 0.58 V. The variations in the Sense Amplifier Enable (SAE) generation are reduced upto 92% compared to the conventional RBL technique.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"46 1","pages":"627-631"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78169814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Distributed Pulse Rotary Traveling Wave VCO: Architecture and Design 分布式脉冲旋转行波压控振荡器:结构与设计
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00051
Prashansa Mukim, Aditya Dalakoti, D. McCarthy, Brandon Pon, Carrie Segal, Merritt Miller, J. Buckwalter, F. Brewer
This paper describes the architecture and design of pulse rotary traveling wave voltage controlled oscillators that preserve wave shape, and thus wave harmonics using non-linear amplification. These oscillators can provide multiple low dutycycle clock phases and architectural modifications can allow for the same clock phase to be present at multiple physical locations. A design fabricated in GFUS 130nm (8RF) technology operates at 5.32 GHz with a 10 MHz offset phase noise of -128.15 dBc/Hz at 45.4 mW while generating 12 driven phase outputs with 15.66 ps phase resolution and less than 500 fs cycle-to-cycle jitter. It can be coarse or fine tuned within a frequency range of 4.35 GHz to 5.4 GHz with KV CO of 1.7 GHz/V and 470 MHz/V respectively. The start-up mechanism of the oscillator minimizes transmission line reflections and allows maintenance of the traveling wave shape, yielding an average 3 dB figure of merit improvement over existing designs.
本文介绍了脉冲旋转行波压控振荡器的结构和设计,该振荡器采用非线性放大技术来保持波形,从而保持波的谐波。这些振荡器可以提供多个低占空比时钟相位,并且结构修改可以允许在多个物理位置出现相同的时钟相位。采用GFUS 130nm (8RF)技术制作的设计工作频率为5.32 GHz, 45.4 mW时10 MHz偏置相位噪声为-128.15 dBc/Hz,同时产生12个驱动相位输出,相位分辨率为15.66 ps,周期间抖动小于500 fs。可在4.35 GHz ~ 5.4 GHz的频率范围内进行粗调或精调,KV CO分别为1.7 GHz/V和470 MHz/V。振荡器的启动机制最大限度地减少了传输线反射,并允许保持行波形状,与现有设计相比,产生平均3 dB的性能改进。
{"title":"Distributed Pulse Rotary Traveling Wave VCO: Architecture and Design","authors":"Prashansa Mukim, Aditya Dalakoti, D. McCarthy, Brandon Pon, Carrie Segal, Merritt Miller, J. Buckwalter, F. Brewer","doi":"10.1109/ISVLSI.2019.00051","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00051","url":null,"abstract":"This paper describes the architecture and design of pulse rotary traveling wave voltage controlled oscillators that preserve wave shape, and thus wave harmonics using non-linear amplification. These oscillators can provide multiple low dutycycle clock phases and architectural modifications can allow for the same clock phase to be present at multiple physical locations. A design fabricated in GFUS 130nm (8RF) technology operates at 5.32 GHz with a 10 MHz offset phase noise of -128.15 dBc/Hz at 45.4 mW while generating 12 driven phase outputs with 15.66 ps phase resolution and less than 500 fs cycle-to-cycle jitter. It can be coarse or fine tuned within a frequency range of 4.35 GHz to 5.4 GHz with KV CO of 1.7 GHz/V and 470 MHz/V respectively. The start-up mechanism of the oscillator minimizes transmission line reflections and allows maintenance of the traveling wave shape, yielding an average 3 dB figure of merit improvement over existing designs.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"18 1","pages":"235-240"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89546675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MRAM-Based Stochastic Oscillators for Adaptive Non-Uniform Sampling of Sparse Signals in IoT Applications 物联网应用中基于mram的稀疏信号自适应非均匀采样随机振荡器
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00079
Soheil Salehi, Alireza Zaeemzadeh, Adrian Tatulian, N. Rahnavard, R. Demara
Recent advances to hardware integration and realization of highly-efficient Compressive Sensing (CS) approaches have inspired novel circuit and architectural-level approaches. These embrace the challenge to design more optimal nonuniform CS solutions that consider device-level constraints for IoT applications wherein lifetime energy, device area, and manufacturing costs are highly-constrained, but meanwhile, the sensing environment is rapidly changing. In this manuscript, we develop a novel adaptive hardware-based approach for non-uniform compressive sampling of sparse and time-varying signals. The proposed Adaptive Sampling of Sparse IoT signals via STochastic-oscillators (ASSIST) approach intelligently generates the CS measurement matrix by distributing the sensing energy among coefficients by considering the signal characteristics such as sparsity rate and noise level obtained in the previous time step. In our proposed approach, Magnetic Random Access Memory (MRAM)-based stochastic oscillators are utilized to generate the random bitstreams used in the CS measurement matrix. SPICE and MATLAB circuit-algorithm simulation results indicate that ASSIST efficiently achieves the desired non-uniform recovery of the original signals with varying sparsity rates and noise levels.
硬件集成和高效压缩感知(CS)方法的最新进展激发了新的电路和架构级方法。这些挑战包括设计更优的非统一CS解决方案,考虑到物联网应用的设备级限制,其中生命周期能量,设备面积和制造成本受到高度限制,但同时,传感环境正在迅速变化。在本文中,我们开发了一种新的基于硬件的自适应方法,用于稀疏和时变信号的非均匀压缩采样。本文提出的基于随机振荡器的物联网稀疏信号自适应采样(ASSIST)方法,通过考虑前一时间步长获得的信号稀疏率和噪声水平等特征,将感知能量分配到系数中,智能地生成CS测量矩阵。在我们提出的方法中,利用基于磁随机存取存储器(MRAM)的随机振荡器来生成CS测量矩阵中使用的随机比特流。SPICE和MATLAB电路算法仿真结果表明,ASSIST有效地实现了原始信号在不同稀疏率和噪声水平下的非均匀恢复。
{"title":"MRAM-Based Stochastic Oscillators for Adaptive Non-Uniform Sampling of Sparse Signals in IoT Applications","authors":"Soheil Salehi, Alireza Zaeemzadeh, Adrian Tatulian, N. Rahnavard, R. Demara","doi":"10.1109/ISVLSI.2019.00079","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00079","url":null,"abstract":"Recent advances to hardware integration and realization of highly-efficient Compressive Sensing (CS) approaches have inspired novel circuit and architectural-level approaches. These embrace the challenge to design more optimal nonuniform CS solutions that consider device-level constraints for IoT applications wherein lifetime energy, device area, and manufacturing costs are highly-constrained, but meanwhile, the sensing environment is rapidly changing. In this manuscript, we develop a novel adaptive hardware-based approach for non-uniform compressive sampling of sparse and time-varying signals. The proposed Adaptive Sampling of Sparse IoT signals via STochastic-oscillators (ASSIST) approach intelligently generates the CS measurement matrix by distributing the sensing energy among coefficients by considering the signal characteristics such as sparsity rate and noise level obtained in the previous time step. In our proposed approach, Magnetic Random Access Memory (MRAM)-based stochastic oscillators are utilized to generate the random bitstreams used in the CS measurement matrix. SPICE and MATLAB circuit-algorithm simulation results indicate that ASSIST efficiently achieves the desired non-uniform recovery of the original signals with varying sparsity rates and noise levels.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"28 1","pages":"403-408"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89941504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Hardware Implementation of Improved Fast-SSC-Flip Decoder for Polar Codes 改进的快速ssc翻转译码器的硬件实现
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00109
Jing Zeng, Yangcan Zhou, Jun Lin, Zhongfeng Wang
Polar code has gained worldwide recognition due to its capacity-achieving property. The fast simplified successive cancellation flip (Fast-SSC-Flip) algorithm improves the frame error rate (FER) performance of successive cancellation (SC) decoders with extra metric computations. However, the metric includes exponential operations, which is inefficient to implement. In this work, one shift operation and one addition are employed to approximate the exponential operations with negligible performance loss. Besides, the metric computations of more constituent codes (Type-I to Type-V) are derived that the complexity of Fast-SSC-Flip is further reduced. Simulation results show that the FER performance of the improved Fast-SSC-Flip algorithm with list size λ = 8 outperforms that of the Fast-SSC algorithm by 0.7dB. Moreover, based on the proposed algorithm, an efficient hardware architecture is first developed. It uses less look up tables and achieves a maximum throughput of 140.6 Mbps at high SNR on the Altera Stratix IV EP4SGX530KH40C2 FPGA. Compared with the state of art Fast-SSC implementation, the proposed architecture has a higher resource utilization efficiency (throughput per look-up tables). Under the 28 nm CMOS technology, the throughput can reach up to 483.8 Mbps at a clock frequency of 704 MHz.
Polar码由于其容量实现特性而获得了全世界的认可。快速简化连续抵消翻转(fast - ssc - flip)算法通过额外的度量计算提高了连续抵消解码器的帧误码率性能。然而,度量包括指数运算,实现起来效率很低。在这项工作中,使用一个移位运算和一个加法运算来近似指数运算,性能损失可以忽略不计。此外,推导了更多组成码(Type-I到Type-V)的度量计算,进一步降低了Fast-SSC-Flip的复杂度。仿真结果表明,当链表大小λ = 8时,改进的Fast-SSC- flip算法的FER性能比Fast-SSC算法高0.7dB。在此基础上,提出了一种高效的硬件结构。它使用较少的查找表,并在Altera Stratix IV EP4SGX530KH40C2 FPGA上实现高信噪比下的最大吞吐量140.6 Mbps。与现有的Fast-SSC实现相比,所提出的体系结构具有更高的资源利用效率(每个查找表的吞吐量)。在28纳米CMOS技术下,时钟频率为704 MHz时的吞吐量可达483.8 Mbps。
{"title":"Hardware Implementation of Improved Fast-SSC-Flip Decoder for Polar Codes","authors":"Jing Zeng, Yangcan Zhou, Jun Lin, Zhongfeng Wang","doi":"10.1109/ISVLSI.2019.00109","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00109","url":null,"abstract":"Polar code has gained worldwide recognition due to its capacity-achieving property. The fast simplified successive cancellation flip (Fast-SSC-Flip) algorithm improves the frame error rate (FER) performance of successive cancellation (SC) decoders with extra metric computations. However, the metric includes exponential operations, which is inefficient to implement. In this work, one shift operation and one addition are employed to approximate the exponential operations with negligible performance loss. Besides, the metric computations of more constituent codes (Type-I to Type-V) are derived that the complexity of Fast-SSC-Flip is further reduced. Simulation results show that the FER performance of the improved Fast-SSC-Flip algorithm with list size λ = 8 outperforms that of the Fast-SSC algorithm by 0.7dB. Moreover, based on the proposed algorithm, an efficient hardware architecture is first developed. It uses less look up tables and achieves a maximum throughput of 140.6 Mbps at high SNR on the Altera Stratix IV EP4SGX530KH40C2 FPGA. Compared with the state of art Fast-SSC implementation, the proposed architecture has a higher resource utilization efficiency (throughput per look-up tables). Under the 28 nm CMOS technology, the throughput can reach up to 483.8 Mbps at a clock frequency of 704 MHz.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"88 1","pages":"580-585"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85633660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Enabling Microarchitectural Randomization in Serialized AES Implementations to Mitigate Side Channel Susceptibility 在序列化AES实现中启用微架构随机化以减轻侧信道易感性
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00064
S. Dhanuskodi, Daniel E. Holcomb
Highly serialized implementations of the AES block cipher are used in lightweight applications where low area and low power are the primary concerns. Security of these lightweight designs becomes increasingly critical on resource-constrained devices in the Internet of Things era. The AES algorithm does not have any significant known cryptanalytic weaknesses, but keys can often be extracted by attacking implementation weaknesses using side channel information leakage or fault injection. Highly serialized AES implementations compute on individual bytes/words of data in each cycle which leaves them especially sensitive to side channel key extraction because there is less overall power consumption to obscure side channel leakages. In this work, we present an efficient AES microarchitecture that randomizes sub-round operations and reduces susceptibility to power side channel attacks. The architecture we propose is compatible with, and complementary to, all existing circuit-level side channel countermeasures. We design an 8-bit AES architecture in a commercial 16nm FinFET technology and observe an order of magnitude improvement in side channel protection at a cost of 36% more area and 25% more energy per encryption. Testchip measurement shows 0.93pJ/bit energy consumption at 10MHz.
AES分组密码的高度序列化实现用于主要关注低面积和低功耗的轻量级应用程序。在物联网时代,这些轻量级设计的安全性在资源受限的设备上变得越来越重要。AES算法没有任何重大的已知密码分析弱点,但通常可以通过使用侧信道信息泄漏或故障注入攻击实现弱点来提取密钥。高度序列化的AES实现在每个周期中对单个字节/字的数据进行计算,这使得它们对侧信道密钥提取特别敏感,因为用于掩盖侧信道泄漏的总功耗更小。在这项工作中,我们提出了一种高效的AES微架构,可以随机化次轮操作并降低对功率侧信道攻击的易感性。我们提出的架构与所有现有的电路级侧信道对抗兼容并互补。我们在商用16nm FinFET技术中设计了一个8位AES架构,并观察到在侧信道保护方面的数量级改进,每次加密的成本增加了36%的面积和25%的能量。测试芯片测量显示10MHz时0.93pJ/bit的能量消耗。
{"title":"Enabling Microarchitectural Randomization in Serialized AES Implementations to Mitigate Side Channel Susceptibility","authors":"S. Dhanuskodi, Daniel E. Holcomb","doi":"10.1109/ISVLSI.2019.00064","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00064","url":null,"abstract":"Highly serialized implementations of the AES block cipher are used in lightweight applications where low area and low power are the primary concerns. Security of these lightweight designs becomes increasingly critical on resource-constrained devices in the Internet of Things era. The AES algorithm does not have any significant known cryptanalytic weaknesses, but keys can often be extracted by attacking implementation weaknesses using side channel information leakage or fault injection. Highly serialized AES implementations compute on individual bytes/words of data in each cycle which leaves them especially sensitive to side channel key extraction because there is less overall power consumption to obscure side channel leakages. In this work, we present an efficient AES microarchitecture that randomizes sub-round operations and reduces susceptibility to power side channel attacks. The architecture we propose is compatible with, and complementary to, all existing circuit-level side channel countermeasures. We design an 8-bit AES architecture in a commercial 16nm FinFET technology and observe an order of magnitude improvement in side channel protection at a cost of 36% more area and 25% more energy per encryption. Testchip measurement shows 0.93pJ/bit energy consumption at 10MHz.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"20 1","pages":"314-319"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85935098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Minimization of Flare in EUVL by Simultaneous Wire Segment Perturbation and Dummification 同时线段摄动和伪化的EUVL中耀斑最小化
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00047
S. Paul, Pritha Banerjee, S. Sur-Kolay
Extreme Ultraviolet Lithography (EUVL) suffers from pattern distortion defects caused by flare, which is irregular reflection from the surface of the mask used. While techniques based on (a) dummification and (b) perturbation of wire segments have reduced flare notably, each has its own merits and demerits. Unlike earlier works where each method for flare reduction is applied independently, in this paper we extensively study the effects on flare and the amount of dummy fills required by simultaneous dummification and wire segment perturbation using an Integer Linear Programming (ILP) based formulation in a multilevel framework at the post-routing stage. Experimental results show an average reduction of maximum flare level by 29% compared to that in the initial routed layout. In addition to that, an average reduction of maximum flare by 19% is observed compared to the method of wire perturbation alone. Moreover, in our method 35% reduction in dummy requirement on average is achieved compared to the application of dummification technique alone for the reduction of flare.
极紫外光刻(EUVL)存在由光斑引起的图案畸变缺陷,这是由所用掩模表面的不规则反射引起的。虽然基于(a)伪化和(b)导线段扰动的技术显著减少了耀斑,但每种技术都有自己的优点和缺点。与早期的工作不同,每种减少耀斑的方法都是独立应用的,在本文中,我们使用基于整数线性规划(ILP)的公式,在路由后阶段的多层框架中广泛研究了同时伪化和线段扰动对耀斑的影响以及所需的虚拟填充量。实验结果表明,与初始路由布局相比,最大耀斑电平平均降低了29%。除此之外,与单独的线摄动方法相比,观察到最大耀斑平均减少19%。此外,在我们的方法中,与单独应用伪化技术相比,平均降低了35%的伪要求,以减少耀斑。
{"title":"Minimization of Flare in EUVL by Simultaneous Wire Segment Perturbation and Dummification","authors":"S. Paul, Pritha Banerjee, S. Sur-Kolay","doi":"10.1109/ISVLSI.2019.00047","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00047","url":null,"abstract":"Extreme Ultraviolet Lithography (EUVL) suffers from pattern distortion defects caused by flare, which is irregular reflection from the surface of the mask used. While techniques based on (a) dummification and (b) perturbation of wire segments have reduced flare notably, each has its own merits and demerits. Unlike earlier works where each method for flare reduction is applied independently, in this paper we extensively study the effects on flare and the amount of dummy fills required by simultaneous dummification and wire segment perturbation using an Integer Linear Programming (ILP) based formulation in a multilevel framework at the post-routing stage. Experimental results show an average reduction of maximum flare level by 29% compared to that in the initial routed layout. In addition to that, an average reduction of maximum flare by 19% is observed compared to the method of wire perturbation alone. Moreover, in our method 35% reduction in dummy requirement on average is achieved compared to the application of dummification technique alone for the reduction of flare.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"8 1","pages":"212-217"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73575290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Aging Analysis of Low Dropout Regulator for Universal Recycled IC Detection 通用回收IC检测用低差稳压器的老化分析
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00113
Sreeja Chowdhury, Haoting Shen, Beomsoo Park, N. Maghari, Domenic Forte
Recycled counterfeit integrated circuits (ICs) are previously used chips that originate from improperly disposed electronics (i.e., e-waste) and are then sold in the market as new. Recycled ICs are dangerous and prone to early failure due to mishandling and deterioration from prior use. With the number of IoT devices expected to reach 125 billion in 2030, ewaste and therefore recycled ICs will face a commensurate rise. Although recycled IC detection and avoidance methods have been emerging over the last decade, there still lacks an all-inone solution. Previously, we discovered low dropout regulator (LDO) aging can be detected with high confidence by measuring the LDO's power supply rejection ratio (PSRR). Since LDOs are embedded in the power management circuitry of virtually all ICs, our method could be applied to avoid recycled chips in any IoT setup, thus bolstering IoT security. Since commercial LDO designs are proprietary to individual design houses, it is difficult to determine the most important sources of degradation. Thus, in order to study this phenomenon further, we have fabricated a custom design generic LDO in 65nm process. Our analysis in this paper reveals ways to improve counterfeit IC detection based on LDOs as well as to develop LDO designs that are even more sensitive to aging/use.
回收的假冒集成电路(ic)是以前使用过的芯片,这些芯片来自处理不当的电子产品(即电子垃圾),然后作为新的在市场上出售。回收的集成电路是危险的,由于处理不当和先前使用的退化,容易出现早期故障。到2030年,物联网设备的数量预计将达到1250亿,电子垃圾和回收ic将面临相应的增长。虽然在过去的十年中已经出现了回收IC检测和避免方法,但仍然缺乏一种一体化的解决方案。之前,我们发现通过测量LDO的电源抑制比(PSRR)可以高置信度地检测低差稳压器(LDO)的老化。由于ldo嵌入在几乎所有ic的电源管理电路中,因此我们的方法可以用于避免在任何物联网设置中回收芯片,从而增强物联网安全性。由于商业LDO设计是由个别设计公司专有的,因此很难确定最重要的降解来源。因此,为了进一步研究这一现象,我们在65nm工艺中制造了定制设计的通用LDO。我们在本文中的分析揭示了改进基于LDO的假冒IC检测以及开发对老化/使用更敏感的LDO设计的方法。
{"title":"Aging Analysis of Low Dropout Regulator for Universal Recycled IC Detection","authors":"Sreeja Chowdhury, Haoting Shen, Beomsoo Park, N. Maghari, Domenic Forte","doi":"10.1109/ISVLSI.2019.00113","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00113","url":null,"abstract":"Recycled counterfeit integrated circuits (ICs) are previously used chips that originate from improperly disposed electronics (i.e., e-waste) and are then sold in the market as new. Recycled ICs are dangerous and prone to early failure due to mishandling and deterioration from prior use. With the number of IoT devices expected to reach 125 billion in 2030, ewaste and therefore recycled ICs will face a commensurate rise. Although recycled IC detection and avoidance methods have been emerging over the last decade, there still lacks an all-inone solution. Previously, we discovered low dropout regulator (LDO) aging can be detected with high confidence by measuring the LDO's power supply rejection ratio (PSRR). Since LDOs are embedded in the power management circuitry of virtually all ICs, our method could be applied to avoid recycled chips in any IoT setup, thus bolstering IoT security. Since commercial LDO designs are proprietary to individual design houses, it is difficult to determine the most important sources of degradation. Thus, in order to study this phenomenon further, we have fabricated a custom design generic LDO in 65nm process. Our analysis in this paper reveals ways to improve counterfeit IC detection based on LDOs as well as to develop LDO designs that are even more sensitive to aging/use.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"49 1","pages":"604-609"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86704120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Machine Learning-Based Processor Adaptability Targeting Energy, Performance, and Reliability 基于机器学习的处理器适应性:能量、性能和可靠性
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00037
A. L. Sartor, P. H. E. Becker, Stephan Wong, R. Marculescu, A. C. S. Beck
Adaptive processors can dynamically change their hardware configuration by tuning several knobs that optimize a given metric, according to the current application. However, the complexity of choosing the best setup at runtime increases exponentially as more adaptive resources become available. Therefore, we propose a polymorphic VLIW processor coupled to a machine learning-based decision mechanism that quickly and accurately delivers the best trade-off in terms of energy, performance, and reliability. The proposed system predicts the best processor configuration in 97.37% of the test cases and achieves an efficiency that is close to an oracle (more than 93.30% on all benchmarks).
自适应处理器可以根据当前的应用程序,通过调整几个旋钮来优化给定的指标,从而动态地改变它们的硬件配置。然而,在运行时选择最佳设置的复杂性随着更多的自适应资源的可用性呈指数增长。因此,我们提出了一种多态VLIW处理器,结合基于机器学习的决策机制,快速准确地在能源、性能和可靠性方面提供最佳权衡。提出的系统在97.37%的测试用例中预测出最佳的处理器配置,并实现了接近oracle的效率(在所有基准测试中超过93.30%)。
{"title":"Machine Learning-Based Processor Adaptability Targeting Energy, Performance, and Reliability","authors":"A. L. Sartor, P. H. E. Becker, Stephan Wong, R. Marculescu, A. C. S. Beck","doi":"10.1109/ISVLSI.2019.00037","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00037","url":null,"abstract":"Adaptive processors can dynamically change their hardware configuration by tuning several knobs that optimize a given metric, according to the current application. However, the complexity of choosing the best setup at runtime increases exponentially as more adaptive resources become available. Therefore, we propose a polymorphic VLIW processor coupled to a machine learning-based decision mechanism that quickly and accurately delivers the best trade-off in terms of energy, performance, and reliability. The proposed system predicts the best processor configuration in 97.37% of the test cases and achieves an efficiency that is close to an oracle (more than 93.30% on all benchmarks).","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"12 1","pages":"158-163"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83889576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Multi-phase Time-to-Digital Converter Differential Vernier Ring Oscillato 多相时间-数字转换器差分游标环振荡器
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00069
A. Annagrebah, E. Bechetoille, I. Laktineh, H. Chanal, P. Russo, H. Mathez
is paper reports the development of an adjustable, Time-to-Digital Converter (TDC) based on two vernier Ring Oscillators (RO). The TDC aims to measure timing in Resistive Plate Chamber (RPC) detector for CMS experiment. Considering previous designs, the contribution from power supply noise and intrinsic transistor noise had been minimizing with differential stages and proper transistor sizing. In order to reduce the dead time inherent to Vernier TDC architecture, as many Phase Detector (PD) as possible had been implemented. Such functionality permits to choose whether reducing the dead time or measuring redundantly the start-stop time difference for an improved resolution. The 81-PD Matrix is the originality of the design. Indeed, the information of the start-stop time is present at each inverter output with a predictable offset in term of time. The inverter architecture introduces a constant delay at each stage of the chain with phase inversion. By recording the counter when a phase detection occurs at each inverter output permitted to revert back to the start-stop time. The prototype TDC fabricated in a 130-nm technology consumes 8.5 mW power under 1.2-V supply. The measurement of this chip shown a timing accuracy of 5.48 ps at a timing resolution of 8 ps for the first data allowed by the first phase detection
本文报道了一种基于两个游标环振荡器的可调时间-数字转换器(TDC)的研制。TDC旨在测量CMS实验中电阻板室(RPC)探测器的定时。考虑到以前的设计,通过不同的级和适当的晶体管尺寸,电源噪声和晶体管固有噪声的贡献已经最小化。为了减少游标TDC结构固有的死区时间,人们设计了尽可能多的鉴相器(PD)。这样的功能允许选择是否减少死区时间或冗余测量启动停止时间差,以提高分辨率。81-PD矩阵是设计的独创性。实际上,启停时间的信息存在于每个逆变器的输出,在时间方面具有可预测的偏移。逆变器结构在链的每个阶段引入恒定的相位反转延迟。通过记录计数器,当相位检测发生在每个逆变器输出时,允许恢复到启停时间。采用130纳米技术制造的原型TDC在1.2 v电源下功耗为8.5 mW。该芯片的测量显示,第一相位检测允许的第一个数据的定时分辨率为8 ps,定时精度为5.48 ps
{"title":"A Multi-phase Time-to-Digital Converter Differential Vernier Ring Oscillato","authors":"A. Annagrebah, E. Bechetoille, I. Laktineh, H. Chanal, P. Russo, H. Mathez","doi":"10.1109/ISVLSI.2019.00069","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00069","url":null,"abstract":"is paper reports the development of an adjustable, Time-to-Digital Converter (TDC) based on two vernier Ring Oscillators (RO). The TDC aims to measure timing in Resistive Plate Chamber (RPC) detector for CMS experiment. Considering previous designs, the contribution from power supply noise and intrinsic transistor noise had been minimizing with differential stages and proper transistor sizing. In order to reduce the dead time inherent to Vernier TDC architecture, as many Phase Detector (PD) as possible had been implemented. Such functionality permits to choose whether reducing the dead time or measuring redundantly the start-stop time difference for an improved resolution. The 81-PD Matrix is the originality of the design. Indeed, the information of the start-stop time is present at each inverter output with a predictable offset in term of time. The inverter architecture introduces a constant delay at each stage of the chain with phase inversion. By recording the counter when a phase detection occurs at each inverter output permitted to revert back to the start-stop time. The prototype TDC fabricated in a 130-nm technology consumes 8.5 mW power under 1.2-V supply. The measurement of this chip shown a timing accuracy of 5.48 ps at a timing resolution of 8 ps for the first data allowed by the first phase detection","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"22 1","pages":"344-347"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85404019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Test Point Insertion Using Artificial Neural Networks 使用人工神经网络插入测试点
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00054
Yang Sun, S. Millican
A method of data collecting, training, and using artificial neural networks (ANNs) for evaluating test point (TP) quality for TP insertion (TPI) is presented in this study. The TPI method analyzes a digital circuit and determines where to insert TPs to improve fault coverage under pseudo-random stimulus, but in contrast to conventional TPI algorithms using heuristically-calculated testability measures, the proposed method uses an ANN trained through fault simulation to evaluate a TP's quality. The time of feature extraction is demonstrated to be significantly faster compared to heuristic-based TP evaluation, and the impact of inserted TPs is shown to provide superior stuck-at fault coverage compared to conventional heuristic-based testability analysis.
本文提出了一种数据收集、训练和使用人工神经网络(ann)来评估TP插入(TPI)测试点(TP)质量的方法。TPI方法分析数字电路并确定在何处插入TPI以提高伪随机刺激下的故障覆盖率,但与传统的TPI算法使用启发式计算的可测试性度量不同,该方法使用经过故障模拟训练的神经网络来评估TP的质量。与启发式TP评估相比,特征提取的时间明显更快,插入TP的影响与传统的基于启发式的可测试性分析相比,提供了更好的卡在故障覆盖率。
{"title":"Test Point Insertion Using Artificial Neural Networks","authors":"Yang Sun, S. Millican","doi":"10.1109/ISVLSI.2019.00054","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00054","url":null,"abstract":"A method of data collecting, training, and using artificial neural networks (ANNs) for evaluating test point (TP) quality for TP insertion (TPI) is presented in this study. The TPI method analyzes a digital circuit and determines where to insert TPs to improve fault coverage under pseudo-random stimulus, but in contrast to conventional TPI algorithms using heuristically-calculated testability measures, the proposed method uses an ANN trained through fault simulation to evaluate a TP's quality. The time of feature extraction is demonstrated to be significantly faster compared to heuristic-based TP evaluation, and the impact of inserted TPs is shown to provide superior stuck-at fault coverage compared to conventional heuristic-based testability analysis.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"31 1","pages":"253-258"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88852526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1