首页 > 最新文献

2014 IEEE Computer Society Annual Symposium on VLSI最新文献

英文 中文
Moving Network Protection from Software to Hardware: An Energy Efficiency Analysis 将网络保护从软件转移到硬件:能源效率分析
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.89
Andre Luiz Pereira de Franca, R. Jasinski, V. Pedroni, A. Santin
Software-based network security is constantly challenged by the increase in network speeds and number of attacks. At the same time, mobile network access underscores the need for energy efficiency. In this paper, we present a new way to improve the throughput and to reduce the energy consumption of an anomaly-based intrusion detection system for probing attacks. Our framework implements the same classifier algorithm in software (C++) and in hardware (synthesizable VHDL), and then compares the energy efficiency of the two approaches. Our results for a decision tree classifier show that the hardware version consumed only 0.03% of the energy used by the same algorithm in software, even though the hardware version operates with a throughput that is 15 times that of the software version.
随着网络速度的提高和网络攻击的增多,基于软件的网络安全受到了不断的挑战。与此同时,移动网络接入强调了对能源效率的需求。在本文中,我们提出了一种新的方法来提高基于异常的入侵检测系统在探测攻击时的吞吐量和降低能量消耗。我们的框架在软件(c++)和硬件(可合成的VHDL)中实现了相同的分类器算法,然后比较了两种方法的能效。我们对决策树分类器的研究结果表明,尽管硬件版本的吞吐量是软件版本的15倍,但硬件版本的能耗仅为软件版本相同算法的0.03%。
{"title":"Moving Network Protection from Software to Hardware: An Energy Efficiency Analysis","authors":"Andre Luiz Pereira de Franca, R. Jasinski, V. Pedroni, A. Santin","doi":"10.1109/ISVLSI.2014.89","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.89","url":null,"abstract":"Software-based network security is constantly challenged by the increase in network speeds and number of attacks. At the same time, mobile network access underscores the need for energy efficiency. In this paper, we present a new way to improve the throughput and to reduce the energy consumption of an anomaly-based intrusion detection system for probing attacks. Our framework implements the same classifier algorithm in software (C++) and in hardware (synthesizable VHDL), and then compares the energy efficiency of the two approaches. Our results for a decision tree classifier show that the hardware version consumed only 0.03% of the energy used by the same algorithm in software, even though the hardware version operates with a throughput that is 15 times that of the software version.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122736002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Removing the Root of Trust: Secure Oblivious Key Establishment for FPGAs 消除信任的根源:fpga的安全遗忘密钥建立
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.49
Lei Xu, W. Shi
FPGAs are widely deployed nowadays. Besides offering powerful computation capacity, contemporary FPGAs also provide many security features such as bitstream protection. The security of these features is dependent on the security of the keys embedded in the FPGA, which is usually generated by the vendor. This type of architecture has a shortcoming that the FPGA vendor knows everything and becomes the root of trust. In this work, we propose a key generation method utilizing bilinear pairing that enables the user of the FPGA to interact with the device to generate keys. The generated keys depend on both the input from the user and the device so vendor cannot learn the keys. Furthermore, we offer a method to allow the user to verify the generated keys to make sure that the keys are related to his input. Finally we conduct some experiments and indicate the effectiveness of our scheme.
如今fpga得到了广泛的应用。除了提供强大的计算能力外,现代fpga还提供许多安全特性,如比特流保护。这些特性的安全性取决于FPGA中嵌入的密钥的安全性,这些密钥通常由供应商生成。这种类型的架构有一个缺点,即FPGA供应商知道一切,并成为信任的根源。在这项工作中,我们提出了一种利用双线性配对的密钥生成方法,该方法使FPGA的用户能够与设备交互以生成密钥。生成的密钥依赖于用户和设备的输入,因此供应商无法学习密钥。此外,我们还提供了一种方法,允许用户验证生成的密钥,以确保这些密钥与他的输入相关。最后通过实验验证了该方案的有效性。
{"title":"Removing the Root of Trust: Secure Oblivious Key Establishment for FPGAs","authors":"Lei Xu, W. Shi","doi":"10.1109/ISVLSI.2014.49","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.49","url":null,"abstract":"FPGAs are widely deployed nowadays. Besides offering powerful computation capacity, contemporary FPGAs also provide many security features such as bitstream protection. The security of these features is dependent on the security of the keys embedded in the FPGA, which is usually generated by the vendor. This type of architecture has a shortcoming that the FPGA vendor knows everything and becomes the root of trust. In this work, we propose a key generation method utilizing bilinear pairing that enables the user of the FPGA to interact with the device to generate keys. The generated keys depend on both the input from the user and the device so vendor cannot learn the keys. Furthermore, we offer a method to allow the user to verify the generated keys to make sure that the keys are related to his input. Finally we conduct some experiments and indicate the effectiveness of our scheme.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"210 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133536154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Low-Noise Variable-Gain Amplifier for in-Probe 3D Imaging Applications Based on CMUT Transducers 基于CMUT换能器的探头内三维成像低噪声变增益放大器
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.113
H. Attarzadeh, T. Ytterdal
This paper presents the design of a low power low noise variable gain amplifier(VGA) interface circuit. The VGA circuit proposed is designed for interface with Capacitive Micro-Machined Ultrasonic Transducer (CMUT). Due to the small area and low power consumption, the circuit is suitable for in-probe imaging where the VGA is interfaced with the in-prob ADC which does all the digital conversion inside probe. The VGA circuit maps the attenuated received signal from CMUT to the full dynamic range of the ADC. The circuit is able to produce a differential output from an ultrasound sensor which is based on a Zero-Bias CMUT, where the requirement for an external high voltage dc bias is eliminated. Therefore, the single to differential conversion is carried out through steering the current from both the electrodes of the CMUT without the need for high voltage design. The VGA is designed and simulated with 65nm CMOS technology. The VGA gain varies in linear from 0 -- 20db. A noise figure (NF) of 3dB for a CMUT with 5MHz center frequency is estimated, where the power consumption of only 80uW and the total area of 0:008mm2 is achieved which makes it perfect for the interface to the in probe ADC circuit. The circuit layout design is based on the standard unit shapes which results in pattern regularity and density uniformity. This will assure the result from the post-layout simulation close to the pre-layout simulation and also gives a better matching in the layout.
本文设计了一种低功耗低噪声可变增益放大器(VGA)接口电路。所提出的VGA电路用于与电容式微加工超声换能器(CMUT)接口。由于面积小,功耗低,该电路适用于探头内成像,其中VGA与探头内ADC接口,探头内完成所有数字转换。VGA电路将衰减后的CMUT接收信号映射到ADC的全动态范围。该电路能够从基于零偏置CMUT的超声波传感器产生差分输出,消除了对外部高压直流偏置的要求。因此,通过控制来自CMUT两个电极的电流来进行单到差动转换,而无需进行高压设计。VGA采用65nm CMOS技术进行设计和仿真。VGA增益从0 - 20db线性变化。对于中心频率为5MHz的CMUT,估计噪声系数(NF)为3dB,功耗仅为80uW,总面积为0:008mm2,使其非常适合与内探头ADC电路的接口。电路布局设计是基于标准单元形状,从而导致图案规则和密度均匀。这将确保布局后仿真的结果接近布局前仿真,并在布局中提供更好的匹配。
{"title":"A Low-Noise Variable-Gain Amplifier for in-Probe 3D Imaging Applications Based on CMUT Transducers","authors":"H. Attarzadeh, T. Ytterdal","doi":"10.1109/ISVLSI.2014.113","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.113","url":null,"abstract":"This paper presents the design of a low power low noise variable gain amplifier(VGA) interface circuit. The VGA circuit proposed is designed for interface with Capacitive Micro-Machined Ultrasonic Transducer (CMUT). Due to the small area and low power consumption, the circuit is suitable for in-probe imaging where the VGA is interfaced with the in-prob ADC which does all the digital conversion inside probe. The VGA circuit maps the attenuated received signal from CMUT to the full dynamic range of the ADC. The circuit is able to produce a differential output from an ultrasound sensor which is based on a Zero-Bias CMUT, where the requirement for an external high voltage dc bias is eliminated. Therefore, the single to differential conversion is carried out through steering the current from both the electrodes of the CMUT without the need for high voltage design. The VGA is designed and simulated with 65nm CMOS technology. The VGA gain varies in linear from 0 -- 20db. A noise figure (NF) of 3dB for a CMUT with 5MHz center frequency is estimated, where the power consumption of only 80uW and the total area of 0:008mm2 is achieved which makes it perfect for the interface to the in probe ADC circuit. The circuit layout design is based on the standard unit shapes which results in pattern regularity and density uniformity. This will assure the result from the post-layout simulation close to the pre-layout simulation and also gives a better matching in the layout.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114492713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
SET Susceptibility Analysis of Clock Tree and Clock Mesh Topologies 时钟树和时钟网拓扑的SET敏感性分析
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.33
R. Chipana, F. Kastensmidt
Clock distribution networks represent one of the most important signals in a synchronous integrated circuit, this signal may be altered by radiation effects and generate an abnormal behavior in the system. In this work we analyzed two types of clock distribution network using the same circuit to know which of one is more sensitive to radiation threats. Using a case study we compare clock tree with clock mesh. Finally we found that clock mesh topology is more sensitive to radiation effects in comparison with traditional tree distribution. This may occur because clock mesh has a uniform distribution of capacitance and this allows a good distribution of the signal even to transient pulse.
时钟分布网络是同步集成电路中最重要的信号之一,该信号可能受到辐射效应的影响而发生改变,从而在系统中产生异常行为。在这项工作中,我们分析了使用同一电路的两种时钟分配网络,以了解哪一种对辐射威胁更敏感。通过一个案例研究,我们比较了时钟树和时钟网格。与传统的树状分布相比,时钟网格拓扑对辐射效应更敏感。这可能发生,因为时钟网格具有均匀分布的电容,这允许一个良好的信号分布,甚至瞬态脉冲。
{"title":"SET Susceptibility Analysis of Clock Tree and Clock Mesh Topologies","authors":"R. Chipana, F. Kastensmidt","doi":"10.1109/ISVLSI.2014.33","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.33","url":null,"abstract":"Clock distribution networks represent one of the most important signals in a synchronous integrated circuit, this signal may be altered by radiation effects and generate an abnormal behavior in the system. In this work we analyzed two types of clock distribution network using the same circuit to know which of one is more sensitive to radiation threats. Using a case study we compare clock tree with clock mesh. Finally we found that clock mesh topology is more sensitive to radiation effects in comparison with traditional tree distribution. This may occur because clock mesh has a uniform distribution of capacitance and this allows a good distribution of the signal even to transient pulse.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114609410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Mach-Zehnder Interferometer Based All Optical Reversible Carry-Lookahead Adder 基于Mach-Zehnder干涉仪的全光可逆超前加法器
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.102
Pratik Dutta, Chandan Bandyopadhyay, C. Giri, H. Rahaman
In this work, we present an efficient reversible implementation of Carry-Lookahead Adder (CLA) in all-optical domain. Now-a-days, semiconductor optical amplifier (SOA)-based Mach-Zehnder interferometer (MZI) plays a vital role in the field of ultra-fast all-optical signal processing. We have used all optical based Mach-Zehnder Interferometer (MZI) switches to design the CLA circuit implementing reversible functionality. Two approaches are proposed for designing the CLA circuit. First, we propose a hierarchical approach for implementation of 2n-bit reversible CLA. In the second approach, we remove the drawback of hierarchical CLA and improve the design by implementing non-modular staircase structure of n-bit reversible CLA. The design complexities of both the approaches are computed. Experimental result shows that the optical cost and delay incurred in staircase structured reversible implementation of CLA are much less than those proposed in the recently reported works.
在这项工作中,我们提出了一种在全光域中高效可逆的携带-前瞻加法器(CLA)。目前,基于半导体光放大器(SOA)的马赫-曾德尔干涉仪(MZI)在超快全光信号处理领域发挥着至关重要的作用。我们使用所有基于光学的Mach-Zehnder干涉仪(MZI)开关来设计实现可逆功能的CLA电路。提出了两种设计CLA电路的方法。首先,我们提出了一种实现2n位可逆CLA的分层方法。在第二种方法中,我们消除了分层CLA的缺点,并通过实现n位可逆CLA的非模块化阶梯结构来改进设计。计算了两种方法的设计复杂度。实验结果表明,阶梯结构可逆实现CLA的光成本和延迟比目前文献中提出的要小得多。
{"title":"Mach-Zehnder Interferometer Based All Optical Reversible Carry-Lookahead Adder","authors":"Pratik Dutta, Chandan Bandyopadhyay, C. Giri, H. Rahaman","doi":"10.1109/ISVLSI.2014.102","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.102","url":null,"abstract":"In this work, we present an efficient reversible implementation of Carry-Lookahead Adder (CLA) in all-optical domain. Now-a-days, semiconductor optical amplifier (SOA)-based Mach-Zehnder interferometer (MZI) plays a vital role in the field of ultra-fast all-optical signal processing. We have used all optical based Mach-Zehnder Interferometer (MZI) switches to design the CLA circuit implementing reversible functionality. Two approaches are proposed for designing the CLA circuit. First, we propose a hierarchical approach for implementation of 2n-bit reversible CLA. In the second approach, we remove the drawback of hierarchical CLA and improve the design by implementing non-modular staircase structure of n-bit reversible CLA. The design complexities of both the approaches are computed. Experimental result shows that the optical cost and delay incurred in staircase structured reversible implementation of CLA are much less than those proposed in the recently reported works.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116738041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Physical vs. Physically-Aware Estimation Flow: Case Study of Design Space Exploration of Adders 物理vs.物理感知评估流程:《加法器》设计空间探索案例研究
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.14
Ivan Ratković, Oscar Palomar, Milan Stanic, O. Unsal, A. Cristal, M. Valero
Selecting an appropriate estimation method for a given technology and design is of crucial interest as the estimations guide future project and design decisions. The accuracy of the estimations of area, timing, and power (metrics of interest) depends on the phase of the design flow and the fidelity of the models. In this research, we use design space exploration of low-power adders as a case study for comparative analysis of two estimation flows: Physical layout Aware Synthesis (PAS) and Place and Route (PnR). We study and compare post-PAS and post-PnR estimations of the metrics of interest and the impact of various design parameters and input switching activity factor (αI). Adders are particularly interesting for this study because they are fundamental microprocessor units, and their design involves many parameters that create a vast design space. We show cases when the post-PAS and post-PnR estimations could lead to different design decisions, especially from a low-power designer point of view. Our experiments reveal that post-PAS results underestimate the side-effects of clock-gating, pipelining, and extensive timing optimizations compared to post-PnR results. We also observe that PnR estimation flow sometimes reports counterintuitive results.
为给定的技术和设计选择适当的评估方法是至关重要的,因为评估可以指导未来的项目和设计决策。估计面积、时间和功率(感兴趣的度量)的准确性取决于设计流程的阶段和模型的保真度。在本研究中,我们以低功耗加法器的设计空间探索为例,对两种评估流程:物理布局感知综合(PAS)和位置和路径(PnR)进行了比较分析。我们研究并比较了pas后和pnr后对感兴趣指标的估计以及各种设计参数和输入切换活性因子(αI)的影响。加法器对这项研究特别有趣,因为它们是基本的微处理器单元,它们的设计涉及许多参数,创造了巨大的设计空间。我们展示了后pas和后pnr评估可能导致不同设计决策的情况,特别是从低功耗设计师的角度来看。我们的实验表明,与pnr后的结果相比,pas后的结果低估了时钟门控、流水线和广泛的时序优化的副作用。我们还观察到,PnR估计流有时报告反直觉的结果。
{"title":"Physical vs. Physically-Aware Estimation Flow: Case Study of Design Space Exploration of Adders","authors":"Ivan Ratković, Oscar Palomar, Milan Stanic, O. Unsal, A. Cristal, M. Valero","doi":"10.1109/ISVLSI.2014.14","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.14","url":null,"abstract":"Selecting an appropriate estimation method for a given technology and design is of crucial interest as the estimations guide future project and design decisions. The accuracy of the estimations of area, timing, and power (metrics of interest) depends on the phase of the design flow and the fidelity of the models. In this research, we use design space exploration of low-power adders as a case study for comparative analysis of two estimation flows: Physical layout Aware Synthesis (PAS) and Place and Route (PnR). We study and compare post-PAS and post-PnR estimations of the metrics of interest and the impact of various design parameters and input switching activity factor (αI). Adders are particularly interesting for this study because they are fundamental microprocessor units, and their design involves many parameters that create a vast design space. We show cases when the post-PAS and post-PnR estimations could lead to different design decisions, especially from a low-power designer point of view. Our experiments reveal that post-PAS results underestimate the side-effects of clock-gating, pipelining, and extensive timing optimizations compared to post-PnR results. We also observe that PnR estimation flow sometimes reports counterintuitive results.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114068685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Variation-Aware Analysis and Test Pattern Generation Based on Functional Faults 基于功能故障的变化感知分析与测试模式生成
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.116
M. Fujita
Due to the continuous shrinking of semiconductor technology, there are more and more variations in the process of manufacturing chips. From the viewpoint of analyzing the functionality of a chip, variation may change the overall "observed" behavior of the chip. In this paper, we discuss additional delays caused by variation that may generate changes of observed behaviors. In the first part of the paper, we discuss functional changes caused by additional delays on the inputs of each gate in the circuit. Unlike stuck-at faults, such additional delays can introduce many different faulty functions on a gate. For example, in the cases of two-input AND/OR gate, all possible logic functions with two-input, which are 222=16 different functions, can potentially be observed. This indicates that it may make sense to model faulty behaviors caused by variation as general functional faults rather than structurally defined faults, such as stuck-at faults. Also, such additional delays by variation can happen in multiple locations simultaneously. As a result, there can be so many possible fault combinations to be considered, and it is not easy at all to analyze them with traditional automatic test pattern generation (ATPG) methods which drop detectable faults by fault simulators using explicit representation of faults. So in the second part of the paper, we discuss about ATPG methods where test pattern generation and fault dropping processes are unified. As faults are represented implicitly, even if numbers of simultaneous faults are large, we may still be able to successfully perform ATPG processes.
由于半导体技术的不断萎缩,在制造芯片的过程中有越来越多的变化。从分析芯片功能的角度来看,变化可能会改变芯片的整体“观察”行为。在本文中,我们讨论了可能产生观测行为变化的变异引起的附加延迟。在本文的第一部分中,我们讨论了电路中每个门的输入上的额外延迟引起的功能变化。与卡滞故障不同,这种额外的延迟会在门上引入许多不同的故障功能。例如,在双输入与或门的情况下,可以潜在地观察到所有可能的双输入逻辑函数,即222=16种不同的函数。这表明,将由变异引起的错误行为建模为一般功能错误而不是结构定义的错误(如卡滞错误)可能是有意义的。此外,这种由变化引起的额外延迟可能同时发生在多个位置。因此,需要考虑的故障组合非常多,传统的自动测试模式生成(ATPG)方法很难对其进行分析,这种方法通过故障模拟器使用显式故障表示来减少可检测故障。因此,本文的第二部分讨论了将测试模式生成和故障排除过程统一起来的ATPG方法。由于故障是隐式表示的,即使同时发生的故障数量很大,我们仍然可以成功地执行ATPG进程。
{"title":"Variation-Aware Analysis and Test Pattern Generation Based on Functional Faults","authors":"M. Fujita","doi":"10.1109/ISVLSI.2014.116","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.116","url":null,"abstract":"Due to the continuous shrinking of semiconductor technology, there are more and more variations in the process of manufacturing chips. From the viewpoint of analyzing the functionality of a chip, variation may change the overall \"observed\" behavior of the chip. In this paper, we discuss additional delays caused by variation that may generate changes of observed behaviors. In the first part of the paper, we discuss functional changes caused by additional delays on the inputs of each gate in the circuit. Unlike stuck-at faults, such additional delays can introduce many different faulty functions on a gate. For example, in the cases of two-input AND/OR gate, all possible logic functions with two-input, which are 222=16 different functions, can potentially be observed. This indicates that it may make sense to model faulty behaviors caused by variation as general functional faults rather than structurally defined faults, such as stuck-at faults. Also, such additional delays by variation can happen in multiple locations simultaneously. As a result, there can be so many possible fault combinations to be considered, and it is not easy at all to analyze them with traditional automatic test pattern generation (ATPG) methods which drop detectable faults by fault simulators using explicit representation of faults. So in the second part of the paper, we discuss about ATPG methods where test pattern generation and fault dropping processes are unified. As faults are represented implicitly, even if numbers of simultaneous faults are large, we may still be able to successfully perform ATPG processes.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121554667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Computing with Spin-Transfer-Torque Devices: Prospects and Perspectives 自旋-传递-扭矩装置的计算:展望与展望
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.120
K. Roy, M. Sharad, Deliang Fan, K. Yogendra
In this paper we discuss the potential of emerging spin-torque devices for computing applications. Recent proposals for spin-based computing schemes may be differentiated as all-spin? vs. hybrid, programmable vs. fixed, and, Boolean vs. non-Boolean. All-spin logic-styles may offer high area-density due to small form-factor of nano-magnetic devices. However, circuit and system-level design techniques need to be explored that leaverage the specific spin-device characterisitcs to achieve energy-efficiency, performance and reliability comparable to those of CMOS. The non-volatility of nano-magnets can be exploited in the design of energy and area-efficient programmable logic. In such logic-styles, spin-devices may play the dual-role of computing as well as memory-elements that provide field-programmability. Spin-based threshold logic design is presented as an example. Emerging spintronic phenomena may lead to ultra-low-voltage, current-mode, spin-torque switches that can offer attractive computing capabilities, beyond digital switches. Such devices may be suitable for non-Boolean data-processing applications which involve analog processing leading to highly energy-efficient information processing hardware for applicatons like pattern-matching, neuromorphic-computing, image-processing and data-conversion. Towards the end, we discuss the possibility of applying emerging spin-torque switches in the design of energy-efficient global interconnects, for future chip multiprocessors.
在本文中,我们讨论了新兴的自旋扭矩装置在计算应用中的潜力。最近提出的基于自旋的计算方案可能被区分为全自旋?混合vs.可编程vs.固定,布尔vs.非布尔。由于纳米磁性器件的小尺寸,全自旋逻辑器件可以提供高的面积密度。然而,需要探索电路和系统级设计技术,以利用特定的自旋器件特性来实现与CMOS相当的能效,性能和可靠性。纳米磁体的非易失性可用于节能和节能的可编程逻辑设计。在这种逻辑风格中,自旋设备可以扮演计算和存储元素的双重角色,提供现场可编程性。给出了一个基于自旋的阈值逻辑设计实例。新兴的自旋电子现象可能导致超低电压,电流模式,自旋扭矩开关,可以提供有吸引力的计算能力,超越数字开关。这种器件可能适用于涉及模拟处理的非布尔数据处理应用,从而为模式匹配、神经形态计算、图像处理和数据转换等应用提供高能效的信息处理硬件。最后,我们讨论了将新兴的自旋扭矩开关应用于节能全球互连设计的可能性,用于未来的芯片多处理器。
{"title":"Computing with Spin-Transfer-Torque Devices: Prospects and Perspectives","authors":"K. Roy, M. Sharad, Deliang Fan, K. Yogendra","doi":"10.1109/ISVLSI.2014.120","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.120","url":null,"abstract":"In this paper we discuss the potential of emerging spin-torque devices for computing applications. Recent proposals for spin-based computing schemes may be differentiated as all-spin? vs. hybrid, programmable vs. fixed, and, Boolean vs. non-Boolean. All-spin logic-styles may offer high area-density due to small form-factor of nano-magnetic devices. However, circuit and system-level design techniques need to be explored that leaverage the specific spin-device characterisitcs to achieve energy-efficiency, performance and reliability comparable to those of CMOS. The non-volatility of nano-magnets can be exploited in the design of energy and area-efficient programmable logic. In such logic-styles, spin-devices may play the dual-role of computing as well as memory-elements that provide field-programmability. Spin-based threshold logic design is presented as an example. Emerging spintronic phenomena may lead to ultra-low-voltage, current-mode, spin-torque switches that can offer attractive computing capabilities, beyond digital switches. Such devices may be suitable for non-Boolean data-processing applications which involve analog processing leading to highly energy-efficient information processing hardware for applicatons like pattern-matching, neuromorphic-computing, image-processing and data-conversion. Towards the end, we discuss the possibility of applying emerging spin-torque switches in the design of energy-efficient global interconnects, for future chip multiprocessors.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121587172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
High Performance Low Swing Clock Tree Synthesis with Custom D Flip-Flop Design 高性能低摆时钟树合成与定制D触发器设计
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.53
Can Sitik, Leo Filippini, E. Salman, B. Taskin
Low swing clocking is a low power design methodology that scales the clock voltage to decrease power consumption of the clock distribution networks, with an expected degradation in the performance. In this work, a novel low swing clock tree synthesis methodology is combined with a custom low swing clock-aware D flip-flop (DFF) design. The low swing clocking serves to reduce the power dissipation whereas the custom low swing-aware DFF serves to preserve the performance of the IC. The experimental results performed on the three largest circuits of ISCAS'89 benchmarks operating at 1GHz in the 32nm technology show that the proposed methodology can achieve an average of 16% power savings in the clock tree compared to its full swing counterpart, while satisfying the same clock skew (50ps) and slew (150ps) constraints at the worst case corner of operation. Moreover, the clock-to-output delay of the low swing DFF does not increase compared to traditional full swing DFF, while consuming only 1% more power.
低摆幅时钟是一种低功耗设计方法,它通过缩放时钟电压来降低时钟分配网络的功耗,从而降低性能的预期下降。在这项工作中,一种新颖的低摆时钟树合成方法与定制的低摆时钟感知D触发器(DFF)设计相结合。低摆幅时钟用于降低功耗,而定制的低摆幅感知DFF用于保持IC的性能。在ISCAS'89基准测试的三个最大电路上进行的实验结果表明,与全摆幅技术相比,所提出的方法可以在时钟树中平均节省16%的功耗。同时在最坏的情况下满足相同的时钟偏差(50ps)和反转(150ps)约束。此外,与传统的全摆幅DFF相比,低摆幅DFF的时钟到输出延迟不会增加,而功耗仅增加1%。
{"title":"High Performance Low Swing Clock Tree Synthesis with Custom D Flip-Flop Design","authors":"Can Sitik, Leo Filippini, E. Salman, B. Taskin","doi":"10.1109/ISVLSI.2014.53","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.53","url":null,"abstract":"Low swing clocking is a low power design methodology that scales the clock voltage to decrease power consumption of the clock distribution networks, with an expected degradation in the performance. In this work, a novel low swing clock tree synthesis methodology is combined with a custom low swing clock-aware D flip-flop (DFF) design. The low swing clocking serves to reduce the power dissipation whereas the custom low swing-aware DFF serves to preserve the performance of the IC. The experimental results performed on the three largest circuits of ISCAS'89 benchmarks operating at 1GHz in the 32nm technology show that the proposed methodology can achieve an average of 16% power savings in the clock tree compared to its full swing counterpart, while satisfying the same clock skew (50ps) and slew (150ps) constraints at the worst case corner of operation. Moreover, the clock-to-output delay of the low swing DFF does not increase compared to traditional full swing DFF, while consuming only 1% more power.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125973329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
System-Level Power and Energy Estimation Methodology for Open Multimedia Applications Platforms 开放多媒体应用平台的系统级功率和能量估算方法
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.38
S. Rethinagiri, Oscar Palomar, J. Moreno, O. Unsal, A. Cristal, M. Biglari-Abhari
In this paper, we propose a power and energy estimation methodology at system-level for Open Multimedia Applications Platforms (OMAPs). Within this methodology, the Functional-Level Power Analysis (FLPA) is extended to create generic power models of the platform under test. Then, a simulation framework is developed at the transactional-level to accurately evaluate the activities used in the previously developed power and energy models. The proposed methodology has several benefits: it considers the power and energy consumption of the entire platform including peripherals and leads to accurate estimates. The efficiency of the proposed system-level methodology is validated using mono-processor and heterogeneous multiprocessor embedded architectures designed around OMAPs. The estimated power and energy results provide a maximum error of 5% for mono-processor and 9% for heterogeneous multiprocessor based system when compared against the real board measurements.
在本文中,我们提出了一种开放多媒体应用平台(OMAPs)系统级的功率和能量估计方法。在这种方法中,功能级功率分析(FLPA)被扩展到创建被测平台的通用功率模型。然后,在事务级别开发一个模拟框架,以准确地评估先前开发的电力和能源模型中使用的活动。提出的方法有几个好处:它考虑了整个平台(包括外围设备)的功率和能源消耗,并导致准确的估计。采用围绕omap设计的单处理器和异构多处理器嵌入式架构验证了所提出的系统级方法的效率。与实际电路板测量结果相比,估计的功率和能量结果为单处理器系统提供了5%的最大误差,为异构多处理器系统提供了9%的最大误差。
{"title":"System-Level Power and Energy Estimation Methodology for Open Multimedia Applications Platforms","authors":"S. Rethinagiri, Oscar Palomar, J. Moreno, O. Unsal, A. Cristal, M. Biglari-Abhari","doi":"10.1109/ISVLSI.2014.38","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.38","url":null,"abstract":"In this paper, we propose a power and energy estimation methodology at system-level for Open Multimedia Applications Platforms (OMAPs). Within this methodology, the Functional-Level Power Analysis (FLPA) is extended to create generic power models of the platform under test. Then, a simulation framework is developed at the transactional-level to accurately evaluate the activities used in the previously developed power and energy models. The proposed methodology has several benefits: it considers the power and energy consumption of the entire platform including peripherals and leads to accurate estimates. The efficiency of the proposed system-level methodology is validated using mono-processor and heterogeneous multiprocessor embedded architectures designed around OMAPs. The estimated power and energy results provide a maximum error of 5% for mono-processor and 9% for heterogeneous multiprocessor based system when compared against the real board measurements.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127525537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2014 IEEE Computer Society Annual Symposium on VLSI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1