首页 > 最新文献

2014 IEEE Computer Society Annual Symposium on VLSI最新文献

英文 中文
Design of a Flexible, Energy Efficient (Auto)Correlator Block for Timing Synchronization 一种灵活、节能(自动)的定时同步相关器块设计
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.27
F. Campi, Roberto Airoldi, J. Nurmi
Multi-mode and multi-standard connectivity has become a necessity for portable communication systems. A convenient architectural solution is to build flexible systems that can be reprogrammed to meet requirements of multiple standards. One of the major issues in this context is the resource overhead required by programmability. In particular, in the latest VLSI technology nodes, energy consumption has become a very severe problem, greatly impacting the reliability of the hardware. Therefore, any design aimed at the implementation of multi-mode multi-standard communication systems must be strictly targeted at the lowest power consumption without jeopardizing peak performance, while, at the same time, retaining a high degree of flexibility. This work presents the design and implementation of a (auto)correlator block for timing synchronization. The design is composed of a scalable computational unit, which allows to meet real-time requirements of different wireless communication standards (e.g. W-CDMA, IEEE 802.11a/g/n). Moreover, dynamic power management allows to dynamically trade-off energy consumption versus performance, adapting power dissipation to the specific requirements of each supported standard, as well as to follow dynamic variations of the computation load.
多模式、多标准连接已成为便携式通信系统的必然要求。一种方便的架构解决方案是构建灵活的系统,这些系统可以重新编程以满足多种标准的需求。这方面的主要问题之一是可编程性所需的资源开销。特别是在最新的VLSI技术节点中,能耗已经成为一个非常严重的问题,极大地影响了硬件的可靠性。因此,任何旨在实现多模多标准通信系统的设计都必须严格以最低功耗为目标,同时不损害峰值性能,同时保持高度的灵活性。本文介绍了一种用于定时同步的(自动)相关器块的设计和实现。该设计由一个可扩展的计算单元组成,可以满足不同无线通信标准(如W-CDMA、IEEE 802.11a/g/n)的实时性要求。此外,动态电源管理允许动态权衡能耗与性能,使功耗适应每个支持标准的特定要求,并遵循计算负载的动态变化。
{"title":"Design of a Flexible, Energy Efficient (Auto)Correlator Block for Timing Synchronization","authors":"F. Campi, Roberto Airoldi, J. Nurmi","doi":"10.1109/ISVLSI.2014.27","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.27","url":null,"abstract":"Multi-mode and multi-standard connectivity has become a necessity for portable communication systems. A convenient architectural solution is to build flexible systems that can be reprogrammed to meet requirements of multiple standards. One of the major issues in this context is the resource overhead required by programmability. In particular, in the latest VLSI technology nodes, energy consumption has become a very severe problem, greatly impacting the reliability of the hardware. Therefore, any design aimed at the implementation of multi-mode multi-standard communication systems must be strictly targeted at the lowest power consumption without jeopardizing peak performance, while, at the same time, retaining a high degree of flexibility. This work presents the design and implementation of a (auto)correlator block for timing synchronization. The design is composed of a scalable computational unit, which allows to meet real-time requirements of different wireless communication standards (e.g. W-CDMA, IEEE 802.11a/g/n). Moreover, dynamic power management allows to dynamically trade-off energy consumption versus performance, adapting power dissipation to the specific requirements of each supported standard, as well as to follow dynamic variations of the computation load.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"455 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131856026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Delay Probability Metric for Input Pattern Ranking Under Process Variation and Supply Noise 过程变化和供应噪声下输入模式排序的延迟概率度量
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.42
A. Asokan, A. Todri, A. Bosio, L. Dilillo, P. Girard, S. Pravossoudovitch, A. Virazel
Ongoing technology scaling has increased delay defects in integrated circuits. Some of the delay defects are due to crosstalk, supply noise, process variations, etc. They degrade the performance and field reliability of circuits. However, testing the circuits with path delay patterns under worst-case conditions helps to detect such defects. Estimation of patterns with worst-case path delay becomes difficult using the conventional techniques due to their unpredictable behavior. In this paper, we first describe the problem and then propose our approach in identifying a worst-case path delay pattern under the impact of process variations and supply noise. A delay probability metric ispresented in this work, for an efficient identification of worst-case path delay pattern, which is the basis of our ranking method. The simulation results of ITC'99 benchmark circuits show the feasibility of our delay probability metric.
不断发展的技术规模增加了集成电路中的延迟缺陷。有些延迟缺陷是由串扰、电源噪声、工艺变化等引起的。它们降低了电路的性能和现场可靠性。然而,在最坏情况下测试具有路径延迟模式的电路有助于检测此类缺陷。具有最坏路径延迟的模式由于其不可预测的行为,使得使用传统技术来估计它们变得困难。在本文中,我们首先描述了这个问题,然后提出了在过程变化和供应噪声影响下识别最坏情况路径延迟模式的方法。为了有效地识别最坏路径的延迟模式,本文提出了一个延迟概率度量,这是我们的排序方法的基础。ITC’99基准电路的仿真结果表明了延迟概率度量的可行性。
{"title":"A Delay Probability Metric for Input Pattern Ranking Under Process Variation and Supply Noise","authors":"A. Asokan, A. Todri, A. Bosio, L. Dilillo, P. Girard, S. Pravossoudovitch, A. Virazel","doi":"10.1109/ISVLSI.2014.42","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.42","url":null,"abstract":"Ongoing technology scaling has increased delay defects in integrated circuits. Some of the delay defects are due to crosstalk, supply noise, process variations, etc. They degrade the performance and field reliability of circuits. However, testing the circuits with path delay patterns under worst-case conditions helps to detect such defects. Estimation of patterns with worst-case path delay becomes difficult using the conventional techniques due to their unpredictable behavior. In this paper, we first describe the problem and then propose our approach in identifying a worst-case path delay pattern under the impact of process variations and supply noise. A delay probability metric ispresented in this work, for an efficient identification of worst-case path delay pattern, which is the basis of our ranking method. The simulation results of ITC'99 benchmark circuits show the feasibility of our delay probability metric.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123445666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Function Extraction from Arithmetic Bit-Level Circuits 算术位级电路的函数提取
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.43
M. Ciesielski, W. Brown, Duo Liu, A. Rossi
The paper describes a method to derive a polynomial function computed by an arithmetic bit-level circuit. The circuit is modeled as a bit-level network composed of adders and logic gates and computation performed by the circuit is viewed as a flow of binary data through the network. The problem is cast as a Network Flow problem and solved using standard algebraic techniques. Extraction of the arithmetic function from the circuit is accomplished by transforming the expression at the primary outputs into an expression at the primary inputs. Experimental results show application of the method to certain classes of large arithmetic circuits.
本文介绍了一种用算术位级电路计算多项式函数的推导方法。电路被建模为由加法器和逻辑门组成的位级网络,电路执行的计算被视为通过网络的二进制数据流。该问题被视为网络流问题,并使用标准代数技术解决。从电路中提取算术函数是通过将主要输出端的表达式转换为主要输入端的表达式来完成的。实验结果表明,该方法适用于某些类型的大型算术电路。
{"title":"Function Extraction from Arithmetic Bit-Level Circuits","authors":"M. Ciesielski, W. Brown, Duo Liu, A. Rossi","doi":"10.1109/ISVLSI.2014.43","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.43","url":null,"abstract":"The paper describes a method to derive a polynomial function computed by an arithmetic bit-level circuit. The circuit is modeled as a bit-level network composed of adders and logic gates and computation performed by the circuit is viewed as a flow of binary data through the network. The problem is cast as a Network Flow problem and solved using standard algebraic techniques. Extraction of the arithmetic function from the circuit is accomplished by transforming the expression at the primary outputs into an expression at the primary inputs. Experimental results show application of the method to certain classes of large arithmetic circuits.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124040350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Enabling Side Channel Secure FSMs in the Presence of Low Power Requirements 在低功耗要求下启用侧信道安全fsm
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.78
M. Borowczak, R. Vemuri
As silicon-based technology feature sizes continue to decrease and designs remain susceptible to novel attacks designers face competing goals when creating secure, low power, integrated circuits (ICs). Often, low power designs rely on heavy minimization and optimization procedures while many secure designs use low-level duplication mechanisms to thwart attacks. An area that requires special attention, and is crucial in both realms, is the power consumption profile of Finite State Machines (FSM). This work specifically addresses the key concern of creating secure, low-power, FSM encodings. This work details a flexible, secure, encoding strategy which, in conjunction with security-based structural modifications, can provide low-power security solutions against side channel attacks. The secure encoding strategy includes methods that modify the original constraints in order to provide varying levels of protection that approach traditional low power encoding methods. Specifically, this work uses the MCNC benchmark suite to compare the state space and encoding requirement for secure (70% increase) and relaxed encoding methods (53-67% increase) aimed at increasing overall device security while reducing state-state transition cost (Npeak = 2).
随着硅基技术的特征尺寸不断减小,设计仍然容易受到新的攻击,设计人员在创建安全、低功耗的集成电路(ic)时面临着相互竞争的目标。通常,低功耗设计依赖于大量的最小化和优化过程,而许多安全设计使用低级复制机制来阻止攻击。需要特别注意的一个领域是有限状态机(FSM)的功耗概况,这在两个领域都是至关重要的。这项工作特别解决了创建安全、低功耗、FSM编码的关键问题。这项工作详细介绍了一种灵活、安全的编码策略,该策略与基于安全的结构修改相结合,可以提供低功耗的安全解决方案,防止侧信道攻击。安全编码策略包括修改原始约束的方法,以便提供接近传统低功耗编码方法的不同级别的保护。具体来说,本工作使用MCNC基准套件来比较安全(增加70%)和宽松编码方法(增加53-67%)的状态空间和编码需求,旨在提高整体设备安全性,同时降低状态-状态转换成本(Npeak = 2)。
{"title":"Enabling Side Channel Secure FSMs in the Presence of Low Power Requirements","authors":"M. Borowczak, R. Vemuri","doi":"10.1109/ISVLSI.2014.78","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.78","url":null,"abstract":"As silicon-based technology feature sizes continue to decrease and designs remain susceptible to novel attacks designers face competing goals when creating secure, low power, integrated circuits (ICs). Often, low power designs rely on heavy minimization and optimization procedures while many secure designs use low-level duplication mechanisms to thwart attacks. An area that requires special attention, and is crucial in both realms, is the power consumption profile of Finite State Machines (FSM). This work specifically addresses the key concern of creating secure, low-power, FSM encodings. This work details a flexible, secure, encoding strategy which, in conjunction with security-based structural modifications, can provide low-power security solutions against side channel attacks. The secure encoding strategy includes methods that modify the original constraints in order to provide varying levels of protection that approach traditional low power encoding methods. Specifically, this work uses the MCNC benchmark suite to compare the state space and encoding requirement for secure (70% increase) and relaxed encoding methods (53-67% increase) aimed at increasing overall device security while reducing state-state transition cost (Npeak = 2).","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127534418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Novel Class of Linear MIMO Detectors with Boosted Communications Performance: Algorithm and VLSI Architecture 一类提高通信性能的新型线性MIMO检测器:算法和VLSI架构
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.16
Dominik Auras, R. Leupers, G. Ascheid
This paper introduces a novel class of linear soft-input soft-output detectors with boosted communications performance. The detector showed an SNR gain of up to 2.4 dB compared to state-of-the-art linear detectors. We introduce a low-complexity algorithm tailored for VLSI implementation, and propose a suitable architecture. The developed ASIC demonstrates the feasibility and efficiency of the concept, achieving the IEEE 802.11n standard's peak data rate of 600 Mbit/s.
本文介绍了一类提高通信性能的新型线性软输入软输出检测器。与最先进的线性检测器相比,该检测器显示了高达2.4 dB的信噪比增益。我们介绍了一种适合VLSI实现的低复杂度算法,并提出了一种合适的架构。开发的ASIC验证了该概念的可行性和效率,实现了IEEE 802.11n标准的峰值数据速率600 Mbit/s。
{"title":"A Novel Class of Linear MIMO Detectors with Boosted Communications Performance: Algorithm and VLSI Architecture","authors":"Dominik Auras, R. Leupers, G. Ascheid","doi":"10.1109/ISVLSI.2014.16","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.16","url":null,"abstract":"This paper introduces a novel class of linear soft-input soft-output detectors with boosted communications performance. The detector showed an SNR gain of up to 2.4 dB compared to state-of-the-art linear detectors. We introduce a low-complexity algorithm tailored for VLSI implementation, and propose a suitable architecture. The developed ASIC demonstrates the feasibility and efficiency of the concept, achieving the IEEE 802.11n standard's peak data rate of 600 Mbit/s.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130598355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Variation Aware Design of Post-Silicon Tunable Clock Buffer 后硅可调时钟缓冲器的变化感知设计
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.95
Vikram B. Suresh, W. Burleson
Process variation is a major limiting factor in designing high performance circuits in advanced CMOS technologies. Over optimizing data paths to provide adequate design margin is leading to increased area and power overhead. In this work, we present a variation aware design of Post-Silicon Tunable (PST) clock buffers to redistribute clock skew and mitigate impact of process variation on chip performance. Conventional PST buffers are designed for linear delay values. We estimate a set of non-linear delay intervals of PST buffer based on slack variation in each critical path. The configuration device sizes are mapped to the non-linear delay intervals using a set of equality conditions and solved using Linear Programming (LP). The variation aware device sizing provides many small delay intervals within one standard deviation of slack distribution and fewer large delay intervals to compensate for larger slack variations. This helps in optimal tuning of PST buffers to fix hold time, as well as better skew distribution to achieve maximum possible performance in a chip. The proposed PST buffer design technique was implemented for ISCAS'89 benchmark circuits. Non-linear delay PST buffers improve performance binning yield by more than 4%. They also provide optimum buffer circuits with an area reduction of 30% and leakage power reduction of 20%. The proposed technique can be implemented by designing dedicated buffers for each critical path or using a smaller set of pre-designed buffers with non-linear delay values.
在先进的CMOS技术中,工艺变化是设计高性能电路的主要限制因素。为了提供足够的设计余量而过度优化数据路径会导致面积和功率开销的增加。在这项工作中,我们提出了一种变化感知设计的后硅可调谐(PST)时钟缓冲器,以重新分配时钟倾斜并减轻工艺变化对芯片性能的影响。传统的PST缓冲器是为线性延迟值设计的。我们根据每个关键路径上的松弛变化估计了PST缓冲区的一组非线性延迟区间。利用一组等价条件将组态器件尺寸映射到非线性延迟区间,并用线性规划方法求解。变化感知装置尺寸在松弛分布的一个标准差内提供了许多小的延迟间隔和更少的大延迟间隔,以补偿较大的松弛变化。这有助于优化调整PST缓冲器,以固定保持时间,以及更好的倾斜分布,以实现芯片中最大可能的性能。提出的PST缓冲器设计技术在ISCAS'89基准电路中实现。非线性延迟PST缓冲器将性能分频率提高了4%以上。它们还提供最佳的缓冲电路,面积减少30%,泄漏功率减少20%。所提出的技术可以通过为每个关键路径设计专用缓冲区或使用一组较小的预先设计的具有非线性延迟值的缓冲区来实现。
{"title":"Variation Aware Design of Post-Silicon Tunable Clock Buffer","authors":"Vikram B. Suresh, W. Burleson","doi":"10.1109/ISVLSI.2014.95","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.95","url":null,"abstract":"Process variation is a major limiting factor in designing high performance circuits in advanced CMOS technologies. Over optimizing data paths to provide adequate design margin is leading to increased area and power overhead. In this work, we present a variation aware design of Post-Silicon Tunable (PST) clock buffers to redistribute clock skew and mitigate impact of process variation on chip performance. Conventional PST buffers are designed for linear delay values. We estimate a set of non-linear delay intervals of PST buffer based on slack variation in each critical path. The configuration device sizes are mapped to the non-linear delay intervals using a set of equality conditions and solved using Linear Programming (LP). The variation aware device sizing provides many small delay intervals within one standard deviation of slack distribution and fewer large delay intervals to compensate for larger slack variations. This helps in optimal tuning of PST buffers to fix hold time, as well as better skew distribution to achieve maximum possible performance in a chip. The proposed PST buffer design technique was implemented for ISCAS'89 benchmark circuits. Non-linear delay PST buffers improve performance binning yield by more than 4%. They also provide optimum buffer circuits with an area reduction of 30% and leakage power reduction of 20%. The proposed technique can be implemented by designing dedicated buffers for each critical path or using a smaller set of pre-designed buffers with non-linear delay values.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"156 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132896849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
FDPIC: Generation of Functional Test Sequences Based on Fault-Dependent Primary Input Cubes 基于故障相关主输入立方体的功能测试序列生成
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.23
I. Pomeranz
A subclass of efficient simulation-based sequential test generation procedures are guided by information about primary input values that effective test subsequences should use. This information is represented by a primary input cube c. In all the earlier procedures that are based on this concept, the computation of c is fault-independent. This paper introduces an approach for a fault-dependent computation of c. This allows the test generation procedure to generate test subsequences that match the detection conditions of specific faults, thus increasing the fault coverage that it can achieve. The computation of the primary input cube c for a fault f is based on a process that unspecifies a primary input subsequence, which does not detect f, in order to identify values that need to be avoided in a test subsequence for f. These values are used in the construction of c for f. The paper also discusses the applicability of the fault-dependent primary input cubes to the built-in generation of functional broadside tests.
有效的基于模拟的顺序测试生成过程的一个子类是由有效的测试子序列应该使用的主要输入值信息指导的。该信息由主输入立方c表示。在基于此概念的所有早期过程中,c的计算是与故障无关的。本文介绍了一种c的故障相关计算方法。该方法允许测试生成过程生成与特定故障的检测条件相匹配的测试子序列,从而增加了它可以实现的故障覆盖率。故障f的主输入立方c的计算是基于一个过程,该过程不指定主输入子序列,该子序列不检测f,以便识别在f的测试子序列中需要避免的值。这些值用于f的c的构造。本文还讨论了故障相关的主输入立方对内置生成功能宽边测试的适用性。
{"title":"FDPIC: Generation of Functional Test Sequences Based on Fault-Dependent Primary Input Cubes","authors":"I. Pomeranz","doi":"10.1109/ISVLSI.2014.23","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.23","url":null,"abstract":"A subclass of efficient simulation-based sequential test generation procedures are guided by information about primary input values that effective test subsequences should use. This information is represented by a primary input cube c. In all the earlier procedures that are based on this concept, the computation of c is fault-independent. This paper introduces an approach for a fault-dependent computation of c. This allows the test generation procedure to generate test subsequences that match the detection conditions of specific faults, thus increasing the fault coverage that it can achieve. The computation of the primary input cube c for a fault f is based on a process that unspecifies a primary input subsequence, which does not detect f, in order to identify values that need to be avoided in a test subsequence for f. These values are used in the construction of c for f. The paper also discusses the applicability of the fault-dependent primary input cubes to the built-in generation of functional broadside tests.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116138790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chip Health Monitoring Using Machine Learning 使用机器学习的芯片健康监测
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.119
F. Firouzi, Fangming Ye, K. Chakrabarty, M. Tahoori
In nanoscale technology nodes, process and runtime variations have emerged as the major sources of timing uncertainties which may ultimately result in circuit failure due to timing violation. Therefore, in-field chip health monitoring is essential to track workload-induced variations at runtime in a per-chip basis. There exist a variety of monitoring circuits to track the delay changes of different on-chip components. However, existing techniques either need to stop normal execution of the chip or introduce a significant overhead unless they are carefully placed for very selective locations. Another challenge is to infer the information regarding the health of every critical paths of the chip with limited information obtained by the monitoring system. We address these challenges in this work using a representative path-selection technique based on machine learning. This technique allows us to measure the delay of a small subset of paths and assess the circuit-level impact of workload for a larger pool of reliability-critical paths.
在纳米技术节点中,工艺和运行时的变化已成为时间不确定性的主要来源,这可能最终导致电路因时间违反而失效。因此,现场芯片运行状况监控对于在运行时以每个芯片为基础跟踪工作负载引起的变化至关重要。目前存在多种监控电路来跟踪不同片上元件的时延变化。然而,现有的技术要么需要停止芯片的正常执行,要么引入显著的开销,除非它们被小心地放置在非常有选择的位置。另一个挑战是利用监测系统获得的有限信息推断芯片每条关键路径的健康状况。我们在这项工作中使用基于机器学习的代表性路径选择技术来解决这些挑战。该技术允许我们测量一小部分路径的延迟,并评估工作负载对更大的可靠性关键路径池的电路级影响。
{"title":"Chip Health Monitoring Using Machine Learning","authors":"F. Firouzi, Fangming Ye, K. Chakrabarty, M. Tahoori","doi":"10.1109/ISVLSI.2014.119","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.119","url":null,"abstract":"In nanoscale technology nodes, process and runtime variations have emerged as the major sources of timing uncertainties which may ultimately result in circuit failure due to timing violation. Therefore, in-field chip health monitoring is essential to track workload-induced variations at runtime in a per-chip basis. There exist a variety of monitoring circuits to track the delay changes of different on-chip components. However, existing techniques either need to stop normal execution of the chip or introduce a significant overhead unless they are carefully placed for very selective locations. Another challenge is to infer the information regarding the health of every critical paths of the chip with limited information obtained by the monitoring system. We address these challenges in this work using a representative path-selection technique based on machine learning. This technique allows us to measure the delay of a small subset of paths and assess the circuit-level impact of workload for a larger pool of reliability-critical paths.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"2673 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121451977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Low Power and Scalable Many-Core Architecture for Big-Data Stream Computing 面向大数据流计算的低功耗、可扩展多核架构
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.77
Karim Kanoun, M. Ruggiero, David Atienza Alonso, M. Schaar
In the last years the process of examining large amounts of different types of data, or Big-Data, in an effort to uncover hidden patterns or unknown correlations has become a major need in our society. In this context, stream mining applications are now widely used in several domains such as financial analysis, video annotation, surveillance, medical services, traffic prediction, etc. In order to cope with the Big-Data stream input and its high variability, modern stream mining applications implement systems with heterogeneous classifiers and adapt online to its input data stream characteristics variation. Moreover, unlike existing architectures for video processing and compression applications, where the processing units are reconfigurable in terms of parameters and possibly even functions as the input data is changing, in Big-Data stream mining applications the complete computing pipeline is changing, as entirely new classifiers and processing functions are invoked depending on the input stream. As a result, new approaches of reconfigurable hardware platform architectures are needed to handle Big-Data streams. However, hardware solutions that have been proposed so far for stream mining applications either target high performance computing without any power consideration (i.e., limiting their applicability in small-scale computing infrastructures or current embedded systems), or they are simply dedicated to a specific learning algorithm (i.e., limited to run with a single type of classifiers). Therefore, in this paper we propose a novel low-power many-core architecture for stream mining applications that is able to cope with the dynamic data-driven nature of stream mining applications while consuming limited power. Our exploration indicates that this new proposed architecture is able to adapt to different classifiers complexities thanks to its multiple scalable vector processing units and their re-configurability feature at run-time. Moreover, our platform architecture includes a memory hierarchy optimized for Big-Data streaming and implements modern fine-grained power management techniques over all the different types of cores allowing then minimum energy consumption for each type of executed classifier.
在过去的几年里,为了发现隐藏的模式或未知的相关性,检查大量不同类型的数据或大数据的过程已经成为我们社会的主要需求。在这种背景下,流挖掘应用现在被广泛应用于金融分析、视频注释、监控、医疗服务、交通预测等多个领域。为了应对大数据流输入及其高可变性,现代流挖掘应用采用异构分类器实现系统,并在线适应其输入数据流特征的变化。此外,与现有的视频处理和压缩应用体系结构不同,在视频处理和压缩应用中,处理单元在参数方面是可重构的,甚至可能随着输入数据的变化而改变功能,而在大数据流挖掘应用中,完整的计算管道正在发生变化,因为根据输入流调用了全新的分类器和处理功能。因此,需要新的可重构硬件平台架构来处理大数据流。然而,迄今为止为流挖掘应用程序提出的硬件解决方案要么是针对高性能计算而不考虑任何功率(即限制其在小规模计算基础设施或当前嵌入式系统中的适用性),要么是专门针对特定的学习算法(即限制在单一类型的分类器上运行)。因此,在本文中,我们为流挖掘应用程序提出了一种新颖的低功耗多核架构,该架构能够在消耗有限功率的同时处理流挖掘应用程序的动态数据驱动特性。我们的研究表明,由于其多个可扩展的向量处理单元及其在运行时的可重构特性,这种新提出的体系结构能够适应不同的分类器复杂性。此外,我们的平台架构包括针对大数据流优化的内存层次结构,并在所有不同类型的内核上实现现代细粒度电源管理技术,从而使每种类型的执行分类器的能耗最小。
{"title":"Low Power and Scalable Many-Core Architecture for Big-Data Stream Computing","authors":"Karim Kanoun, M. Ruggiero, David Atienza Alonso, M. Schaar","doi":"10.1109/ISVLSI.2014.77","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.77","url":null,"abstract":"In the last years the process of examining large amounts of different types of data, or Big-Data, in an effort to uncover hidden patterns or unknown correlations has become a major need in our society. In this context, stream mining applications are now widely used in several domains such as financial analysis, video annotation, surveillance, medical services, traffic prediction, etc. In order to cope with the Big-Data stream input and its high variability, modern stream mining applications implement systems with heterogeneous classifiers and adapt online to its input data stream characteristics variation. Moreover, unlike existing architectures for video processing and compression applications, where the processing units are reconfigurable in terms of parameters and possibly even functions as the input data is changing, in Big-Data stream mining applications the complete computing pipeline is changing, as entirely new classifiers and processing functions are invoked depending on the input stream. As a result, new approaches of reconfigurable hardware platform architectures are needed to handle Big-Data streams. However, hardware solutions that have been proposed so far for stream mining applications either target high performance computing without any power consideration (i.e., limiting their applicability in small-scale computing infrastructures or current embedded systems), or they are simply dedicated to a specific learning algorithm (i.e., limited to run with a single type of classifiers). Therefore, in this paper we propose a novel low-power many-core architecture for stream mining applications that is able to cope with the dynamic data-driven nature of stream mining applications while consuming limited power. Our exploration indicates that this new proposed architecture is able to adapt to different classifiers complexities thanks to its multiple scalable vector processing units and their re-configurability feature at run-time. Moreover, our platform architecture includes a memory hierarchy optimized for Big-Data streaming and implements modern fine-grained power management techniques over all the different types of cores allowing then minimum energy consumption for each type of executed classifier.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132370728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Cost-Effective Test Optimized Scheme of TSV-Based 3D SoCs for Pre-Bond Test 基于tsv的3D soc预粘接测试优化方案
Pub Date : 2014-07-09 DOI: 10.1109/ISVLSI.2014.57
Kele Shen, D. Xiang, Z. Jiang
Three-dimensional (3D) SoC is becoming one of the most promising approaches for extending Moore's Law. However, managing test optimized scheme to reduce the cost of 3D SoCs is a significant challenge. In this paper, we propose a cost-effective optimized scheme of 3D SoCs for pre-bond test based on a generic cost model we defined. Both test time and number of TSV are considered in our novel scheme. Experimental results on ITC'02 SoC benchmark circuits show that our scheme is superior to one baseline solution and can effectively achieve good performance on test optimization.
三维(3D) SoC正成为扩展摩尔定律最有前途的方法之一。然而,管理测试优化方案以降低3D soc的成本是一个重大挑战。在本文中,我们提出了一种具有成本效益的3D soc预粘合测试优化方案,该方案基于我们定义的通用成本模型。该方案同时考虑了测试时间和TSV数量。在ITC’02 SoC基准电路上的实验结果表明,该方案优于单一基准方案,能够有效地实现良好的测试优化性能。
{"title":"Cost-Effective Test Optimized Scheme of TSV-Based 3D SoCs for Pre-Bond Test","authors":"Kele Shen, D. Xiang, Z. Jiang","doi":"10.1109/ISVLSI.2014.57","DOIUrl":"https://doi.org/10.1109/ISVLSI.2014.57","url":null,"abstract":"Three-dimensional (3D) SoC is becoming one of the most promising approaches for extending Moore's Law. However, managing test optimized scheme to reduce the cost of 3D SoCs is a significant challenge. In this paper, we propose a cost-effective optimized scheme of 3D SoCs for pre-bond test based on a generic cost model we defined. Both test time and number of TSV are considered in our novel scheme. Experimental results on ITC'02 SoC benchmark circuits show that our scheme is superior to one baseline solution and can effectively achieve good performance on test optimization.","PeriodicalId":405755,"journal":{"name":"2014 IEEE Computer Society Annual Symposium on VLSI","volume":"323 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113985522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2014 IEEE Computer Society Annual Symposium on VLSI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1