首页 > 最新文献

2009 IEEE International Conference on Computer Design最新文献

英文 中文
Interconnect performance corners considering crosstalk noise 考虑串扰噪声的互连性能角
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413148
Ravikishore Gandikota, D. Blaauw, D. Sylvester
Process induced variations in the interconnect capacitance and resistance have resulted in significant uncertainly in the interconnect delay. In this work, we propose a new method to compute the interconnect corner considering coupling-noise due to simultaneous switching of aggressors. In prior approaches, the interconnect corners were computed under the assumption that the aggressor nets are not switching and no coupling-noise is injected on the victim net. In this paper, we first show that the interconnect corners obtained under such assumptions could in reality be much different from the true interconnect corner and could therefore result in optimistic delay analysis, particularly for fast-path analysis performed to check hold time violations. We also show that in some cases, the interconnect corner may not lie at an extreme point of the process variation range. In this work, we use the Elmore delay metric to efficiently search for the correct interconnect corner of the victim stage considering delay noise. We then show experimental results to verify the effectiveness of our proposed approach and demonstrate that the traditional approaches of computing the interconnect corners could lead to errors of up to 60% on a net by net basis.
工艺引起的互连电容和电阻的变化导致了互连延迟的显著不确定性。在这项工作中,我们提出了一种考虑入侵源同时切换引起的耦合噪声的互连角点计算新方法。在先前的方法中,互连角的计算是在假设攻击网没有切换和没有耦合噪声注入到受害网的情况下进行的。在本文中,我们首先表明,在这些假设下获得的互连角实际上可能与真实互连角有很大不同,因此可能导致乐观延迟分析,特别是用于检查保持时间违规的快速路径分析。我们还表明,在某些情况下,互连角可能不在工艺变化范围的极端点上。在这项工作中,我们使用Elmore延迟度量在考虑延迟噪声的情况下有效地搜索受害级的正确互连角。然后,我们展示了实验结果来验证我们提出的方法的有效性,并证明传统的互连角点计算方法在逐净的基础上可能导致高达60%的误差。
{"title":"Interconnect performance corners considering crosstalk noise","authors":"Ravikishore Gandikota, D. Blaauw, D. Sylvester","doi":"10.1109/ICCD.2009.5413148","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413148","url":null,"abstract":"Process induced variations in the interconnect capacitance and resistance have resulted in significant uncertainly in the interconnect delay. In this work, we propose a new method to compute the interconnect corner considering coupling-noise due to simultaneous switching of aggressors. In prior approaches, the interconnect corners were computed under the assumption that the aggressor nets are not switching and no coupling-noise is injected on the victim net. In this paper, we first show that the interconnect corners obtained under such assumptions could in reality be much different from the true interconnect corner and could therefore result in optimistic delay analysis, particularly for fast-path analysis performed to check hold time violations. We also show that in some cases, the interconnect corner may not lie at an extreme point of the process variation range. In this work, we use the Elmore delay metric to efficiently search for the correct interconnect corner of the victim stage considering delay noise. We then show experimental results to verify the effectiveness of our proposed approach and demonstrate that the traditional approaches of computing the interconnect corners could lead to errors of up to 60% on a net by net basis.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114411318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Timing variation-aware high-level synthesis considering accurate yield computation 考虑精确良率计算的时序变化敏感高级综合
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413152
Jongyoon Jung, Taewhan Kim
This work proposes a new yield computation technique dedicated to HLS, which is an essential component in timing variation-aware HLS research field. The SSTAs used by the current timing variation-aware HLS techniques cannot support the following two critical factors at all: (i) non-Gaussian delay distribution of ‘module patterns’ used in scheduling and binding and (ii) correlation of timing variation between module patterns. However, without considering these factors, the synthesis results would be far less accurate in timing, being very likely to fail in timing closure. Even though there are advances in the logic level for SSTAs that support (i) and (ii), the manipulation and computation of (i) and (ii) in the course of scheduling and binding in HLS are unique in that there are no concepts of module sharing and performance yield computation in the logic level. Specifically, we propose a novel yield computation technique to handle the non-Gaussian timing variation of module patterns, where the sum and max operations are closed-form formulas and the timing correlation between modules used in computing performance yield is preserved to the first-order form. Experimental results show that our synthesis using the proposed yield computation technique reduces the latency by 24.1% and 28.8% under 95% and 90% performance yield constraints over that by the conventional HLS, respectively. Further, it is confirmed that our synthesis results are near optimal with less than 3.1% error on average.
本文提出了一种新的HLS产率计算方法,是时变敏感HLS研究领域的重要组成部分。当前感知时序变化的HLS技术所使用的ssta根本不能支持以下两个关键因素:(i)调度和绑定中使用的“模块模式”的非高斯延迟分布;(ii)模块模式之间时序变化的相关性。然而,如果不考虑这些因素,合成结果在计时上的准确性将大大降低,很可能在计时关闭中失败。尽管支持(i)和(ii)的ssta在逻辑层面上有所进步,但HLS在调度和绑定过程中对(i)和(ii)的操作和计算是独特的,因为在逻辑层面没有模块共享和性能良率计算的概念。具体而言,我们提出了一种新的产率计算技术来处理模块模式的非高斯时序变化,其中和和和最大运算是封闭形式的公式,并且用于计算性能产率的模块之间的时序相关性保持为一阶形式。实验结果表明,在95%和90%的性能良率约束下,我们的合成比传统的HLS分别减少了24.1%和28.8%的延迟。进一步证实,我们的合成结果接近最优,平均误差小于3.1%。
{"title":"Timing variation-aware high-level synthesis considering accurate yield computation","authors":"Jongyoon Jung, Taewhan Kim","doi":"10.1109/ICCD.2009.5413152","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413152","url":null,"abstract":"This work proposes a new yield computation technique dedicated to HLS, which is an essential component in timing variation-aware HLS research field. The SSTAs used by the current timing variation-aware HLS techniques cannot support the following two critical factors at all: (i) non-Gaussian delay distribution of ‘module patterns’ used in scheduling and binding and (ii) correlation of timing variation between module patterns. However, without considering these factors, the synthesis results would be far less accurate in timing, being very likely to fail in timing closure. Even though there are advances in the logic level for SSTAs that support (i) and (ii), the manipulation and computation of (i) and (ii) in the course of scheduling and binding in HLS are unique in that there are no concepts of module sharing and performance yield computation in the logic level. Specifically, we propose a novel yield computation technique to handle the non-Gaussian timing variation of module patterns, where the sum and max operations are closed-form formulas and the timing correlation between modules used in computing performance yield is preserved to the first-order form. Experimental results show that our synthesis using the proposed yield computation technique reduces the latency by 24.1% and 28.8% under 95% and 90% performance yield constraints over that by the conventional HLS, respectively. Further, it is confirmed that our synthesis results are near optimal with less than 3.1% error on average.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123949475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Iterative built-in testing and tuning of mixed-signal/RF systems 混合信号/射频系统的迭代内置测试和调谐
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413136
A. Chatterjee, Donghoon Han, Vishwanath Natarajan, S. Devarakond, Shreyas Sen, H. Choi, R. Senguttuvan, S. Bhattacharya, A. Goyal, Deuk Lee, M. Swaminathan
Design and test of high-speed mixed-signal/RF circuits and systems is undergoing a transformation due to the effects of process variations stemming from the use of scaled CMOS technologies that result in significant yield loss. To this effect, postmanufacture tuning for yield recovery is now a necessity for many high-speed electronic circuits and systems and is typically driven by iterative test-and-tune procedures. Such procedures create new challenges for manufacturing test and built-in self-test of advanced mixed-signal/RF systems. In this paper, key test challenges are discussed and promising solutions are presented in the hope that it will be possible to design, manufacture and test “truly self-healing” systems in the near future.
高速混合信号/射频电路和系统的设计和测试正在经历转型,这是由于使用缩放CMOS技术产生的工艺变化的影响,导致重大的产量损失。为此,现在许多高速电子电路和系统都需要在制造后进行调整以恢复良率,并且通常由迭代测试和调整程序驱动。这些程序为先进混合信号/射频系统的制造测试和内置自检带来了新的挑战。本文讨论了关键的测试挑战,并提出了有希望的解决方案,希望在不久的将来能够设计、制造和测试“真正的自我修复”系统。
{"title":"Iterative built-in testing and tuning of mixed-signal/RF systems","authors":"A. Chatterjee, Donghoon Han, Vishwanath Natarajan, S. Devarakond, Shreyas Sen, H. Choi, R. Senguttuvan, S. Bhattacharya, A. Goyal, Deuk Lee, M. Swaminathan","doi":"10.1109/ICCD.2009.5413136","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413136","url":null,"abstract":"Design and test of high-speed mixed-signal/RF circuits and systems is undergoing a transformation due to the effects of process variations stemming from the use of scaled CMOS technologies that result in significant yield loss. To this effect, postmanufacture tuning for yield recovery is now a necessity for many high-speed electronic circuits and systems and is typically driven by iterative test-and-tune procedures. Such procedures create new challenges for manufacturing test and built-in self-test of advanced mixed-signal/RF systems. In this paper, key test challenges are discussed and promising solutions are presented in the hope that it will be possible to design, manufacture and test “truly self-healing” systems in the near future.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121487548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Reincarnate historic systems on FPGA with novel design methodology 用新颖的设计方法在FPGA上再现历史系统
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413182
N. Shimizu
In this paper, I introduce my and my students projects to reincarnate historic systems on FPGA. Our projects are not replica nor paper-model of historic systems, but reorganized and working system on FPGA with novel and progressive design methodology. I mean progressive as under the development, because I have developed them and I am still improving the methodology and tools very often to use them by myself. In this paper, I also introduce my design methodology and tools which is used in my and my students projects.
在本文中,我介绍了我和我的学生在FPGA上再现历史系统的项目。我们的项目不是历史系统的复制或纸上模型,而是在FPGA上重组和工作的系统,采用新颖和先进的设计方法。我说的渐进式是指在开发过程中,因为我已经开发了它们,而且我还在不断改进方法和工具,以便自己使用它们。在本文中,我还介绍了我的设计方法和工具,这是在我和我的学生的项目中使用的。
{"title":"Reincarnate historic systems on FPGA with novel design methodology","authors":"N. Shimizu","doi":"10.1109/ICCD.2009.5413182","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413182","url":null,"abstract":"In this paper, I introduce my and my students projects to reincarnate historic systems on FPGA. Our projects are not replica nor paper-model of historic systems, but reorganized and working system on FPGA with novel and progressive design methodology. I mean progressive as under the development, because I have developed them and I am still improving the methodology and tools very often to use them by myself. In this paper, I also introduce my design methodology and tools which is used in my and my students projects.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123683594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reliable cache design with detection of gate oxide breakdown using BIST 采用BIST检测栅极氧化物击穿的可靠缓存设计
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413131
Fahad Ahmed, L. Milor
Scaling of device sizes has reduced gate oxide thickness to a few atomic layers, increasing the vulnerability of the gate oxide to breakdown. During breakdown, devices go through a gradual wearout process after an initial gate leakage increase leading to device failure. It is proposed that if wearout can be monitored, cache arrays with failing cells can be reliably operated after reconfiguration given available memory redundancy. Using experimentally verified gate oxide breakdown models, a detailed analysis of the effect of progressive gate oxide breakdown on the performance of a conventional 6T SRAM cell is presented for 45nm predictive technology. The DC margin trends (Read, Write and Retention) and access times (Read and Write) during wearout are analyzed, and a cell breakdown point due to degradation in each of these parameters is defined. A combination of these results is used to formulate a practical definition for the hard-breakdown point of a cell. Using an on-chip PVT (process, voltage, and temperature) tolerant monitoring scheme, it has been shown that gradual wearout in SRAM cells, due to gate oxide breakdown, is detectible, and cell failure can be predicted before its occurrence.
器件尺寸的缩放使栅极氧化物的厚度减少到几个原子层,增加了栅极氧化物击穿的脆弱性。在击穿过程中,器件在初始栅极泄漏增加后,经历一个逐渐的磨损过程,导致器件失效。在给定可用内存冗余的情况下,如果能够监测到损耗,则具有失效单元的缓存阵列可以在重构后可靠地运行。利用实验验证的栅极氧化物击穿模型,详细分析了45纳米预测技术下栅极氧化物递进击穿对传统6T SRAM电池性能的影响。分析了磨损期间的DC余量趋势(Read, Write和Retention)和访问时间(Read和Write),并定义了由于这些参数的退化而导致的电池击穿点。这些结果的组合被用来制定一个实用的定义为一个电池的硬击穿点。使用片上PVT(过程,电压和温度)耐受监测方案,已经证明SRAM单元中由于栅极氧化物击穿而逐渐磨损是可检测的,并且可以在其发生之前预测单元失效。
{"title":"Reliable cache design with detection of gate oxide breakdown using BIST","authors":"Fahad Ahmed, L. Milor","doi":"10.1109/ICCD.2009.5413131","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413131","url":null,"abstract":"Scaling of device sizes has reduced gate oxide thickness to a few atomic layers, increasing the vulnerability of the gate oxide to breakdown. During breakdown, devices go through a gradual wearout process after an initial gate leakage increase leading to device failure. It is proposed that if wearout can be monitored, cache arrays with failing cells can be reliably operated after reconfiguration given available memory redundancy. Using experimentally verified gate oxide breakdown models, a detailed analysis of the effect of progressive gate oxide breakdown on the performance of a conventional 6T SRAM cell is presented for 45nm predictive technology. The DC margin trends (Read, Write and Retention) and access times (Read and Write) during wearout are analyzed, and a cell breakdown point due to degradation in each of these parameters is defined. A combination of these results is used to formulate a practical definition for the hard-breakdown point of a cell. Using an on-chip PVT (process, voltage, and temperature) tolerant monitoring scheme, it has been shown that gradual wearout in SRAM cells, due to gate oxide breakdown, is detectible, and cell failure can be predicted before its occurrence.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"67 23","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121001683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Reducing register file size through instruction pre-execution enhanced by value prediction 通过值预测增强的指令预执行减少寄存器文件大小
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413149
Yusuke Tanaka, H. Ando
Two-step physical register deallocation (TSD) is an architectural scheme, which enhances memory-level parallelism (MLP) by pre-executing instructions. Ideally, the TSD allows MLP under the unlimited number of physical registers to be exploited, and consequently only a small register file is necessary for MLP. In practice, however, the amount of MLP exploitable is limited, because there are cases where pre-execution is not performed or timing of pre-execution is delayed. This is caused by data dependencies among the pre-executed instructions. This paper proposes the use of value prediction to solve these problems. Our way of the value prediction usage has the advantage over the conventional way of the usage for enhancing ILP, that there is no need to recover from misspeculation. Our evaluation results using SPECfp2000 benchmark show that our scheme can achieve equivalent performance to that of the previous TSD scheme without value prediction, with 75% of the register file size.
两步物理寄存器释放(TSD)是一种通过预执行指令来增强内存级并行性(MLP)的体系结构方案。理想情况下,TSD允许在无限数量的物理寄存器下使用MLP,因此MLP只需要一个小的寄存器文件。然而,在实践中,可利用的MLP数量是有限的,因为在某些情况下不执行预执行或延迟预执行的时间。这是由于预执行指令之间的数据依赖关系造成的。本文提出利用价值预测来解决这些问题。我们的价值预测使用方法与传统的提高ILP的使用方法相比,具有不需要从错误猜测中恢复的优点。我们使用SPECfp2000基准测试的评估结果表明,我们的方案在没有值预测的情况下可以达到与以前的TSD方案相当的性能,寄存器文件大小为75%。
{"title":"Reducing register file size through instruction pre-execution enhanced by value prediction","authors":"Yusuke Tanaka, H. Ando","doi":"10.1109/ICCD.2009.5413149","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413149","url":null,"abstract":"Two-step physical register deallocation (TSD) is an architectural scheme, which enhances memory-level parallelism (MLP) by pre-executing instructions. Ideally, the TSD allows MLP under the unlimited number of physical registers to be exploited, and consequently only a small register file is necessary for MLP. In practice, however, the amount of MLP exploitable is limited, because there are cases where pre-execution is not performed or timing of pre-execution is delayed. This is caused by data dependencies among the pre-executed instructions. This paper proposes the use of value prediction to solve these problems. Our way of the value prediction usage has the advantage over the conventional way of the usage for enhancing ILP, that there is no need to recover from misspeculation. Our evaluation results using SPECfp2000 benchmark show that our scheme can achieve equivalent performance to that of the previous TSD scheme without value prediction, with 75% of the register file size.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121567361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A robust pulsed flip-flop and its use in enhanced scan design 一种鲁棒脉冲触发器及其在增强扫描设计中的应用
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413168
R. Kumar, Kalyana C. Bollapalli, Rajesh Garg, Tarun Soni, S. Khatri
Delay faults are frequently encountered in nanometer technologies. Therefore, it is critical to detect these faults during factory test. Testing for a delay fault requires the application of a pair of test vectors in an at-speed manner. To maximize the delay fault detection capability, it is desired that the vectors in this pair are independent. Independent vector pairs cannot always be applied to a circuit implemented with standard scan design approaches. However, this can be achieved by using enhanced scan flip-flops, which store two bits of data. This paper has two contributions. First, we develop a pulsed flip-flop (PFF) design. Second, we present an enhanced scan flipflop design, based on our PFF circuit. We have compared the performance of our pulse based flip-flop with recently published pulse based flip-flop designs, as well as a traditional master-slave D flip-flop. Our PFF shows significant improvements in power and timing compared to the other designs. Our pulse based enhanced scan flip-flop (PESFF) has 13% lower power dissipation and 26% better timing than a conventional D flipflop based enhanced scan flip-flop (DESFF). The layout area of our PESFF is 5.2% smaller than the DESFF. Monte Carlo simulations demonstrate that our design is more robust to process variations than the DESFF.
延迟故障是纳米技术中经常遇到的问题。因此,在工厂测试中检测这些故障是至关重要的。延迟故障的测试需要以高速的方式应用一对测试向量。为了使延迟故障检测能力最大化,要求这对向量是独立的。独立向量对并不总是适用于用标准扫描设计方法实现的电路。然而,这可以通过使用增强扫描触发器来实现,它可以存储两位数据。这篇论文有两个贡献。首先,我们开发了一个脉冲触发器(PFF)设计。其次,我们提出了一个增强扫描触发器设计,基于我们的PFF电路。我们将我们的脉冲触发器与最近发表的脉冲触发器设计以及传统的主从D触发器的性能进行了比较。与其他设计相比,我们的PFF在功率和时序方面有显着改进。我们基于脉冲的增强扫描触发器(PESFF)比传统的基于D触发器的增强扫描触发器(DESFF)功耗低13%,时序好26%。我们的PESFF的布局面积比DESFF小5.2%。蒙特卡罗模拟表明,我们的设计比DESFF对过程变化的鲁棒性更强。
{"title":"A robust pulsed flip-flop and its use in enhanced scan design","authors":"R. Kumar, Kalyana C. Bollapalli, Rajesh Garg, Tarun Soni, S. Khatri","doi":"10.1109/ICCD.2009.5413168","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413168","url":null,"abstract":"Delay faults are frequently encountered in nanometer technologies. Therefore, it is critical to detect these faults during factory test. Testing for a delay fault requires the application of a pair of test vectors in an at-speed manner. To maximize the delay fault detection capability, it is desired that the vectors in this pair are independent. Independent vector pairs cannot always be applied to a circuit implemented with standard scan design approaches. However, this can be achieved by using enhanced scan flip-flops, which store two bits of data. This paper has two contributions. First, we develop a pulsed flip-flop (PFF) design. Second, we present an enhanced scan flipflop design, based on our PFF circuit. We have compared the performance of our pulse based flip-flop with recently published pulse based flip-flop designs, as well as a traditional master-slave D flip-flop. Our PFF shows significant improvements in power and timing compared to the other designs. Our pulse based enhanced scan flip-flop (PESFF) has 13% lower power dissipation and 26% better timing than a conventional D flipflop based enhanced scan flip-flop (DESFF). The layout area of our PESFF is 5.2% smaller than the DESFF. Monte Carlo simulations demonstrate that our design is more robust to process variations than the DESFF.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126039076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Computer-aided design for microfluidic chips based on multilayer soft lithography 基于多层软光刻技术的微流控芯片计算机辅助设计
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413185
Nada Amin, W. Thies, Saman P. Amarasinghe
Microfluidic chips are emerging as a powerful platform for automating biology experiments. As it becomes possible to integrate tens of thousands of components on a single chip, researchers will require design automation tools to push the scale and complexity of their designs to match the capabilities of the substrate. However, to date such tools have focused only on droplet-based devices, leaving out the popular class of chips that are based on multilayer soft lithography. In this paper, we develop design automation techniques for microfluidic chips based on multilayer soft lithography. We focus our attention on the control layer, which is driven by pressure actuators to invoke the desired flows on chip. We present a language in which designers can specify the Instruction Set Architecture (ISA) of a microfluidic device. Given an ISA, we automatically infer the locations of valves needed to implement the ISA. We also present novel algorithms for minimizing the number of control lines needed to drive the valves, as well as for routing valves to control ports while admitting sharing between the control lines. To the microfluidic community, we offer a free computer-aided design tool, Micado, which implements a subset of our algorithms as a practical plug-in to AutoCAD. Micado is being used successfully by microfluidic designers. We demonstrate its performance on three realistic chips.
微流控芯片正在成为生物实验自动化的强大平台。随着在单个芯片上集成数万个组件成为可能,研究人员将需要设计自动化工具来推动其设计的规模和复杂性,以匹配基板的能力。然而,到目前为止,这些工具只专注于基于液滴的设备,而忽略了基于多层软光刻技术的流行芯片。在本文中,我们开发了基于多层软光刻技术的微流控芯片设计自动化技术。我们将注意力集中在控制层上,控制层由压力执行器驱动,以调用芯片上所需的流量。我们提出了一种语言,其中设计人员可以指定的指令集架构(ISA)的微流控装置。给定一个ISA,我们自动推断实现ISA所需的阀门的位置。我们还提出了新颖的算法,以最大限度地减少驱动阀门所需的控制线数量,以及在允许控制线之间共享的情况下将阀门路由到控制端口。对于微流体社区,我们提供了一个免费的计算机辅助设计工具Micado,它实现了我们的算法的一个子集,作为AutoCAD的实用插件。Micado正在被微流体设计师成功地使用。我们在三个实际芯片上演示了它的性能。
{"title":"Computer-aided design for microfluidic chips based on multilayer soft lithography","authors":"Nada Amin, W. Thies, Saman P. Amarasinghe","doi":"10.1109/ICCD.2009.5413185","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413185","url":null,"abstract":"Microfluidic chips are emerging as a powerful platform for automating biology experiments. As it becomes possible to integrate tens of thousands of components on a single chip, researchers will require design automation tools to push the scale and complexity of their designs to match the capabilities of the substrate. However, to date such tools have focused only on droplet-based devices, leaving out the popular class of chips that are based on multilayer soft lithography. In this paper, we develop design automation techniques for microfluidic chips based on multilayer soft lithography. We focus our attention on the control layer, which is driven by pressure actuators to invoke the desired flows on chip. We present a language in which designers can specify the Instruction Set Architecture (ISA) of a microfluidic device. Given an ISA, we automatically infer the locations of valves needed to implement the ISA. We also present novel algorithms for minimizing the number of control lines needed to drive the valves, as well as for routing valves to control ports while admitting sharing between the control lines. To the microfluidic community, we offer a free computer-aided design tool, Micado, which implements a subset of our algorithms as a practical plug-in to AutoCAD. Micado is being used successfully by microfluidic designers. We demonstrate its performance on three realistic chips.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128069381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
Adaptive online testing for efficient hard fault detection 自适应在线测试,有效检测硬故障
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413132
S. Gupta, Amin Ansari, Shuguang Feng, S. Mahlke
With growing semiconductor integration, the reliability of individual transistors is expected to rapidly decline in future technology generations. In such a scenario, processors would need to be equipped with fault tolerance mechanisms to tolerate in-field silicon defects. Periodic online testing is a popular technique to detect such failures; however, it tends to impose a heavy testing penalty. In this paper, we propose an adaptive online testing framework to significantly reduce the testing overhead. The proposed approach is unique in its ability to assess the hardware health and apply suitably detailed tests. Thus, a significant chunk of the testing time can be saved for the healthy components. We further extend the framework to work with the StageNet CMP fabric, which provides the flexibility to group together pipeline stages with similar health conditions, thereby reducing the overall testing burden. For a modest 2.6% sensor area overhead, the proposed scheme was able to achieve an 80% reduction in software test instructions over the lifetime of a 16-core CMP.
随着半导体集成度的不断提高,预计在未来的技术世代中,单个晶体管的可靠性将迅速下降。在这种情况下,处理器需要配备容错机制来容忍现场硅缺陷。定期在线测试是检测此类故障的一种流行技术;然而,它倾向于施加沉重的测试惩罚。在本文中,我们提出了一个自适应在线测试框架,以显着降低测试开销。该方法的独特之处在于能够评估硬件运行状况并应用适当详细的测试。因此,可以为健康组件节省大量的测试时间。我们进一步扩展了该框架,使其与StageNet CMP结构一起工作,该结构提供了将具有相似健康状况的管道阶段组合在一起的灵活性,从而减少了总体测试负担。对于适度的2.6%传感器面积开销,所提出的方案能够在16核CMP的生命周期内实现软件测试指令减少80%。
{"title":"Adaptive online testing for efficient hard fault detection","authors":"S. Gupta, Amin Ansari, Shuguang Feng, S. Mahlke","doi":"10.1109/ICCD.2009.5413132","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413132","url":null,"abstract":"With growing semiconductor integration, the reliability of individual transistors is expected to rapidly decline in future technology generations. In such a scenario, processors would need to be equipped with fault tolerance mechanisms to tolerate in-field silicon defects. Periodic online testing is a popular technique to detect such failures; however, it tends to impose a heavy testing penalty. In this paper, we propose an adaptive online testing framework to significantly reduce the testing overhead. The proposed approach is unique in its ability to assess the hardware health and apply suitably detailed tests. Thus, a significant chunk of the testing time can be saved for the healthy components. We further extend the framework to work with the StageNet CMP fabric, which provides the flexibility to group together pipeline stages with similar health conditions, thereby reducing the overall testing burden. For a modest 2.6% sensor area overhead, the proposed scheme was able to achieve an 80% reduction in software test instructions over the lifetime of a 16-core CMP.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130643450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Transaction-based debugging of system-on-chips with patterns 带模式的片上系统的基于事务的调试
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413157
A. M. Gharehbaghi, M. Fujita
This paper presents a debug method for system communications in post-silicon verification. First, we extract transaction sequences at run-time using on-chip circuits and store them in a trace buffer. Then, we read the stored transactions and analyze them with software. The analysis software tries to find certain patterns in the extracted transactions that are defined by our transaction debug pattern specification language (TDPSL). We have also defined a number of standard patterns for common communication problems such as race and deadlock in TDPSL. To show the feasibility of the method, it is applied to a number of on chip buses. It is shown that the area overhead of the method is very low. Also we have implemented the analysis software and shown that it is memory efficient, scalable and effective to find bugs. The proposed method can also be applied to fault analysis including transient faults.
本文提出了一种后硅验证系统通信的调试方法。首先,我们在运行时使用片上电路提取事务序列,并将它们存储在跟踪缓冲区中。然后,我们读取存储的事务并使用软件进行分析。分析软件试图在提取的事务中找到由我们的事务调试模式规范语言(TDPSL)定义的某些模式。我们还为TDPSL中的竞争和死锁等常见通信问题定义了许多标准模式。为了证明该方法的可行性,将其应用于多个片上总线。结果表明,该方法的面积开销非常低。此外,我们已经实现了分析软件,并表明它是内存高效,可扩展和有效的发现漏洞。该方法也可用于包括暂态故障在内的故障分析。
{"title":"Transaction-based debugging of system-on-chips with patterns","authors":"A. M. Gharehbaghi, M. Fujita","doi":"10.1109/ICCD.2009.5413157","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413157","url":null,"abstract":"This paper presents a debug method for system communications in post-silicon verification. First, we extract transaction sequences at run-time using on-chip circuits and store them in a trace buffer. Then, we read the stored transactions and analyze them with software. The analysis software tries to find certain patterns in the extracted transactions that are defined by our transaction debug pattern specification language (TDPSL). We have also defined a number of standard patterns for common communication problems such as race and deadlock in TDPSL. To show the feasibility of the method, it is applied to a number of on chip buses. It is shown that the area overhead of the method is very low. Also we have implemented the analysis software and shown that it is memory efficient, scalable and effective to find bugs. The proposed method can also be applied to fault analysis including transient faults.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130659058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
期刊
2009 IEEE International Conference on Computer Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1