首页 > 最新文献

2009 IEEE International Conference on Computer Design最新文献

英文 中文
Test-wrapper optimization for embedded cores in TSV-based three-dimensional SOCs 基于tsv的三维soc中嵌入式内核的测试包装优化
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413172
Brandon Noia, K. Chakrabarty, Yuan Xie
System-on-chip (SOC) designs comprised of a number of embedded cores are widespread in today's integrated circuits. Embedded core-based design is likely to be equally popular for three-dimensional integrated circuits (3D ICs), the manufacture of which has become feasible in recent years. 3D integration offers a number of advantages over traditional two-dimensional (2D) technologies, such as the reduction in the average interconnect length, higher performance, lower interconnect power consumption, and smaller IC footprint. Despite recent advances in 3D fabrication and design methods, no attempt has been made thus far to design a 1500-style test wrapper for an embedded core that spans multiple layers in a 3D SOC. This paper addresses wrapper optimization in 3D ICs based on through-silicon vias (TSVs) for vertical interconnects. Our objective is to minimize the scan-test time for a core under constraints on the total number of TSVs available for testing. We present two polynomial-time heuristic solutions. Simulation results are presented for embedded cores from the ITC 2002 SOC test benchmarks.
由许多嵌入式内核组成的系统芯片(SOC)设计在当今的集成电路中非常普遍。基于嵌入式核心的设计可能同样流行于三维集成电路(3D ic),其制造在最近几年变得可行。与传统的二维(2D)技术相比,3D集成提供了许多优势,例如缩短平均互连长度、提高性能、降低互连功耗和减小IC占地面积。尽管最近在3D制造和设计方法方面取得了进展,但迄今为止还没有尝试为3D SOC中跨越多层的嵌入式核心设计1500风格的测试封装器。本文讨论了垂直互连中基于硅通孔(tsv)的3D集成电路封装优化。我们的目标是在可用于测试的tsv总数的限制下,最大限度地减少堆芯的扫描测试时间。我们提出了两个多项式时间启发式解。从ITC 2002 SOC测试基准中给出了嵌入式内核的仿真结果。
{"title":"Test-wrapper optimization for embedded cores in TSV-based three-dimensional SOCs","authors":"Brandon Noia, K. Chakrabarty, Yuan Xie","doi":"10.1109/ICCD.2009.5413172","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413172","url":null,"abstract":"System-on-chip (SOC) designs comprised of a number of embedded cores are widespread in today's integrated circuits. Embedded core-based design is likely to be equally popular for three-dimensional integrated circuits (3D ICs), the manufacture of which has become feasible in recent years. 3D integration offers a number of advantages over traditional two-dimensional (2D) technologies, such as the reduction in the average interconnect length, higher performance, lower interconnect power consumption, and smaller IC footprint. Despite recent advances in 3D fabrication and design methods, no attempt has been made thus far to design a 1500-style test wrapper for an embedded core that spans multiple layers in a 3D SOC. This paper addresses wrapper optimization in 3D ICs based on through-silicon vias (TSVs) for vertical interconnects. Our objective is to minimize the scan-test time for a core under constraints on the total number of TSVs available for testing. We present two polynomial-time heuristic solutions. Simulation results are presented for embedded cores from the ITC 2002 SOC test benchmarks.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116665775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
Performance analysis of decimal floating-point libraries and its impact on decimal hardware and software solutions 十进制浮点库的性能分析及其对十进制硬件和软件解决方案的影响
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413114
Michael J. Anderson, C. Tsen, Liang-Kai Wang, Katherine Compton, M. Schulte
The IEEE Standards Committee recently approved the IEEE 754–2008 Standard for Floating-point Arithmetic, which includes specifications for decimal floating-point (DFP) arithmetic. A growing number of DFP solutions have emerged, and developers now have many DFP design choices including arbitrary or fixed precision, binary or decimal significand encodings, 64-bit or 128-bit DFP operands, and software or hardware implementations. There is a need for accurate analysis of these solutions on representative DFP benchmarks. In this paper, we expand previous DFP benchmark and performance analysis research. We employ a DFP benchmark suite that currently supports several DFP solutions and is easily extendable. We also present performance analysis that (1) provides execution profiles for various DFP encodings and types, (2) gives the average number cycles for common DFP operations and the total number of each DFP operation in each benchmark, and (3) highlights the tradeoffs between using 64-bit and 128-bit DFP operands for both binary and decimal significand encodings. This analysis can help guide the design of future DFP hardware and software solutions.
IEEE标准委员会最近批准了IEEE 754-2008浮点算术标准,其中包括十进制浮点(DFP)算术的规范。越来越多的DFP解决方案已经出现,开发人员现在有许多DFP设计选择,包括任意或固定精度、二进制或十进制有效编码、64位或128位DFP操作数以及软件或硬件实现。有必要在代表性DFP基准上对这些解决方案进行准确的分析。在本文中,我们扩展了以往的DFP基准和性能分析研究。我们使用了一个DFP基准套件,该套件目前支持多个DFP解决方案,并且易于扩展。我们还提供了性能分析:(1)提供了各种DFP编码和类型的执行概况,(2)给出了常见DFP操作的平均数字周期和每个基准中每个DFP操作的总数,以及(3)强调了使用64位和128位DFP操作数进行二进制和十进制有效编码之间的权衡。这种分析可以帮助指导未来DFP硬件和软件解决方案的设计。
{"title":"Performance analysis of decimal floating-point libraries and its impact on decimal hardware and software solutions","authors":"Michael J. Anderson, C. Tsen, Liang-Kai Wang, Katherine Compton, M. Schulte","doi":"10.1109/ICCD.2009.5413114","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413114","url":null,"abstract":"The IEEE Standards Committee recently approved the IEEE 754–2008 Standard for Floating-point Arithmetic, which includes specifications for decimal floating-point (DFP) arithmetic. A growing number of DFP solutions have emerged, and developers now have many DFP design choices including arbitrary or fixed precision, binary or decimal significand encodings, 64-bit or 128-bit DFP operands, and software or hardware implementations. There is a need for accurate analysis of these solutions on representative DFP benchmarks. In this paper, we expand previous DFP benchmark and performance analysis research. We employ a DFP benchmark suite that currently supports several DFP solutions and is easily extendable. We also present performance analysis that (1) provides execution profiles for various DFP encodings and types, (2) gives the average number cycles for common DFP operations and the total number of each DFP operation in each benchmark, and (3) highlights the tradeoffs between using 64-bit and 128-bit DFP operands for both binary and decimal significand encodings. This analysis can help guide the design of future DFP hardware and software solutions.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117190484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
The salvage cache: A fault-tolerant cache architecture for next-generation memory technologies 回收缓存:用于下一代内存技术的容错缓存架构
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413145
Cheng-Kok Koh, W. Wong, Yiran Chen, Hai Helen Li
There has been much work on the next generation of memory technologies such as MRAM, RRAM and PRAM. Most of these are non-volatile in nature, and compared to SRAM, they are often denser, just as fast, and have much lower energy consumption. Using 3-D stacking technology, it has been proposed that they can be used instead of SRAM in large level 2 caches prevalent in today's microprocessors. However, one of the key challenges in the use of these technologies, such as MRAM, is their higher fault probabilities arising from the larger process variation, defects in its fabrication, and the fact that the cache is much larger. This seriously affect yield. In this paper, we propose a fault resilient set associative cache architecture which we called the salvage cache. In the salvage cache, a faulty cache block is sacrificed and used to repair faults found in other blocks. We will describe in detail the architecture of the salvage cache as well as provide results of yield simulations that show that a much higher yield can be achieved viz-a-viz other fault tolerant techniques. We will also show the performance savings that arise from the use of a large next-generation L2 cache.
在MRAM、RRAM和PRAM等下一代存储技术方面已经有了很多工作。它们中的大多数本质上是非易失性的,与SRAM相比,它们通常密度更大,速度一样快,能耗更低。利用3-D堆叠技术,有人提出它们可以在当今微处理器中流行的大型2级缓存中取代SRAM。然而,使用这些技术(如MRAM)的关键挑战之一是,由于较大的工艺变化、制造缺陷以及缓存更大,它们的故障概率更高。这严重影响了产量。本文提出了一种故障弹性集关联缓存体系结构,我们称之为打捞缓存。在回收缓存中,一个有故障的缓存块被牺牲,用来修复在其他块中发现的故障。我们将详细描述打捞缓存的体系结构,并提供产量模拟的结果,这些结果表明,与其他容错技术相比,可以实现更高的产量。我们还将展示使用大型下一代L2缓存所带来的性能节省。
{"title":"The salvage cache: A fault-tolerant cache architecture for next-generation memory technologies","authors":"Cheng-Kok Koh, W. Wong, Yiran Chen, Hai Helen Li","doi":"10.1109/ICCD.2009.5413145","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413145","url":null,"abstract":"There has been much work on the next generation of memory technologies such as MRAM, RRAM and PRAM. Most of these are non-volatile in nature, and compared to SRAM, they are often denser, just as fast, and have much lower energy consumption. Using 3-D stacking technology, it has been proposed that they can be used instead of SRAM in large level 2 caches prevalent in today's microprocessors. However, one of the key challenges in the use of these technologies, such as MRAM, is their higher fault probabilities arising from the larger process variation, defects in its fabrication, and the fact that the cache is much larger. This seriously affect yield. In this paper, we propose a fault resilient set associative cache architecture which we called the salvage cache. In the salvage cache, a faulty cache block is sacrificed and used to repair faults found in other blocks. We will describe in detail the architecture of the salvage cache as well as provide results of yield simulations that show that a much higher yield can be achieved viz-a-viz other fault tolerant techniques. We will also show the performance savings that arise from the use of a large next-generation L2 cache.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115560292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
A Technology-Agnostic Simulation Environment (TASE) for iterative custom IC design across processes 一个技术不可知的仿真环境(TASE)迭代定制集成电路设计的过程
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413107
Satyanand Nalam, M. Bhargava, Kyle Ringgenberg, K. Mai, B. Calhoun
A designer's intent and knowledge about the critical issues and trade-offs underlying a custom circuit design are implicit in the simulations she sets up for design creation and verification. However, this knowledge is tightly conjoined with technology-specific features and decoupled from the final schematic in traditional design flows. As a result, this knowledge is easily lost when the technology specifics change. This paper presents a Technology Agnostic Simulation Environment (TASE), which is a tool that uses simulation templates to capture the designer's knowledge and separate it from the technology-specific components of a simulation. TASE also allows the designer to form groups of related simulations and port them as a unit to a new technology. This allows an actual design schematic to remain tied to the analyses that illuminate the underlying trade-offs and design issues, unlike the case where schematics are ported alone. Giving the designer immediate access to the trade-offs, which are likely to change in new technologies, accelerates the re-design that often must accompany porting of complicated custom circuits. We demonstrate the usefulness of TASE by investigating Read and Write noise margins for a 6T SRAM in predictive technologies down to 16 nm.
设计师对定制电路设计的关键问题和权衡的意图和知识隐含在她为设计创建和验证设置的模拟中。然而,这些知识与特定于技术的特性紧密结合,并与传统设计流程中的最终原理图分离。因此,当技术细节发生变化时,这些知识很容易丢失。本文提出了一个与技术无关的仿真环境(TASE),它是一个工具,使用仿真模板来捕获设计人员的知识,并将其与仿真的特定技术组件分离。TASE还允许设计者形成相关的模拟组,并将它们作为一个单元移植到新技术上。这允许实际的设计原理图与阐明潜在权衡和设计问题的分析保持联系,而不像单独移植原理图的情况。让设计人员能够立即获得可能在新技术中发生变化的权衡,从而加速了通常伴随复杂定制电路移植而必须进行的重新设计。通过研究预测技术中6T SRAM的读写噪声裕度,我们证明了TASE的实用性。
{"title":"A Technology-Agnostic Simulation Environment (TASE) for iterative custom IC design across processes","authors":"Satyanand Nalam, M. Bhargava, Kyle Ringgenberg, K. Mai, B. Calhoun","doi":"10.1109/ICCD.2009.5413107","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413107","url":null,"abstract":"A designer's intent and knowledge about the critical issues and trade-offs underlying a custom circuit design are implicit in the simulations she sets up for design creation and verification. However, this knowledge is tightly conjoined with technology-specific features and decoupled from the final schematic in traditional design flows. As a result, this knowledge is easily lost when the technology specifics change. This paper presents a Technology Agnostic Simulation Environment (TASE), which is a tool that uses simulation templates to capture the designer's knowledge and separate it from the technology-specific components of a simulation. TASE also allows the designer to form groups of related simulations and port them as a unit to a new technology. This allows an actual design schematic to remain tied to the analyses that illuminate the underlying trade-offs and design issues, unlike the case where schematics are ported alone. Giving the designer immediate access to the trade-offs, which are likely to change in new technologies, accelerates the re-design that often must accompany porting of complicated custom circuits. We demonstrate the usefulness of TASE by investigating Read and Write noise margins for a 6T SRAM in predictive technologies down to 16 nm.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114180877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Low-overhead error detection for Networks-on-Chip 片上网络的低开销错误检测
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413150
A. Berman, I. Keidar
In the current deep sub-micron age, interconnect reliability is a subject of major concern, and is crucial for a successful product. Coding is a widely-used method to achieve communication reliability, which can be very useful in a Network-on-Chip (NoC). A key challenge for NoC error detection is to provide a defined detection level, while minimizing the number of redundant parity bits, using small encoder and decoder circuits, and ensuring shortest path routing. We present Parity Routing (PaR), a novel method to reduce the number of redundant bits transmitted. PaR exploits NoC path diversity to reduce the number of redundant parity bits. Our analysis shows that, for example, on a 4×4 NoC with a demand of one parity bit, PaR reduces the redundant information transmitted by 75%, and the savings increase asymptotically to 100% with the size of the NoC. In addition, we show that PaR can yield power savings due to the reduced number of bit transmissions and simple decoding process. Furthermore, PaR utilizes low complexity, small-area circuits.
在当前的深亚微米时代,互连可靠性是一个主要关注的主题,对一个成功的产品至关重要。编码是实现通信可靠性的一种广泛使用的方法,它在片上网络(NoC)中非常有用。NoC错误检测的一个关键挑战是提供一个定义的检测级别,同时尽量减少冗余奇偶位的数量,使用较小的编码器和解码器电路,并确保最短路径路由。提出了奇偶校验路由(PaR),这是一种减少传输冗余比特数的新方法。PaR利用NoC路径分集来减少冗余奇偶校验位的数量。我们的分析表明,例如,在需要一个奇偶校验位的4×4 NoC上,PaR将传输的冗余信息减少了75%,并且随着NoC的大小,节省量渐近增加到100%。此外,我们还证明了PaR可以节省功率,因为它减少了比特传输的数量和简单的解码过程。此外,PaR利用低复杂度、小面积电路。
{"title":"Low-overhead error detection for Networks-on-Chip","authors":"A. Berman, I. Keidar","doi":"10.1109/ICCD.2009.5413150","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413150","url":null,"abstract":"In the current deep sub-micron age, interconnect reliability is a subject of major concern, and is crucial for a successful product. Coding is a widely-used method to achieve communication reliability, which can be very useful in a Network-on-Chip (NoC). A key challenge for NoC error detection is to provide a defined detection level, while minimizing the number of redundant parity bits, using small encoder and decoder circuits, and ensuring shortest path routing. We present Parity Routing (PaR), a novel method to reduce the number of redundant bits transmitted. PaR exploits NoC path diversity to reduce the number of redundant parity bits. Our analysis shows that, for example, on a 4×4 NoC with a demand of one parity bit, PaR reduces the redundant information transmitted by 75%, and the savings increase asymptotically to 100% with the size of the NoC. In addition, we show that PaR can yield power savings due to the reduced number of bit transmissions and simple decoding process. Furthermore, PaR utilizes low complexity, small-area circuits.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114259753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
3D stacked power distribution considering substrate coupling 考虑衬底耦合的三维叠加功率分布
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413151
A. S. Arani, Xiang Hu, Wanping Zhang, Chung-Kuan Cheng, A. Engin, Xiaoming Chen, M. Popovich
Reliable design of power distribution network for stacked integrated circuits introduces new challenges i.e., substrate coupling among through silicon vias (TSVs) and tiers grid in addition to reliability issues such as electromigration and thermo-mechanical stress, compared to conventional System on Chip (SoC). In this paper a comprehensive modeling of the TSV and stacked power grid with frequency dependent parasitic is proposed. The analytical model considers the impact of the substrate coupling between the TSVs and layers grid. A frequency domain based analysis flow is introduced to incorporate frequency dependent parasitics. The design of a reliable power distribution network is formulated as an optimization problem to minimize power noise under reliability and electro-migration constraints. Experimental results demonstrate the efficacy of the problem formulation and solution technique.
与传统的片上系统(SoC)相比,用于堆叠集成电路的配电网络的可靠设计引入了新的挑战,即,除了电迁移和热机械应力等可靠性问题外,还包括硅通孔(tsv)和分层电网之间的衬底耦合。本文提出了一种具有频率依赖寄生的TSV和堆叠电网的综合建模方法。分析模型考虑了tsv与层网格之间衬底耦合的影响。引入了一种基于频域的分析流程来考虑频率相关的寄生效应。可靠配电网的设计是在可靠性和电迁移约束下最小化功率噪声的优化问题。实验结果证明了该方法的有效性。
{"title":"3D stacked power distribution considering substrate coupling","authors":"A. S. Arani, Xiang Hu, Wanping Zhang, Chung-Kuan Cheng, A. Engin, Xiaoming Chen, M. Popovich","doi":"10.1109/ICCD.2009.5413151","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413151","url":null,"abstract":"Reliable design of power distribution network for stacked integrated circuits introduces new challenges i.e., substrate coupling among through silicon vias (TSVs) and tiers grid in addition to reliability issues such as electromigration and thermo-mechanical stress, compared to conventional System on Chip (SoC). In this paper a comprehensive modeling of the TSV and stacked power grid with frequency dependent parasitic is proposed. The analytical model considers the impact of the substrate coupling between the TSVs and layers grid. A frequency domain based analysis flow is introduced to incorporate frequency dependent parasitics. The design of a reliable power distribution network is formulated as an optimization problem to minimize power noise under reliability and electro-migration constraints. Experimental results demonstrate the efficacy of the problem formulation and solution technique.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117083793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Framework for massively parallel testing at wafer and package test 用于晶圆和封装测试的大规模并行测试框架
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413134
A. H. Baba, Kee Sup Kim
A novel DFT approach is introduced that enables massively parallel testing of logic devices at both wafer and package test. Parallelism is achieved by utilizing interconnection networks that are built onto a wafer probe or a tester interface unit. The financial benefits of this method in a realistic setting are also presented.
介绍了一种新的DFT方法,可以在晶圆和封装测试中对逻辑器件进行大规模并行测试。并行性是通过利用建立在晶圆探头或测试接口单元上的互连网络来实现的。本文还介绍了这种方法在实际应用中的经济效益。
{"title":"Framework for massively parallel testing at wafer and package test","authors":"A. H. Baba, Kee Sup Kim","doi":"10.1109/ICCD.2009.5413134","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413134","url":null,"abstract":"A novel DFT approach is introduced that enables massively parallel testing of logic devices at both wafer and package test. Parallelism is achieved by utilizing interconnection networks that are built onto a wafer probe or a tester interface unit. The financial benefits of this method in a realistic setting are also presented.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124585975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Fault-tolerant synthesis using non-uniform redundancy 使用非均匀冗余的容错综合
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413153
Keven L. Woo, Matthew R. Guthaus
As process technologies continue to scale into the nanometer regime, devices are becoming significantly more unreliable. Many forms of unreliability manifest as transient faults and can cause intermittent random logic upsets. These logic upsets are often caused by natural radiation (neutrons and alpha particles) or on-chip noise (cross-coupling, supply drop, or flicker noise). This research improves reliability by using non-uniform redundancy. Specifically, we present a dynamic programming algorithm that considers many possible topological redundancies, yet maintains a linear run-time due to efficient pruning of suboptimal solutions. Our algorithm provides designers with a Pareto-optimal set of solutions that trade reliability for area. Compared to existing Triple Modular Redundancy (TMR), we see similar reliability with only 35% area overhead instead of 326%.
随着工艺技术不断扩展到纳米级,设备变得越来越不可靠。许多形式的不可靠性表现为瞬态故障,并可能导致间歇性随机逻辑失稳。这些逻辑干扰通常是由自然辐射(中子和α粒子)或芯片上噪声(交叉耦合、电源下降或闪烁噪声)引起的。该研究通过使用非均匀冗余来提高可靠性。具体来说,我们提出了一种动态规划算法,该算法考虑了许多可能的拓扑冗余,但由于次优解的有效修剪而保持线性运行时间。我们的算法为设计人员提供了一组帕累托最优的解决方案,以可靠性换取面积。与现有的三模冗余(TMR)相比,我们发现可靠性相似,面积开销只有35%,而不是326%。
{"title":"Fault-tolerant synthesis using non-uniform redundancy","authors":"Keven L. Woo, Matthew R. Guthaus","doi":"10.1109/ICCD.2009.5413153","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413153","url":null,"abstract":"As process technologies continue to scale into the nanometer regime, devices are becoming significantly more unreliable. Many forms of unreliability manifest as transient faults and can cause intermittent random logic upsets. These logic upsets are often caused by natural radiation (neutrons and alpha particles) or on-chip noise (cross-coupling, supply drop, or flicker noise). This research improves reliability by using non-uniform redundancy. Specifically, we present a dynamic programming algorithm that considers many possible topological redundancies, yet maintains a linear run-time due to efficient pruning of suboptimal solutions. Our algorithm provides designers with a Pareto-optimal set of solutions that trade reliability for area. Compared to existing Triple Modular Redundancy (TMR), we see similar reliability with only 35% area overhead instead of 326%.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126898565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Hierarchical parametric test metrics estimation: A ΣΔ converter BIST case study 分层参数测试度量估计:ΣΔ转换器BIST案例研究
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413173
M. Dubois, H. Stratigopoulos, S. Mir
In this paper we propose a method for evaluating test measurements for complex circuits that are difficult to simulate. The evaluation aims at estimating test metrics, such as parametric test escape and yield loss, with parts per million (ppm) accuracy. To achieve this, the method combines behavioral modeling, density estimation, and regression. The method is demonstrated for a previously proposed Built-In Self-Test (BIST) technique for ΣΔ Analog-to-Digital Converters (ADC) explaining in detail the derivation of a behavioral model that captures the main nonidealities in the circuit. The estimated test metrics are further analyzed in order to uncover trends in a large device sample that explain the source of erroneous test decisions.
在本文中,我们提出了一种评估难以模拟的复杂电路测试测量的方法。评估的目的是估计测试指标,如参数测试逃逸和产量损失,以百万分之一(ppm)的精度。为了实现这一点,该方法结合了行为建模、密度估计和回归。该方法演示了先前提出的用于ΣΔ模数转换器(ADC)的内置自检(BIST)技术,详细解释了捕获电路中主要非理想性的行为模型的推导。进一步分析估计的测试指标,以揭示大型设备样本中解释错误测试决策来源的趋势。
{"title":"Hierarchical parametric test metrics estimation: A ΣΔ converter BIST case study","authors":"M. Dubois, H. Stratigopoulos, S. Mir","doi":"10.1109/ICCD.2009.5413173","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413173","url":null,"abstract":"In this paper we propose a method for evaluating test measurements for complex circuits that are difficult to simulate. The evaluation aims at estimating test metrics, such as parametric test escape and yield loss, with parts per million (ppm) accuracy. To achieve this, the method combines behavioral modeling, density estimation, and regression. The method is demonstrated for a previously proposed Built-In Self-Test (BIST) technique for ΣΔ Analog-to-Digital Converters (ADC) explaining in detail the derivation of a behavioral model that captures the main nonidealities in the circuit. The estimated test metrics are further analyzed in order to uncover trends in a large device sample that explain the source of erroneous test decisions.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130246237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A distributed concurrent on-line test scheduling protocol for many-core NoC-based systems 面向多核计算机系统的分布式并发在线测试调度协议
Pub Date : 2009-10-04 DOI: 10.1109/ICCD.2009.5413156
J. Lee, R. Mahapatra, Praveen Bhojwani
Concurrent on-line testing (COLT) of many-core systems-on-chip (SoC) has been recently proposed by researchers in response to the growing threat of electronic wear-out to system operational lifetimes and to the increasing reliability and availability demands of safety-critical applications. Previous research in concurrent on-line testing has focused on centralized approaches to manage core testing while the system is available to execute normal user applications. However, as technology scaling allows dozens and hundreds of processing cores to be placed on a single chip, these centralized approaches are not scalable solutions. In this paper, a distributed concurrent on-line test scheduling protocol is proposed and evaluated against previously developed solutions. Our experiments show that a distributed COLT scheduler can test a moderately-sized SoC with a speedup of 3.85 over centralized approaches while consuming 84% less energy, and performance benefits improve as the number of cores per chip increases. This research also presents a core test ordering algorithm — Code-Division Core Test Scheduling — that provides an additional 40% reduction in system test latency compared to other schedulers.
多核片上系统(SoC)的并发在线测试(COLT)最近被研究人员提出,以应对日益严重的电子磨损对系统使用寿命的威胁以及安全关键应用日益增长的可靠性和可用性需求。以前对并发在线测试的研究主要集中在集中管理核心测试的方法上,同时系统可用于执行正常的用户应用程序。然而,由于技术的扩展允许在单个芯片上放置数十甚至数百个处理核心,这些集中式方法不是可扩展的解决方案。本文提出了一种分布式并发在线测试调度协议,并对已有的解决方案进行了评估。我们的实验表明,分布式COLT调度器可以测试中等大小的SoC,速度比集中式方法提高3.85,同时消耗的能量减少84%,并且随着每个芯片内核数量的增加,性能优势也会提高。这项研究还提出了一个核心测试排序算法——代码划分核心测试调度——与其他调度程序相比,它可以额外减少40%的系统测试延迟。
{"title":"A distributed concurrent on-line test scheduling protocol for many-core NoC-based systems","authors":"J. Lee, R. Mahapatra, Praveen Bhojwani","doi":"10.1109/ICCD.2009.5413156","DOIUrl":"https://doi.org/10.1109/ICCD.2009.5413156","url":null,"abstract":"Concurrent on-line testing (COLT) of many-core systems-on-chip (SoC) has been recently proposed by researchers in response to the growing threat of electronic wear-out to system operational lifetimes and to the increasing reliability and availability demands of safety-critical applications. Previous research in concurrent on-line testing has focused on centralized approaches to manage core testing while the system is available to execute normal user applications. However, as technology scaling allows dozens and hundreds of processing cores to be placed on a single chip, these centralized approaches are not scalable solutions. In this paper, a distributed concurrent on-line test scheduling protocol is proposed and evaluated against previously developed solutions. Our experiments show that a distributed COLT scheduler can test a moderately-sized SoC with a speedup of 3.85 over centralized approaches while consuming 84% less energy, and performance benefits improve as the number of cores per chip increases. This research also presents a core test ordering algorithm — Code-Division Core Test Scheduling — that provides an additional 40% reduction in system test latency compared to other schedulers.","PeriodicalId":256908,"journal":{"name":"2009 IEEE International Conference on Computer Design","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131129329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2009 IEEE International Conference on Computer Design
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1