首页 > 最新文献

2008 45th ACM/IEEE Design Automation Conference最新文献

英文 中文
Protecting bus-based hardware IP by secret sharing 通过秘密共享保护基于总线的硬件IP
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391684
Jarrod A. Roy, F. Koushanfar, I. Markov
Our work addresses protection of hardware IP at the mask level with the goal of preventing unauthorized manufacturing. The proposed protocol based on chip locking and activation is applicable to a broad category of electronic systems with a primary bus. Such designs include (1) numerous IP offerings for USB, PCI, PCI-E, AMBA and other bus standards typically used in system-on-a-chip designs and computer peripherals, (2) SRAM-based FPGAs that are programmed through an input bus, (3) general-purpose and embedded microprocessors, including soft cores, (4) DSPs, (5) network processors, and (6) game consoles. Our key insight is that such designs can be locked by scrambling the central bus by controlled reversible bit-permutations and substitutions. To securely establish a unique code per chip to control bus scrambling, we employ true random number generators and Dime-Hellman cryptography during activation.
我们的工作涉及掩码级别的硬件IP保护,目标是防止未经授权的制造。所提出的基于芯片锁定和激活的协议适用于具有主总线的广泛电子系统。此类设计包括(1)用于USB、PCI、PCI- e、AMBA和其他通常用于片上系统设计和计算机外设的总线标准的众多IP产品,(2)通过输入总线编程的基于sram的fpga,(3)通用和嵌入式微处理器,包括软核,(4)dsp,(5)网络处理器和(6)游戏机。我们的关键见解是,这样的设计可以通过控制可逆的位置换和替换打乱中央总线来锁定。为了在每个芯片上建立一个唯一的代码来控制总线置乱,我们在激活过程中使用了真随机数生成器和Dime-Hellman加密。
{"title":"Protecting bus-based hardware IP by secret sharing","authors":"Jarrod A. Roy, F. Koushanfar, I. Markov","doi":"10.1145/1391469.1391684","DOIUrl":"https://doi.org/10.1145/1391469.1391684","url":null,"abstract":"Our work addresses protection of hardware IP at the mask level with the goal of preventing unauthorized manufacturing. The proposed protocol based on chip locking and activation is applicable to a broad category of electronic systems with a primary bus. Such designs include (1) numerous IP offerings for USB, PCI, PCI-E, AMBA and other bus standards typically used in system-on-a-chip designs and computer peripherals, (2) SRAM-based FPGAs that are programmed through an input bus, (3) general-purpose and embedded microprocessors, including soft cores, (4) DSPs, (5) network processors, and (6) game consoles. Our key insight is that such designs can be locked by scrambling the central bus by controlled reversible bit-permutations and substitutions. To securely establish a unique code per chip to control bus scrambling, we employ true random number generators and Dime-Hellman cryptography during activation.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117171429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
A power and temperature aware DRAM architecture 一种功率和温度敏感的DRAM架构
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391691
Song Liu, S. Memik, Yu Zhang, G. Memik
Technological advances enable modern processors to utilize increasingly larger DRAMs with rising access frequencies. This is leading to high power consumption and operating temperature in DRAM chips. As a result, temperature management has become a real and pressing issue in high performance DRAM systems. Traditional low power techniques are not suitable for high performance DRAM systems with high bandwidth. In this paper, we propose and evaluate a customized DRAM low power technique based on page hit aware write buffer (PHA-WB). Our proposed approach reduces DRAM system power consumption and temperature without any performance penalty. Our experiments show that a system with a 64-entry PHA-WB could reduce the total DRAM power consumption by up to 22.0% (9.6% on average). The peak and average temperature reductions are 6.1degC and 2.1degC, respectively.
技术进步使现代处理器能够利用越来越大的内存和不断上升的访问频率。这导致DRAM芯片的高功耗和高工作温度。因此,温度管理已成为高性能DRAM系统中一个现实而紧迫的问题。传统的低功耗技术不适合高带宽的高性能DRAM系统。本文提出并评估了一种基于页面命中感知写缓冲器(PHA-WB)的定制DRAM低功耗技术。我们提出的方法降低了DRAM系统的功耗和温度,而没有任何性能损失。我们的实验表明,采用64入口PHA-WB的系统可以将DRAM总功耗降低22.0%(平均为9.6%)。峰值降温6.1℃,平均降温2.1℃。
{"title":"A power and temperature aware DRAM architecture","authors":"Song Liu, S. Memik, Yu Zhang, G. Memik","doi":"10.1145/1391469.1391691","DOIUrl":"https://doi.org/10.1145/1391469.1391691","url":null,"abstract":"Technological advances enable modern processors to utilize increasingly larger DRAMs with rising access frequencies. This is leading to high power consumption and operating temperature in DRAM chips. As a result, temperature management has become a real and pressing issue in high performance DRAM systems. Traditional low power techniques are not suitable for high performance DRAM systems with high bandwidth. In this paper, we propose and evaluate a customized DRAM low power technique based on page hit aware write buffer (PHA-WB). Our proposed approach reduces DRAM system power consumption and temperature without any performance penalty. Our experiments show that a system with a 64-entry PHA-WB could reduce the total DRAM power consumption by up to 22.0% (9.6% on average). The peak and average temperature reductions are 6.1degC and 2.1degC, respectively.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127278711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Analysis and implications of parasitic and screening effects on the high-frequency/RF performance of tunneling-carbon nanotube FETs 寄生效应和筛选效应对隧道碳纳米管场效应管高频/射频性能的影响
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391533
C. Kshirsagar, Mohamed N. El-Zeftawi, K. Banerjee
Intrinsic and parasitic capacitances play an important role in determining the high-frequency RF performance of devices. Recently, a new type of carbon nanotube field effect transistor (CNFET) based on tunneling principle has been proposed, which shows impressive device properties and overcomes some of the limitations of previously proposed CNFET devices. Although carbon nanotube based devices have been optimized for DC performance so far, little has been done to optimize them for high-frequency operation. In this paper, we present, detailed modeling and analysis of device geometry based intrinsic and parasitic capacitances of tunneling carbon nanotube field effect transistors (T-CNFETs) with both single nanotube as well as nanotube-array based channel. Based on the model, we analyze scaling of parasitic capacitances with device geometry for two different scaling scenarios of T-CNFETs. We show that in order to reduce the impact of parasitic capacitance, nanotube density has to be optimized. Furthermore, for the first time, we analyze various factors affecting the high-frequency/RF performance of back gated T-CNFETs and study the impact of parasitic and screening effects on the high-frequency/RF performance of these devices.
固有电容和寄生电容在决定器件的高频射频性能方面起着重要作用。近年来,提出了一种基于隧道原理的新型碳纳米管场效应晶体管(CNFET),它具有令人印象深刻的器件性能,克服了以前提出的CNFET器件的一些局限性。尽管目前基于碳纳米管的器件已经针对直流性能进行了优化,但在高频工作方面却做得很少。在本文中,我们提出了详细的建模和分析基于固有和寄生电容的隧道碳纳米管场效应晶体管(t - cnfet)的器件几何结构,包括单纳米管和纳米管阵列通道。基于该模型,我们分析了两种不同的t - cnfet缩放场景下寄生电容随器件几何尺寸的缩放。我们表明,为了减少寄生电容的影响,必须优化纳米管密度。此外,我们首次分析了影响后门控t - cnfet高频/射频性能的各种因素,并研究了寄生效应和筛选效应对这些器件高频/射频性能的影响。
{"title":"Analysis and implications of parasitic and screening effects on the high-frequency/RF performance of tunneling-carbon nanotube FETs","authors":"C. Kshirsagar, Mohamed N. El-Zeftawi, K. Banerjee","doi":"10.1145/1391469.1391533","DOIUrl":"https://doi.org/10.1145/1391469.1391533","url":null,"abstract":"Intrinsic and parasitic capacitances play an important role in determining the high-frequency RF performance of devices. Recently, a new type of carbon nanotube field effect transistor (CNFET) based on tunneling principle has been proposed, which shows impressive device properties and overcomes some of the limitations of previously proposed CNFET devices. Although carbon nanotube based devices have been optimized for DC performance so far, little has been done to optimize them for high-frequency operation. In this paper, we present, detailed modeling and analysis of device geometry based intrinsic and parasitic capacitances of tunneling carbon nanotube field effect transistors (T-CNFETs) with both single nanotube as well as nanotube-array based channel. Based on the model, we analyze scaling of parasitic capacitances with device geometry for two different scaling scenarios of T-CNFETs. We show that in order to reduce the impact of parasitic capacitance, nanotube density has to be optimized. Furthermore, for the first time, we analyze various factors affecting the high-frequency/RF performance of back gated T-CNFETs and study the impact of parasitic and screening effects on the high-frequency/RF performance of these devices.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115041523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
SystemCoDesigner: Automatic design space exploration and rapid prototyping from behavioral models SystemCoDesigner:自动设计空间探索和快速原型从行为模型
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391616
C. Haubelt, T. Schlichter, J. Keinert, M. Meredith
SystemCoDesigner is an ESL tool developed at the University of Erlangen-Nuremberg, Germany. SystemCoDesigner offers a fast design space exploration and rapid prototyping of behavioral SystemC models. Together with Forte Design Systems, a fully automated approach was developed by integrating behavioral synthesis into the design flow. Starting from a behavioral SystemC model, hardware accelerators can be generated automatically using Forte Cynthesizer and can be added to the design space. The resulting design space is explored automatically by optimizing several objectives simultaneously using state of the art multi-objective optimization algorithms. As a result, SystemCoDesigner presents optimized hardware/software solutions to the designer who can select any of them for rapid prototyping on an FPGA basis. Thus, SystemCoDesigner bridges the gap from ESL to RTL and increases the confidence in early design decisions.
SystemCoDesigner是德国埃尔兰根-纽伦堡大学开发的ESL工具。SystemCoDesigner提供了一个快速的设计空间探索和行为SystemC模型的快速原型。与Forte Design Systems一起,通过将行为合成集成到设计流程中,开发了一种完全自动化的方法。从行为SystemC模型开始,硬件加速器可以使用Forte synthesizer自动生成,并可以添加到设计空间中。利用最先进的多目标优化算法,通过同时优化多个目标来自动探索最终的设计空间。因此,SystemCoDesigner为设计人员提供了优化的硬件/软件解决方案,设计人员可以选择其中任何一个在FPGA基础上进行快速原型设计。因此,SystemCoDesigner弥补了ESL和RTL之间的差距,并增加了对早期设计决策的信心。
{"title":"SystemCoDesigner: Automatic design space exploration and rapid prototyping from behavioral models","authors":"C. Haubelt, T. Schlichter, J. Keinert, M. Meredith","doi":"10.1145/1391469.1391616","DOIUrl":"https://doi.org/10.1145/1391469.1391616","url":null,"abstract":"SystemCoDesigner is an ESL tool developed at the University of Erlangen-Nuremberg, Germany. SystemCoDesigner offers a fast design space exploration and rapid prototyping of behavioral SystemC models. Together with Forte Design Systems, a fully automated approach was developed by integrating behavioral synthesis into the design flow. Starting from a behavioral SystemC model, hardware accelerators can be generated automatically using Forte Cynthesizer and can be added to the design space. The resulting design space is explored automatically by optimizing several objectives simultaneously using state of the art multi-objective optimization algorithms. As a result, SystemCoDesigner presents optimized hardware/software solutions to the designer who can select any of them for rapid prototyping on an FPGA basis. Thus, SystemCoDesigner bridges the gap from ESL to RTL and increases the confidence in early design decisions.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115511760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 91
Cache modeling in probabilistic execution time analysis 概率执行时间分析中的缓存建模
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391551
Yun Liang, T. Mitra
Multimedia-dominated consumer electronics devices (such as cellular phone, digital camera, etc.) operate under soft real-time constraints. Overly pessimistic worst-case execution time analysis techniques borrowed from hard real-time systems domain are not particularly suitable in this context. Instead, the execution time distribution of a task provides a more valuable input to the system-level performance analysis frameworks. Both program inputs and underlying architecture contribute to the execution time variation of a task. But existing probabilistic execution time analysis approaches mostly ignore architectural modeling. In this paper, we take the first step towards remedying this situation through instruction cache modeling. We introduce the notion of probabilistic cache states to model the evolution of cache content during program execution over multiple inputs. In particular, we estimate the mean and variance of execution time of a program across inputs in the presence of instruction cache. The experimental evaluation confirms the scalability and accuracy of our probabilistic cache modeling approach.
多媒体主导的消费电子设备(如蜂窝电话、数码相机等)在软实时约束下运行。从硬实时系统领域借来的过于悲观的最坏情况执行时间分析技术并不特别适用于这种情况。相反,任务的执行时间分布为系统级性能分析框架提供了更有价值的输入。程序输入和底层体系结构都会影响任务的执行时间变化。但是现有的概率执行时间分析方法大多忽略了体系结构建模。在本文中,我们通过指令缓存建模迈出了纠正这种情况的第一步。我们引入了概率缓存状态的概念来模拟在多个输入的程序执行过程中缓存内容的演变。特别是,我们估计了在存在指令缓存的情况下,程序跨输入执行时间的平均值和方差。实验结果验证了概率缓存建模方法的可扩展性和准确性。
{"title":"Cache modeling in probabilistic execution time analysis","authors":"Yun Liang, T. Mitra","doi":"10.1145/1391469.1391551","DOIUrl":"https://doi.org/10.1145/1391469.1391551","url":null,"abstract":"Multimedia-dominated consumer electronics devices (such as cellular phone, digital camera, etc.) operate under soft real-time constraints. Overly pessimistic worst-case execution time analysis techniques borrowed from hard real-time systems domain are not particularly suitable in this context. Instead, the execution time distribution of a task provides a more valuable input to the system-level performance analysis frameworks. Both program inputs and underlying architecture contribute to the execution time variation of a task. But existing probabilistic execution time analysis approaches mostly ignore architectural modeling. In this paper, we take the first step towards remedying this situation through instruction cache modeling. We introduce the notion of probabilistic cache states to model the evolution of cache content during program execution over multiple inputs. In particular, we estimate the mean and variance of execution time of a program across inputs in the presence of instruction cache. The experimental evaluation confirms the scalability and accuracy of our probabilistic cache modeling approach.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115563523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Concurrent topology and routing optimization in automotive network integration 汽车网络集成中的并发拓扑和路由优化
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391629
M. Lukasiewycz, M. Glaß, C. Haubelt, J. Teich, Richard Regler, Bardo Lang
In this paper, a novel automatic approach for the concurrent topology and routing optimization that achieves a high quality network layout is proposed. This optimization is based on a specialized binary Integer Linear Program (ILP) in combination with a Multi-Objective Evolutionary Algorithm (MOEA). The ILP is formulated such that each solution represents a topology and routing that fulfills all requirements and demands of the network. Thus, in an iterative process, this ILP is solved to obtain feasible networks whereas the MOEA is used for the optimization of multiple even non-linear objectives and ensures a fast convergence towards the optimal solutions. Additionally, a domain specific preprocessing algorithm for the ILP is presented that decreases the problem complexity and, thus, allows to optimize large and complex networks efficiently. The experimental results validate the performance of this methodology on two state-of-the-art prototype automotive networks.
本文提出了一种自动化的并行拓扑和路由优化方法,以实现高质量的网络布局。这种优化是基于一个专门的二进制整数线性规划(ILP)和多目标进化算法(MOEA)相结合的。ILP是这样制定的:每个解决方案都代表一种拓扑和路由,满足网络的所有需求。因此,在迭代过程中,求解该ILP以获得可行网络,而MOEA用于优化多个均匀非线性目标,并确保快速收敛到最优解。此外,提出了一种针对特定领域的ILP预处理算法,降低了问题的复杂性,从而可以有效地优化大型复杂网络。实验结果验证了该方法在两个最先进的原型汽车网络上的性能。
{"title":"Concurrent topology and routing optimization in automotive network integration","authors":"M. Lukasiewycz, M. Glaß, C. Haubelt, J. Teich, Richard Regler, Bardo Lang","doi":"10.1145/1391469.1391629","DOIUrl":"https://doi.org/10.1145/1391469.1391629","url":null,"abstract":"In this paper, a novel automatic approach for the concurrent topology and routing optimization that achieves a high quality network layout is proposed. This optimization is based on a specialized binary Integer Linear Program (ILP) in combination with a Multi-Objective Evolutionary Algorithm (MOEA). The ILP is formulated such that each solution represents a topology and routing that fulfills all requirements and demands of the network. Thus, in an iterative process, this ILP is solved to obtain feasible networks whereas the MOEA is used for the optimization of multiple even non-linear objectives and ensures a fast convergence towards the optimal solutions. Additionally, a domain specific preprocessing algorithm for the ILP is presented that decreases the problem complexity and, thus, allows to optimize large and complex networks efficiently. The experimental results validate the performance of this methodology on two state-of-the-art prototype automotive networks.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114262760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
A methodology for statistical estimation of read access yield in SRAMs 一种统计估计sram读存取率的方法
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391522
M. Abu-Rahma, K. Chowdhury, Joseph Wang, Zhiqin Chen, S. Yoon, M. Anis
The increase of process variations in advanced CMOS technologies is considered one of the biggest challenges for SRAM designers. This is aggravated by the strong demand for lower cost and power consumption, higher performance and density which complicates SRAM design process. In this paper, we present a methodology for statistical simulation of SRAM read access yield, which is tightly related to SRAM performance and power consumption. The proposed flow enables early SRAM yield predication and performance/power optimization in the design time, which is important for SRAM in nanometer technologies. The methodology is verified using measured silicon yield data from a 1 Mb memory fabricated in an industrial 45 nm technology.
先进CMOS技术中工艺变化的增加被认为是SRAM设计人员面临的最大挑战之一。对低成本和功耗、更高性能和密度的强烈需求加剧了这一点,这使SRAM设计过程复杂化。在本文中,我们提出了一种统计模拟SRAM读访问良率的方法,这与SRAM的性能和功耗密切相关。该流程可以实现SRAM的早期良率预测和设计时的性能/功耗优化,这对于纳米技术中的SRAM非常重要。采用工业45nm技术制造的1mb存储器测量硅产率数据验证了该方法。
{"title":"A methodology for statistical estimation of read access yield in SRAMs","authors":"M. Abu-Rahma, K. Chowdhury, Joseph Wang, Zhiqin Chen, S. Yoon, M. Anis","doi":"10.1145/1391469.1391522","DOIUrl":"https://doi.org/10.1145/1391469.1391522","url":null,"abstract":"The increase of process variations in advanced CMOS technologies is considered one of the biggest challenges for SRAM designers. This is aggravated by the strong demand for lower cost and power consumption, higher performance and density which complicates SRAM design process. In this paper, we present a methodology for statistical simulation of SRAM read access yield, which is tightly related to SRAM performance and power consumption. The proposed flow enables early SRAM yield predication and performance/power optimization in the design time, which is important for SRAM in nanometer technologies. The methodology is verified using measured silicon yield data from a 1 Mb memory fabricated in an industrial 45 nm technology.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"189 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114587853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Enhancing timing-driven FPGA placement for pipelined netlists 为流水线网络列表增强时序驱动的FPGA布局
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391480
Ken Eguro, S. Hauck
FPGA application developers often use pipelining, C-slowing and retiming to improve the performance of their designs. Unfortunately, registered netlists present a fundamentally different problem to CAD tools, potentially limiting the benefit of these techniques. In this paper we discuss some of the inherent issues pipelined netlists pose to existing timing-driven placement approaches. We then present two algorithmic modifications that reduce post-routing critical path delay by an average of 40%.
FPGA应用程序开发人员经常使用流水线、c -慢化和重新定时来提高其设计的性能。不幸的是,注册的网络列表对CAD工具提出了一个根本不同的问题,潜在地限制了这些技术的好处。在本文中,我们讨论了流水线网络列表对现有的时间驱动放置方法所带来的一些固有问题。然后,我们提出了两种算法修改,可将路由后关键路径延迟平均减少40%。
{"title":"Enhancing timing-driven FPGA placement for pipelined netlists","authors":"Ken Eguro, S. Hauck","doi":"10.1145/1391469.1391480","DOIUrl":"https://doi.org/10.1145/1391469.1391480","url":null,"abstract":"FPGA application developers often use pipelining, C-slowing and retiming to improve the performance of their designs. Unfortunately, registered netlists present a fundamentally different problem to CAD tools, potentially limiting the benefit of these techniques. In this paper we discuss some of the inherent issues pipelined netlists pose to existing timing-driven placement approaches. We then present two algorithmic modifications that reduce post-routing critical path delay by an average of 40%.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122118288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Precise failure localization using automated layout analysis of diagnosis candidates 使用自动布局分析诊断候选者的精确故障定位
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391568
W. Tam, O. Poku, R. D. Blanton
Traditional software-based diagnosis of failing chips typically identifies several lines where the failure is believed to reside. However, these lines can span across multiple layers and can be very long in length. This makes physical failure analysis difficult. hi contrast, there are emerging diagnosis techniques that identify both the faulty lines as well as the neighboring conditions for which an affected line becomes faulty, hi this paper, an approach is presented to improve failure localization by automatically analyzing the information associated with the outcome of diagnosis. Experimental results show a significant improvement in failure localization when this method is applied to 106 real IC failures.
传统的基于软件的故障芯片诊断通常会识别出被认为存在故障的几条线路。然而,这些线条可以跨越多个层,长度可以很长。这使得物理失效分析变得困难。相比之下,新兴的诊断技术既可以识别故障线路,也可以识别受影响线路发生故障的邻近条件,因此,本文提出了一种通过自动分析与诊断结果相关的信息来提高故障定位的方法。实验结果表明,该方法对106个实际集成电路故障的定位精度有显著提高。
{"title":"Precise failure localization using automated layout analysis of diagnosis candidates","authors":"W. Tam, O. Poku, R. D. Blanton","doi":"10.1145/1391469.1391568","DOIUrl":"https://doi.org/10.1145/1391469.1391568","url":null,"abstract":"Traditional software-based diagnosis of failing chips typically identifies several lines where the failure is believed to reside. However, these lines can span across multiple layers and can be very long in length. This makes physical failure analysis difficult. hi contrast, there are emerging diagnosis techniques that identify both the faulty lines as well as the neighboring conditions for which an affected line becomes faulty, hi this paper, an approach is presented to improve failure localization by automatically analyzing the information associated with the outcome of diagnosis. Experimental results show a significant improvement in failure localization when this method is applied to 106 real IC failures.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117281127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Optimizing imprecise fixed-point arithmetic circuits specified by Taylor Series through Arithmetic Transform 通过算术变换优化泰勒级数指定的不精确定点算术电路
Pub Date : 2008-06-08 DOI: 10.1145/1391469.1391574
Yu Pang, K. Radecka
We consider synthesis of arithmetic DSP circuits with finite precision fixed-point operations. The aim is to choose the lowest cost implementation that matches a real-valued specification within the allowed imprecision. Starting from Taylor series or real-valued polynomials, we demonstrate first a method to obtain satisfying implementations that uses intermediate arithmetic transform polynomials as an analytical apparatus suitable to precision analysis for both the quantization (bit-width) and approximation sources of imprecision. We then derive the precision optimization algorithm that explores multiple precision parameters in a branch-and-bound search.
研究了具有有限精度不动点运算的DSP算术电路的综合。其目的是在允许的不精确范围内选择与实值规范匹配的成本最低的实现。从泰勒级数或实值多项式开始,我们首先展示了一种获得满意实现的方法,该方法使用中间算术变换多项式作为适合于量化(位宽度)和不精确近似源的精度分析的分析工具。然后,我们推导了在分支定界搜索中探索多个精度参数的精度优化算法。
{"title":"Optimizing imprecise fixed-point arithmetic circuits specified by Taylor Series through Arithmetic Transform","authors":"Yu Pang, K. Radecka","doi":"10.1145/1391469.1391574","DOIUrl":"https://doi.org/10.1145/1391469.1391574","url":null,"abstract":"We consider synthesis of arithmetic DSP circuits with finite precision fixed-point operations. The aim is to choose the lowest cost implementation that matches a real-valued specification within the allowed imprecision. Starting from Taylor series or real-valued polynomials, we demonstrate first a method to obtain satisfying implementations that uses intermediate arithmetic transform polynomials as an analytical apparatus suitable to precision analysis for both the quantization (bit-width) and approximation sources of imprecision. We then derive the precision optimization algorithm that explores multiple precision parameters in a branch-and-bound search.","PeriodicalId":412696,"journal":{"name":"2008 45th ACM/IEEE Design Automation Conference","volume":"28 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129950403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2008 45th ACM/IEEE Design Automation Conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1