首页 > 最新文献

IEEE International SOC Conference, 2004. Proceedings.最新文献

英文 中文
Battery-efficient task execution on portable reconfigurable computing 便携式可重构计算上的高效电池任务执行
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362420
B. Sethuraman, J. Khan, R. Vemuri
We present a battery-efficient task execution methodology for portable reconfigurable computing (RC) platforms. We implement a given algorithm with varying power-performance levels: we call these implementations, Cores and each core is characterized in terms of its power and performance levels. Core change and/or the frequency change are the two mechanisms used to vary the battery consumption. We consider two cases for single task execution: one increasing the performance with a battery life constraint and the other prolonging the battery' life with a performance constraint. The execution time of each task is divided into equal time intervals, called slots. A simulated annealing based algorithm is used to find the best constraint-satisfying sequence of cores offline. Our results show a 50% increase in the total work done (case 1) and 61% increase in battery life (case 2), by using this methodology when compared to a system not using it. The combined effect of both cases is applied to a multiple task execution and the results are reported.
我们提出了一种便携式可重构计算(RC)平台的高效任务执行方法。我们实现具有不同功率性能水平的给定算法:我们将这些实现称为核心,每个核心都根据其功率和性能水平进行表征。核心变化和/或频率变化是用来改变电池消耗的两种机制。我们考虑了单任务执行的两种情况:一种是在电池寿命限制下提高性能,另一种是在性能限制下延长电池寿命。每个任务的执行时间被划分为相等的时间间隔,称为插槽。采用基于模拟退火的算法寻找最优的满足约束的核序列。我们的结果显示,与不使用该方法的系统相比,使用该方法的系统完成的总功增加了50%(案例1),电池寿命增加了61%(案例2)。将这两种情况的综合效果应用于多任务执行,并报告结果。
{"title":"Battery-efficient task execution on portable reconfigurable computing","authors":"B. Sethuraman, J. Khan, R. Vemuri","doi":"10.1109/SOCC.2004.1362420","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362420","url":null,"abstract":"We present a battery-efficient task execution methodology for portable reconfigurable computing (RC) platforms. We implement a given algorithm with varying power-performance levels: we call these implementations, Cores and each core is characterized in terms of its power and performance levels. Core change and/or the frequency change are the two mechanisms used to vary the battery consumption. We consider two cases for single task execution: one increasing the performance with a battery life constraint and the other prolonging the battery' life with a performance constraint. The execution time of each task is divided into equal time intervals, called slots. A simulated annealing based algorithm is used to find the best constraint-satisfying sequence of cores offline. Our results show a 50% increase in the total work done (case 1) and 61% increase in battery life (case 2), by using this methodology when compared to a system not using it. The combined effect of both cases is applied to a multiple task execution and the results are reported.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132734637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
ChipPower: an architecture-level leakage simulator ChipPower:一个架构级泄漏模拟器
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362476
Y. Tsai, Ananth Hegde, N. Vijaykrishnan, M. J. Irwin, T. Theocharides
Leakage power is projected to be one of the major challenges in future technology generations. The temperature profile, process variation, and transistor count all have strong impact on the leakage power distribution of a processor. We have built a simulator to estimate the dynamic/leakage power for a VLIW architecture considering dynamic temperature feedback and process variation. The framework is based on architecture similar to the Intel Itanium IA64 and is extended to simulate its power when implemented in 65nm technology. Our experimental results show that leakage power will become more than 50% of the power budget in 65nm technology. Moreover, without including the process variation, the total leakage power will be underestimated by as much as 30%.
泄漏功率预计将是未来几代技术的主要挑战之一。温度分布、工艺变化和晶体管数量都对处理器的泄漏功率分布有很大的影响。我们建立了一个模拟器来估计VLIW体系结构的动态/泄漏功率,考虑动态温度反馈和过程变化。该框架基于类似于英特尔安腾IA64的架构,并扩展以模拟其在65纳米技术中实现时的性能。我们的实验结果表明,在65nm技术中,泄漏功率将超过功率预算的50%。此外,如果不包括工艺变化,总泄漏功率将被低估多达30%。
{"title":"ChipPower: an architecture-level leakage simulator","authors":"Y. Tsai, Ananth Hegde, N. Vijaykrishnan, M. J. Irwin, T. Theocharides","doi":"10.1109/SOCC.2004.1362476","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362476","url":null,"abstract":"Leakage power is projected to be one of the major challenges in future technology generations. The temperature profile, process variation, and transistor count all have strong impact on the leakage power distribution of a processor. We have built a simulator to estimate the dynamic/leakage power for a VLIW architecture considering dynamic temperature feedback and process variation. The framework is based on architecture similar to the Intel Itanium IA64 and is extended to simulate its power when implemented in 65nm technology. Our experimental results show that leakage power will become more than 50% of the power budget in 65nm technology. Moreover, without including the process variation, the total leakage power will be underestimated by as much as 30%.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133544209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
A low clock load conditional flip-flop 低时钟负载条件触发器
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362394
M. Hansson, A. Alvandpour
We describe a low clock load conditional transmission-gate flip-flop aimed at reducing on-chip clock power consumption. It utilizes a scalable and simple leakage compensation technique, which injects additional leakage current in opposite direction, thus compensating for the worst-case leakage. During any low frequency operation, the flip-flop is configured as a static flip-flop with increased functional robustness. Post-layout simulations show a 30 % clock power reduction compared to a conventional static flip-flop.
我们描述了一种低时钟负载条件传输门触发器,旨在降低片上时钟功耗。它采用一种可扩展和简单的泄漏补偿技术,在相反方向注入额外的泄漏电流,从而补偿最坏情况下的泄漏。在任何低频操作期间,触发器被配置为具有增强功能鲁棒性的静态触发器。布局后仿真显示,与传统的静态触发器相比,时钟功耗降低了30%。
{"title":"A low clock load conditional flip-flop","authors":"M. Hansson, A. Alvandpour","doi":"10.1109/SOCC.2004.1362394","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362394","url":null,"abstract":"We describe a low clock load conditional transmission-gate flip-flop aimed at reducing on-chip clock power consumption. It utilizes a scalable and simple leakage compensation technique, which injects additional leakage current in opposite direction, thus compensating for the worst-case leakage. During any low frequency operation, the flip-flop is configured as a static flip-flop with increased functional robustness. Post-layout simulations show a 30 % clock power reduction compared to a conventional static flip-flop.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121868208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Signal integrity implications of inductor-to-circuit proximity 电感-电路接近的信号完整性含义
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362354
R. Secareanu, Qiang Li, S. Bharatan, C. Kyono, R. Thoma, Mel Miller, O. Hartin
Signal integrity is one of the major challenges in system-on-a-chip (SOC) integration. The complexity of the associated problems increases when RF circuits are integrated together with other circuits, such as digital or large signal circuits. In this material, an analysis of possible interactions between an on-chip inductor and various types of circuit blocks is described. Conclusions on the feasibility of an inductor-circuit system are outlined.
信号完整性是片上系统(SOC)集成的主要挑战之一。当射频电路与其他电路(如数字或大信号电路)集成在一起时,相关问题的复杂性增加。在本材料中,对片上电感器和各种类型的电路块之间可能的相互作用进行了分析。总结了电感电路系统的可行性。
{"title":"Signal integrity implications of inductor-to-circuit proximity","authors":"R. Secareanu, Qiang Li, S. Bharatan, C. Kyono, R. Thoma, Mel Miller, O. Hartin","doi":"10.1109/SOCC.2004.1362354","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362354","url":null,"abstract":"Signal integrity is one of the major challenges in system-on-a-chip (SOC) integration. The complexity of the associated problems increases when RF circuits are integrated together with other circuits, such as digital or large signal circuits. In this material, an analysis of possible interactions between an on-chip inductor and various types of circuit blocks is described. Conclusions on the feasibility of an inductor-circuit system are outlined.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125112079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VLSI design and analysis of a critical-band processor for speech recognition 一种用于语音识别的临界带处理器的VLSI设计与分析
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362466
Chao Wang, Y. Tong, Yu Shao
The critical-band analysis plays a very important role in the frond-end feature extraction for speech recognition. In this paper, a generic low-power and low-voltage VLSI design of a critical-band transform (CBT) processor is proposed. Design and analysis of a 21-band Munich Bark scale CBT processor showed that it can achieve significant power-efficiency by the reduction in computational complexity, implementation of pipeline and parallel processing, and application of supply voltage scaling technique. Simulation results showed that it can complete a CBT analysis (including I/O process) in 4.99ms on one 160-point segment of speech at a very low system clock frequency of 234 kHz. This would support a CBT analysis for 50% overlap 10 ms segments of speech at a sampling frequency of 16 kHz. The power dissipation is 1413.6 /spl mu/W/MHz and 66.7 /spl mu/W/MHz for supply voltage of 3.3 V and 1.1 V, respectively. It contains 206273 transistors and occupies 2.69 mm/sup 2/ for a 0.35 /spl mu/m CMOS technology.
临界频带分析在语音识别的前端特征提取中起着非常重要的作用。本文提出了一种通用的低功耗、低电压的临界频带变换(CBT)处理器VLSI设计方案。对21波段慕尼黑树皮规模CBT处理器的设计和分析表明,该处理器通过降低计算复杂度、实现流水线和并行处理以及应用电源电压缩放技术,可以实现显著的功耗效率。仿真结果表明,在极低的系统时钟频率234 kHz下,该算法可以在4.99ms内完成一个160点语音段的CBT分析(包括I/O过程)。这将支持在16khz采样频率下对50%重叠的10ms语音片段进行CBT分析。电源电压为3.3 V和1.1 V时,功耗分别为1413.6 /spl mu/W/MHz和66.7 /spl mu/W/MHz。它包含206273个晶体管,对于0.35 /spl mu/m的CMOS技术,占地2.69 mm/sup 2/。
{"title":"VLSI design and analysis of a critical-band processor for speech recognition","authors":"Chao Wang, Y. Tong, Yu Shao","doi":"10.1109/SOCC.2004.1362466","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362466","url":null,"abstract":"The critical-band analysis plays a very important role in the frond-end feature extraction for speech recognition. In this paper, a generic low-power and low-voltage VLSI design of a critical-band transform (CBT) processor is proposed. Design and analysis of a 21-band Munich Bark scale CBT processor showed that it can achieve significant power-efficiency by the reduction in computational complexity, implementation of pipeline and parallel processing, and application of supply voltage scaling technique. Simulation results showed that it can complete a CBT analysis (including I/O process) in 4.99ms on one 160-point segment of speech at a very low system clock frequency of 234 kHz. This would support a CBT analysis for 50% overlap 10 ms segments of speech at a sampling frequency of 16 kHz. The power dissipation is 1413.6 /spl mu/W/MHz and 66.7 /spl mu/W/MHz for supply voltage of 3.3 V and 1.1 V, respectively. It contains 206273 transistors and occupies 2.69 mm/sup 2/ for a 0.35 /spl mu/m CMOS technology.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127168023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bandgap yield loss due to dislocations on RFSiGe transceiver ICs: failure analysis, design RFSiGe收发器ic位错造成的带隙良率损失:失效分析与设计
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362385
R. Oberhuber, C. Hechtl, K. Schimpf, B. Staufer
New design and layout methods were developed to overcome yield loss from dislocation defects, which are omnipresent in SiGe technologies as a penalty for the higher speed compared to pure Si. This paper presents the failure analysis on bandgap malfunctions in a RF-SiGe transceiver device which is currently ramped to production. The resulting yieldloss was significant. In-circuit fault analysis identified collector-emitter leakage of the SiGe-HBT's as the electrical root cause. All existing failure patterns were explained with SPICE circuit simulation. Isolation and characterization of the bipolar transistor and high resolution TEM showed dislocations originating from the high strain at the STI-moat edge. Design and layout improvements were applied to reduce the sensitivity of the bandgap reference circuit to the defects: a 5V HBT with superior yield performance was introduced, and variations in the sizing of the matched npn were investigated. With these improvements, the bandgap fails were drastically reduced.
为了克服位错缺陷造成的良率损失,开发了新的设计和布局方法,位错缺陷是SiGe技术中普遍存在的,与纯Si相比,位错缺陷是更高速度的惩罚。本文介绍了一种已投入生产的RF-SiGe收发器带隙故障的失效分析。由此造成的产量损失是显著的。在电路故障分析中发现SiGe-HBT的集电极-发射极漏电是电气的根本原因。通过SPICE电路仿真对现有的失效模式进行了解释。双极晶体管的隔离和表征以及高分辨率透射电镜显示,位错源于sti -护城河边缘的高应变。为了降低带隙参考电路对缺陷的敏感性,采用了改进的设计和布局:引入了良率性能优异的5V HBT,并研究了匹配npn尺寸的变化。通过这些改进,带隙失效大大减少。
{"title":"Bandgap yield loss due to dislocations on RFSiGe transceiver ICs: failure analysis, design","authors":"R. Oberhuber, C. Hechtl, K. Schimpf, B. Staufer","doi":"10.1109/SOCC.2004.1362385","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362385","url":null,"abstract":"New design and layout methods were developed to overcome yield loss from dislocation defects, which are omnipresent in SiGe technologies as a penalty for the higher speed compared to pure Si. This paper presents the failure analysis on bandgap malfunctions in a RF-SiGe transceiver device which is currently ramped to production. The resulting yieldloss was significant. In-circuit fault analysis identified collector-emitter leakage of the SiGe-HBT's as the electrical root cause. All existing failure patterns were explained with SPICE circuit simulation. Isolation and characterization of the bipolar transistor and high resolution TEM showed dislocations originating from the high strain at the STI-moat edge. Design and layout improvements were applied to reduce the sensitivity of the bandgap reference circuit to the defects: a 5V HBT with superior yield performance was introduced, and variations in the sizing of the matched npn were investigated. With these improvements, the bandgap fails were drastically reduced.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129452077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A scalable and robust rail-to-rail delay cell for DLLs 一种可扩展且鲁棒的dll轨对轨延迟单元
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362378
Håkan Bengtson, C. Svensson
This paper describes a scalable and robust differential rail-to-rail delay cell. The delay cell is fabricated in a 3.3 V 0.35 /spl mu/m CMOS process. The delay cell shows a wide-range operation and low power supply sensitivity. The delay range is 0.31 ps to 21.8 ns. For 0.5 ns delay, when the clock period is 500 MHz, the power supply sensitivity is 0.033 ps/mV. The delay cell is used in a DLL for clock generation of a four times interleaved 2 Gb/s decision feedback equalizer.
本文描述了一种可扩展的鲁棒差分轨间延迟单元。该延迟单元采用3.3 V 0.35 /spl mu/m CMOS工艺制作。延时单元工作范围宽,电源灵敏度低。延迟范围为0.31 ps至21.8 ns。延时0.5 ns,时钟周期为500mhz时,电源灵敏度为0.033 ps/mV。延时单元用于DLL中的四倍交错2gb /s决策反馈均衡器的时钟生成。
{"title":"A scalable and robust rail-to-rail delay cell for DLLs","authors":"Håkan Bengtson, C. Svensson","doi":"10.1109/SOCC.2004.1362378","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362378","url":null,"abstract":"This paper describes a scalable and robust differential rail-to-rail delay cell. The delay cell is fabricated in a 3.3 V 0.35 /spl mu/m CMOS process. The delay cell shows a wide-range operation and low power supply sensitivity. The delay range is 0.31 ps to 21.8 ns. For 0.5 ns delay, when the clock period is 500 MHz, the power supply sensitivity is 0.033 ps/mV. The delay cell is used in a DLL for clock generation of a four times interleaved 2 Gb/s decision feedback equalizer.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"172 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120865655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A memory allocation and assignment method using multiway partitioning 一种使用多路分区的内存分配和分配方法
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362382
Nam-Hoon Kim, P. Beerel, Ralph Peng
In many data-intensive applications, an area and power efficient memory architecture is significant for the overall system in terms of area and power consumption. We present a new memory allocation and assignment method using memory partitioning to customize the memory architecture. The method consists of two main steps. The useful exploration region to trade off area with energy consumption is first extracted. Secondly, based on this exploration space, an iterative multi-way partitioning is performed to optimize area and power. Memory partitioning is an effective memory optimization strategy that seeks to customize the memory architecture for a given application. Our experimental results on several examples demonstrate that our method can find the promising memory architecture.
在许多数据密集型应用程序中,就面积和功耗而言,面积和功耗效率高的内存体系结构对整个系统非常重要。我们提出了一种新的内存分配和分配方法,使用内存分区来定制内存体系结构。该方法包括两个主要步骤。首先提取出能够权衡面积与能耗的有用勘探区域。其次,基于该探索空间进行迭代多路划分,优化面积和功率;内存分区是一种有效的内存优化策略,它旨在为给定的应用程序定制内存架构。几个实例的实验结果表明,我们的方法可以找到有前途的存储结构。
{"title":"A memory allocation and assignment method using multiway partitioning","authors":"Nam-Hoon Kim, P. Beerel, Ralph Peng","doi":"10.1109/SOCC.2004.1362382","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362382","url":null,"abstract":"In many data-intensive applications, an area and power efficient memory architecture is significant for the overall system in terms of area and power consumption. We present a new memory allocation and assignment method using memory partitioning to customize the memory architecture. The method consists of two main steps. The useful exploration region to trade off area with energy consumption is first extracted. Secondly, based on this exploration space, an iterative multi-way partitioning is performed to optimize area and power. Memory partitioning is an effective memory optimization strategy that seeks to customize the memory architecture for a given application. Our experimental results on several examples demonstrate that our method can find the promising memory architecture.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131652198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Reducing crosstalk noise in high speed FPGAs 降低高速fpga串扰噪声
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362391
A. Mukherjee
Narrowing time-to-market windows are driving the design community toward FPGAs. Whereas quick prototype implementations are possible using FPGAs, circuit delays have always been a major concern. Moreover, achieving high performance in FPGAs with densely packed routing resources is difficult because of crosstalk noise. In this paper we describe a very high performance FPGA, and show a simple and practical technique of almost reducing crosstalk noise by using a two-phase nonoverlapping complimentary clocking scheme. An efficient integer linear programming formulation has been proposed to find an optimum solution to a constrained problem, and we have studied the effects and costs of applying our idea to different architectures. Experiments with MCNC benchmark circuits in different architectures of our FPGA show that, on average, we could reduce crosstalk induced delay increases to less than 4% of the clock period. With a minimal increase of 3% in area due to this optimization, our results seem very promising.
缩短上市时间窗口正推动设计界转向fpga。虽然使用fpga可以实现快速原型,但电路延迟一直是一个主要问题。此外,由于串扰噪声,在路由资源密集的fpga中实现高性能是困难的。在本文中,我们描述了一个非常高性能的FPGA,并展示了一种简单实用的技术,通过使用两相非重叠互补时钟方案几乎降低串扰噪声。我们提出了一个有效的整数线性规划公式来寻找约束问题的最优解,并研究了将我们的思想应用于不同体系结构的效果和代价。在不同架构的FPGA上对MCNC基准电路进行的实验表明,平均而言,我们可以将串扰引起的延迟增加减少到时钟周期的4%以下。由于这种优化,面积最小增加了3%,我们的结果看起来非常有希望。
{"title":"Reducing crosstalk noise in high speed FPGAs","authors":"A. Mukherjee","doi":"10.1109/SOCC.2004.1362391","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362391","url":null,"abstract":"Narrowing time-to-market windows are driving the design community toward FPGAs. Whereas quick prototype implementations are possible using FPGAs, circuit delays have always been a major concern. Moreover, achieving high performance in FPGAs with densely packed routing resources is difficult because of crosstalk noise. In this paper we describe a very high performance FPGA, and show a simple and practical technique of almost reducing crosstalk noise by using a two-phase nonoverlapping complimentary clocking scheme. An efficient integer linear programming formulation has been proposed to find an optimum solution to a constrained problem, and we have studied the effects and costs of applying our idea to different architectures. Experiments with MCNC benchmark circuits in different architectures of our FPGA show that, on average, we could reduce crosstalk induced delay increases to less than 4% of the clock period. With a minimal increase of 3% in area due to this optimization, our results seem very promising.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126809348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Clock tree layout design for reduced delay uncertainty 时钟树布局设计,减少延迟不确定性
Pub Date : 2004-11-30 DOI: 10.1109/SOCC.2004.1362399
D. Velenis, M. Papaefthymiou, E. Friedman
The design of clock distribution networks in synchronous digital systems presents enormous challenges. Controlling the clock signal delay in the presence of various noise sources, process parameter variations, and environmental effects represents a fundamental problem in the design of high speed synchronous circuits. Two different approaches for enhancing the layout of the clock tree in order to reduce the uncertainty of the clock signal are presented in this paper. The application of these techniques on a set of benchmark circuits demonstrates interesting tradeoffs among the aggregate clock buffer size, the total wire length of the clock tree, and the power dissipation.
同步数字系统中时钟分配网络的设计提出了巨大的挑战。在存在各种噪声源、工艺参数变化和环境影响的情况下控制时钟信号延迟是高速同步电路设计中的一个基本问题。本文提出了两种不同的增强时钟树布局的方法,以降低时钟信号的不确定性。这些技术在一组基准电路上的应用演示了在总时钟缓冲区大小、时钟树的总导线长度和功耗之间的有趣权衡。
{"title":"Clock tree layout design for reduced delay uncertainty","authors":"D. Velenis, M. Papaefthymiou, E. Friedman","doi":"10.1109/SOCC.2004.1362399","DOIUrl":"https://doi.org/10.1109/SOCC.2004.1362399","url":null,"abstract":"The design of clock distribution networks in synchronous digital systems presents enormous challenges. Controlling the clock signal delay in the presence of various noise sources, process parameter variations, and environmental effects represents a fundamental problem in the design of high speed synchronous circuits. Two different approaches for enhancing the layout of the clock tree in order to reduce the uncertainty of the clock signal are presented in this paper. The application of these techniques on a set of benchmark circuits demonstrates interesting tradeoffs among the aggregate clock buffer size, the total wire length of the clock tree, and the power dissipation.","PeriodicalId":184894,"journal":{"name":"IEEE International SOC Conference, 2004. Proceedings.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126879999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
IEEE International SOC Conference, 2004. Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1