首页 > 最新文献

2008 Asia and South Pacific Design Automation Conference最新文献

英文 中文
1-cc computer using UWB-IR for wireless sensor network 1cc电脑使用UWB-IR进行无线传感器网络
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483982
T. Nakagawa, M. Miyazaki, G. Ono, R. Fujiwara, T. Norimatsu, T. Terada, A. Maeki, Y. Ogata, Shinsuke Kobayashi, N. Koshizuka, K. Sakamura
An ultra-small, high-data-rate, low-power 1-cc computer (OCCC) with an UWB-IR (ultra-wideband impulse-radio) transceiver was developed for a wireless sensor network. Thanks to bear-chip implementation and a flexible printed circuit board, the size of the computer is only 1 cm3. To achieve 10-Mbps data rate, a middle-class 32-bit microcontroller, which has both a bus interface and a USB 2.0 controller, was selected. Low-power techniques, such as transition of microcontroller status to standby mode by using an external real-time clock during wait times, power shutdown of halted circuits, and detailed control of UWB-IR transceiver status, are applied. The effect of these low-power techniques is verified by measuring the time history of current consumption of the OCCC. It was confirmed that the OCCC can provide wireless communication at a transmission rate of 258 kbps over a distance of 30 m.
为无线传感器网络开发了一种超小型、高数据速率、低功耗的1-cc计算机(OCCC),具有UWB-IR(超宽带脉冲无线电)收发器。由于采用了熊掌芯片和柔性印刷电路板,计算机的尺寸仅为1立方厘米。为了实现10-Mbps的数据速率,选择了具有总线接口和USB 2.0控制器的中产阶级32位微控制器。应用了低功耗技术,例如在等待时间内使用外部实时时钟将微控制器状态转换为待机模式,停止电路的电源关闭以及UWB-IR收发器状态的详细控制。通过测量OCCC电流消耗的时间历史,验证了这些低功耗技术的效果。据证实,OCCC可以在30米的距离上以258kbps的传输速率进行无线通信。
{"title":"1-cc computer using UWB-IR for wireless sensor network","authors":"T. Nakagawa, M. Miyazaki, G. Ono, R. Fujiwara, T. Norimatsu, T. Terada, A. Maeki, Y. Ogata, Shinsuke Kobayashi, N. Koshizuka, K. Sakamura","doi":"10.1109/ASPDAC.2008.4483982","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483982","url":null,"abstract":"An ultra-small, high-data-rate, low-power 1-cc computer (OCCC) with an UWB-IR (ultra-wideband impulse-radio) transceiver was developed for a wireless sensor network. Thanks to bear-chip implementation and a flexible printed circuit board, the size of the computer is only 1 cm3. To achieve 10-Mbps data rate, a middle-class 32-bit microcontroller, which has both a bus interface and a USB 2.0 controller, was selected. Low-power techniques, such as transition of microcontroller status to standby mode by using an external real-time clock during wait times, power shutdown of halted circuits, and detailed control of UWB-IR transceiver status, are applied. The effect of these low-power techniques is verified by measuring the time history of current consumption of the OCCC. It was confirmed that the OCCC can provide wireless communication at a transmission rate of 258 kbps over a distance of 30 m.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131714464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Technology modeling and characterization beyond the 45nm node 超过45nm节点的技术建模和表征
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483944
S. Nassif
The semiconductor industry is unique in that it produces products with little or no prototyping! While a car company will build (and crash) many prototypes before converging on a final design, integrated circuits are built almost entirely on a basis of computer predictions. These predictions are based on models of performance based on simulation performed at multiple hierarchical levels, but always rooted in the end in classical circuit simulation using tools like the venerable Spice [1]. But as we continue to scale technology further, we observe a diminishing rate of performance return which is in turn causing a spiral of increasing manufacturing process complexity in an attempt to maintain performance per historical trends. This increase in technology complexity is introducing a number of systematic (i.e. design dependent) sources of design variability which demand modeling and characterization resources. At the same time, we are entering a regime where the averaging effect of the law of large numbers is becoming weaker, resulting in an increase in influence of fundamental atomistic variations. Phenomena like channel dopant fluctuations [2] and line-edge roughness [3] are creating a random variability noise floor which is difficult to get around without significant process impact. The result of the increase in these, and other sources of variability is a corresponding increase in important circuit phenomena like SRAM stability and leakage power variations. The net result is a gradual breakdown of the traditional ";device model + design rule"; contact between design and manufacturing, and a corresponding lack of predictability in fabrication outcome that is endangering the profitability of Silicon semiconductor manufacturing as we enter what may be the last handful of generations of CMOS. This lack of predictability is happening because of two important factors. ldr The overall CMOS technology slowdown has led to rapidly increasing complexity in the process and in its interaction with design. This has in turn caused an increase in the number and magnitude of systematic sources of mismatch between simulation models (both at the circuit simulation and timing levels) and hardware measurements. ldr Manufacturing variability, both systematic and random, -long a source of concern only for analog design- is becoming important for digital designs as well and thus its prediction is now a first order priority. However, it is competing for the attention of researchers and CAD developers with a host of other so-called nm effects, thus slowing down the delivery of needed solutions. The result is (a) our ability to arbitrarily compose a design out of disparate components is compromised because of a high degree of interaction between these components , and (b) our ability to predict the nominal performance of a design as well as its tolerances and sensitivities is in danger. In this talk, we will review these issues and show how they are all related to the cor
半导体行业的独特之处在于,它生产的产品很少或没有原型!虽然汽车公司在最终设计之前会制造(和碰撞)许多原型,但集成电路几乎完全是在计算机预测的基础上制造的。这些预测是基于在多个层次上进行的基于仿真的性能模型,但始终植根于使用诸如古老的Spice[1]等工具的经典电路仿真。但随着我们继续扩大技术规模,我们观察到性能回报率的下降,这反过来又导致了制造工艺复杂性的螺旋式增长,以保持每历史趋势的性能。这种技术复杂性的增加引入了许多系统的(即依赖于设计的)设计可变性的来源,这需要建模和表征资源。同时,我们正在进入一个大数定律的平均效应变得越来越弱的状态,导致基本原子变化的影响增加。通道掺杂波动[2]和线边缘粗糙度[3]等现象正在产生随机可变性噪声底,在不产生重大工艺影响的情况下很难绕过。这些和其他可变性来源增加的结果是SRAM稳定性和泄漏功率变化等重要电路现象的相应增加。最终的结果是传统的“设备模型+设计规则”逐渐被打破;设计和制造之间的联系,以及相应的制造结果缺乏可预测性,这正在危及硅半导体制造业的盈利能力,因为我们进入了可能是最后几代CMOS。这种缺乏可预测性的现象是由两个重要因素造成的。整体CMOS技术的放缓导致工艺及其与设计交互的复杂性迅速增加。这反过来又导致了仿真模型(在电路仿真和时序级别)和硬件测量之间不匹配的系统来源的数量和程度的增加。ldr制造的可变性,无论是系统的还是随机的,长期以来只在模拟设计中引起关注,现在对数字设计也变得很重要,因此它的预测现在是第一优先级。然而,它正在与许多其他所谓的纳米效应争夺研究人员和CAD开发人员的注意力,从而减慢了所需解决方案的交付速度。结果是:(a)由于这些组件之间的高度交互,我们将不同组件任意组合设计的能力受到损害,并且(b)我们预测设计的标称性能及其容差和灵敏度的能力处于危险之中。在这次演讲中,我们将回顾这些问题,并展示它们是如何与模型与硬件匹配的核心问题相关的。我们还将展示该问题的潜在解决方案的示例,其中一些解决方案目前正在IBM开发中,而一些解决方案则是长期的,并且将从学术界的关注中受益匪浅。
{"title":"Technology modeling and characterization beyond the 45nm node","authors":"S. Nassif","doi":"10.1109/ASPDAC.2008.4483944","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483944","url":null,"abstract":"The semiconductor industry is unique in that it produces products with little or no prototyping! While a car company will build (and crash) many prototypes before converging on a final design, integrated circuits are built almost entirely on a basis of computer predictions. These predictions are based on models of performance based on simulation performed at multiple hierarchical levels, but always rooted in the end in classical circuit simulation using tools like the venerable Spice [1]. But as we continue to scale technology further, we observe a diminishing rate of performance return which is in turn causing a spiral of increasing manufacturing process complexity in an attempt to maintain performance per historical trends. This increase in technology complexity is introducing a number of systematic (i.e. design dependent) sources of design variability which demand modeling and characterization resources. At the same time, we are entering a regime where the averaging effect of the law of large numbers is becoming weaker, resulting in an increase in influence of fundamental atomistic variations. Phenomena like channel dopant fluctuations [2] and line-edge roughness [3] are creating a random variability noise floor which is difficult to get around without significant process impact. The result of the increase in these, and other sources of variability is a corresponding increase in important circuit phenomena like SRAM stability and leakage power variations. The net result is a gradual breakdown of the traditional \";device model + design rule\"; contact between design and manufacturing, and a corresponding lack of predictability in fabrication outcome that is endangering the profitability of Silicon semiconductor manufacturing as we enter what may be the last handful of generations of CMOS. This lack of predictability is happening because of two important factors. ldr The overall CMOS technology slowdown has led to rapidly increasing complexity in the process and in its interaction with design. This has in turn caused an increase in the number and magnitude of systematic sources of mismatch between simulation models (both at the circuit simulation and timing levels) and hardware measurements. ldr Manufacturing variability, both systematic and random, -long a source of concern only for analog design- is becoming important for digital designs as well and thus its prediction is now a first order priority. However, it is competing for the attention of researchers and CAD developers with a host of other so-called nm effects, thus slowing down the delivery of needed solutions. The result is (a) our ability to arbitrarily compose a design out of disparate components is compromised because of a high degree of interaction between these components , and (b) our ability to predict the nominal performance of a design as well as its tolerances and sensitivities is in danger. In this talk, we will review these issues and show how they are all related to the cor","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129572803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Statistical gate delay model for Multiple Input Switching 多输入开关的统计门延迟模型
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483959
Takayuki Fukuoka, A. Tsuchiya, H. Onodera
In this paper, we propose a calculation method of gate delay for SSTA (Statistical Static Timing Analysis) considering MIS (Multiple Input Switching). Most SSTA approaches assume a single input switching model and ignore the effect of MIS on gate delay. MIS occurs when multiple inputs of a gate switch nearly simultaneously. Thus, ignoring MIS causes error in MAX operation in SSTA. We propose a statistical gate delay model considering MIS. We verify the proposed method by SPICE based Monte Carlo simulations and experimental results show that the proposed method improves the error due to ignoring MIS.
在本文中,我们提出了一种考虑多输入开关(MIS)的SSTA(统计静态时序分析)的门延迟计算方法。大多数SSTA方法采用单输入开关模型,忽略了MIS对门延迟的影响。当一个门开关的多个输入几乎同时发生时,MIS就发生了。因此,忽略MIS会导致SSTA中MAX操作出错。我们提出了一个考虑MIS的统计门延迟模型。通过基于SPICE的蒙特卡罗仿真验证了所提方法的有效性,实验结果表明,所提方法能有效地改善由于忽略MIS而导致的误差。
{"title":"Statistical gate delay model for Multiple Input Switching","authors":"Takayuki Fukuoka, A. Tsuchiya, H. Onodera","doi":"10.1109/ASPDAC.2008.4483959","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483959","url":null,"abstract":"In this paper, we propose a calculation method of gate delay for SSTA (Statistical Static Timing Analysis) considering MIS (Multiple Input Switching). Most SSTA approaches assume a single input switching model and ignore the effect of MIS on gate delay. MIS occurs when multiple inputs of a gate switch nearly simultaneously. Thus, ignoring MIS causes error in MAX operation in SSTA. We propose a statistical gate delay model considering MIS. We verify the proposed method by SPICE based Monte Carlo simulations and experimental results show that the proposed method improves the error due to ignoring MIS.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132143205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
ORB: An on-chip optical ring bus communication architecture for multi-processor systems-on-chip ORB:用于片上多处理器系统的片上光环总线通信体系结构
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484059
S. Pasricha, N. Dutt
As application complexity continues to increase, multiprocessor systems-on-chip (MPSoC) with tens to hundreds of processing cores are becoming the norm. While computational cores have become faster with each successive technology generation, communication between them has become a bottleneck that limits overall chip performance. On-chip optical interconnects can overcome this bottleneck by replacing electrical wires with optical waveguides. In this paper we propose an optical ring bus (ORB) based on-chip communication architecture for next generation MPSoCs. ORB uses an optical ring waveguide to replace global pipelined electrical interconnects while preserving the interface with today's bus protocol standards such as AMBA AXI. We present experiments to show how ORB has the potential to provide superior performance (more than 2times) and significantly lower power consumption (a reduction of more than 10times) compared to traditionally used pipelined, all-electrical bus-based communication architectures, for 65-22 nm technology nodes.
随着应用复杂性的不断增加,具有数十到数百个处理核心的多处理器片上系统(MPSoC)正成为常态。随着每一代技术的发展,计算核心的速度越来越快,但它们之间的通信已经成为限制芯片整体性能的瓶颈。片上光互连可以通过用光波导取代电线来克服这一瓶颈。本文提出了一种基于光环总线(ORB)的下一代mpsoc片上通信架构。ORB使用光环波导来取代全球流水线电气互连,同时保留当今总线协议标准(如AMBA AXI)的接口。我们提出的实验表明,与传统使用的流水线、基于全电总线的通信架构相比,ORB在65-22 nm技术节点上具有提供卓越性能(超过2倍)和显着降低功耗(降低10倍以上)的潜力。
{"title":"ORB: An on-chip optical ring bus communication architecture for multi-processor systems-on-chip","authors":"S. Pasricha, N. Dutt","doi":"10.1109/ASPDAC.2008.4484059","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484059","url":null,"abstract":"As application complexity continues to increase, multiprocessor systems-on-chip (MPSoC) with tens to hundreds of processing cores are becoming the norm. While computational cores have become faster with each successive technology generation, communication between them has become a bottleneck that limits overall chip performance. On-chip optical interconnects can overcome this bottleneck by replacing electrical wires with optical waveguides. In this paper we propose an optical ring bus (ORB) based on-chip communication architecture for next generation MPSoCs. ORB uses an optical ring waveguide to replace global pipelined electrical interconnects while preserving the interface with today's bus protocol standards such as AMBA AXI. We present experiments to show how ORB has the potential to provide superior performance (more than 2times) and significantly lower power consumption (a reduction of more than 10times) compared to traditionally used pipelined, all-electrical bus-based communication architectures, for 65-22 nm technology nodes.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129394749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 55
Localized random access scan: Towards low area and routing overhead 局部随机访问扫描:面向低区域和路由开销
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484016
Yu Hu, Xiang Fu, Xiaoxin Fan, H. Fujiwara
Conventional random access scan (RAS) designs, although economic in test power dissipation, test application time and test data volume, are expensive in area and routing overhead. In this paper, we present a localized RAS architecture (LRAS) to address this issue. A novel scan cell structure, which has fewer transistors than the multiplexer-type scan cell, is proposed to eliminate the global test enable signal and to localize the row enable and the column enable signals. Experimental results on ISCAS'89 and ITC'99 benchmark circuits demonstrate that LRAS has 54% less area overhead than multiplexer-type scan chain based designs, while significantly outperforms the state-of-the-art RAS scheme in routing overhead.
传统的随机存取扫描(RAS)设计虽然在测试功耗、测试应用时间和测试数据量方面具有经济性,但在面积和路由开销方面却很昂贵。在本文中,我们提出了一个本地化的RAS体系结构(LRAS)来解决这个问题。提出了一种新型扫描单元结构,该结构比复用器型扫描单元具有更少的晶体管,可以消除全局测试使能信号,并对行使能信号和列使能信号进行局部化。在ISCAS'89和ITC'99基准电路上的实验结果表明,LRAS比基于多路复用器型扫描链的设计减少了54%的面积开销,同时在路由开销方面显著优于最先进的RAS方案。
{"title":"Localized random access scan: Towards low area and routing overhead","authors":"Yu Hu, Xiang Fu, Xiaoxin Fan, H. Fujiwara","doi":"10.1109/ASPDAC.2008.4484016","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484016","url":null,"abstract":"Conventional random access scan (RAS) designs, although economic in test power dissipation, test application time and test data volume, are expensive in area and routing overhead. In this paper, we present a localized RAS architecture (LRAS) to address this issue. A novel scan cell structure, which has fewer transistors than the multiplexer-type scan cell, is proposed to eliminate the global test enable signal and to localize the row enable and the column enable signals. Experimental results on ISCAS'89 and ITC'99 benchmark circuits demonstrate that LRAS has 54% less area overhead than multiplexer-type scan chain based designs, while significantly outperforms the state-of-the-art RAS scheme in routing overhead.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134276666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Full-chip thermal analysis for the early design stage via generalized integral transforms 基于广义积分变换的早期设计全芯片热分析
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483995
Pei-Yu Huang, Chih-Kang Lin, Yu-Min Lee
The capability of predicting the temperature profile is critically important for circuit timing estimation, leakage reduction, power estimation, hotspot avoidance, and reliability concerns during modern IC designs. This paper presents an accurate and fast analytical full-chip thermal simulator for the early-stage temperature-aware chip design. By using the technique of generalized integral transforms (GIT), our proposed method can accurately estimate the temperature distribution of full-chip with very small truncation points of bases in the spatial domain. We also develop a fast Fourier transform (FFT) like evaluating algorithm to efficiently evaluate the temperature distribution. Experimental results confirm that our GIT based analyzer can achieve an order of magnitude speedup compared with a highly efficient Green's function based method.
在现代IC设计中,预测温度分布的能力对于电路时序估计,减少泄漏,功率估计,热点避免和可靠性问题至关重要。本文提出了一种精确、快速的全芯片热模拟器,用于早期温度感知芯片的设计。该方法利用广义积分变换技术,可以在极小的碱基截断点的情况下准确估计出全芯片的温度分布。我们还开发了一种快速傅立叶变换(FFT)评估算法来有效地评估温度分布。实验结果表明,与基于格林函数的高效方法相比,基于GIT的分析仪可以实现一个数量级的加速。
{"title":"Full-chip thermal analysis for the early design stage via generalized integral transforms","authors":"Pei-Yu Huang, Chih-Kang Lin, Yu-Min Lee","doi":"10.1109/ASPDAC.2008.4483995","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483995","url":null,"abstract":"The capability of predicting the temperature profile is critically important for circuit timing estimation, leakage reduction, power estimation, hotspot avoidance, and reliability concerns during modern IC designs. This paper presents an accurate and fast analytical full-chip thermal simulator for the early-stage temperature-aware chip design. By using the technique of generalized integral transforms (GIT), our proposed method can accurately estimate the temperature distribution of full-chip with very small truncation points of bases in the spatial domain. We also develop a fast Fourier transform (FFT) like evaluating algorithm to efficiently evaluate the temperature distribution. Experimental results confirm that our GIT based analyzer can achieve an order of magnitude speedup compared with a highly efficient Green's function based method.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133219577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
REWIRED - Register Write Inhibition by Resource Dedication 通过资源奉献来抑制寄存器写
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483960
Pushkar Tripathi, Rohan Jain, S. Kurra, P. Panda
We propose REWIRED (register write inhibition by resource dedication), a technique for reducing power during high level synthesis (HLS) by selectively inhibiting the storage of function unit (FU) output data into registers. Registers are generally inferred in HLS when data produced in one clock cycle is used in a later cycle. However, when it can be established that the input registers to an FU are not changing values during a certain period, the outputs during this period can be directly read off the FU output pins without needing to store them in registers. When the life-times of such data are short, it may be possible to completely eliminate the register storage operation, thereby reducing power. We present a genetic algorithm formulation and a heuristic for maximizing the number of register stores that can be inhibited in a scheduled data flow graph (DFG) during behavioral synthesis.
我们提出REWIRED(寄存器写入抑制资源分配),这是一种通过选择性地抑制功能单元(FU)输出数据存储到寄存器中来降低高级合成(HLS)期间功耗的技术。当在一个时钟周期中产生的数据在下一个周期中使用时,HLS通常推断寄存器。然而,当可以确定FU的输入寄存器在一段时间内不改变值时,这段时间内的输出可以直接从FU的输出引脚读取,而无需将其存储在寄存器中。当这些数据的寿命很短时,可以完全消除寄存器存储操作,从而降低功耗。我们提出了一种遗传算法公式和一种启发式算法,用于在行为合成过程中最大化可在调度数据流图(DFG)中被抑制的寄存器存储数量。
{"title":"REWIRED - Register Write Inhibition by Resource Dedication","authors":"Pushkar Tripathi, Rohan Jain, S. Kurra, P. Panda","doi":"10.1109/ASPDAC.2008.4483960","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483960","url":null,"abstract":"We propose REWIRED (register write inhibition by resource dedication), a technique for reducing power during high level synthesis (HLS) by selectively inhibiting the storage of function unit (FU) output data into registers. Registers are generally inferred in HLS when data produced in one clock cycle is used in a later cycle. However, when it can be established that the input registers to an FU are not changing values during a certain period, the outputs during this period can be directly read off the FU output pins without needing to store them in registers. When the life-times of such data are short, it may be possible to completely eliminate the register storage operation, thereby reducing power. We present a genetic algorithm formulation and a heuristic for maximizing the number of register stores that can be inhibited in a scheduled data flow graph (DFG) during behavioral synthesis.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125851024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Ordered escape routing based on Boolean satisfiability 基于布尔可满足性的有序转义路由
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483950
Lijuan Luo, Martin D. F. Wong
Routing for high-speed boards is largely a time-consuming manual task today. In this work we consider the ordered escape routing problem which is a key problem in board-level routing. All existing approaches to this problem cannot guarantee to find a routing solution even if one exists. We present an algorithm to exactly solve this problem based on Boolean satisfiability. Experimental results on escape routing problems from industry show that our algorithm performs well.
今天,高速板的路由在很大程度上是一项耗时的手工任务。本文研究了有序逃逸路由问题,这是板级路由中的一个关键问题。所有现有的解决这个问题的方法都不能保证找到一个路由解决方案,即使它存在。提出了一种基于布尔可满足性的精确求解算法。对工业中逃逸路由问题的实验结果表明,该算法具有良好的性能。
{"title":"Ordered escape routing based on Boolean satisfiability","authors":"Lijuan Luo, Martin D. F. Wong","doi":"10.1109/ASPDAC.2008.4483950","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483950","url":null,"abstract":"Routing for high-speed boards is largely a time-consuming manual task today. In this work we consider the ordered escape routing problem which is a key problem in board-level routing. All existing approaches to this problem cannot guarantee to find a routing solution even if one exists. We present an algorithm to exactly solve this problem based on Boolean satisfiability. Experimental results on escape routing problems from industry show that our algorithm performs well.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130243295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
Reconfigurable RTD-based circuit elements of complete logic functionality 可重构的基于rtd的电路元件的完整逻辑功能
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484045
Yexin Zheng, Chao Huang
Resonant tunneling diodes (RTDs) have demonstrated promising circuit characteristics of high speed switching property and versatile functionality with negative differential resistance (NDR). In this paper, we propose novel programmable logic elements (PLEs) that can be configured to realize all three- or four-input logic functions. These simple RTD-based circuit elements are implemented with threshold gates (TGs) and multi-threshold threshold gates (MTTGs) by employing programmable monostable-bistable logic element (MOBILE) principles. We also developed a dynamically reconfigurable scheme based on our PLE structures which facilitate nanopipelining without incurring delay overheads.
谐振隧道二极管(rtd)具有高速开关特性和具有负差分电阻(NDR)的多用途功能,具有良好的电路特性。在本文中,我们提出了新的可编程逻辑元件(ple),可以配置实现所有三或四输入逻辑功能。这些简单的基于rtd的电路元件采用可编程单稳-双稳逻辑元件(MOBILE)原理,由阈值门(tg)和多阈值门(mttg)实现。我们还开发了一种基于我们的PLE结构的动态可重构方案,该方案促进了纳米管道而不会产生延迟开销。
{"title":"Reconfigurable RTD-based circuit elements of complete logic functionality","authors":"Yexin Zheng, Chao Huang","doi":"10.1109/ASPDAC.2008.4484045","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484045","url":null,"abstract":"Resonant tunneling diodes (RTDs) have demonstrated promising circuit characteristics of high speed switching property and versatile functionality with negative differential resistance (NDR). In this paper, we propose novel programmable logic elements (PLEs) that can be configured to realize all three- or four-input logic functions. These simple RTD-based circuit elements are implemented with threshold gates (TGs) and multi-threshold threshold gates (MTTGs) by employing programmable monostable-bistable logic element (MOBILE) principles. We also developed a dynamically reconfigurable scheme based on our PLE structures which facilitate nanopipelining without incurring delay overheads.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129971273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
LP based white space redistribution for thermal via planning and performance optimization in 3D ICs 基于LP的白色空间再分配,通过规划和性能优化的3D集成电路
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483942
Xin Li, Yuchun Ma, Xianlong Hong, Sheqin Dong, J. Cong
Thermal issue is a critical challenge in 3D IC circuit design. Incorporating thermal vias into 3D IC is a promising way to mitigate thermal issues by lowering down the thermal resistances between device layers. However, it is usually difficult to get enough space at target regions to insert thermal vias. In this paper, we propose a novel analytical algorithm to re-allocate white space for 3D ICs to facilitate via insertion. Experimental results show that after reallocating whitespaces, thermal vias and total wirelength could be reduced by 14% and by 2%, respectively. It also shows that whitespace distribution with via planning alone will degrade performance by 9% while performance-aware via planning method can reduce thermal via number by 60% and the performance is kept nearly unchanged.
热问题是三维集成电路设计中的一个关键问题。通过降低器件层之间的热阻,将热通孔集成到3D集成电路中是一种很有前途的解决热问题的方法。然而,通常很难在目标区域获得足够的空间来插入热过孔。在本文中,我们提出了一种新的解析算法来重新分配三维集成电路的空白空间,以方便通过插入。实验结果表明,重新分配空白空间后,热通孔和总波长分别减少14%和2%。研究还表明,单独采用间隙规划的空白分布将使性能下降9%,而性能敏感的间隙规划方法可以减少60%的热孔数,并且性能几乎保持不变。
{"title":"LP based white space redistribution for thermal via planning and performance optimization in 3D ICs","authors":"Xin Li, Yuchun Ma, Xianlong Hong, Sheqin Dong, J. Cong","doi":"10.1109/ASPDAC.2008.4483942","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483942","url":null,"abstract":"Thermal issue is a critical challenge in 3D IC circuit design. Incorporating thermal vias into 3D IC is a promising way to mitigate thermal issues by lowering down the thermal resistances between device layers. However, it is usually difficult to get enough space at target regions to insert thermal vias. In this paper, we propose a novel analytical algorithm to re-allocate white space for 3D ICs to facilitate via insertion. Experimental results show that after reallocating whitespaces, thermal vias and total wirelength could be reduced by 14% and by 2%, respectively. It also shows that whitespace distribution with via planning alone will degrade performance by 9% while performance-aware via planning method can reduce thermal via number by 60% and the performance is kept nearly unchanged.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116986601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
期刊
2008 Asia and South Pacific Design Automation Conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1