首页 > 最新文献

Proceedings Design, Automation and Test in Europe Conference and Exhibition最新文献

英文 中文
Impact of data transformations on memory bank locality 数据转换对内存库局部性的影响
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268896
M. Kandemir
High-energy consumption presents a problem for sustainable clock frequency and deliverable performance. In particular, memory energy consumption of array-intensive applications can be overwhelming due to poor cache locality. One option for reducing memory energy is to adopt a banked memory architecture, where memory space is divided into banks and each bank can be powered down if it is not in active use. An important issue here is the bank access pattern, which determines opportunities for saving energy. In this paper, we present a compiler-based data layout transformation strategy for increasing the effectiveness of a banked memory architecture. The idea is to transform the array layouts in memory in such a way that two loop iterations executed one after another access the data in the same bank as much as possible; the remaining banks can be placed into a low-power mode. Our simulation-based experiments with nine array-intensive applications show significant savings in memory energy consumption.
高能耗对可持续时钟频率和可交付性能提出了一个问题。特别是,由于缓存局部性差,存储密集型应用程序的内存能量消耗可能会非常大。减少内存能量的一种选择是采用存储内存架构,其中内存空间被划分为多个存储库,如果每个存储库没有被积极使用,则可以关闭电源。这里的一个重要问题是银行访问模式,它决定了节约能源的机会。在本文中,我们提出了一种基于编译器的数据布局转换策略,以提高存储内存体系结构的有效性。其思想是转换内存中的数组布局,使两个循环迭代一个接一个地访问同一银行中的数据尽可能多;其余的银行可以设置为低功耗模式。我们对9个阵列密集型应用程序进行了基于模拟的实验,结果显示内存能耗显著降低。
{"title":"Impact of data transformations on memory bank locality","authors":"M. Kandemir","doi":"10.1109/DATE.2004.1268896","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268896","url":null,"abstract":"High-energy consumption presents a problem for sustainable clock frequency and deliverable performance. In particular, memory energy consumption of array-intensive applications can be overwhelming due to poor cache locality. One option for reducing memory energy is to adopt a banked memory architecture, where memory space is divided into banks and each bank can be powered down if it is not in active use. An important issue here is the bank access pattern, which determines opportunities for saving energy. In this paper, we present a compiler-based data layout transformation strategy for increasing the effectiveness of a banked memory architecture. The idea is to transform the array layouts in memory in such a way that two loop iterations executed one after another access the data in the same bank as much as possible; the remaining banks can be placed into a low-power mode. Our simulation-based experiments with nine array-intensive applications show significant savings in memory energy consumption.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114305133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Impact of test point insertion on silicon area and timing during layout 测试点插入对布局时硅面积和时间的影响
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268981
H. Vranken, Ferry Syafei Sapei, H. Wunderlich
This paper presents an experimental investigation on the impact of test point insertion on circuit size and performance. Often test points are inserted into a circuit in order to improve the circuit's testability, which results in smaller test data volume, shorter test time, and higher fault coverage. Inserting test points however requires additional silicon area and influences the timing of a circuit. The paper shows how placement and routing is affected by test point insertion during layout generation. Experimental data for industrial circuits show that inserting 1% test points in general increases the silicon area after layout by less than 0.5% while the performance of the circuit may be reduced by 5% or more.
本文对测试点插入对电路尺寸和性能的影响进行了实验研究。为了提高电路的可测试性,通常会在电路中插入测试点,从而实现更小的测试数据量、更短的测试时间和更高的故障覆盖率。然而,插入测试点需要额外的硅面积,并影响电路的时序。本文阐述了在版图生成过程中,测试点的插入对布局和布线的影响。工业电路的实验数据表明,一般来说,插入1%的测试点可使布局后的硅面积增加不到0.5%,而电路的性能可能下降5%或更多。
{"title":"Impact of test point insertion on silicon area and timing during layout","authors":"H. Vranken, Ferry Syafei Sapei, H. Wunderlich","doi":"10.1109/DATE.2004.1268981","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268981","url":null,"abstract":"This paper presents an experimental investigation on the impact of test point insertion on circuit size and performance. Often test points are inserted into a circuit in order to improve the circuit's testability, which results in smaller test data volume, shorter test time, and higher fault coverage. Inserting test points however requires additional silicon area and influences the timing of a circuit. The paper shows how placement and routing is affected by test point insertion during layout generation. Experimental data for industrial circuits show that inserting 1% test points in general increases the silicon area after layout by less than 0.5% while the performance of the circuit may be reduced by 5% or more.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114524513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
A game theoretic approach to low energy wireless video streaming 低能量无线视频流的博弈论方法
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268935
A. Iranli, Kihwan Choi, Massoud Pedram
This paper presents a dynamic energy management policy for a wireless video streaming system, consisting of battery-powered client and server. The paper starts from the observation that the video quality in wireless streaming is a function of three factors: encoding aptitude of the server, decoding aptitude of the client, and the wireless channel. Based on this observation, the energy consumption of a wireless video streaming system is modeled and analyzed. Using the proposed model, the optimal energy assignment to each video frame is done such that the maximum system lifetime is achieved while satisfying a given minimum video quality requirement. Experimental results show that the proposed policy increases the system lifetime by 20%.
本文提出了一种由电池供电的客户端和服务器组成的无线视频流系统的动态能量管理策略。本文从观察到无线流媒体中的视频质量是三个因素的函数:服务器的编码能力、客户端的解码能力和无线信道。在此基础上,对无线视频流系统的能耗进行了建模和分析。利用所提出的模型,对每个视频帧进行最优的能量分配,从而在满足给定的最低视频质量要求的同时实现最大的系统寿命。实验结果表明,该策略使系统寿命提高了20%。
{"title":"A game theoretic approach to low energy wireless video streaming","authors":"A. Iranli, Kihwan Choi, Massoud Pedram","doi":"10.1109/DATE.2004.1268935","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268935","url":null,"abstract":"This paper presents a dynamic energy management policy for a wireless video streaming system, consisting of battery-powered client and server. The paper starts from the observation that the video quality in wireless streaming is a function of three factors: encoding aptitude of the server, decoding aptitude of the client, and the wireless channel. Based on this observation, the energy consumption of a wireless video streaming system is modeled and analyzed. Using the proposed model, the optimal energy assignment to each video frame is done such that the maximum system lifetime is achieved while satisfying a given minimum video quality requirement. Experimental results show that the proposed policy increases the system lifetime by 20%.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114564930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aspects of formal and graphical design of a bus system 公交系统的形式设计和图形设计
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268879
T. Seceleanu, T. Westerlund
This study shows the derivation of a local segmented bus arbiter from an original single segment bus arbiter. The operations are performed in the formal framework of action systems and illustrated in a graphical manner using the corresponding action systems - UML profile notations. The derivation is useful both to demonstrate the capability of preserving correctness when considering an important hardware design decision and also to identify means through which this kind of decisions can be performed in a graphical environment.
本研究展示了从原始的单段总线仲裁器衍生出的局部分段总线仲裁器。操作在操作系统的正式框架中执行,并使用相应的操作系统UML概要符号以图形方式进行说明。这个推导对于演示在考虑重要的硬件设计决策时保持正确性的能力以及确定在图形环境中执行此类决策的方法都很有用。
{"title":"Aspects of formal and graphical design of a bus system","authors":"T. Seceleanu, T. Westerlund","doi":"10.1109/DATE.2004.1268879","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268879","url":null,"abstract":"This study shows the derivation of a local segmented bus arbiter from an original single segment bus arbiter. The operations are performed in the formal framework of action systems and illustrated in a graphical manner using the corresponding action systems - UML profile notations. The derivation is useful both to demonstrate the capability of preserving correctness when considering an important hardware design decision and also to identify means through which this kind of decisions can be performed in a graphical environment.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124315332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Timing correction and optimization with adaptive delay sequential elements 自适应延迟序列元件的时序校正与优化
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269114
K. Rahimi, S. Bridges, C. Diorio
This paper introduces adaptive delay sequential elements (ADSEs). ADSEs are registers that use nonvolatile, floating-gate transistors to tune their internal clock delays. We propose ADSEs for correcting timing violations and optimizing circuit performance. We present an ADSE circuit example, system architecture, and tuning methodology. We present experimental results that demonstrate the correct operation of our example circuit and discuss the die-area impact of using ADSEs. Our experiments also show that voltage and temperature sensitivity of ADSEs are comparable to non-adaptive flip-flops.
介绍了自适应延迟顺序元件(ADSEs)。ads是使用非易失性、浮栅晶体管来调整其内部时钟延迟的寄存器。我们提出了用于纠正时序违规和优化电路性能的ads。我们提出了一个ADSE电路示例,系统架构和调谐方法。我们给出了实验结果,证明了我们的示例电路的正确工作,并讨论了使用adse对模面积的影响。我们的实验还表明,adse的电压和温度灵敏度与非自适应触发器相当。
{"title":"Timing correction and optimization with adaptive delay sequential elements","authors":"K. Rahimi, S. Bridges, C. Diorio","doi":"10.1109/DATE.2004.1269114","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269114","url":null,"abstract":"This paper introduces adaptive delay sequential elements (ADSEs). ADSEs are registers that use nonvolatile, floating-gate transistors to tune their internal clock delays. We propose ADSEs for correcting timing violations and optimizing circuit performance. We present an ADSE circuit example, system architecture, and tuning methodology. We present experimental results that demonstrate the correct operation of our example circuit and discuss the die-area impact of using ADSEs. Our experiments also show that voltage and temperature sensitivity of ADSEs are comparable to non-adaptive flip-flops.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127873890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Guaranteed bandwidth using looped containers in temporally disjoint networks within the nostrum network on chip 在芯片上的nostrum网络内的暂时不连接的网络中使用环形容器保证带宽
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269001
Mikael Millberg, E. Nilsson, R. Thid, A. Jantsch
In today's emerging network-on-chips, there is a need for different traffic classes with different quality-of-service guarantees. Within our NoC architecture nostrum, we have implemented a service of guaranteed bandwidth (GB), and latency, in addition to the already existing service of best-effort (BE) packet delivery. The guaranteed bandwidth is accessed via virtual circuits (VC). The VCs are implemented using a combination of two concepts that we call 'Looped Containers' and 'Temporally Disjoint Networks'. The looped containers are used to guarantee access to the network-independently of the current network load without dropping packets; and the TDNs are used in order to achieve several VCs, plus ordinary BE traffic, in the network. The TDNs are a consequence of the deflective routing policy used, and gives rise to an explicit time-division-multiplexing within the network. To prove our concept an HDL implementation has been synthesised and simulated. The cost in terms of additional hardware needed, as well as additional bandwidth is very low-less than 2 percent in both cases! Simulations showed that ordinary BE traffic is practically unaffected by the VCs.
在当今新兴的片上网络中,需要具有不同服务质量保证的不同流量类别。在我们的NoC架构中,除了现有的尽力而为(best-effort, BE)包交付服务之外,我们还实现了保证带宽(GB)和延迟的服务。保证的带宽是通过虚拟电路(VC)访问的。vc的实现结合了两个概念,我们称之为“环形容器”和“暂时分离网络”。环形容器用于保证对网络的访问——独立于当前网络负载而不丢弃数据包;tdn用于在网络中实现多个vc和普通BE流量。tdn是使用偏转路由策略的结果,并在网络中产生显式的时分多路复用。为了证明我们的概念,合成了一个HDL实现并进行了仿真。就所需的额外硬件和额外带宽而言,成本非常低——在这两种情况下都不到2% !仿真表明,普通的BE流量实际上不受vc的影响。
{"title":"Guaranteed bandwidth using looped containers in temporally disjoint networks within the nostrum network on chip","authors":"Mikael Millberg, E. Nilsson, R. Thid, A. Jantsch","doi":"10.1109/DATE.2004.1269001","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269001","url":null,"abstract":"In today's emerging network-on-chips, there is a need for different traffic classes with different quality-of-service guarantees. Within our NoC architecture nostrum, we have implemented a service of guaranteed bandwidth (GB), and latency, in addition to the already existing service of best-effort (BE) packet delivery. The guaranteed bandwidth is accessed via virtual circuits (VC). The VCs are implemented using a combination of two concepts that we call 'Looped Containers' and 'Temporally Disjoint Networks'. The looped containers are used to guarantee access to the network-independently of the current network load without dropping packets; and the TDNs are used in order to achieve several VCs, plus ordinary BE traffic, in the network. The TDNs are a consequence of the deflective routing policy used, and gives rise to an explicit time-division-multiplexing within the network. To prove our concept an HDL implementation has been synthesised and simulated. The cost in terms of additional hardware needed, as well as additional bandwidth is very low-less than 2 percent in both cases! Simulations showed that ordinary BE traffic is practically unaffected by the VCs.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126218891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 429
Polynomial abstraction for verification of sequentially implemented combinational circuits 验证顺序实现组合电路的多项式抽象
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268933
Tarvo Raudvere, A. Singh, I. Sander, A. Jantsch
Today's integrated circuits with increasing complexity cause the well known state space explosion problem in verification tools. In order to handle this problem a much simpler abstract model of the design has to be created for verification. We introduce the polynomial abstraction technique, which efficiently simplifies the verification task of sequential design blocks whose functionality can be expressed as a polynomial. Through our technique, the domains of possible values of data input signals can be reduced. This is done in such a way that the abstract model is still valid for model checking of the design functionality in terms of the system's control and data properties. We incorporate polynomial abstraction into the ForSyDe methodology, for the verification of clock domain design refinements.
当今集成电路的复杂性日益增加,导致了验证工具中众所周知的状态空间爆炸问题。为了处理这个问题,必须创建一个更简单的设计抽象模型来进行验证。引入多项式抽象技术,有效地简化了功能可以用多项式表示的顺序设计块的验证任务。通过我们的技术,可以减小数据输入信号可能值的域。这是通过这样一种方式完成的,即抽象模型对于根据系统的控制和数据属性对设计功能进行模型检查仍然有效。我们将多项式抽象纳入到ForSyDe方法中,以验证时钟域设计的改进。
{"title":"Polynomial abstraction for verification of sequentially implemented combinational circuits","authors":"Tarvo Raudvere, A. Singh, I. Sander, A. Jantsch","doi":"10.1109/DATE.2004.1268933","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268933","url":null,"abstract":"Today's integrated circuits with increasing complexity cause the well known state space explosion problem in verification tools. In order to handle this problem a much simpler abstract model of the design has to be created for verification. We introduce the polynomial abstraction technique, which efficiently simplifies the verification task of sequential design blocks whose functionality can be expressed as a polynomial. Through our technique, the domains of possible values of data input signals can be reduced. This is done in such a way that the abstract model is still valid for model checking of the design functionality in terms of the system's control and data properties. We incorporate polynomial abstraction into the ForSyDe methodology, for the verification of clock domain design refinements.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125575149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A fast word-level statistical estimator of intra-bus crosstalk 总线内串扰的快速字级统计估计
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269041
Suvodeep Gupta, S. Katkoori
Given word-level statistics, namely mean, standard deviation, and lag-one temporal correlation of input data, we estimate the bit-level crosstalk probability on a system bus using a non-enumerative statistical approach. We introduce a sampling technique for fast evaluation of integrals during the estimation process. We had proposed two techniques previously - (a) a stream-based estimator that counts crosstalk events on a bus; and (b) a statistical enumeration technique that enumerates crosstalk-producing values on a bus and computes their occurrence probability. Both these techniques suffer from exponential time complexity with respect to the bus-width. In this work, we propose a statistical non-enumerative technique that has linear time complexity with respect to the bus-width. We achieve the linear complexity by resorting to: (1) manipulating the data stream to make the crosstalk-producing values contiguous and (2) sampling the distribution function and storing it as a lookup table. Experimental results for data streams from different data environments are presented, compared against the stream-based approach. Average errors of less than 12% are obtained for bus-widths ranging from 8b to 32b.
给定字级统计,即输入数据的平均值、标准差和滞后时间相关性,我们使用非枚举统计方法估计系统总线上的位级串扰概率。在估计过程中,我们引入了一种快速求积分的采样技术。我们之前提出了两种技术:(a)基于流的估计器,用于统计总线上的串扰事件;(b)统计枚举技术,枚举总线上产生串音的值并计算它们的发生概率。这两种技术都有相对于总线宽度的指数时间复杂度。在这项工作中,我们提出了一种统计非枚举技术,它具有相对于总线宽度的线性时间复杂度。我们通过以下方式实现线性复杂性:(1)操纵数据流使产生串扰的值相邻;(2)对分布函数进行采样并将其存储为查找表。给出了不同数据环境下的数据流的实验结果,并与基于流的方法进行了比较。母线宽度在8b ~ 32b范围内,平均误差小于12%。
{"title":"A fast word-level statistical estimator of intra-bus crosstalk","authors":"Suvodeep Gupta, S. Katkoori","doi":"10.1109/DATE.2004.1269041","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269041","url":null,"abstract":"Given word-level statistics, namely mean, standard deviation, and lag-one temporal correlation of input data, we estimate the bit-level crosstalk probability on a system bus using a non-enumerative statistical approach. We introduce a sampling technique for fast evaluation of integrals during the estimation process. We had proposed two techniques previously - (a) a stream-based estimator that counts crosstalk events on a bus; and (b) a statistical enumeration technique that enumerates crosstalk-producing values on a bus and computes their occurrence probability. Both these techniques suffer from exponential time complexity with respect to the bus-width. In this work, we propose a statistical non-enumerative technique that has linear time complexity with respect to the bus-width. We achieve the linear complexity by resorting to: (1) manipulating the data stream to make the crosstalk-producing values contiguous and (2) sampling the distribution function and storing it as a lookup table. Experimental results for data streams from different data environments are presented, compared against the stream-based approach. Average errors of less than 12% are obtained for bus-widths ranging from 8b to 32b.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128202231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Re-configurable bus encoding scheme for reducing power consumption of the cross coupling capacitance for deep sub-micron instruction bus 降低深亚微米指令总线交叉耦合电容功耗的可重构总线编码方案
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1268838
S. Wong, C. Tsui
In very deep sub-micron designs, cross coupling capacitances become the dominant factor of the total bus loading and have a significant impact on the power consumption. In this paper, we propose two reconfigurable bus encoding schemes, which are based on the correlation among the bit lines, to reduce the power consumption at the cross coupling capacitances of the instruction buses. The instruction is encoded by flipping and reordering the bit lines during compilation time to reduce the total switching capacitances. A crossbar is used to map back the data to the original instruction code before sending to the instruction decoder. The reordering can be re-configured during run-time by using different configurations in the crossbar. We propose two types of re-configuration, static and dynamic. Static coding uses a fix flipping and re-configuring pattern after the corresponding program is compiled. Dynamic coding allows different re-configuring patterns during program execution. Experimental results show that by using the proposed schemes, significant energy reduction, 17-23%, can be achieved. Comparisons with existing bit lines reordering encoding scheme have also been made and on average more than 15% reduction can be obtained using our method.
在极深亚微米设计中,交叉耦合电容成为总母线负载的主导因素,并对功耗产生重大影响。本文提出了两种基于位线相关性的可重构总线编码方案,以降低指令总线交叉耦合电容的功耗。该指令在编译期间通过翻转和重新排序位线来编码,以减小总开关电容。在发送到指令解码器之前,使用交叉条将数据映射回原始指令代码。通过在交叉栏中使用不同的配置,可以在运行期间重新配置重新排序。我们提出两种类型的重新配置,静态和动态。静态编码在相应的程序编译后使用固定翻转和重新配置模式。动态编码允许在程序执行期间使用不同的重新配置模式。实验结果表明,采用该方案可实现17-23%的显著节能。并与现有的位线重排序编码方案进行了比较,结果表明,该方法平均可使编码降低15%以上。
{"title":"Re-configurable bus encoding scheme for reducing power consumption of the cross coupling capacitance for deep sub-micron instruction bus","authors":"S. Wong, C. Tsui","doi":"10.1109/DATE.2004.1268838","DOIUrl":"https://doi.org/10.1109/DATE.2004.1268838","url":null,"abstract":"In very deep sub-micron designs, cross coupling capacitances become the dominant factor of the total bus loading and have a significant impact on the power consumption. In this paper, we propose two reconfigurable bus encoding schemes, which are based on the correlation among the bit lines, to reduce the power consumption at the cross coupling capacitances of the instruction buses. The instruction is encoded by flipping and reordering the bit lines during compilation time to reduce the total switching capacitances. A crossbar is used to map back the data to the original instruction code before sending to the instruction decoder. The reordering can be re-configured during run-time by using different configurations in the crossbar. We propose two types of re-configuration, static and dynamic. Static coding uses a fix flipping and re-configuring pattern after the corresponding program is compiled. Dynamic coding allows different re-configuring patterns during program execution. Experimental results show that by using the proposed schemes, significant energy reduction, 17-23%, can be achieved. Comparisons with existing bit lines reordering encoding scheme have also been made and on average more than 15% reduction can be obtained using our method.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130834465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Mapping multi-million gate SoCs on FPGAs: industrial methodology and experience 在fpga上映射数百万门soc:工业方法和经验
Pub Date : 2004-02-16 DOI: 10.1109/DATE.2004.1269065
H. Krupnova
Today, having a fast hardware platform for SoC software development prior to silicon is an important challenge to gain the time-to-market. The FPGAs offer an excellent prototyping basis for building hardware platforms since more than ten years. However, as the circuit complexity increases and project timeframes shrink, building a multi-FPGA prototype represents a real challenge from the complexity viewpoint. The paper describes the state-of-the-art mapping methodology, prototyping tools and flows, shows the most difficult mapping problems and the ways to overcome them. The paper is issued from the experience of mapping on FPGA platform of four latest highly complex ST Microelectronics SoCs ranging from 1.5 to 4 million real ASIC gates mapped to up to 9 highest capacity FPGAs.
今天,在硅之前为SoC软件开发提供快速硬件平台是获得上市时间的重要挑战。十多年来,fpga为构建硬件平台提供了良好的原型基础。然而,随着电路复杂性的增加和项目时间的缩短,从复杂性的角度来看,构建多fpga原型是一个真正的挑战。本文描述了最先进的映射方法,原型工具和流程,展示了最困难的映射问题和克服它们的方法。本文是根据在FPGA平台上映射四个最新的高度复杂的ST微电子soc的经验发布的,范围从150万到400万真正的ASIC门映射到多达9个最高容量的FPGA。
{"title":"Mapping multi-million gate SoCs on FPGAs: industrial methodology and experience","authors":"H. Krupnova","doi":"10.1109/DATE.2004.1269065","DOIUrl":"https://doi.org/10.1109/DATE.2004.1269065","url":null,"abstract":"Today, having a fast hardware platform for SoC software development prior to silicon is an important challenge to gain the time-to-market. The FPGAs offer an excellent prototyping basis for building hardware platforms since more than ten years. However, as the circuit complexity increases and project timeframes shrink, building a multi-FPGA prototype represents a real challenge from the complexity viewpoint. The paper describes the state-of-the-art mapping methodology, prototyping tools and flows, shows the most difficult mapping problems and the ways to overcome them. The paper is issued from the experience of mapping on FPGA platform of four latest highly complex ST Microelectronics SoCs ranging from 1.5 to 4 million real ASIC gates mapped to up to 9 highest capacity FPGAs.","PeriodicalId":335658,"journal":{"name":"Proceedings Design, Automation and Test in Europe Conference and Exhibition","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126583623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
期刊
Proceedings Design, Automation and Test in Europe Conference and Exhibition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1