首页 > 最新文献

2008 Asia and South Pacific Design Automation Conference最新文献

英文 中文
A delay model for interconnect trees based on ABCD matrix 基于ABCD矩阵的互连树延迟模型
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484004
Guofei Zhou, L. Su, Depeng Jin, Lieguang Zeng
The accuracy of interconnect delay estimations can be improved by the method presented in this paper, in which the first two moments are obtained with ABCD matrix and a stable model to incorporate effects of transport delay into the delay estimate is developed. Simulation results show that the method share the same accuracy with traditional methods when rise time delay is much longer than transport delay and more accurate when the two are of the same order.
本文提出的方法利用ABCD矩阵求出互连时延的前两个矩,并建立了一个将传输时延影响纳入时延估计的稳定模型,提高了互连时延估计的精度。仿真结果表明,当上升时延大于传输时延时,该方法与传统方法具有相同的精度,当上升时延大于传输时延时,该方法的精度更高。
{"title":"A delay model for interconnect trees based on ABCD matrix","authors":"Guofei Zhou, L. Su, Depeng Jin, Lieguang Zeng","doi":"10.1109/ASPDAC.2008.4484004","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484004","url":null,"abstract":"The accuracy of interconnect delay estimations can be improved by the method presented in this paper, in which the first two moments are obtained with ABCD matrix and a stable model to incorporate effects of transport delay into the delay estimate is developed. Simulation results show that the method share the same accuracy with traditional methods when rise time delay is much longer than transport delay and more accurate when the two are of the same order.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"383 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115300556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Robust test generation for power supply noise induced path delay faults 电源噪声引起的路径延迟故障鲁棒性测试生成
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484033
Xiang Fu, Huawei Li, Yu Hu, Xiaowei Li
In deep sub-micron designs, the delay caused by power supply noise (PSN) can no longer be ignored. A PSN-induced path delay fault (PSNPDF) model is proposed in this paper, and should be tested to enhance chip quality. Based on precise timing analysis, we also propose a robust test generation technique for PSNPDF. Concept of timing window is introduced into the PSNPDF model. If two devices in the same feed region simultaneously switch in the same direction, the current waveform of the two devices will have an overlap and excessive PSN will be produced. Experimental results on ISCAS'89 circuits showed test generation can be finished in a few seconds.
在深亚微米设计中,由电源噪声(PSN)引起的延迟不能再被忽视。本文提出了一种psn诱导路径延迟故障(PSNPDF)模型,该模型需要进行测试以提高芯片质量。在精确时序分析的基础上,提出了一种鲁棒的PSNPDF测试生成技术。在PSNPDF模型中引入了定时窗口的概念。如果同一馈电区域的两个器件同时向同一方向切换,则两个器件的电流波形会有重叠,产生过多的PSN。在ISCAS'89电路上的实验结果表明,测试生成可以在几秒钟内完成。
{"title":"Robust test generation for power supply noise induced path delay faults","authors":"Xiang Fu, Huawei Li, Yu Hu, Xiaowei Li","doi":"10.1109/ASPDAC.2008.4484033","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484033","url":null,"abstract":"In deep sub-micron designs, the delay caused by power supply noise (PSN) can no longer be ignored. A PSN-induced path delay fault (PSNPDF) model is proposed in this paper, and should be tested to enhance chip quality. Based on precise timing analysis, we also propose a robust test generation technique for PSNPDF. Concept of timing window is introduced into the PSNPDF model. If two devices in the same feed region simultaneously switch in the same direction, the current waveform of the two devices will have an overlap and excessive PSN will be produced. Experimental results on ISCAS'89 circuits showed test generation can be finished in a few seconds.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"354 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122796360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Synergistic physical synthesis for manufacturability and variability in 45nm designs and beyond 协同物理合成可制造性和可变性在45纳米及以上的设计
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483945
D. Pan, Minsik Cho
Nanometer IC designs are increasingly challenged by manufacturing closure, i.e., being fabricated with high product yield, mainly due to aggressive technology scaling and increasing process/environmental variations. Realizing the criticality of addressing manufacturability for higher yield and tolerance to variations during design, there has been a surge of research activities recently from both academia and industry. In this paper, we will survey the key activities in synergistic physical synthesis and shed lights on some of the future research directions.
纳米IC设计越来越多地受到制造封闭的挑战,即高产品产量的制造,主要是由于积极的技术缩放和不断增加的工艺/环境变化。意识到在设计过程中解决可制造性问题以获得更高的良率和对变化的容忍度的重要性,学术界和工业界最近都开展了大量的研究活动。本文将对协同物理合成中的关键活动进行综述,并对未来的研究方向进行展望。
{"title":"Synergistic physical synthesis for manufacturability and variability in 45nm designs and beyond","authors":"D. Pan, Minsik Cho","doi":"10.1109/ASPDAC.2008.4483945","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483945","url":null,"abstract":"Nanometer IC designs are increasingly challenged by manufacturing closure, i.e., being fabricated with high product yield, mainly due to aggressive technology scaling and increasing process/environmental variations. Realizing the criticality of addressing manufacturability for higher yield and tolerance to variations during design, there has been a surge of research activities recently from both academia and industry. In this paper, we will survey the key activities in synergistic physical synthesis and shed lights on some of the future research directions.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125286847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Duo-binary circular turbo decoder based on border metric encoding for WiMAX 基于边界度量编码的WiMAX双二进制圆形turbo解码器
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483918
Ji-Hoon Kim, I. Park
This paper presents a duo-binary circular turbo decoder based on border metric encoding. With the proposed method, the memory size for branch memory is reduced by half and the dummy calculation is removed at the cost of the small-sized memory which holds the encoded border metrics. Based on the proposed SISO decoder and the dedicated hardware interleaver, a duo-binary circular turbo decoder is designed for the WiMAX standard using a 0.13 mum CMOS process, which can support 24.26 Mbps at 200 MHz.
提出了一种基于边界度量编码的双二进制圆形turbo解码器。该方法将分支存储器的内存大小减少了一半,并且以保留编码边界度量的小内存为代价消除了虚拟计算。基于所提出的SISO解码器和专用硬件交织器,设计了一种适用于WiMAX标准的双二进制圆形turbo解码器,该解码器采用0.13 μ m CMOS工艺,在200 MHz下可支持24.26 Mbps。
{"title":"Duo-binary circular turbo decoder based on border metric encoding for WiMAX","authors":"Ji-Hoon Kim, I. Park","doi":"10.1109/ASPDAC.2008.4483918","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483918","url":null,"abstract":"This paper presents a duo-binary circular turbo decoder based on border metric encoding. With the proposed method, the memory size for branch memory is reduced by half and the dummy calculation is removed at the cost of the small-sized memory which holds the encoded border metrics. Based on the proposed SISO decoder and the dedicated hardware interleaver, a duo-binary circular turbo decoder is designed for the WiMAX standard using a 0.13 mum CMOS process, which can support 24.26 Mbps at 200 MHz.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126267727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Optimal allocation and placement of thermal sensors for reconfigurable systems and its practical extension 可重构系统中热传感器的优化配置与放置及其实用推广
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484042
Byunghyun Lee, Taewhan Kim
A dynamic monitoring of thermal behavior of hardware resources using thermal sensors is very important to maintain the operation of systems safe and reliable. This work proposes an effective solution to the problem of thermal sensor allocation and placement for reconfigurable systems at the post-manufacturing stage. Specifically, we define the sensor allocation and placement problem (SAPP), and propose a solution which formulates SAPP into the unate-covering problem (UCP) and solves it optimally. We then provide an extended solution to handle a practical design issue where the hardware resources for the sensor implementation on specific array locations have already been used up by the application logic. Experimental results using MCNC benchmarks show that our proposed technique uses 19.7% less number of sensors to monitor hotspots on the average than that used by the bisection based (Mukherjee et al., 2006) approaches.
利用热传感器对硬件资源的热行为进行动态监测,对维护系统的安全可靠运行具有重要意义。这项工作提出了一个有效的解决方案,热传感器的分配和安置在可重构系统的后制造阶段的问题。具体来说,我们定义了传感器的分配与放置问题(SAPP),并提出了一种将SAPP转化为覆盖问题(UCP)的解决方案。然后,我们提供了一个扩展的解决方案来处理一个实际的设计问题,其中用于在特定阵列位置上实现传感器的硬件资源已经被应用程序逻辑用完。使用MCNC基准的实验结果表明,我们提出的技术使用的传感器数量平均比基于对分法(Mukherjee et al., 2006)的方法少19.7%。
{"title":"Optimal allocation and placement of thermal sensors for reconfigurable systems and its practical extension","authors":"Byunghyun Lee, Taewhan Kim","doi":"10.1109/ASPDAC.2008.4484042","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484042","url":null,"abstract":"A dynamic monitoring of thermal behavior of hardware resources using thermal sensors is very important to maintain the operation of systems safe and reliable. This work proposes an effective solution to the problem of thermal sensor allocation and placement for reconfigurable systems at the post-manufacturing stage. Specifically, we define the sensor allocation and placement problem (SAPP), and propose a solution which formulates SAPP into the unate-covering problem (UCP) and solves it optimally. We then provide an extended solution to handle a practical design issue where the hardware resources for the sensor implementation on specific array locations have already been used up by the application logic. Experimental results using MCNC benchmarks show that our proposed technique uses 19.7% less number of sensors to monitor hotspots on the average than that used by the bisection based (Mukherjee et al., 2006) approaches.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"443 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125763856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Investigation of diffusion rounding for post-lithography analysis 光刻后分析中扩散舍入的研究
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483998
Puneet Gupta, A. Kahng, Youngmin Kim, Saumil Shah, D. Sylvester
Due to aggressive scaling of device feature size to improve circuit performance in the sub-wavelength lithography regime, both diffusion and poly gate shapes are no longer rectilinear. Diffusion rounding occurs most notably where the diffusion shapes are not perfectly rectangular, including common L and T-shaped diffusion layouts to connect to power rails. This paper investigates the impact of the non-rectilinear shape of diffusion (i.e., sloped diffusion or diffusion rounding) on circuit performance (delay and leakage). Simple weighting function models for Ionmiddot and Ioff to account for the diffusion rounding effects are proposed, and compared with TCAD simulation. Our experiments show that diffusion rounding has an asymmetric characteristic for Ioff due to the differing significance of source/drain junctions on device threshold voltage. Therefore, we can model Ionmiddot and Ioff as a function of slope angle and direction. The proposed models match well with TCAD simulation results, with less than 2% and 6% error in Ionmiddot and Ioff, respectively.
由于在亚波长光刻系统中,为了提高电路性能,器件特征尺寸的积极缩放,扩散和多栅极的形状都不再是直线的。扩散舍入最明显地发生在扩散形状不是完美矩形的地方,包括连接到电源轨道的常见L形和t形扩散布局。本文研究了扩散的非直线形状(即斜扩散或扩散舍入)对电路性能(延迟和漏电)的影响。提出了考虑扩散舍入效应的Ionmiddot和Ioff的简单加权函数模型,并与TCAD仿真进行了比较。我们的实验表明,由于源极/漏极结对器件阈值电压的不同意义,扩散舍入具有不对称的off特性。因此,我们可以将Ionmiddot和Ioff建模为斜率和方向的函数。所提出的模型与TCAD仿真结果吻合良好,Ionmiddot和Ioff的误差分别小于2%和6%。
{"title":"Investigation of diffusion rounding for post-lithography analysis","authors":"Puneet Gupta, A. Kahng, Youngmin Kim, Saumil Shah, D. Sylvester","doi":"10.1109/ASPDAC.2008.4483998","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483998","url":null,"abstract":"Due to aggressive scaling of device feature size to improve circuit performance in the sub-wavelength lithography regime, both diffusion and poly gate shapes are no longer rectilinear. Diffusion rounding occurs most notably where the diffusion shapes are not perfectly rectangular, including common L and T-shaped diffusion layouts to connect to power rails. This paper investigates the impact of the non-rectilinear shape of diffusion (i.e., sloped diffusion or diffusion rounding) on circuit performance (delay and leakage). Simple weighting function models for Ionmiddot and Ioff to account for the diffusion rounding effects are proposed, and compared with TCAD simulation. Our experiments show that diffusion rounding has an asymmetric characteristic for Ioff due to the differing significance of source/drain junctions on device threshold voltage. Therefore, we can model Ionmiddot and Ioff as a function of slope angle and direction. The proposed models match well with TCAD simulation results, with less than 2% and 6% error in Ionmiddot and Ioff, respectively.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129598381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
A 1.2GHz delayed clock generator for high-speed microprocessors 用于高速微处理器的1.2GHz延迟时钟发生器
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484068
I. Jung, Moo-young Kim, Chulwoo Kim
A 1.2GHz delayed clock generator capable of adjusting its clock phase according to input clock frequencies has been developed. It consists of a full-digital CMOS circuit that leads to a simple, robust, and portable IP. One-cycle lock time enables clock-on-demand circuit structures. The implemented delayed clock generator tile in 0.13 um CMOS technology occupies only 0.004 mm and operates at variable input frequencies ranging from 625 MHz to 1.2GHz.
研制了一种能够根据输入时钟频率调整时钟相位的1.2GHz延时时钟发生器。它由一个全数字CMOS电路组成,导致一个简单,强大和便携的IP。一个周期的锁定时间使时钟按需电路结构。所实现的延迟时钟发生器采用0.13 um CMOS技术,占地仅为0.004 mm,可在625 MHz至1.2GHz的可变输入频率下工作。
{"title":"A 1.2GHz delayed clock generator for high-speed microprocessors","authors":"I. Jung, Moo-young Kim, Chulwoo Kim","doi":"10.1109/ASPDAC.2008.4484068","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484068","url":null,"abstract":"A 1.2GHz delayed clock generator capable of adjusting its clock phase according to input clock frequencies has been developed. It consists of a full-digital CMOS circuit that leads to a simple, robust, and portable IP. One-cycle lock time enables clock-on-demand circuit structures. The implemented delayed clock generator tile in 0.13 um CMOS technology occupies only 0.004 mm and operates at variable input frequencies ranging from 625 MHz to 1.2GHz.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124653570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Buffered clock tree synthesis for 3D ICs under thermal variations 热变化下三维集成电路的缓冲时钟树合成
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484003
J. Minz, Xin Zhao, S. Lim
In this paper, we study the buffered clock tree synthesis problem under thermal variations for 3D IC technology. Our major contribution is the Balanced Skew Theorem, which provides a theoretical background to efficiently construct a buffered 3D clock tree that minimizes and balances the skew values under two distinct non-uniform thermal profiles. Our clock tree synthesis algorithm named BURITO (buffered clock tree with thermal optimization) first constructs a 3D abstract tree under the wirelength vs via-congestion tradeoff. This abstract tree is then embedded, buffered, and refined under the given non-uniform thermal profiles so that the temperature-dependent skews are minimized and balanced. Experimental results show that our algorithms significantly reduce and perfectly balance clock skew values with minimal wirelength overhead.
本文研究了三维集成电路技术在热变化条件下的缓冲时钟树合成问题。我们的主要贡献是平衡偏态定理,它提供了一个理论背景,有效地构建一个缓冲的3D时钟树,最小化和平衡两个不同的非均匀热剖面下的偏态值。我们的时钟树合成算法BURITO(缓冲时钟树与热优化)首先构建了一个三维抽象树在无线与通过拥塞权衡。然后在给定的非均匀热剖面下嵌入、缓冲和精炼这棵抽象树,以使与温度相关的倾斜最小化和平衡。实验结果表明,我们的算法在最小的带宽开销下显著降低并完美平衡了时钟偏差值。
{"title":"Buffered clock tree synthesis for 3D ICs under thermal variations","authors":"J. Minz, Xin Zhao, S. Lim","doi":"10.1109/ASPDAC.2008.4484003","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484003","url":null,"abstract":"In this paper, we study the buffered clock tree synthesis problem under thermal variations for 3D IC technology. Our major contribution is the Balanced Skew Theorem, which provides a theoretical background to efficiently construct a buffered 3D clock tree that minimizes and balances the skew values under two distinct non-uniform thermal profiles. Our clock tree synthesis algorithm named BURITO (buffered clock tree with thermal optimization) first constructs a 3D abstract tree under the wirelength vs via-congestion tradeoff. This abstract tree is then embedded, buffered, and refined under the given non-uniform thermal profiles so that the temperature-dependent skews are minimized and balanced. Experimental results show that our algorithms significantly reduce and perfectly balance clock skew values with minimal wirelength overhead.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130923798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 70
NoCOUT : NoC topology generation with mixed packet-switched and point-to-point networks noout:混合分组交换和点对点网络的NoC拓扑生成
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483953
J. Chan, S. Parameswaran
Networks-on-chip (NoC) have been widely proposed as the future communication paradigm for use in next-generation system-on-chip. In this paper, we present NoCOUT, a methodology for generating an energy optimized application specific NoC topology which supports both point-to-point and packet-switched networks. The algorithm uses a prohibitive greedy iterative improvement strategy to explore the design space efficiently. A system-level floorplanner is used to evaluate the iterative design improvements and provide feedback on the effects of the topology on wire length. The algorithm is integrated within a NoC synthesis framework with characterized NoC power and area models to allow accurate exploration for a NoC router library. We apply the topology generation algorithm to several test cases including real-world and synthetic communication graphs with both regular and irregular traffic patterns, and varying core sizes. Since the method is iterative, it is possible to start with a known design to search for improvements. Experimental results show that many different applications benefit from a mix of ";on chip networks"; and ";point-to-point networks";. With such a hybrid network, we achieve approximately 25% lower energy consumption (with a maximum of 37%) than a state of the art min-cut partition based topology generator for a variety of benchmarks. In addition, the average hop count is reduced by 0.75 hops, which would significantly reduce the network latency.
片上网络(NoC)已被广泛提出作为下一代片上系统的未来通信范式。在本文中,我们提出了noout,一种用于生成能量优化的应用特定NoC拓扑的方法,该拓扑支持点对点和分组交换网络。该算法采用禁止贪婪迭代改进策略,有效地探索设计空间。系统级地板规划器用于评估迭代设计改进,并提供拓扑对导线长度影响的反馈。该算法集成在具有特征NoC功率和面积模型的NoC综合框架中,可以精确地探索NoC路由器库。我们将拓扑生成算法应用于几个测试用例,包括具有规则和不规则流量模式以及不同核心大小的真实和合成通信图。由于该方法是迭代的,因此可以从已知的设计开始寻找改进。实验结果表明,许多不同的应用都受益于“片上网络”的混合;和“点对点网络”;使用这样的混合网络,我们实现了大约25%的能耗降低(最大37%),而不是最先进的基于最小分割的拓扑生成器,用于各种基准测试。此外,平均跳数减少了0.75跳,这将大大减少网络延迟。
{"title":"NoCOUT : NoC topology generation with mixed packet-switched and point-to-point networks","authors":"J. Chan, S. Parameswaran","doi":"10.1109/ASPDAC.2008.4483953","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483953","url":null,"abstract":"Networks-on-chip (NoC) have been widely proposed as the future communication paradigm for use in next-generation system-on-chip. In this paper, we present NoCOUT, a methodology for generating an energy optimized application specific NoC topology which supports both point-to-point and packet-switched networks. The algorithm uses a prohibitive greedy iterative improvement strategy to explore the design space efficiently. A system-level floorplanner is used to evaluate the iterative design improvements and provide feedback on the effects of the topology on wire length. The algorithm is integrated within a NoC synthesis framework with characterized NoC power and area models to allow accurate exploration for a NoC router library. We apply the topology generation algorithm to several test cases including real-world and synthetic communication graphs with both regular and irregular traffic patterns, and varying core sizes. Since the method is iterative, it is possible to start with a known design to search for improvements. Experimental results show that many different applications benefit from a mix of \";on chip networks\"; and \";point-to-point networks\";. With such a hybrid network, we achieve approximately 25% lower energy consumption (with a maximum of 37%) than a state of the art min-cut partition based topology generator for a variety of benchmarks. In addition, the average hop count is reduced by 0.75 hops, which would significantly reduce the network latency.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"24 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120877537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
Automated techniques for energy efficient scheduling on homogeneous and heterogeneous chip multi-processor architectures 同质和异构芯片多处理器体系结构上的节能调度自动化技术
Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484026
Sushu Zhang, Karam S. Chatha
We address performance maximization of independent task sets under energy constraint on chip multi-processor (CMP) architectures that support multiple voltage/frequency operating states for each core. We prove that the problem is strongly NP-hard. We propose polynomial time 2-approximation algorithms for homogeneous and heterogeneous CMPs. To the best of our knowledge, our techniques offer the tightest bounds for energy constrained design on CMP architectures. Experimental results demonstrate that our techniques are effective and efficient under various workloads on several CMP architectures.
我们在芯片多处理器(CMP)架构上解决了在能量约束下独立任务集的性能最大化问题,该架构支持每个核心的多个电压/频率工作状态。我们证明了这个问题是强np困难的。我们提出了多项式时间2逼近算法的齐次和异构cmp。据我们所知,我们的技术为CMP架构上的能源约束设计提供了最严格的限制。实验结果表明,我们的技术在各种CMP架构下都是有效的。
{"title":"Automated techniques for energy efficient scheduling on homogeneous and heterogeneous chip multi-processor architectures","authors":"Sushu Zhang, Karam S. Chatha","doi":"10.1109/ASPDAC.2008.4484026","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484026","url":null,"abstract":"We address performance maximization of independent task sets under energy constraint on chip multi-processor (CMP) architectures that support multiple voltage/frequency operating states for each core. We prove that the problem is strongly NP-hard. We propose polynomial time 2-approximation algorithms for homogeneous and heterogeneous CMPs. To the best of our knowledge, our techniques offer the tightest bounds for energy constrained design on CMP architectures. Experimental results demonstrate that our techniques are effective and efficient under various workloads on several CMP architectures.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"94 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120980200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
期刊
2008 Asia and South Pacific Design Automation Conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1