2008 Asia and South Pacific Design Automation Conference最新文献

英文中文

Efficient numerical modeling of random rough surface effects for interconnect internal impedance extraction 随机粗糙表面效应在互连内部阻抗提取中的高效数值模拟

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483930

Quan Chen, N. Wong

This paper proposes an efficient model for numerically evaluating the impact of random surface roughness on the internal impedance for large-scale interconnect structures. The effective resistivity (ER) and effective permeability (EP) are numerically formulated to avoid the computationally prohibitive global discretization, while maintaining the model accuracy and flexibility. A modified stochastic integral equation (SIE) method is proposed to significantly speed up the computation for the mean values of ER and EP under the assumption of random surface roughness. Numerical experiments then verify the efficacy of our approach.

本文提出了一种有效的模型，用于数值评估随机表面粗糙度对大型互连结构内部阻抗的影响。有效电阻率(ER)和有效渗透率(EP)在保持模型精度和灵活性的同时，避免了计算上难以实现的全局离散化。提出了一种改进的随机积分方程(SIE)方法，在随机表面粗糙度假设下，显著加快了ER和EP均值的计算速度。数值实验验证了该方法的有效性。

引用次数: 3

A fast two-pass HDL simulation with on-demand dump 一个快速的双通道HDL仿真与按需转储

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483987

Kyuho Shim, Young-Rae Cho, Namdo Kim, Hyuncheol Baik, Kyung-Seok Kim, Dusung Kim, Jae-Beom Kim, Byeong Min, Kyumyung Choi, M. Ciesielski, Seiyang Yang

Simulation-based functional verification is characterized by two inherently conflicting targets: the signal visibility and simulation performance. Achieving a proper trade-off between these two targets is of paramount importance. Even though HDL simulators are the most widely used verification platform at the RTL and gate level, their major drawback is the low performance in verifying complex SOCs, especially when the high visibility over the design under verification is required. This paper presents a new, fast simulation method as an effective way to achieve both high simulation speed and full signal visibility. It is based on an original two-pass simulation approach. During the 1st pass, with the simulation running at full speed, a set of design states is saved periodically at predetermined checkpoints. During the 2nd pass, another simulation is performed, using any of saved checkpoints and providing 100% signal visibility for debugging. Our method differs from the traditional simulation snapshot approach in the amount and the way the design state is saved. Experimental results show significant speed-up compared to existing traditional simulation methods while maintaining 100% visibility.

基于仿真的功能验证具有两个内在冲突的目标:信号可见性和仿真性能。在这两个目标之间实现适当的权衡是至关重要的。尽管HDL模拟器是RTL和栅极级使用最广泛的验证平台，但其主要缺点是在验证复杂soc时性能较低，特别是当需要对验证设计进行高可见性时。本文提出了一种新的、快速的仿真方法，是实现高仿真速度和全信号可见性的有效途径。它是基于一种原始的两步仿真方法。在第一次通过期间，随着仿真全速运行，在预定的检查点定期保存一组设计状态。在第二次通过期间，执行另一个模拟，使用任何保存的检查点并为调试提供100%的信号可见性。我们的方法与传统的仿真快照方法在设计状态保存的数量和方式上有所不同。实验结果表明，与现有的传统仿真方法相比，在保持100%可见性的情况下，该方法的速度有显著提高。

{"title":"A fast two-pass HDL simulation with on-demand dump","authors":"Kyuho Shim, Young-Rae Cho, Namdo Kim, Hyuncheol Baik, Kyung-Seok Kim, Dusung Kim, Jae-Beom Kim, Byeong Min, Kyumyung Choi, M. Ciesielski, Seiyang Yang","doi":"10.1109/ASPDAC.2008.4483987","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483987","url":null,"abstract":"Simulation-based functional verification is characterized by two inherently conflicting targets: the signal visibility and simulation performance. Achieving a proper trade-off between these two targets is of paramount importance. Even though HDL simulators are the most widely used verification platform at the RTL and gate level, their major drawback is the low performance in verifying complex SOCs, especially when the high visibility over the design under verification is required. This paper presents a new, fast simulation method as an effective way to achieve both high simulation speed and full signal visibility. It is based on an original two-pass simulation approach. During the 1st pass, with the simulation running at full speed, a set of design states is saved periodically at predetermined checkpoints. During the 2nd pass, another simulation is performed, using any of saved checkpoints and providing 100% signal visibility for debugging. Our method differs from the traditional simulation snapshot approach in the amount and the way the design state is saved. Experimental results show significant speed-up compared to existing traditional simulation methods while maintaining 100% visibility.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133705265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Area recovery under depth constraint by Cut Substitution for technology mapping for LUT-based FPGAs 基于lut的fpga的切割替代技术映射在深度约束下的面积恢复

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483928

Taiga Takata, Y. Matsunaga

In this paper we present the post-processing algorithm, cut substitution, for technology mapping for LUT-based FPGAs to minimize the area under depth minimum constraint. The problem to generate a LUT's network whose area is minimum under depth minimum constraint seems to be as difficult as NP-hard class problem. Cut substitution is the process to generate a local optimum solution by eliminating redundant LUTs while the depth of network is maintained. The experiments shows that the proposed method derives the solutions whose area are 9% smaller than the solutions of a previous state-of-the-art, DAOmap on average.

在本文中，我们提出了基于lut的fpga技术映射的后处理算法，即切割替换，以最小化深度最小约束下的面积。在深度最小约束下生成面积最小的LUT网络的问题似乎与NP-hard类问题一样困难。切替换是在保持网络深度的前提下，通过消除冗余lut来产生局部最优解的过程。实验表明，该方法得到的解的面积平均比以前最先进的DAOmap解的面积小9%。

引用次数: 3

Exploring power management in multi-core systems 探索多核系统中的电源管理

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484043

R. Bergamaschi, Guoling Han, A. Buyuktosunoglu, Hiren D. Patel, I. Nair, G. Dittmann, G. Janssen, N. Dhanwada, Zhigang Hu, P. Bose, J. Darringer

Power dissipation has become a critical design metric in microprocessor-based system design. In a multi-core system, running multiple applications, power and performance can be dynamically traded off using an integrated power management (PM) unit. This PM unit monitors the performance and power of each core and dynamically adjusts the individual voltages and frequencies in order to maximize system performance under a given power budget (usually set by the operating system). This paper presents a performance and power analysis methodology, featuring a simulation model for multi-core systems that can be easily reconfigured for different scenarios and a PM infrastructure for the exploration and analysis of PM algorithms. Two algorithms have been implemented: one for discrete and one for continuous power modes based on non-linear programming. Extensive experiments are reported, illustrating the effect of power management both at the core and the chip level.

在基于微处理器的系统设计中，功耗已成为一个重要的设计指标。在多核系统中，运行多个应用程序，可以使用集成电源管理(PM)单元动态地权衡功耗和性能。该PM单元监视每个核心的性能和功率，并动态调整单个电压和频率，以便在给定的功率预算(通常由操作系统设置)下最大化系统性能。本文提出了一种性能和功率分析方法，其特点是为多核系统的仿真模型，可以轻松地为不同的场景重新配置，以及用于探索和分析PM算法的PM基础设施。实现了两种算法:一种用于离散功率模式，另一种用于基于非线性规划的连续功率模式。大量的实验报告，说明电源管理的影响，在核心和芯片水平。

引用次数: 51

Within-die process variations: How accurately can they be statistically modeled? 模具内工艺变化:统计建模的准确性如何?

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484007

Brendan Hargreaves, Henrik Hult, S. Reda

Within-die process variations arise during integrated circuit (IC) fabrication in the sub-100nm regime. These variations are of paramount concern as they deviate the performance of ICs from their designers' original intent. These deviations reduce the parametric yield and revenues from integrated circuit fabrication. In this paper we provide a complete treatment to the subject of within-die variations. We propose a scan-chain based system, vMeter, to extract within-die variations in an automated fashion. We implement our system in a sample of 90 nm chips, and collect the within-die variations data. Then we propose a number of novel statistical analysis techniques that accurately model the within-die variation trends and capture the spatial correlations. We propose the use of maximum-likelihood techniques to find the required parameters to fit the model to the data. The accuracy of our models is statistically verified through residual analysis and variograms. Using our successful modeling technique, we propose a procedure to generate synthetic within-die variation patterns that mimic, or imitate, real silicon data.

在亚100nm制程的集成电路(IC)制造过程中，会出现模内工艺变化。这些变化是最重要的问题，因为它们使ic的性能偏离了设计者的初衷。这些偏差降低了集成电路制造的参数良率和收益。在本文中，我们提供了一个完整的处理的主题，模具内的变化。我们提出了一个基于扫描链的系统，vMeter，以自动化的方式提取模具内的变化。我们在90纳米芯片样品中实现了我们的系统，并收集了芯片内的变化数据。然后，我们提出了一些新的统计分析技术，可以准确地模拟模内变化趋势并捕获空间相关性。我们建议使用最大似然技术来找到所需的参数来拟合模型与数据。通过残差分析和方差分析，对模型的准确性进行了统计验证。利用我们成功的建模技术，我们提出了一种程序来生成模拟或模仿真实硅数据的合成模内变化模式。

{"title":"Within-die process variations: How accurately can they be statistically modeled?","authors":"Brendan Hargreaves, Henrik Hult, S. Reda","doi":"10.1109/ASPDAC.2008.4484007","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4484007","url":null,"abstract":"Within-die process variations arise during integrated circuit (IC) fabrication in the sub-100nm regime. These variations are of paramount concern as they deviate the performance of ICs from their designers' original intent. These deviations reduce the parametric yield and revenues from integrated circuit fabrication. In this paper we provide a complete treatment to the subject of within-die variations. We propose a scan-chain based system, vMeter, to extract within-die variations in an automated fashion. We implement our system in a sample of 90 nm chips, and collect the within-die variations data. Then we propose a number of novel statistical analysis techniques that accurately model the within-die variation trends and capture the spatial correlations. We propose the use of maximum-likelihood techniques to find the required parameters to fit the model to the data. The accuracy of our models is statistically verified through residual analysis and variograms. Using our successful modeling technique, we propose a procedure to generate synthetic within-die variation patterns that mimic, or imitate, real silicon data.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"172 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117327589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 59

A multicycle communication architecture and synthesis flow for Global interconnect Resource Sharing 面向全局互连资源共享的多周期通信体系结构和综合流程

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483933

Wei-Sheng Huang, Y. Hong, Juinn-Dar Huang, Ya-Shih Huang

In deep submicron technology, wire delay is no longer negligible and is gradually dominating the system latency. Some state-of-the-art architectural synthesis flows adopt the distributed register (DR) architecture to cope with this increasing latency. The DR architecture, though allows multicycle communication, introduces extra overhead on interconnect resource. In this paper, we propose the regular distributed register - global resource sharing (RDR-GRS) architecture to enable global sharing of interconnects and registers. Based on the RDR-GRS architecture, we further define the channel and register allocation problem as a path scheduling problem of data transfers. A formal and flexible formulation of this problem is then presented and optimally solved by Integer Linear Programming (ILP). Experimental results show that RDR-GRS/ILP can averagely reduce 58% wires and 35% registers compared to the previous work.

在深亚微米技术中，线延迟不再是可以忽略不计的，并逐渐主导着系统延迟。一些最先进的体系结构合成流采用分布式寄存器(DR)体系结构来处理这种不断增加的延迟。DR架构虽然允许多周期通信，但在互连资源上引入了额外的开销。为了实现互连和寄存器的全局共享，本文提出了规则分布式寄存器全局资源共享(RDR-GRS)架构。在RDR-GRS架构的基础上，进一步将信道和寄存器分配问题定义为数据传输的路径调度问题。然后给出了该问题的一种形式和灵活的表述，并用整数线性规划(ILP)进行了最优求解。实验结果表明，RDR-GRS/ILP与以前的工作相比，平均减少58%的导线和35%的寄存器。

引用次数: 8

A delay model for interconnect trees based on ABCD matrix 基于ABCD矩阵的互连树延迟模型

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484004

Guofei Zhou, L. Su, Depeng Jin, Lieguang Zeng

The accuracy of interconnect delay estimations can be improved by the method presented in this paper, in which the first two moments are obtained with ABCD matrix and a stable model to incorporate effects of transport delay into the delay estimate is developed. Simulation results show that the method share the same accuracy with traditional methods when rise time delay is much longer than transport delay and more accurate when the two are of the same order.

本文提出的方法利用ABCD矩阵求出互连时延的前两个矩，并建立了一个将传输时延影响纳入时延估计的稳定模型，提高了互连时延估计的精度。仿真结果表明，当上升时延大于传输时延时，该方法与传统方法具有相同的精度，当上升时延大于传输时延时，该方法的精度更高。

引用次数: 1

Synergistic physical synthesis for manufacturability and variability in 45nm designs and beyond 协同物理合成可制造性和可变性在45纳米及以上的设计

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483945

D. Pan, Minsik Cho

Nanometer IC designs are increasingly challenged by manufacturing closure, i.e., being fabricated with high product yield, mainly due to aggressive technology scaling and increasing process/environmental variations. Realizing the criticality of addressing manufacturability for higher yield and tolerance to variations during design, there has been a surge of research activities recently from both academia and industry. In this paper, we will survey the key activities in synergistic physical synthesis and shed lights on some of the future research directions.

纳米IC设计越来越多地受到制造封闭的挑战，即高产品产量的制造，主要是由于积极的技术缩放和不断增加的工艺/环境变化。意识到在设计过程中解决可制造性问题以获得更高的良率和对变化的容忍度的重要性，学术界和工业界最近都开展了大量的研究活动。本文将对协同物理合成中的关键活动进行综述，并对未来的研究方向进行展望。

引用次数: 6

A slew-rate controlled output driver with one-cycle tuning time 具有单周期调谐时间的回转速率控制输出驱动器

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4484070

Young-Ho Kwak, I. Jung, Chulwoo Kim

A low-power slew-rate controlled output driver with open loop digital scheme, one-cycle lock time is presented. Proposed output driver maintains slew rate in the range of 2.1 V/ns to 3.6 V/ns in a one cycle after the enable clock is inserted. It is implemented in 0.18 um CMOS process, and the control block consumes 13.7 mW at 1 Gbps.

提出了一种开环数字方案的低功耗回转速率控制输出驱动器，锁紧时间为一个周期。建议的输出驱动器在使能时钟插入后的一个周期内保持在2.1 V/ns至3.6 V/ns的压转率。它采用0.18 um CMOS工艺实现，控制块在1gbps下消耗13.7 mW。

引用次数: 0

NoCOUT : NoC topology generation with mixed packet-switched and point-to-point networks noout:混合分组交换和点对点网络的NoC拓扑生成

2008 Asia and South Pacific Design Automation Conference

Pub Date : 2008-01-21 DOI: 10.1109/ASPDAC.2008.4483953

J. Chan, S. Parameswaran

Networks-on-chip (NoC) have been widely proposed as the future communication paradigm for use in next-generation system-on-chip. In this paper, we present NoCOUT, a methodology for generating an energy optimized application specific NoC topology which supports both point-to-point and packet-switched networks. The algorithm uses a prohibitive greedy iterative improvement strategy to explore the design space efficiently. A system-level floorplanner is used to evaluate the iterative design improvements and provide feedback on the effects of the topology on wire length. The algorithm is integrated within a NoC synthesis framework with characterized NoC power and area models to allow accurate exploration for a NoC router library. We apply the topology generation algorithm to several test cases including real-world and synthetic communication graphs with both regular and irregular traffic patterns, and varying core sizes. Since the method is iterative, it is possible to start with a known design to search for improvements. Experimental results show that many different applications benefit from a mix of ";on chip networks"; and ";point-to-point networks";. With such a hybrid network, we achieve approximately 25% lower energy consumption (with a maximum of 37%) than a state of the art min-cut partition based topology generator for a variety of benchmarks. In addition, the average hop count is reduced by 0.75 hops, which would significantly reduce the network latency.

片上网络(NoC)已被广泛提出作为下一代片上系统的未来通信范式。在本文中，我们提出了noout，一种用于生成能量优化的应用特定NoC拓扑的方法，该拓扑支持点对点和分组交换网络。该算法采用禁止贪婪迭代改进策略，有效地探索设计空间。系统级地板规划器用于评估迭代设计改进，并提供拓扑对导线长度影响的反馈。该算法集成在具有特征NoC功率和面积模型的NoC综合框架中，可以精确地探索NoC路由器库。我们将拓扑生成算法应用于几个测试用例，包括具有规则和不规则流量模式以及不同核心大小的真实和合成通信图。由于该方法是迭代的，因此可以从已知的设计开始寻找改进。实验结果表明，许多不同的应用都受益于“片上网络”的混合;和“点对点网络”;使用这样的混合网络，我们实现了大约25%的能耗降低(最大37%)，而不是最先进的基于最小分割的拓扑生成器，用于各种基准测试。此外，平均跳数减少了0.75跳，这将大大减少网络延迟。

{"title":"NoCOUT : NoC topology generation with mixed packet-switched and point-to-point networks","authors":"J. Chan, S. Parameswaran","doi":"10.1109/ASPDAC.2008.4483953","DOIUrl":"https://doi.org/10.1109/ASPDAC.2008.4483953","url":null,"abstract":"Networks-on-chip (NoC) have been widely proposed as the future communication paradigm for use in next-generation system-on-chip. In this paper, we present NoCOUT, a methodology for generating an energy optimized application specific NoC topology which supports both point-to-point and packet-switched networks. The algorithm uses a prohibitive greedy iterative improvement strategy to explore the design space efficiently. A system-level floorplanner is used to evaluate the iterative design improvements and provide feedback on the effects of the topology on wire length. The algorithm is integrated within a NoC synthesis framework with characterized NoC power and area models to allow accurate exploration for a NoC router library. We apply the topology generation algorithm to several test cases including real-world and synthetic communication graphs with both regular and irregular traffic patterns, and varying core sizes. Since the method is iterative, it is possible to start with a known design to search for improvements. Experimental results show that many different applications benefit from a mix of \";on chip networks\"; and \";point-to-point networks\";. With such a hybrid network, we achieve approximately 25% lower energy consumption (with a maximum of 37%) than a state of the art min-cut partition based topology generator for a variety of benchmarks. In addition, the average hop count is reduced by 0.75 hops, which would significantly reduce the network latency.","PeriodicalId":277556,"journal":{"name":"2008 Asia and South Pacific Design Automation Conference","volume":"24 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120877537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 52

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 Asia and South Pacific Design Automation Conference

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀