16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)最新文献

英文中文

Managing complexity in design debugging with sequential abstraction and refinement 通过顺序抽象和细化来管理设计调试中的复杂性

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722237

Brian Keng, A. Veneris

Design debugging is becoming an increasingly difficult task in the VLSI design flow with the growing size of modern designs and their error traces. In this work, a novel abstraction and refinement technique for design debugging is presented that addresses two key components of the debugging complexity, the design size and the error trace length. The abstraction technique works by under-approximating the debugging problem by removing modules of the original design and replacing them with simulated values of the erroneous circuit. After each abstract problem is solved, the refinement strategy uses the resulting UNSAT core to direct which modules should be refined. This refinement strategy is extended by allowing refinement of across time-frames in addition to modules. Experimental results show that the proposed algorithm is able to return solutions for all instances compared to only 41% without the technique demonstrating the viability of this approach in tackling real-world debugging problems.

随着现代设计规模和误差轨迹的不断扩大，设计调试成为VLSI设计流程中越来越困难的一项任务。本文提出了一种新的设计调试的抽象和细化技术，解决了调试复杂性的两个关键组成部分，即设计尺寸和错误跟踪长度。抽象技术的工作原理是将原设计的模块去掉，代之以错误电路的模拟值，从而对调试问题进行低逼近。在每个抽象问题解决后，细化策略使用由此产生的UNSAT核心来指导应该细化哪些模块。除了模块之外，还允许跨时间框架进行精化，从而扩展了此精化策略。实验结果表明，所提出的算法能够返回所有实例的解决方案，而没有技术证明这种方法在解决实际调试问题时的可行性，只有41%。

引用次数: 8

A frequent-value based PRAM memory architecture 基于频率值的PRAM存储器结构

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722186

Guangyu Sun, Dimin Niu, J. Ouyang, Yuan Xie

Phase Change Random Access Memory (PRAM) has great potential as the replacement of DRAM as main memory, due to its advantages of high density, non-volatility, fast read speed, and excellent scalability. However, poor endurance and high write energy appear to be the challenges to be tackled before PRAM can be adopted as main memory. In order to mitigate these limitations, prior research focuses on reducing write intensity at the bit level. In this work, we study the data pattern of memory write operations, and explore the frequent-value locality in data written back to main memory. Based on the fact that many data are written to memory repeatedly, an architecture of frequent-value storage is proposed for PRAM memory. It can significantly reduce the write intensity to PRAM memory so that the lifetime is improved and the write energy is reduced. The trade-off between endurance and capacity of PRAM memory is explored for different configurations. After using the frequent-value storage, the endurance of PRAM is improved to about 1.6X on average, and the write energy is reduced by 20%.

相变随机存取存储器(Phase Change Random Access Memory, PRAM)具有高密度、非易失性、读取速度快、可扩展性好等优点，具有取代DRAM作主存的巨大潜力。然而，在PRAM被用作主存之前，较差的续航能力和较高的写入能量似乎是需要解决的挑战。为了减轻这些限制，先前的研究主要集中在降低比特级别的写入强度。在这项工作中，我们研究了内存写操作的数据模式，并探讨了写回主存的数据中的频率值局部性。针对大量数据被重复写入存储器的特点，提出了一种基于频率值存储的PRAM存储器结构。它可以显著降低对PRAM内存的写入强度，从而提高寿命，减少写入能量。探讨了不同配置下PRAM存储器的持久性能和容量之间的权衡。采用频率值存储后，PRAM的持久时间平均提高到1.6倍左右，写入能量降低20%。

引用次数: 52

A low-power management technique for high-performance domino circuits 高性能多米诺电路的低功耗管理技术

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722313

Yu-Tzu Tsai, Cheng-Chih Tsai, Cheng-An Chien, Ching-Hwa Cheng, Jiun-In Guo

Exploiting a charge sharing method enables a performance power management design for domino circuits. The domino circuits have both high performance and low power consumption. A test chip has been successfully validated using TSMC 0.13um CMOS technology. Reductions in dynamic power consumption of 68% and static power consumption of 15% are achieved.

利用电荷共享方法实现了多米诺电路的性能电源管理设计。该多米诺电路具有高性能和低功耗的特点。采用台积电0.13um CMOS技术的测试芯片已成功验证。动态功耗降低68%，静态功耗降低15%。

引用次数: 0

Network-on-Chip router design with Buffer-Stealing 带Buffer-Stealing的片上网络路由器设计

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722177

Wan-Ting Su, Jih-Sheng Shen, Pao-Ann Hsiung

Communication in a Network-on-Chip (NoC) can be made more efficient by designing faster routers, using larger buffers, larger number of ports and channels, and adaptive routing, all of which incur significant overheads in hardware costs. As a more economic solution, we try to improve communication efficiency without increasing the buffer size. A Buffer-Stealing (BS) mechanism is proposed, which enables the input channels that have insufficient buffer space to utilize at runtime the unused input buffers from other input channels. Implementation results of the proposed BS design for a 64-bit 5-input-buffer router show a reduction of the average packet transmission latency by up to 10.17% and an increase of the average throughput by up to 23.47%, at an overhead of 22% more hardware resources.

通过设计更快的路由器、使用更大的缓冲区、更多的端口和通道以及自适应路由，可以提高片上网络(NoC)中的通信效率，所有这些都会导致硬件成本方面的重大开销。作为一种更经济的解决方案，我们试图在不增加缓冲区大小的情况下提高通信效率。提出了一种缓冲窃取(buffer - stealing, BS)机制，使缓冲区空间不足的输入通道能够在运行时利用来自其他输入通道的未使用的输入缓冲区。在64位5输入缓冲路由器上的实现结果表明，在硬件资源增加22%的情况下，平均分组传输延迟减少了10.17%，平均吞吐量增加了23.47%。

引用次数: 11

A thermal-aware application specific routing algorithm for Network-on-Chip design 片上网络设计的一种热感知应用专用路由算法

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722232

Zhiliang Qian, C. Tsui

In this work, we propose an application specific routing algorithm to reduce the hot-spot temperature for Network-on-chip (NoC). Using the traffic information of applications, we develop a routing scheme which can achieve a higher adaptivity than the generic ones and at the same time distribute the traffic more uniformly. A set of deadlock-free admissible paths for all the communications is first obtained. To reduce the hot-spot temperature, we find the optimal distribution ratio of the communication traffic among the set of candidate paths. The problem of finding this optimal distribution ratio is formulated as a linear programming (LP) problem and is solved offline. A router microarchitecture which supports our ratio-based selection policy is also proposed. From the simulation results, the peak energy reduction considering the energy consumption of both the processors and routers can be as high as 16.6% for synthetic traffic and real benchmarks.

在这项工作中，我们提出了一种特定于应用的路由算法来降低片上网络(NoC)的热点温度。利用应用程序的流量信息，我们开发了一种路由方案，该方案比一般的路由方案具有更高的自适应能力，同时使流量分布更加均匀。首先为所有通信获得一组无死锁的可接受路径。为了降低热点温度，我们在候选路径集中找到通信流量的最优分布比例。寻找这种最优分布比的问题被表述为线性规划问题，并且是离线解决的。提出了一种支持基于比例选择策略的路由器微架构。从仿真结果来看，在综合流量和真实基准测试中，考虑处理器和路由器能耗的峰值能耗降低可高达16.6%。

引用次数: 23

Clock tree optimization for Electromagnetic Compatibility (EMC) 电磁兼容性(EMC)时钟树优化

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722181

Xuchu Hu, Matthew R. Guthaus

Electromagnetic Interference (EMI) generated by electronic systems is increasing with operating frequency and shrinking process technologies. The clock distribution network is one of the major causes of on-chip EMI. In this paper, we discuss the EMI problem in clock tree design. Spectrum analysis shows that slew rate of clock signal is the main parameter determining the high-frequency spectral content distribution. This is the first work to consider maximum and minimum buffer slew rates in clock tree synthesis to reduce EMI. In this paper, we propose a dynamic programming algorithm to optimize the clock tree considering both traditional metrics and Electromagnetic Compatibility (EMC). Our experimental results show that slew can be controlled in a feasible range and high-frequency spectrum contents can be reduced without sacrificing the traditional metrics such as power and skew. With the efficient optimization and pruning method, the biggest benchmark is able to complete in four minutes.

电子系统产生的电磁干扰(EMI)随着工作频率的增加和工艺技术的缩小而增加。时钟分配网络是造成片上电磁干扰的主要原因之一。本文讨论了时钟树设计中的电磁干扰问题。频谱分析表明，时钟信号的摆幅率是决定高频频谱含量分布的主要参数。这是第一个考虑时钟树合成中最大和最小缓冲摆率以减少EMI的工作。在本文中，我们提出了一种动态规划算法来优化时钟树，同时考虑了传统指标和电磁兼容性。实验结果表明，在不牺牲功率和偏度等传统指标的情况下，可以将摆幅控制在可行范围内，减少高频频谱含量。通过高效的优化和修剪方法，最大的基准可以在4分钟内完成。

引用次数: 4

Hierarchical exact symbolic analysis of large analog integrated circuits by symbolic stamps 用符号印章对大型模拟集成电路进行层次精确符号分析

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722183

Hui Xu, G. Shi, Xiaopeng Li

Linearized small-signal transistor models share the common circuit structure but may take different parameter values in the ac analysis of an analog circuit simulator. This property can be utilized for symbolic circuit analysis. This paper proposes to use a symbolic stamp for all device models in the same circuit for hierarchical symbolic analysis. Two levels of binary decision diagrams (BDDs) are used for maximum data sharing, one for the symbolic device stamp and the other for modified nodal analysis. The symbolic transadmittances of the device stamp share one BDD for storage saving. The modified nodal analysis (MNA) matrix formulated using symbolic stamp is of much lower dimension, hence it can be solved by a determinant decision diagram (DDD) with significantly reduced complexity. A circuit simulator is implemented based on the proposed partitioning architecture. It is able to analyze an op-amp circuit containing 44 MOS transistors exactly for the first time.

线性化的小信号晶体管模型具有相同的电路结构，但在模拟电路模拟器的交流分析中可能采用不同的参数值。这一性质可用于符号电路分析。本文提出对同一电路中的所有器件模型使用符号戳进行分层符号分析。两层二进制决策图(bdd)用于最大限度地共享数据，一层用于符号设备戳，另一层用于修改节点分析。为了节省存储，设备戳的符号trans导纳共享一个BDD。使用符号戳表示的改进节点分析矩阵具有较低的维数，因此可以用行列式决策图(DDD)求解，大大降低了复杂度。在此基础上实现了一个电路模拟器。首次对包含44个MOS晶体管的运放电路进行了精确分析。

引用次数: 17

An optimal algorithm for allocation, placement, and delay assignment of adjustable delay buffers for clock skew minimization in multi-voltage mode designs 多电压模式设计中可调延迟缓冲器的分配、放置和延迟分配的最优算法

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722242

Kyoung-Hwan Lim, Taewhan Kim

Satisfying clock skew constraint is one of the most important tasks in the clock tree synthesis. Moreover, the task becomes much harder to solve as the clock tree is designed under multiple power mode environment, in which the voltage applied to some design module varies as the power mode changes. Recently, it is shown that adjustable delay buffer (ADB) whose delay can be tuned dynamically can be used to solve the clock skew problem effectively under multiple power modes. However, due to the area/control overhead by ADBs it is very important to minimize the number of ADBs. This work provides a complete solution to the problem of clock skew minimization using ADBs under multiple power modes. We propose a linear-time optimal algorithm that simultaneously solves the problems of computing (1) the minimum number of ADBs to be used, (2) the location at which each ADB is to be placed, and (3) the delay value of each ADB to be assigned to each power mode. Experimental results show that in comparison with the previous work [8] which iteratively performs the ADB allocation, placement, and delay assignment, our integrated algorithm produces consistently better designs for all tested benchmarks under four power modes, reducing the number of ADBs by 9.27% further on average at skew bound of 30ps∼50ps even with shorter clock latencies.

满足时钟偏差约束是时钟树综合中最重要的任务之一。此外，由于时钟树是在多种功率模式环境下设计的，在这种环境下，某些设计模块所施加的电压会随着功率模式的变化而变化，因此解决这个问题变得更加困难。近年来，研究表明，可调延迟缓冲器(ADB)的延迟可动态调整，可以有效地解决多种功率模式下的时钟偏差问题。然而，由于adb的面积/控制开销，减少adb的数量非常重要。这项工作提供了一个完整的解决方案，以时钟倾斜最小化的问题，在多种功率模式下使用ADBs。我们提出了一种线性时间最优算法，该算法同时解决了计算(1)要使用的ADB的最小数量，(2)每个ADB放置的位置，以及(3)每个ADB分配给每种功率模式的延迟值的问题。实验结果表明，与之前迭代执行ADB分配、放置和延迟分配的工作[8]相比，我们的集成算法在四种功率模式下为所有测试基准提供了一致的更好设计，即使在更短的时钟延迟下，在30ps ~ 50ps的斜界下，平均将ADB数量进一步减少9.27%。

{"title":"An optimal algorithm for allocation, placement, and delay assignment of adjustable delay buffers for clock skew minimization in multi-voltage mode designs","authors":"Kyoung-Hwan Lim, Taewhan Kim","doi":"10.1109/ASPDAC.2011.5722242","DOIUrl":"https://doi.org/10.1109/ASPDAC.2011.5722242","url":null,"abstract":"Satisfying clock skew constraint is one of the most important tasks in the clock tree synthesis. Moreover, the task becomes much harder to solve as the clock tree is designed under multiple power mode environment, in which the voltage applied to some design module varies as the power mode changes. Recently, it is shown that adjustable delay buffer (ADB) whose delay can be tuned dynamically can be used to solve the clock skew problem effectively under multiple power modes. However, due to the area/control overhead by ADBs it is very important to minimize the number of ADBs. This work provides a complete solution to the problem of clock skew minimization using ADBs under multiple power modes. We propose a linear-time optimal algorithm that simultaneously solves the problems of computing (1) the minimum number of ADBs to be used, (2) the location at which each ADB is to be placed, and (3) the delay value of each ADB to be assigned to each power mode. Experimental results show that in comparison with the previous work [8] which iteratively performs the ADB allocation, placement, and delay assignment, our integrated algorithm produces consistently better designs for all tested benchmarks under four power modes, reducing the number of ADBs by 9.27% further on average at skew bound of 30ps∼50ps even with shorter clock latencies.","PeriodicalId":316253,"journal":{"name":"16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)","volume":"218 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134397280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Handling dynamic frequency changes in statically scheduled cycle-accurate simulation 处理动态频率变化的静态调度周期精确模拟

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722224

M. Gligor, F. Pétrot

Although high level simulation models are being increasingly used for digital electronic system validation, cycle accuracy is still required in some cases, such as hardware protocol validation or accurate power/energy estimation. Cycle-accurate simulation is however slow and acceleration approaches make the assumption of a single constant clock, which is not true anymore with the generalization of dynamic voltage and frequency scaling techniques. Fast cycle-accurate simulators supporting several clocks whose frequencies can change at run time are thus needed. This paper presents two algorithms we designed for this purpose and details their properties and implementations.

虽然高级仿真模型越来越多地用于数字电子系统验证，但在某些情况下仍然需要周期精度，例如硬件协议验证或准确的功率/能量估计。然而，周期精确的仿真速度很慢，加速方法假设一个恒定的时钟，随着动态电压和频率标度技术的推广，这种假设不再成立。因此需要支持多个时钟的快速周期精确模拟器，这些时钟的频率可以在运行时改变。本文介绍了我们为此目的设计的两种算法，并详细介绍了它们的特性和实现。

引用次数: 2

Ultra-low power microcontrollers for portable, wearable, and implantable medical electronics 用于便携式，可穿戴和植入式医疗电子产品的超低功耗微控制器

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

Pub Date : 2011-01-25 DOI: 10.1109/ASPDAC.2011.5722252

S. Sridhara

An aging population, coupled with choices on diet and lifestyle, is causing an increased demand for portable, wearable, and implantable medical devices that enable chronic disease management and wellness assessment. Battery life specifications drive the power consumption requirements of integrated circuits in these devices. Microcontrollers provide the right combination of programmability, cost, performance, and power consumption needed to realize such devices. In this paper, we describe microcontrollers that are enabling today's medical applications and discuss innovations necessary for enabling future applications with sophisticated signal processing needs. As an example, we present the design of an embedded microcontroller system-on-chip that achieves the first sub-microwatt per channel electroencephalograph (EEG) seizure detection.

人口老龄化，加上饮食和生活方式的选择，导致对便携式、可穿戴和植入式医疗设备的需求增加，这些设备可以进行慢性疾病管理和健康评估。电池寿命规格驱动了这些设备中集成电路的功耗要求。微控制器提供了实现此类器件所需的可编程性、成本、性能和功耗的正确组合。在本文中，我们描述了实现当今医疗应用的微控制器，并讨论了实现具有复杂信号处理需求的未来应用所需的创新。作为一个例子，我们提出了一个嵌入式微控制器片上系统的设计，实现了第一个亚微瓦每通道脑电图(EEG)癫痫检测。

引用次数: 19

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀