2009 International Symposium on VLSI Design, Automation and Test最新文献

英文中文

A detailed router for hierarchical FPGAs based on simulated evolution 一种基于模拟进化的分层fpga路由器

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158108

Kewei Zhu, Yici Cai, Qiang Zhou, Xianlong Hong

This paper presents a new detialed router for the hierarchical field programmable gate arrays (H-FPGAs). The optimal objectives of proposed routing algorithm are improving the time consumption of routing procedure (minimizing the running time of algorithm), and at the same time make great effort to decrease the wire length and critical path delay. Initially, nets are routed sequentially according to their criticalities. Then, to achieve optimization targets, the nets violating routablity constrains are resolved iteratively by a rip-up and rerouting router using the simulated evolution optimization technique, where each net will be evaluated via a rip-up priority function consisting of the timing part and the congestion part, and then compared to a random number to decide if it will be ripped and rerouted. An experimental result under commercial H-FPGA shows that our router can have about 26% improvement on the time-consumption and 0.45% reduction on total wire length when compared with a modified VPR.

本文提出了一种用于分层现场可编程门阵列(h - fpga)的新型详细路由器。所提出的路由算法的最优目标是提高路由过程的时间消耗(使算法的运行时间最小化)，同时努力减小线长和关键路径延迟。最初，网络是按临界顺序路由的。然后，为了实现优化目标，使用模拟进化优化技术，通过撕裂和重路由路由器迭代解决违反可达性约束的网络，其中每个网络将通过由时序部分和拥塞部分组成的撕裂优先级函数进行评估，然后将其与随机数进行比较，以确定是否将被撕裂和重路由。在商用H-FPGA上的实验结果表明，与改进后的VPR相比，该路由器的耗时提高了26%，总线长减少了0.45%。

引用次数: 2

Prefetching for array data in embedded Java hardware accelerator 嵌入式Java硬件加速器中数组数据的预取

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158170

Yi-Ruei Wu, Yu-Sheng Chen, J. Shann

For improving the efficiency of a program, it is important to reduce stalls caused by memory access. Traditional programs usually spend much time on memory stalls when accessing lower-level memory, and so do Java programs. In order to reduce memory stall time, prefetching is a feasible solution. We observed some obvious properties of array access so that we could prefetch array data by taking advantage of these properties. We analyzed these properties and proposed a suitable array prefetching mechanism for embedded Java hardware accelerators so as to reduce the time spent on memory stalls. Our approach eliminates about 25% of array stall time on average and even up to 50% for some array-based programs.

为了提高程序的效率，减少由内存访问引起的停顿是很重要的。传统程序在访问底层内存时通常会花费大量时间在内存中断上，Java程序也是如此。为了减少内存失速时间，预取是一种可行的解决方案。我们观察了数组访问的一些明显属性，因此我们可以利用这些属性预取数组数据。我们分析了这些特性，并提出了一种适合嵌入式Java硬件加速器的数组预取机制，以减少内存占用时间。我们的方法平均减少了大约25%的数组停顿时间，对于一些基于数组的程序甚至可以减少50%。

引用次数: 0

Hierarchical architecture for network-on-chip platform 片上网络平台的分层结构

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158165

Lianq-Yu Lin, Huang-Kai Lin, Cheng-Yeh Wang, Lan-Da Van, Jing-Yang Jou

In this paper, we propose one hierarchical 2-D mesh Network-on-Chip (NoC) platform to support applications with the complexity of several hundreds of tasks or with huge amount of transmission data. Moreover, applying the task binding method by considering communication amount, communication data contention and bandwidth penalty to enhance the system overall performance of the new architecture. Modeling the NoC system data transmission behavior at system level is applied to predict system overall performance and an automatic NoC system performance simulation tonl is also built. Therefore, architecture and designers can predict the system performance and obtain all parameters of the designed platform at system abstraction level. The experimental results show that the overall system throughput, the latency, and the saving of redundant transactions are improved by 27%, 14.4% and 21.8% respectively under the communication dominated situation.

在本文中，我们提出了一种分层二维网格片上网络(NoC)平台，以支持具有数百个任务复杂性或传输数据量巨大的应用。此外，在考虑通信量、通信数据争用和带宽损失的情况下，应用任务绑定方法提高了新架构的系统整体性能。在系统层面对NoC系统数据传输行为进行建模，预测系统整体性能，并建立了NoC系统性能自动仿真模型。因此，架构师和设计人员可以在系统抽象层预测系统性能并获得所设计平台的所有参数。实验结果表明，在通信占主导地位的情况下，系统的整体吞吐量、延迟和冗余事务的节省分别提高了27%、14.4%和21.8%。

引用次数: 1

A 6-GS/s, 6-bit, at-speed testable ADC and DAC pair in 0.13µm CMOS 一个6-GS/s, 6位，高速可测试的ADC和DAC对在0.13µm CMOS

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158131

Chen-Kang Ho, Hao-Chiao Hong

This paper demonstrates a 6-GS/s 6-bit flash ADC and current-steering DAC pair in 0.13µm CMOS. Averaging and interpolating techniques are applied to reduce the offsets and to save the power of the ADC. Current mode logics are used to achieve a high speed and to overcome the severe power bouncing issue. Design-for-testability circuits are added to conduct the at-speed tests by internally cascading the ADC and DAC. The cascaded ADC and DAC pair clocked at 6GHz achieves a 37.0 dB signal-to-noise ratio and a 26.0 dBc spurious-free dynamic range with the −1 dBFS, 502 MHz stimulus. The ADC and DAC consumes 655 mW and 115 mW from a 1.2-V supply, respectively.

本文演示了一个6-GS/s的6位闪存ADC和电流导向DAC对在0.13µm CMOS。采用平均和插值技术来减少偏移量并节省ADC的功率。电流模式逻辑用于实现高速和克服严重的功率反弹问题。增加了可测试性设计电路，通过内部级联ADC和DAC来进行高速测试。频率为6GHz的级联ADC和DAC对在- 1 dBFS, 502 MHz激励下实现了37.0 dB的信噪比和26.0 dBc的无杂散动态范围。ADC和DAC分别从1.2 v电源消耗655mw和115mw。

引用次数: 2

Implementation of 6-Port 3D transformer in injection-locked frequency divider 注入锁定分频器中6端口3D变压器的实现

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158135

S. Jang, Chia‐Wei Tai, Cheng-Chen Liu

This paper proposes a 6-port 3-dimensional (3-D) transformer used to improve the performance of injection-locked frequency divider (ILFD). The aim of the 3-D transformer is to reduce chip size and to reduce power consumption. The CMOS LC-tank ILFD is implemented using the direct injection nMOS between the differential outputs of an nMOS-core cross-coupled VCO. At the supply voltage of 0.6 V, the free-running frequency of ILFD is tunable from 4.81 GHz to 5.3 GHz. At the incident power of 0 dBm and VDD = 0.6 V, the total locking range is about 3 GHz, from the incident frequency 8.9 to 11.9 GHz for the ILFD in the divide-by-2 mode. The core power consumption is 1.02 mW. The die area is 0.394 × 0.623 mm2.

为了提高注入锁定分频器(ILFD)的性能，提出了一种6端口三维变压器。3d变压器的目标是减小芯片尺寸并降低功耗。CMOS LC-tank ILFD是在nMOS核心交叉耦合VCO的差分输出之间使用直接注入nMOS实现的。在电源电压为0.6 V时，ILFD的自由工作频率在4.81 GHz到5.3 GHz之间可调。当入射功率为0 dBm, VDD = 0.6 V时，除以2模式下ILFD的总锁定范围约为3 GHz，从入射频率8.9到11.9 GHz。核心功耗为1.02 mW。模具面积0.394 × 0.623 mm2。

引用次数: 5

Static and dynamic test power reduction in scan-based testing 静态和动态测试功率降低扫描测试

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158094

Sying-Jyan Wang, Shun-Jie Huang, Katherine Shu-Min Li

Static power due to leakage current will become a major source of power consumption in the nanometer technology era. In this paper, we propose a simple yet effective technique to reduce both static and dynamic power consumption in the scan test process. The leakage current is restrained by selecting a good primary input vector to control the paths of leakage current during the scan shift process, and this vector can also be used to reduce dynamic power. However, the reverse is not always true. A heuristic algorithm is presented to find such vectors. The proposed method is simulated by SPICE with BPTM 22nm transistor model, and the results show that on the average 15% total power reduction is achievable by the proposed method.

泄漏电流产生的静电将成为纳米技术时代的主要功耗来源。在本文中，我们提出了一种简单而有效的技术来降低扫描测试过程中的静态和动态功耗。通过选择合适的一次输入矢量来控制扫描移位过程中漏电流的路径，从而抑制漏电流的产生，并利用该矢量来降低动态功率。然而，反过来并不总是正确的。提出了一种寻找此类向量的启发式算法。采用BPTM 22nm晶体管模型对该方法进行了SPICE仿真，结果表明，该方法平均可实现15%的总功耗降低。

引用次数: 0

An area efficient shared synapse cellular neural network for low power image processing 面向低功耗图像处理的区域高效共享突触细胞神经网络

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158148

Jinwook Oh, Seungjin Lee, Joo-Young Kim, H. Yoo

This paper presents an area and power efficient cellular neural network (CNN) that enables real-time image processing. The proposed shared synapse architecture halves the number of required synapse multipliers, which are the main contributor to area and power consumption of CNNs. For this, a current holder circuit is used to sample and hold the currents of non-changing synaptic circuit outputs. Compared to the conventional architecture of CNNs, power and area are reduced by 46% and 41%, respectively.

本文提出了一种面积和功率效率高的细胞神经网络(CNN)，可以实现实时图像处理。所提出的共享突触架构将所需的突触乘法器数量减半，而突触乘法器是影响cnn面积和功耗的主要因素。为此，电流保持电路用于采样和保持不变的突触电路输出的电流。与传统的cnn架构相比，功耗和面积分别降低了46%和41%。

引用次数: 0

Incremental physical design method for flat SOC design 平面SOC设计的增量物理设计方法

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158167

Li-Yi Lin, Hsin-chang Lin, Shih-Arn Hwang

SOC designs for consumer electronics often evolve generation by generation in a very short time. Besides the needs for merging more functionality, more and more enhancements are for the purpose of interface upgrading for new standards and better or faster signal processing hardware engines for video/audio encoding and decoding. Physical designs for these kinds of enhanced chips can reuse large potions of the previous layout and do not need to be re-implemented from the ground up to shorten the time to market. However, traditional physical incremental design method is becoming impractical, especially for the flat design, which usually can has the advantage of the smaller die size compared with the hierarchical design. In this paper, we propose an incremental physical design method to take the advantages of the hieratical design while maintaining the cost strength in the flat design. Our proposed method has been successfully applied to our next generation multimedia chip and the results show that no design iteration is needed and the run time is at least 5 times faster compared with the traditional method.

消费类电子产品的SOC设计通常在很短的时间内一代又一代地发展。除了合并更多功能的需求外，越来越多的增强是为了升级接口以适应新标准，以及更好或更快的信号处理硬件引擎，用于视频/音频编码和解码。这类增强芯片的物理设计可以重用之前布局的大部分内容，而不需要从头开始重新实现，从而缩短上市时间。然而，传统的物理增量设计方法正变得越来越不实用，特别是对于平面设计而言，与分层设计相比，这种方法通常具有更小的模具尺寸的优点。在本文中，我们提出了一种增量物理设计方法，在保持平面设计成本强度的同时，利用分层设计的优势。该方法已成功应用于我们的下一代多媒体芯片，结果表明，该方法无需重复设计，运行时间比传统方法快至少5倍。

引用次数: 2

Transforming RF and mm-Wave CMOS circuits 变换射频和毫米波CMOS电路

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158114

A. Niknejad, D. Chowdhury

This papers highlights the design of RF and mm-wave building blocks which achieve high performance despite implementation in low-voltage digital CMOS technology. It is shown that integrated circuit transformers can greatly aid the design of such circuits by serving many roles, such as impedance matching, AC coupling, voltage combining, and converting signals from single-ended to differential form. Design, layout, and simulation strategies will be reviewed and several key power amplifier (PA) building blocks will be demonstrated.

本文重点介绍了射频和毫米波构建模块的设计，尽管在低压数字CMOS技术中实现，但仍能实现高性能。研究表明，集成电路变压器可以通过阻抗匹配、交流耦合、电压组合以及将单端信号转换为差分形式来极大地帮助设计这种电路。设计，布局和仿真策略将进行审查，并将演示几个关键的功率放大器(PA)构建模块。

引用次数: 3

A frequency synthesizer for Mode-1 MB-OFDM UWB applications 频率合成器的模式1 MB-OFDM UWB应用

2009 International Symposium on VLSI Design, Automation and Test

Pub Date : 2009-04-28 DOI: 10.1109/VDAT.2009.5158134

Jung-Yu Chang, Che-Wei Fan, Shen-Iuan Liu

A frequency synthesizer for Mode-1 MB-OFDM UWB applications is realized in 65nm CMOS. By using a delay-locked loop (DLL) and the proposed multiply-by-two circuit, the frequency synthesizer achieves the in-band spur of −40dBc for the three-band operation. The proposed multiply-by-2 circuit realizes the quadrature signals, and its input signals do not need the 50% duty cycle. A modified current-starving cell in a DLL is also proposed to reduce the supply noise sensitivity. The measured switching time from 3.342GHz to 4.488GHz is around 1.1ns. The area is 1.25×1.175mm2 with pads and the power is 19.2mW for 1.2V supply.

在65nm CMOS上实现了一种适用于Mode-1 MB-OFDM UWB应用的频率合成器。该频率合成器采用锁延环(DLL)和乘二电路，实现了−40dBc的带内杂散。所提出的乘2电路实现了正交信号，其输入信号不需要50%占空比。为了降低电源噪声的灵敏度，还提出了一种改进的缺流单元。从3.342GHz到4.488GHz的切换时间约为1.1ns。面积为1.25×1.175mm2，带衬垫，1.2V供电，功率为19.2mW。

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2009 International Symposium on VLSI Design, Automation and Test

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀