首页 > 最新文献

2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)最新文献

英文 中文
Self-Aligned Double and Quadruple Patterning-aware grid routing with hotspots control 具有热点控制的自对齐双和四重模式感知网格路由
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509607
C. Kodama, H. Ichikawa, Koichi Nakayama, T. Kotani, S. Nojima, S. Mimotogi, S. Miyamoto, A. Takahashi
Although Self-Aligned Double and Quadruple Patterning (SADP, SAQP) have become the most promising processes for sub-20 nm and sub-14 nm node advanced technologies, not all wafer images are realized by them. In advanced technologies, feasible wafer images should be generated effectively by utilizing SADP and SAQP where a wafer image is uniquely determined by a selected mandrel pattern. However, predicting the wafer image of a mandrel pattern is not easy. In this paper, we propose a routing method of generating a feasible wafer image satisfying the connection requirements. Routing algorithms comprising simple connecting and cutting rules are performed on a new grid structure where two (SADP) or three colors (SAQP) are assigned alternately to grid-nodes. Then a mandrel pattern is selected without complex coloring or decomposition methods. Also, hotspot reduction by dummy pattern flipping is proposed. In experiments, feasible wafer images meeting the connection requirements are generated and the effectiveness of the proposed framework is confirmed.
虽然自对准双和四重模式(SADP, SAQP)已经成为亚20纳米和亚14纳米节点先进技术中最有前途的工艺,但并非所有的晶圆图像都是由它们实现的。在先进的技术中,可行的晶圆图像应利用SADP和SAQP有效地生成,其中晶圆图像是由选定的芯轴图案唯一确定的。然而,预测芯轴图案的晶圆图像并不容易。在本文中,我们提出了一种路由方法来产生满足连接要求的可行晶圆图像。在网格节点交替分配两种颜色(SADP)或三种颜色(SAQP)的新网格结构上执行包含简单连接和切割规则的路由算法。然后选择一个芯棒图案,没有复杂的着色或分解方法。此外,还提出了用虚拟模式翻转来减少热点的方法。通过实验,生成了符合连接要求的可行的晶圆图像,验证了该框架的有效性。
{"title":"Self-Aligned Double and Quadruple Patterning-aware grid routing with hotspots control","authors":"C. Kodama, H. Ichikawa, Koichi Nakayama, T. Kotani, S. Nojima, S. Mimotogi, S. Miyamoto, A. Takahashi","doi":"10.1109/ASPDAC.2013.6509607","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509607","url":null,"abstract":"Although Self-Aligned Double and Quadruple Patterning (SADP, SAQP) have become the most promising processes for sub-20 nm and sub-14 nm node advanced technologies, not all wafer images are realized by them. In advanced technologies, feasible wafer images should be generated effectively by utilizing SADP and SAQP where a wafer image is uniquely determined by a selected mandrel pattern. However, predicting the wafer image of a mandrel pattern is not easy. In this paper, we propose a routing method of generating a feasible wafer image satisfying the connection requirements. Routing algorithms comprising simple connecting and cutting rules are performed on a new grid structure where two (SADP) or three colors (SAQP) are assigned alternately to grid-nodes. Then a mandrel pattern is selected without complex coloring or decomposition methods. Also, hotspot reduction by dummy pattern flipping is proposed. In experiments, feasible wafer images meeting the connection requirements are generated and the effectiveness of the proposed framework is confirmed.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126158459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Simplification of C-RTL equivalent checking for fused multiply add unit using intermediate models 用中间模型简化融合乘加单元的C-RTL等效校核
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509686
Bin Xue, Prosenjit Chatterjee, S. Shukla
The functionality of Fused multiply add (FMA) design can be formally verified by comparing its register transition level (RTL) implementation against its system level specification often modeled by C/C++ language using sequential equivalent checking (SEC). However, C-RTL SEC does not scale for FMA because of the huge discrepancy existed between the two models. This paper analyzes the dissimilarities and proposes two intermediate models, one abstract RTL and one rewritten C model to bridge the gap. The original SEC proof are partitioned into three sub-proofs among intermediate models where a variety of simplification techniques are applied to further reduce the complexity. Experiments from an industry project show that with the two intermediate models, the SEC proof is complete and scalable for FMA design.
融合乘加(FMA)设计的功能可以通过比较其寄存器转换级(RTL)实现与系统级规范(通常由C/ c++语言使用顺序等效检查(SEC)建模)进行形式化验证。然而,由于两种模型之间存在巨大的差异,C-RTL SEC不适合FMA。本文分析了两者之间的差异,提出了两种中间模型,一种是抽象的RTL模型,另一种是重写的C模型。原始证交会证明在中间模型中被划分为三个子证明,并应用了各种简化技术来进一步降低复杂性。一个工业项目的实验表明,使用这两个中间模型,SEC证明是完整的,并且可扩展用于FMA设计。
{"title":"Simplification of C-RTL equivalent checking for fused multiply add unit using intermediate models","authors":"Bin Xue, Prosenjit Chatterjee, S. Shukla","doi":"10.1109/ASPDAC.2013.6509686","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509686","url":null,"abstract":"The functionality of Fused multiply add (FMA) design can be formally verified by comparing its register transition level (RTL) implementation against its system level specification often modeled by C/C++ language using sequential equivalent checking (SEC). However, C-RTL SEC does not scale for FMA because of the huge discrepancy existed between the two models. This paper analyzes the dissimilarities and proposes two intermediate models, one abstract RTL and one rewritten C model to bridge the gap. The original SEC proof are partitioned into three sub-proofs among intermediate models where a variety of simplification techniques are applied to further reduce the complexity. Experiments from an industry project show that with the two intermediate models, the SEC proof is complete and scalable for FMA design.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"0510 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125823962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A dynamic stream link for efficient data flow control in NoC based heterogeneous MPSoC 基于NoC的异构MPSoC中高效数据流控制的动态流链路
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509556
C. Helmstetter, Sylvain Basset, R. Lemaire, F. Clermidy, P. Vivet, M. Langevin, Chuck Pilkington, P. Paulin, D. Fuin
As Systems-on-Chip size increase, the communication costs become critical and Networks-on-Chip (NoC) bring innovative solutions. Efficient stream-based protocols over NoC have been widely studied to address dataflow communications. They are usually controlled by a set of static parameters. However, new applications, such as high-resolution video decoders, present more data-dependent behaviors forcing communication protocols to support higher dynamicity. For this purpose, we present in this paper dynamic stream links for stream-based end-to-end NoC communications by introducing two link protocols, both independent of the transfer size, allowing to improve the hardware/software control flexibility. The proposed protocols have been modeled in a MPSoC virtual platform and the hardware cost evaluated. Based on simulations, we provide guidelines to exploit these protocols according to application needs.
随着片上系统尺寸的增加,通信成本变得至关重要,而片上网络(NoC)带来了创新的解决方案。基于NoC的高效流协议已被广泛研究以解决数据流通信问题。它们通常由一组静态参数控制。然而,新的应用,如高分辨率视频解码器,呈现出更多的数据依赖行为,迫使通信协议支持更高的动态性。为此,我们在本文中通过引入两个独立于传输大小的链路协议,提出了基于流的端到端NoC通信的动态流链路,从而提高了硬件/软件控制的灵活性。在MPSoC虚拟平台上对所提出的协议进行了建模,并对硬件成本进行了评估。在仿真的基础上,我们提供了根据应用需要利用这些协议的指南。
{"title":"A dynamic stream link for efficient data flow control in NoC based heterogeneous MPSoC","authors":"C. Helmstetter, Sylvain Basset, R. Lemaire, F. Clermidy, P. Vivet, M. Langevin, Chuck Pilkington, P. Paulin, D. Fuin","doi":"10.1109/ASPDAC.2013.6509556","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509556","url":null,"abstract":"As Systems-on-Chip size increase, the communication costs become critical and Networks-on-Chip (NoC) bring innovative solutions. Efficient stream-based protocols over NoC have been widely studied to address dataflow communications. They are usually controlled by a set of static parameters. However, new applications, such as high-resolution video decoders, present more data-dependent behaviors forcing communication protocols to support higher dynamicity. For this purpose, we present in this paper dynamic stream links for stream-based end-to-end NoC communications by introducing two link protocols, both independent of the transfer size, allowing to improve the hardware/software control flexibility. The proposed protocols have been modeled in a MPSoC virtual platform and the hardware cost evaluated. Based on simulations, we provide guidelines to exploit these protocols according to application needs.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121943846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Support tools for porting legacy applications to multicore 支持将遗留应用程序移植到多核的工具
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509658
Yuri Ardila, Natsuki Kawai, Takashi Nakamura, Yosuke Tamura
This paper presents PEMAP, an automated performance estimation tool to project performance of hand-parallelized programs from sequential programs and BEMAP, a benchmark suite to measure an auto-parallelizer or even a machine's performance. BEMAP is an open-source project, and the documentations on code explanations and experimental results are also provided. Our experiments on PEMAP shows we can estimate performance of hand-parallelized programs in an error of 0.44% of sequential program's performance on average, while using BEMAP shows that the ability of an auto-parallelizer can be measured by comparing the compiled code to the handtuned parallelized OpenCL code, and therefore assisting the development of the auto-parallelizer tool.
本文介绍了PEMAP,一个自动性能评估工具,用于从顺序程序中预测手动并行程序的性能,BEMAP是一个基准套件,用于测量自动并行化甚至机器的性能。BEMAP是一个开源项目,并提供了有关代码解释和实验结果的文档。我们在PEMAP上的实验表明,我们可以估计手动并行化程序的性能,平均误差为顺序程序性能的0.44%,而使用BEMAP表明,自动并行化的能力可以通过将编译代码与手动调优的并行化OpenCL代码进行比较来衡量,从而帮助开发自动并行化工具。
{"title":"Support tools for porting legacy applications to multicore","authors":"Yuri Ardila, Natsuki Kawai, Takashi Nakamura, Yosuke Tamura","doi":"10.1109/ASPDAC.2013.6509658","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509658","url":null,"abstract":"This paper presents PEMAP, an automated performance estimation tool to project performance of hand-parallelized programs from sequential programs and BEMAP, a benchmark suite to measure an auto-parallelizer or even a machine's performance. BEMAP is an open-source project, and the documentations on code explanations and experimental results are also provided. Our experiments on PEMAP shows we can estimate performance of hand-parallelized programs in an error of 0.44% of sequential program's performance on average, while using BEMAP shows that the ability of an auto-parallelizer can be measured by comparing the compiled code to the handtuned parallelized OpenCL code, and therefore assisting the development of the auto-parallelizer tool.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116217313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
An adaptive filtering mechanism for energy efficient data prefetching 一种节能数据预取的自适应滤波机制
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509617
Xianglei Dang, Xiaoyin Wang, Dong Tong, Zichao Xie, Lingda Li, Keyi Wang
As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for improving the energy efficiency. In this paper, we propose an adaptive prefetch filtering (APF) mechanism to reduce the wasted bandwidth and energy as well as the cache pollution caused by useless prefetches. APF records the prefetch-victim address pairs of issued prefetches and collects information about which address in each pair is first accessed by the processor to guide the filtering of new generated useless prefetches. Meanwhile, filtered prefetches are recorded for building the feedback mechanism to avoid filtering useful prefetches. Experimental results demonstrate that APF reduces useless prefetches by an average of 53.81% with a mere 5.28% reduction of useful prefetches, thus reducing the memory access bandwidth consumption by 59.92% and the L2 cache energy by 6.19%. APF also improves the performance of several programs by reducing the cache pollution incurred by useless prefetches, thus gaining an average performance improvement of 2.12%.
数据预取应用于嵌入式处理器中,减少数据预取的能量浪费是提高处理器能效的关键。本文提出了一种自适应预取滤波(APF)机制,以减少无用预取造成的带宽和能量浪费以及缓存污染。APF记录发出预取的预取受害者地址对,并收集每个地址对中哪个地址首先被处理器访问的信息,以指导过滤新生成的无用预取。同时,记录过滤后的预取,建立反馈机制,避免过滤有用的预取。实验结果表明,APF平均减少了53.81%的无用预取,减少了5.28%的有用预取,从而减少了59.92%的内存访问带宽消耗和6.19%的L2缓存能量。APF还通过减少无用的预取带来的缓存污染来提高几个程序的性能,从而获得2.12%的平均性能提升。
{"title":"An adaptive filtering mechanism for energy efficient data prefetching","authors":"Xianglei Dang, Xiaoyin Wang, Dong Tong, Zichao Xie, Lingda Li, Keyi Wang","doi":"10.1109/ASPDAC.2013.6509617","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509617","url":null,"abstract":"As data prefetching is used in embedded processors, it is crucial to reduce the wasted energy for improving the energy efficiency. In this paper, we propose an adaptive prefetch filtering (APF) mechanism to reduce the wasted bandwidth and energy as well as the cache pollution caused by useless prefetches. APF records the prefetch-victim address pairs of issued prefetches and collects information about which address in each pair is first accessed by the processor to guide the filtering of new generated useless prefetches. Meanwhile, filtered prefetches are recorded for building the feedback mechanism to avoid filtering useful prefetches. Experimental results demonstrate that APF reduces useless prefetches by an average of 53.81% with a mere 5.28% reduction of useful prefetches, thus reducing the memory access bandwidth consumption by 59.92% and the L2 cache energy by 6.19%. APF also improves the performance of several programs by reducing the cache pollution incurred by useless prefetches, thus gaining an average performance improvement of 2.12%.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121335233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Piecewise-polynomial associated transform macromodeling algorithm for fast nonlinear circuit simulation 快速非线性电路仿真的分段多项式关联变换宏建模算法
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509648
Yang Zhang, N. Fong, N. Wong
We present a piecewise-polynomial based associated transform algorithm (PWPAT) for macromodeling nonlinear circuits in system-level circuit design. The generated reduced model can provide both global and local accuracies with the most compact dimension. Numerical examples compare it with existing algorithms and verify its superior accuracy in higher order harmonics simulation over traditional Trajectory Piecewise-Linear (TPWL) approach.
提出了一种基于分段多项式的关联变换算法(PWPAT),用于系统级电路设计中非线性电路的宏建模。生成的约简模型既能提供全局精度,又能提供局部精度,且维数最紧凑。数值算例将其与现有算法进行比较,验证了其在高次谐波仿真中的精度优于传统的轨迹分段线性(TPWL)方法。
{"title":"Piecewise-polynomial associated transform macromodeling algorithm for fast nonlinear circuit simulation","authors":"Yang Zhang, N. Fong, N. Wong","doi":"10.1109/ASPDAC.2013.6509648","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509648","url":null,"abstract":"We present a piecewise-polynomial based associated transform algorithm (PWPAT) for macromodeling nonlinear circuits in system-level circuit design. The generated reduced model can provide both global and local accuracies with the most compact dimension. Numerical examples compare it with existing algorithms and verify its superior accuracy in higher order harmonics simulation over traditional Trajectory Piecewise-Linear (TPWL) approach.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126811411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Local approximation improvement of trajectory piecewise linear macromodels through Chebyshev interpolating polynomials 用切比雪夫插值多项式改进轨迹分段线性宏模型的局部逼近
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509693
M. Farooq, L. Xia
We introduce the concept of two dimensional (2D) scalability of trajectory piecewise linear (TPWL) through the exploitation of Chebyshev interpolating polynomials in each piecewise region. The goal of 2D scalability is to improve the local approximation properties of TPWL macromodels. Horizontal scalability is achieved through the reduction of number of linearization points along the trajectory; vertical scalability is obtained by extending the scope of macromodel to predict the response of a nonlinear system for inputs far from training trajectory. In this way more efficient macromodels are obtained in terms of simulation speed up of complex nonlinear systems. The methodology developed is to predict the nonlinear responses generated by faults introduced in Micro Electro-Mechanical Systems (MEMS) accelerometer during fabrication, that are used to obtain the seismic images for oil and gas discovery. We provide the implementation details and illustrate the 2D scalability concept with an example using nonlinear transmission line.
通过利用切比雪夫插值多项式,引入了轨迹分段线性(TPWL)的二维可扩展性概念。二维可扩展性的目标是改善TPWL宏模型的局部逼近特性。通过减少沿轨迹的线性化点数量来实现水平可扩展性;通过扩展宏模型的范围来预测远离训练轨迹的非线性系统的响应,从而获得垂直可扩展性。这种方法在提高复杂非线性系统的仿真速度方面得到了更有效的宏观模型。所开发的方法是预测微机电系统(MEMS)加速度计在制造过程中引入的故障产生的非线性响应,用于获得石油和天然气发现的地震图像。我们提供了实现细节,并以非线性传输线为例说明了二维可扩展性的概念。
{"title":"Local approximation improvement of trajectory piecewise linear macromodels through Chebyshev interpolating polynomials","authors":"M. Farooq, L. Xia","doi":"10.1109/ASPDAC.2013.6509693","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509693","url":null,"abstract":"We introduce the concept of two dimensional (2D) scalability of trajectory piecewise linear (TPWL) through the exploitation of Chebyshev interpolating polynomials in each piecewise region. The goal of 2D scalability is to improve the local approximation properties of TPWL macromodels. Horizontal scalability is achieved through the reduction of number of linearization points along the trajectory; vertical scalability is obtained by extending the scope of macromodel to predict the response of a nonlinear system for inputs far from training trajectory. In this way more efficient macromodels are obtained in terms of simulation speed up of complex nonlinear systems. The methodology developed is to predict the nonlinear responses generated by faults introduced in Micro Electro-Mechanical Systems (MEMS) accelerometer during fabrication, that are used to obtain the seismic images for oil and gas discovery. We provide the implementation details and illustrate the 2D scalability concept with an example using nonlinear transmission line.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132772178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A 6.72-Gb/s, 8pJ/bit/iteration WPAN LDPC decoder in 65nm CMOS 基于65nm CMOS的6.72 gb /s, 8pJ/bit/迭代WPAN LDPC解码器
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509569
Zhixiang Chen, Xiao Peng, Xiongxin Zhao, Leona Okamura, Dajiang Zhou, S. Goto
An LDPC decoder in 65nm CMOS targeting WPAN (IEEE 802.15.3c) is presented with measurement results. A modified-PCM based message permutation strategy with compatible data flow is proposed to solve the network problem raised by high parallelism LDPC decoding. Compared to the state-of-art, decoder chip achieves 17.7%, 33.5% and 49% improvements in chip density, gate count and energy efficiency, respectively.
提出了一种针对WPAN (IEEE 802.15.3c)的65nm CMOS LDPC解码器,并给出了测量结果。针对LDPC高并行解码带来的网络问题,提出了一种基于改进pcm的兼容数据流的消息排列策略。与目前的技术水平相比,译码芯片在芯片密度、栅极数和能效方面分别提高了17.7%、33.5%和49%。
{"title":"A 6.72-Gb/s, 8pJ/bit/iteration WPAN LDPC decoder in 65nm CMOS","authors":"Zhixiang Chen, Xiao Peng, Xiongxin Zhao, Leona Okamura, Dajiang Zhou, S. Goto","doi":"10.1109/ASPDAC.2013.6509569","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509569","url":null,"abstract":"An LDPC decoder in 65nm CMOS targeting WPAN (IEEE 802.15.3c) is presented with measurement results. A modified-PCM based message permutation strategy with compatible data flow is proposed to solve the network problem raised by high parallelism LDPC decoding. Compared to the state-of-art, decoder chip achieves 17.7%, 33.5% and 49% improvements in chip density, gate count and energy efficiency, respectively.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"31 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116940000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Power optimization for application-specific 3D network-on-chip with multiple supply voltages 针对具有多个电源电压的特定应用的3D片上网络的电源优化
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509622
Kan Wang, Sheqin Dong
In this paper, a MSV-driven power optimization method is proposed for application-specific 3D NoC (MSV-3DNoC). A unified modeling method is presented for considering both layer assignment and voltage assignment, which achieves the best trade-off between core power and communication power. A 3D NoC synthesis is proposed to assign network components onto each layer and generate inter-layer interconnection. A global redistribution is applied to further reduce communication power. Experimental results show that compared to MSV-driven 2D NoC, the proposed method can improve total chip power greatly.
本文提出了一种面向特定应用的msv驱动的3DNoC功率优化方法(MSV-3DNoC)。提出了一种同时考虑层分配和电压分配的统一建模方法,实现了核心功率和通信功率的最佳权衡。提出了一种三维NoC合成方法,将网络组件分配到每一层,并产生层间互连。采用全局重分配,进一步降低通信功率。实验结果表明,与msv驱动的二维NoC相比,该方法可显著提高芯片总功耗。
{"title":"Power optimization for application-specific 3D network-on-chip with multiple supply voltages","authors":"Kan Wang, Sheqin Dong","doi":"10.1109/ASPDAC.2013.6509622","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509622","url":null,"abstract":"In this paper, a MSV-driven power optimization method is proposed for application-specific 3D NoC (MSV-3DNoC). A unified modeling method is presented for considering both layer assignment and voltage assignment, which achieves the best trade-off between core power and communication power. A 3D NoC synthesis is proposed to assign network components onto each layer and generate inter-layer interconnection. A global redistribution is applied to further reduce communication power. Experimental results show that compared to MSV-driven 2D NoC, the proposed method can improve total chip power greatly.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132000675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Statistical analysis of BTI in the presence of process-induced voltage and temperature variations 过程感应电压和温度变化时BTI的统计分析
Pub Date : 2013-04-29 DOI: 10.1109/ASPDAC.2013.6509663
F. Firouzi, S. Kiamehr, M. Tahoori
In nano-scale regime, there are various sources of uncertainty and unpredictability of VLSI designs such as transistor aging mainly due to Bias Temperature Instability (BTI) as well as Process-Voltage-Temperature (PVT) variations. BTI exponentially varies by temperature and the actual supply voltage seen by the transistors within the chip which are functions of leakage power. Leakage power is strongly impacted by PVT and BTI which in turn results in thermal-voltage variations. Hence, neglecting one or some of these aspects can lead to a considerable inaccuracy in the estimated BTI-induced delay degradation. However, a holistic approach to tackle all these issues and their interdependence is missing. In this paper, we develop an analytical model to predict the probability density function and covariance of temperatures and voltage droops of a die in the presence of the BTI and process variation. Based on this model, we propose a statistical method that characterizes the life-time of the circuit affected by BTI in the presence of process-induced temperature-voltage variations. We observe that for benchmark circuits, treating each aspect independently and ignoring their intrinsic interactions results in 16% over-design, translating to unnecessary yield and performance loss.
在纳米尺度下,超大规模集成电路设计存在各种不确定性和不可预测性,例如晶体管老化,主要是由于偏置温度不稳定性(BTI)以及工艺电压温度(PVT)变化。BTI随温度和芯片内晶体管看到的实际电源电压呈指数变化,这是泄漏功率的函数。泄漏功率受到PVT和BTI的强烈影响,进而导致热电压的变化。因此,忽略这些方面中的一个或某些方面可能导致估计bti引起的延迟退化相当不准确。然而,目前还缺乏一种全面的方法来解决所有这些问题及其相互依存关系。在本文中,我们建立了一个分析模型来预测存在BTI和工艺变化的模具温度和电压降的概率密度函数和协方差。基于该模型,我们提出了一种统计方法来表征存在过程引起的温度电压变化时受BTI影响的电路寿命。我们观察到,对于基准电路,独立处理每个方面并忽略其内在相互作用导致16%的过度设计,转化为不必要的产量和性能损失。
{"title":"Statistical analysis of BTI in the presence of process-induced voltage and temperature variations","authors":"F. Firouzi, S. Kiamehr, M. Tahoori","doi":"10.1109/ASPDAC.2013.6509663","DOIUrl":"https://doi.org/10.1109/ASPDAC.2013.6509663","url":null,"abstract":"In nano-scale regime, there are various sources of uncertainty and unpredictability of VLSI designs such as transistor aging mainly due to Bias Temperature Instability (BTI) as well as Process-Voltage-Temperature (PVT) variations. BTI exponentially varies by temperature and the actual supply voltage seen by the transistors within the chip which are functions of leakage power. Leakage power is strongly impacted by PVT and BTI which in turn results in thermal-voltage variations. Hence, neglecting one or some of these aspects can lead to a considerable inaccuracy in the estimated BTI-induced delay degradation. However, a holistic approach to tackle all these issues and their interdependence is missing. In this paper, we develop an analytical model to predict the probability density function and covariance of temperatures and voltage droops of a die in the presence of the BTI and process variation. Based on this model, we propose a statistical method that characterizes the life-time of the circuit affected by BTI in the presence of process-induced temperature-voltage variations. We observe that for benchmark circuits, treating each aspect independently and ignoring their intrinsic interactions results in 16% over-design, translating to unnecessary yield and performance loss.","PeriodicalId":297528,"journal":{"name":"2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134398194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
期刊
2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1