首页 > 最新文献

International Symposium on Quality Electronic Design (ISQED)最新文献

英文 中文
Input-aware statistical timing analysis-based delay test pattern generation 基于输入感知统计时序分析的延迟测试模式生成
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523651
Bao Liu, Lu Wang
Delay test pattern generation has emerged as an increasingly critical problem in high performance VLSI designs. Existing techniques find timing critical paths by STA or SSTA, apply a traditional ATPG algorithm subsequently and find the test patterns. In this paper, we propose a new delay test pattern generation method, which finds timing critical paths by more accurate input-aware statistical timing analysis, achieves input patterns by back-tracing, and verifies the estimated timing critical paths under the input patterns by logic simulation. Our experimental results based on 9 ISCAS'89 benchmark circuits show that the state-of-the-art SSTA-TQM-BnB technique achieves an average of 57.83%, 54.50%, and 69.91% delay fault coverage, while our SPSTA-DTPG technique achieves an average of 67.83%, 71.39%, and 77.53% delay fault coverage for a test size of 50, 100, and 200, respectively.
延迟测试模式的生成已成为高性能VLSI设计中日益重要的问题。现有的技术通过STA或SSTA找到定时关键路径,然后应用传统的ATPG算法找到测试模式。本文提出了一种新的延迟测试模式生成方法,该方法通过更精确的输入感知统计时序分析找到时序关键路径,通过反向跟踪获得输入模式,并通过逻辑仿真验证输入模式下估计的时序关键路径。基于9个ISCAS’89基准电路的实验结果表明,在测试规模为50、100和200的情况下,最先进的SSTA-TQM-BnB技术的延迟故障覆盖率平均为57.83%、54.50%和69.91%,而SPSTA-DTPG技术的延迟故障覆盖率平均为67.83%、71.39%和77.53%。
{"title":"Input-aware statistical timing analysis-based delay test pattern generation","authors":"Bao Liu, Lu Wang","doi":"10.1109/ISQED.2013.6523651","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523651","url":null,"abstract":"Delay test pattern generation has emerged as an increasingly critical problem in high performance VLSI designs. Existing techniques find timing critical paths by STA or SSTA, apply a traditional ATPG algorithm subsequently and find the test patterns. In this paper, we propose a new delay test pattern generation method, which finds timing critical paths by more accurate input-aware statistical timing analysis, achieves input patterns by back-tracing, and verifies the estimated timing critical paths under the input patterns by logic simulation. Our experimental results based on 9 ISCAS'89 benchmark circuits show that the state-of-the-art SSTA-TQM-BnB technique achieves an average of 57.83%, 54.50%, and 69.91% delay fault coverage, while our SPSTA-DTPG technique achieves an average of 67.83%, 71.39%, and 77.53% delay fault coverage for a test size of 50, 100, and 200, respectively.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133079686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Performance and cache access time of SRAM-eDRAM hybrid caches considering wire delay 考虑线延迟的SRAM-eDRAM混合缓存的性能和缓存访问时间
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523661
Young-Ho Gong, H. Jang, S. Chung
Most modern microprocessors have multi-level on-chip caches with multi-megabyte shared last-level cache (LLC). By using multi-level cache hierarchy, the whole size of on-chip caches becomes larger. The increased cache size causes the leakage power and area of the on-chip caches to increase. Recently, to reduce the leakage power and area of the SRAM based cache, the SRAM-eDRAM hybrid cache was proposed. For SRAM-eDRAM hybrid caches, however, there has not been any study to analyze the effects of the reduced area on wire delay, cache access time, and performance. By replacing half (or three fourth) of SRAM cells by small eDRAM cells for the SRAM-eDRAM hybrid caches, wire length is shortened, which eventually results in the reduction of wire delay and cache access time. In this paper, we evaluate the SRAM-eDRAM hybrid caches in terms of the energy, area, wire delay, access time, and performance. We show that the SRAM-eDRAM hybrid cache reduces the energy consumption, area, wire delay, and SRAM array access time by up to 53.9%, 49.9%, 50.4%, and 38.7%, respectively, compared to the SRAM based cache.
大多数现代微处理器都具有多兆字节的共享最后一级缓存(LLC)的多级片上缓存。通过使用多级缓存层次结构,片上缓存的整体大小变得更大。缓存大小的增加导致片上缓存的泄漏功率和面积增加。近年来,为了降低SRAM高速缓存的泄漏功率和泄漏面积,提出了SRAM- edram混合高速缓存。然而,对于SRAM-eDRAM混合缓存,还没有任何研究分析减小的面积对线延迟、缓存访问时间和性能的影响。通过将SRAM-eDRAM混合缓存的一半(或四分之三)SRAM单元替换为小型eDRAM单元,缩短了线长度,最终减少了线延迟和缓存访问时间。在本文中,我们从能量、面积、线延迟、存取时间和性能等方面评估了SRAM-eDRAM混合高速缓存。研究表明,与基于SRAM的高速缓存相比,SRAM- edram混合高速缓存的能耗、面积、线延迟和SRAM阵列访问时间分别降低了53.9%、49.9%、50.4%和38.7%。
{"title":"Performance and cache access time of SRAM-eDRAM hybrid caches considering wire delay","authors":"Young-Ho Gong, H. Jang, S. Chung","doi":"10.1109/ISQED.2013.6523661","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523661","url":null,"abstract":"Most modern microprocessors have multi-level on-chip caches with multi-megabyte shared last-level cache (LLC). By using multi-level cache hierarchy, the whole size of on-chip caches becomes larger. The increased cache size causes the leakage power and area of the on-chip caches to increase. Recently, to reduce the leakage power and area of the SRAM based cache, the SRAM-eDRAM hybrid cache was proposed. For SRAM-eDRAM hybrid caches, however, there has not been any study to analyze the effects of the reduced area on wire delay, cache access time, and performance. By replacing half (or three fourth) of SRAM cells by small eDRAM cells for the SRAM-eDRAM hybrid caches, wire length is shortened, which eventually results in the reduction of wire delay and cache access time. In this paper, we evaluate the SRAM-eDRAM hybrid caches in terms of the energy, area, wire delay, access time, and performance. We show that the SRAM-eDRAM hybrid cache reduces the energy consumption, area, wire delay, and SRAM array access time by up to 53.9%, 49.9%, 50.4%, and 38.7%, respectively, compared to the SRAM based cache.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133890918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Reliable Express-Virtual-Channel-based network-on-chip under the impact of technology scaling 技术规模化影响下基于可靠快速虚拟通道的片上网络
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523690
Xin Fu, Tao Li, J. Fortes
Packet-switched on-chip interconnection networks are emerging as pervasive communication fabrics to connect different processing elements in multi/many-core chips. As a preferred NoC flow-control mechanism, Express Virtual Channel (EVC) allows packets to virtually bypass intermediate nodes to minimize communication delay. Technology scaling results in process variation and Negative Biased Temperature Instability (NBTI) which can significantly affect the reliability and lifetime of NoC fabricated using nano-meter transistors. In this paper, we propose a technique that significantly improves the reliability of EVC-based NoCs by reducing the simultaneous impact of process variation and NBTI. Our evaluation results using a detailed cycle-accurate simulator on a wide range of synthetic traffics and parallel benchmark traces show up to 75.5% guardband improvement over the conventional EVC-based NoCs.
片上分组交换互连网络作为一种普及的通信结构正在兴起,用于连接多核/多核芯片中的不同处理元件。快速虚拟通道(Express Virtual Channel, EVC)是一种首选的NoC流量控制机制,它允许数据包虚拟地绕过中间节点,以最大限度地减少通信延迟。工艺缩放导致的工艺变化和负偏温不稳定性(NBTI)会显著影响纳米晶体管NoC的可靠性和寿命。在本文中,我们提出了一种技术,通过减少工艺变化和NBTI的同时影响,显著提高了基于evc的noc的可靠性。我们使用详细的周期精确模拟器对广泛的合成流量和并行基准轨迹进行评估,结果显示,与传统的基于evc的noc相比,其防护带提高了75.5%。
{"title":"Reliable Express-Virtual-Channel-based network-on-chip under the impact of technology scaling","authors":"Xin Fu, Tao Li, J. Fortes","doi":"10.1109/ISQED.2013.6523690","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523690","url":null,"abstract":"Packet-switched on-chip interconnection networks are emerging as pervasive communication fabrics to connect different processing elements in multi/many-core chips. As a preferred NoC flow-control mechanism, Express Virtual Channel (EVC) allows packets to virtually bypass intermediate nodes to minimize communication delay. Technology scaling results in process variation and Negative Biased Temperature Instability (NBTI) which can significantly affect the reliability and lifetime of NoC fabricated using nano-meter transistors. In this paper, we propose a technique that significantly improves the reliability of EVC-based NoCs by reducing the simultaneous impact of process variation and NBTI. Our evaluation results using a detailed cycle-accurate simulator on a wide range of synthetic traffics and parallel benchmark traces show up to 75.5% guardband improvement over the conventional EVC-based NoCs.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128798730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sustainable dual-level DVFS-enabled NoC with on-chip wireless links 具有片上无线链路的可持续双电平dvfs NoC
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523601
Jacob Murray, Rajath Hegde, Teng Lu, P. Pande, B. Shirazi
Wireless Network-on-Chip (WiNoC) has emerged as an enabling technology to design low power and high bandwidth massive multi-core chips. The performance advantages mainly stem from using the wireless links as long-range shortcuts between far apart cores. This performance gain can be enhanced further if the characteristics of the wireline links and the processing cores of the WiNoC are optimized according to the traffic patterns and workloads. In this work, we demonstrate that by incorporating both processor- and network-level dynamic voltage and frequency scaling (DVFS) in a WiNoC, the power and thermal profiles can be enhanced without a significant impact on the overall execution time. We also show that depending on the benchmark applications, temperature hotspots can be formed either in the processing core or in the network infrastructure. The proposed dual-level DVFS is capable of addressing both.
无线片上网络(WiNoC)已成为设计低功耗、高带宽、大规模多核芯片的一种技术。性能优势主要来自于使用无线链路作为相隔很远的核心之间的远程捷径。如果根据流量模式和工作负载优化有线链路的特性和WiNoC的处理核心,则可以进一步增强这种性能增益。在这项工作中,我们证明了通过在WiNoC中结合处理器级和网络级动态电压和频率缩放(DVFS),可以增强功率和热分布,而不会对总体执行时间产生重大影响。我们还表明,根据基准测试应用程序,温度热点可以在处理核心或网络基础设施中形成。所提出的双电平DVFS能够解决这两个问题。
{"title":"Sustainable dual-level DVFS-enabled NoC with on-chip wireless links","authors":"Jacob Murray, Rajath Hegde, Teng Lu, P. Pande, B. Shirazi","doi":"10.1109/ISQED.2013.6523601","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523601","url":null,"abstract":"Wireless Network-on-Chip (WiNoC) has emerged as an enabling technology to design low power and high bandwidth massive multi-core chips. The performance advantages mainly stem from using the wireless links as long-range shortcuts between far apart cores. This performance gain can be enhanced further if the characteristics of the wireline links and the processing cores of the WiNoC are optimized according to the traffic patterns and workloads. In this work, we demonstrate that by incorporating both processor- and network-level dynamic voltage and frequency scaling (DVFS) in a WiNoC, the power and thermal profiles can be enhanced without a significant impact on the overall execution time. We also show that depending on the benchmark applications, temperature hotspots can be formed either in the processing core or in the network infrastructure. The proposed dual-level DVFS is capable of addressing both.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"464 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125824717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Electrical and thermal analysis for design exchange formats in three dimensional integrated circuits 三维集成电路中设计交换格式的电气和热分析
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523627
R. Bazaz, Jianyong Xie, M. Swaminathan
Three dimensional integrated circuits (3D ICs) built with through-silicon vias (TSVs) have smaller planar dimensions, shorter wire length, and better performance than 2D ICs. Many 3D stacks will combine digital and analog/RF circuitry, requiring a strong analog/mixed-signal capability. Because of the unique packaging requirements of stacked die, an IC/package co-design capability is required. Hence there must be some standards to facilitate smooth and effective design of 3D ICs. Thus the main purpose of an exchange format (EF) between dies is to permit sharing of information necessary for design by external parties without disclosing their intellectual property (IP). The requirements of the standards should be the minimum necessary to produce satisfactory answers. Producing such models is a customer support function, not a core engineering function, i.e. it is not generated by the chip design scheme. The role of the standards is to facilitate the transfer of information through a compact model, not necessarily to build one. This paper presents initial efforts in designing such standards or EF. Steady state electrical and thermal simulations are performed in this paper to demonstrate the necessary information that needs to be exchanged between the dies to ensure adequate co-design.
采用硅通孔(tsv)构建的三维集成电路(3D ic)具有比二维集成电路更小的平面尺寸、更短的导线长度和更好的性能。许多3D堆栈将结合数字和模拟/RF电路,需要强大的模拟/混合信号能力。由于堆叠式封装的独特封装要求,需要集成电路/封装协同设计能力。因此,必须有一些标准来促进3D集成电路的顺利和有效的设计。因此,模具之间交换格式(EF)的主要目的是允许外部各方在不泄露其知识产权(IP)的情况下共享设计所需的信息。标准的要求应该是产生满意答案的最低要求。生成这样的模型是客户支持功能,而不是核心工程功能,即它不是由芯片设计方案生成的。标准的作用是通过一个紧凑的模型来促进信息的传递,而不是建立一个模型。本文介绍了设计这种EF标准的初步努力。本文进行了稳态电和热模拟,以演示在模具之间需要交换的必要信息,以确保充分的协同设计。
{"title":"Electrical and thermal analysis for design exchange formats in three dimensional integrated circuits","authors":"R. Bazaz, Jianyong Xie, M. Swaminathan","doi":"10.1109/ISQED.2013.6523627","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523627","url":null,"abstract":"Three dimensional integrated circuits (3D ICs) built with through-silicon vias (TSVs) have smaller planar dimensions, shorter wire length, and better performance than 2D ICs. Many 3D stacks will combine digital and analog/RF circuitry, requiring a strong analog/mixed-signal capability. Because of the unique packaging requirements of stacked die, an IC/package co-design capability is required. Hence there must be some standards to facilitate smooth and effective design of 3D ICs. Thus the main purpose of an exchange format (EF) between dies is to permit sharing of information necessary for design by external parties without disclosing their intellectual property (IP). The requirements of the standards should be the minimum necessary to produce satisfactory answers. Producing such models is a customer support function, not a core engineering function, i.e. it is not generated by the chip design scheme. The role of the standards is to facilitate the transfer of information through a compact model, not necessarily to build one. This paper presents initial efforts in designing such standards or EF. Steady state electrical and thermal simulations are performed in this paper to demonstrate the necessary information that needs to be exchanged between the dies to ensure adequate co-design.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126685408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Impact of process parameter and supply voltage fluctuations on multi-threshold-voltage seven-transistor static memory cells 工艺参数和电源电压波动对多阈值电压七晶体管静态存储单元的影响
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523650
Hong Zhu, V. Kursun
Conventional Static Random Access Memory (SRAM) cells suffer from an intrinsic data instability problem due to directly-accessed data storage nodes during a read operation. Noise margins of memory cells further shrink with increasing variability and decreasing supply voltage in scaled CMOS technologies. A selected set of novel seven-transistor (7T) and conventional six-transistor (6T) multi-threshold-voltage memory circuits are characterized for data stability, write margin, and idle mode leakage currents with an equal area constraint under parameter variations in this paper. The mean of the statistical read static noise margin distribution is enhanced by up to 2.4X and the mean of the statistical array leakage power consumption distribution is reduced by up to 82% with the triple-threshold-voltage 7T SRAM cells as compared to the traditional 6T SRAM cells in a UMC 80nm CMOS technology.
传统的静态随机存取存储器(SRAM)单元由于在读取操作中直接访问数据存储节点而存在固有的数据不稳定性问题。在微缩型CMOS技术中,随着可变性的增加和电源电压的降低,存储单元的噪声裕度进一步缩小。本文选择了一组新颖的七晶体管(7T)和传统的六晶体管(6T)多阈值电压存储电路,对其在参数变化下的数据稳定性、写入裕度和等面积约束下的空闲模式漏电流进行了表征。与UMC 80nm CMOS技术中的传统6T SRAM单元相比,采用三阈值电压7T SRAM单元的统计读取静态噪声边际分布均值提高了2.4倍,统计阵列泄漏功耗分布均值降低了82%。
{"title":"Impact of process parameter and supply voltage fluctuations on multi-threshold-voltage seven-transistor static memory cells","authors":"Hong Zhu, V. Kursun","doi":"10.1109/ISQED.2013.6523650","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523650","url":null,"abstract":"Conventional Static Random Access Memory (SRAM) cells suffer from an intrinsic data instability problem due to directly-accessed data storage nodes during a read operation. Noise margins of memory cells further shrink with increasing variability and decreasing supply voltage in scaled CMOS technologies. A selected set of novel seven-transistor (7T) and conventional six-transistor (6T) multi-threshold-voltage memory circuits are characterized for data stability, write margin, and idle mode leakage currents with an equal area constraint under parameter variations in this paper. The mean of the statistical read static noise margin distribution is enhanced by up to 2.4X and the mean of the statistical array leakage power consumption distribution is reduced by up to 82% with the triple-threshold-voltage 7T SRAM cells as compared to the traditional 6T SRAM cells in a UMC 80nm CMOS technology.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121792761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Relocatable and resizable SRAM synthesis for via configurable structured ASIC 通过可配置的结构化ASIC合成可重新定位和可调整大小的SRAM
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523657
Hsin-Hung Liu, Rung-Bin Lin, I. Tseng
Memory blocks in a structured ASIC are normally pre-customized with fixed sizes and placed at predefined locations. The number of memory blocks is also pre-determined. This imposes a stringent limitation on the use of memory blocks, often creating a situation of either insufficient capacity or considerable waste. To remove this limitation, in this paper we propose a method to create relocatable and resizable SRAM blocks using the same via-configurable logic block to implement both logic gates and 6T SRAM cells. We develop an SRAM compiler to synthesize SRAM blocks of this sort. Our single-port SRAM array uses only 1/3 the area taken by a flip-flop based SRAM array. For dual-port SRAM arrays, this ratio is 2/3. We demonstrate first time the feasibility of deploying a varying number of relocatable and resizable SRAM blocks on a structured ASIC.
结构化ASIC中的内存块通常是预先定制的,具有固定的大小并放置在预定义的位置。内存块的数量也是预先确定的。这对内存块的使用施加了严格的限制,经常造成容量不足或大量浪费的情况。为了消除这一限制,在本文中,我们提出了一种方法来创建可重新定位和可调整大小的SRAM块,使用相同的可配置逻辑块来实现逻辑门和6T SRAM单元。我们开发了一个SRAM编译器来合成这种类型的SRAM块。我们的单端口SRAM阵列使用的面积仅为基于触发器的SRAM阵列的1/3。对于双端口SRAM阵列,该比率为2/3。我们首次展示了在结构化ASIC上部署不同数量的可重新定位和可调整大小的SRAM块的可行性。
{"title":"Relocatable and resizable SRAM synthesis for via configurable structured ASIC","authors":"Hsin-Hung Liu, Rung-Bin Lin, I. Tseng","doi":"10.1109/ISQED.2013.6523657","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523657","url":null,"abstract":"Memory blocks in a structured ASIC are normally pre-customized with fixed sizes and placed at predefined locations. The number of memory blocks is also pre-determined. This imposes a stringent limitation on the use of memory blocks, often creating a situation of either insufficient capacity or considerable waste. To remove this limitation, in this paper we propose a method to create relocatable and resizable SRAM blocks using the same via-configurable logic block to implement both logic gates and 6T SRAM cells. We develop an SRAM compiler to synthesize SRAM blocks of this sort. Our single-port SRAM array uses only 1/3 the area taken by a flip-flop based SRAM array. For dual-port SRAM arrays, this ratio is 2/3. We demonstrate first time the feasibility of deploying a varying number of relocatable and resizable SRAM blocks on a structured ASIC.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123348899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Low-energy digital filter design based on controlled timing error acceptance 基于可控时序误差接受的低功耗数字滤波器设计
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523603
Ku He, A. Gerstlauer, M. Orshansky
In signal processing applications, large energy gains can be obtained by accepting some degradation in the output signal quality. Filters are at the core of many such systems. In this paper, we demonstrate the potential of a new paradigm for achieving favorable quality-energy trade-offs in digital filter design that is based on directly accepting timing errors in the datapath under aggressively scaled VDD. In an unmodified design, such scaling leads to rapid onset of timing errors and, consequently, quality loss. In a modified filter implementation, the onset of large errors is delayed, permitting significant energy reduction while maintaining high quality. Specifically, the innovations in the design include techniques for: 1) run-time adjustment of datapath bitwidth, and 2) design-time reordering of filter taps. We tested the new design strategy on several audio and image processing applications. The designs were synthesized using a 45nm standard cell library. Results of SPICE simulations on the entire designs show that up to 70% energy savings can be achieved while maintaining excellent perceived signal-to-noise ratios (SNRs). Compared to a traditional filter design, the area overhead of our architecture is about 2%.
在信号处理应用中,可以通过接受输出信号质量的一些退化来获得较大的能量增益。过滤器是许多这类系统的核心。在本文中,我们展示了在数字滤波器设计中实现有利的质量-能量权衡的新范例的潜力,该范例基于在积极缩放VDD下直接接受数据路径中的时序误差。在未经修改的设计中,这种缩放导致计时误差的快速发生,从而导致质量损失。在改进的滤波器实现中,大错误的开始被延迟,允许在保持高质量的同时显著降低能量。具体来说,设计中的创新包括以下技术:1)运行时数据路径位宽的调整,以及2)设计时滤波器分岔的重新排序。我们在几个音频和图像处理应用程序中测试了新的设计策略。设计采用45nm标准细胞库合成。SPICE对整个设计的模拟结果表明,在保持良好的感知信噪比(SNRs)的同时,可以实现高达70%的节能。与传统的过滤器设计相比,我们的架构的面积开销约为2%。
{"title":"Low-energy digital filter design based on controlled timing error acceptance","authors":"Ku He, A. Gerstlauer, M. Orshansky","doi":"10.1109/ISQED.2013.6523603","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523603","url":null,"abstract":"In signal processing applications, large energy gains can be obtained by accepting some degradation in the output signal quality. Filters are at the core of many such systems. In this paper, we demonstrate the potential of a new paradigm for achieving favorable quality-energy trade-offs in digital filter design that is based on directly accepting timing errors in the datapath under aggressively scaled VDD. In an unmodified design, such scaling leads to rapid onset of timing errors and, consequently, quality loss. In a modified filter implementation, the onset of large errors is delayed, permitting significant energy reduction while maintaining high quality. Specifically, the innovations in the design include techniques for: 1) run-time adjustment of datapath bitwidth, and 2) design-time reordering of filter taps. We tested the new design strategy on several audio and image processing applications. The designs were synthesized using a 45nm standard cell library. Results of SPICE simulations on the entire designs show that up to 70% energy savings can be achieved while maintaining excellent perceived signal-to-noise ratios (SNRs). Compared to a traditional filter design, the area overhead of our architecture is about 2%.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116730179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis, modeling and silicon correlation of low-voltage flop data retention in 28nm process technology 28nm制程技术中低压触发器数据保留的分析、建模和硅相关
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523669
A. Datta, M. Abu-Rahma, S. Dasnurkar, Hadi Rasouli, Sean Tamjidi, M. Cai, S. Sengupta, P. Chidambaram, Raghavan Thirumala, Nikhil Kulkarni, Prasanna Seeram, Prasad Bhadri, P. Patel, S. Yoon, E. Terzioglu
Mobile devices spend most of the time in standby mode. Supported features and functionalities are increasing in each newer model. With the wide spread adaptation of multi-tasking in mobile devices, retaining current status and data for all active tasks is critical for user satisfaction. Extending battery life in portable mobile devices necessitates the use of minimum possible energy in standby mode while retaining present states for all active tasks. This paper for the first time, explains the low-voltage data-retention failure mechanism in ops. It analyzes the impact of design and process parameters on the data-retention failure. Statistical nature of data retention failure is established and validated with extensive Monte-Carlo simulations across various process corners. Finally, silicon measurement from several 28nm industrial mobile chips is presented showing good correlation of retention failure prediction from simulation.
移动设备大部分时间处于待机状态。支持的特性和功能在每个新模型中都在增加。随着多任务在移动设备上的广泛应用,保留所有活动任务的当前状态和数据对于用户满意度至关重要。延长便携式移动设备的电池寿命需要在待机模式下使用尽可能少的能量,同时保持所有活动任务的当前状态。本文首次解释了操作系统中低压数据保留失效的机理。分析了设计参数和工艺参数对数据保留失效的影响。数据保留失败的统计性质是建立和验证广泛的蒙特卡罗模拟跨各个过程的角落。最后,给出了几种28nm工业移动芯片的硅测量结果,显示了从模拟中预测保留失效的良好相关性。
{"title":"Analysis, modeling and silicon correlation of low-voltage flop data retention in 28nm process technology","authors":"A. Datta, M. Abu-Rahma, S. Dasnurkar, Hadi Rasouli, Sean Tamjidi, M. Cai, S. Sengupta, P. Chidambaram, Raghavan Thirumala, Nikhil Kulkarni, Prasanna Seeram, Prasad Bhadri, P. Patel, S. Yoon, E. Terzioglu","doi":"10.1109/ISQED.2013.6523669","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523669","url":null,"abstract":"Mobile devices spend most of the time in standby mode. Supported features and functionalities are increasing in each newer model. With the wide spread adaptation of multi-tasking in mobile devices, retaining current status and data for all active tasks is critical for user satisfaction. Extending battery life in portable mobile devices necessitates the use of minimum possible energy in standby mode while retaining present states for all active tasks. This paper for the first time, explains the low-voltage data-retention failure mechanism in ops. It analyzes the impact of design and process parameters on the data-retention failure. Statistical nature of data retention failure is established and validated with extensive Monte-Carlo simulations across various process corners. Finally, silicon measurement from several 28nm industrial mobile chips is presented showing good correlation of retention failure prediction from simulation.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"496 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133273352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Temperature aware thread migration in 3D architecture with stacked DRAM 基于堆叠DRAM的3D架构温度感知线程迁移
Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523594
Dali Zhao, H. Homayoun, A. Veidenbaum
A 3D architecture with DRAM memory stacked on a multi-core processor has many benefits for the embedded system. Compared with a conventional 2D design, it reduces memory access latency, increases memory bandwidth and reduces energy consumption. However it poses a thermal challenge as the heat generated by the processor cannot dissipate efficiently through the DRAM memory layer. Due to the fact that DRAM is very sensitive to high temperature as well as temperature variance, 3D stacking causes more failures to occur because DRAM thermal variance is higher than the conventional 2D architecture. To address this thermal challenge we propose to reduce temperature variance and peak temperature of a 3D multi-core processor and stacked DRAM by thermally aware thread migration among processor cores. This method has very limited impact on processor performance. Using migration-based policy we reduce peak steady-state temperature in the processor by up to 8.3 degrees Celsius, with the average of 4.7 degrees.
将DRAM内存堆叠在多核处理器上的3D架构对嵌入式系统有许多好处。与传统的2D设计相比,它减少了存储器访问延迟,增加了存储器带宽并降低了能耗。然而,由于处理器产生的热量不能通过DRAM存储层有效地消散,因此它提出了一个热挑战。由于DRAM对高温和温度变化非常敏感,3D堆叠会导致更多的故障发生,因为DRAM的热变化高于传统的2D架构。为了解决这一热挑战,我们提出通过在处理器内核之间热感知线程迁移来减少3D多核处理器和堆叠DRAM的温度差异和峰值温度。这种方法对处理器性能的影响非常有限。使用基于迁移的策略,我们将处理器的峰值稳态温度降低了8.3摄氏度,平均降低了4.7摄氏度。
{"title":"Temperature aware thread migration in 3D architecture with stacked DRAM","authors":"Dali Zhao, H. Homayoun, A. Veidenbaum","doi":"10.1109/ISQED.2013.6523594","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523594","url":null,"abstract":"A 3D architecture with DRAM memory stacked on a multi-core processor has many benefits for the embedded system. Compared with a conventional 2D design, it reduces memory access latency, increases memory bandwidth and reduces energy consumption. However it poses a thermal challenge as the heat generated by the processor cannot dissipate efficiently through the DRAM memory layer. Due to the fact that DRAM is very sensitive to high temperature as well as temperature variance, 3D stacking causes more failures to occur because DRAM thermal variance is higher than the conventional 2D architecture. To address this thermal challenge we propose to reduce temperature variance and peak temperature of a 3D multi-core processor and stacked DRAM by thermally aware thread migration among processor cores. This method has very limited impact on processor performance. Using migration-based policy we reduce peak steady-state temperature in the processor by up to 8.3 degrees Celsius, with the average of 4.7 degrees.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123796994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
期刊
International Symposium on Quality Electronic Design (ISQED)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1