International Symposium on Quality Electronic Design (ISQED)最新文献

英文中文

Thermal-aware semi-dynamic power management for multicore systems with energy harvesting 具有能量收集的多核系统的热感知半动态电源管理

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523675

Yi Xiang, S. Pasricha

In this paper, we focus on power and thermal management for multicore embedded systems with solar energy harvesting as the power source and a periodic hard real-time task set as the workload. We design a novel semi-dynamic scheme, which reschedules tasks at the beginning of specified time epochs. By rejecting job instances of certain tasks until the next rescheduling point, our scheduler dispatches a subset of tasks that comply with the predicted energy budget and thermal conditions. Our approach reacts to run-time energy harvesting power variation without losing the consistency of the periodic task set, which helps to scale processor speed evenly by utilizing slack time efficiently without the need for complex slack reclamation algorithms, as in prior work. When applied to a multicore platform, our approach offers a chance to shut down cores and reassign tasks for superior energy efficiency. As a result, experimental results show up to 70% miss rate reduction compared to prior work. Unlike any prior work, our approach also integrates thermal management to reduce peak temperature while minimizing miss rate for energy harvesting embedded systems.

本文主要研究以太阳能收集为电源，以定期硬实时任务集为工作负载的多核嵌入式系统的电源和热管理。我们设计了一种新颖的半动态调度方案，在指定的时间点开始重新调度任务。通过拒绝某些任务的作业实例直到下一个重调度点，我们的调度器调度符合预测的能量预算和热条件的任务子集。我们的方法对运行时能量收集功率的变化做出反应，而不会失去周期性任务集的一致性，这有助于通过有效地利用空闲时间来均匀地扩展处理器速度，而不需要像之前的工作那样使用复杂的空闲回收算法。当应用于多核平台时，我们的方法提供了关闭核心并重新分配任务的机会，以获得更高的能源效率。实验结果表明，与之前的工作相比，脱靶率降低了70%。与以往的工作不同，我们的方法还集成了热管理，以降低峰值温度，同时最大限度地减少能量收集嵌入式系统的失误率。

{"title":"Thermal-aware semi-dynamic power management for multicore systems with energy harvesting","authors":"Yi Xiang, S. Pasricha","doi":"10.1109/ISQED.2013.6523675","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523675","url":null,"abstract":"In this paper, we focus on power and thermal management for multicore embedded systems with solar energy harvesting as the power source and a periodic hard real-time task set as the workload. We design a novel semi-dynamic scheme, which reschedules tasks at the beginning of specified time epochs. By rejecting job instances of certain tasks until the next rescheduling point, our scheduler dispatches a subset of tasks that comply with the predicted energy budget and thermal conditions. Our approach reacts to run-time energy harvesting power variation without losing the consistency of the periodic task set, which helps to scale processor speed evenly by utilizing slack time efficiently without the need for complex slack reclamation algorithms, as in prior work. When applied to a multicore platform, our approach offers a chance to shut down cores and reassign tasks for superior energy efficiency. As a result, experimental results show up to 70% miss rate reduction compared to prior work. Unlike any prior work, our approach also integrates thermal management to reduce peak temperature while minimizing miss rate for energy harvesting embedded systems.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129377917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Impact of conventional and emerging interconnects on the circuit performance of various post-CMOS devices 传统互连和新兴互连对各种后cmos器件电路性能的影响

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523611

A. Ceyhan, A. Naeemi

The trade-offs between the technology parameters of various interconnect technologies are investigated on the basis of their impacts on the circuit performances of emerging post-CMOS devices. In this paper, carbon nanotube field-effect transistor (CNFET), nanowire-based gate-all-around (GAA) tunneling field-effect transistor (TFET), FinFET and sub-threshold CMOS circuits are studied. Each of these devices are paired with the conventional Cu/low-k interconnect, single-wall carbon nanotube (SWNT) interconnect manufactured in horizontal bundles or in a single layer, and multi-layer graphene nanoribbon (GNR) interconnect. The relative performances of all these interconnect technologies with each type of device are evaluated. The interconnect technology option that gives the best performance in terms of circuit delay, energy-per-bit and energy-delay product (EDP) is reported for each of the device technologies.

基于对新兴后cmos器件电路性能的影响，研究了各种互连技术参数之间的权衡。本文研究了碳纳米管场效应晶体管(CNFET)、基于纳米线的栅极全能(GAA)隧道场效应晶体管(TFET)、FinFET和亚阈值CMOS电路。每种器件都与传统的Cu/低k互连、水平束或单层制造的单壁碳纳米管(SWNT)互连和多层石墨烯纳米带(GNR)互连配对。评估了所有这些互连技术与每种类型器件的相对性能。对于每种器件技术，报告了在电路延迟、每比特能量和能量延迟产品(EDP)方面提供最佳性能的互连技术选项。

引用次数: 11

System-level optimization and benchmarking for InAs nanowire based gate-all-around tunneling FETs 基于InAs纳米线的栅极全隧穿场效应管的系统级优化与基准测试

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523610

C. Pan, A. Ceyhan, A. Naeemi

The ON/OFF current and input capacitance of InAs nanowire based gate-all-around (GAA) tunnel FETs are modeled. Based on the device- and system-level models, optimization has been done and comparison has been made between TFETs and CMOS devices under different constraints for both single- and multi-core processors. Several performance metrics have been analyzed, which shows that optimal numbers of cores, power density and die size area exist for maximizing various design targets.

建立了基于InAs纳米线的栅极全能(GAA)隧道场效应管的通/关电流和输入电容模型。基于器件级和系统级模型，对tfet和CMOS器件在不同约束条件下的单核和多核处理器性能进行了优化和比较。分析了几种性能指标，表明存在最优的核数、功率密度和芯片尺寸面积，以最大化各种设计目标。

引用次数: 3

A co-synthesis methodology for power delivery and data interconnection networks in 3D ICs 三维集成电路中电力传输和数据互连网络的协同综合方法

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523593

N. Kapadia, S. Pasricha

A stable voltage supply is critical for multiprocessor system-on-chips (MPSoCs) to operate at near-optimal performance levels. The problem of IR drops in a Power Delivery Network (PDN) is very severe in 3D MPSoCs with network-on-chip (NoC) fabrics where the current in the PDN increases proportionally with the number of device layers. At the same time, with the increasing core counts in today's power-hungry MPSoCs, the already hard problem of voltage island-aware Network-on-Chip (NoC) design has become even more challenging. Even though the PDN and NoC design goals are non-overlapping, both the optimizations are interdependent. Unfortunately, designers today seldom consider design of the PDN while synthesizing NoCs. In this work, for the first time, we propose a novel system-level co-synthesis methodology that minimizes 3D NoC energy while meeting performance goals; and simultaneously optimizes the 3D PDN design while satisfying IR-drop constraints. Our experimental results show that the proposed co-synthesis methodology meets IR-drop constraints while minimizing energy consumption for several real-world applications, improving upon results from traditional system-level methodologies that perform PDN design and NoC synthesis separately.

稳定的电压供应对于多处理器片上系统(mpsoc)在接近最佳性能水平下工作至关重要。在采用片上网络(NoC)结构的3D mpsoc中，功率传输网络(PDN)中的IR下降问题非常严重，其中PDN中的电流随着器件层数的增加而成比例地增加。与此同时，随着当今耗电的mpsoc中核心数量的增加，电压岛感知片上网络(NoC)设计的难题变得更加具有挑战性。尽管PDN和NoC的设计目标不重叠，但这两种优化是相互依赖的。不幸的是，今天的设计人员在合成noc时很少考虑PDN的设计。在这项工作中，我们首次提出了一种新的系统级协同合成方法，可以在满足性能目标的同时最大限度地减少3D NoC能量;并在满足IR-drop约束的同时优化了3D PDN设计。我们的实验结果表明，所提出的协同合成方法满足ir下降约束，同时最大限度地减少了几种实际应用的能耗，改进了分别执行PDN设计和NoC合成的传统系统级方法的结果。

{"title":"A co-synthesis methodology for power delivery and data interconnection networks in 3D ICs","authors":"N. Kapadia, S. Pasricha","doi":"10.1109/ISQED.2013.6523593","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523593","url":null,"abstract":"A stable voltage supply is critical for multiprocessor system-on-chips (MPSoCs) to operate at near-optimal performance levels. The problem of IR drops in a Power Delivery Network (PDN) is very severe in 3D MPSoCs with network-on-chip (NoC) fabrics where the current in the PDN increases proportionally with the number of device layers. At the same time, with the increasing core counts in today's power-hungry MPSoCs, the already hard problem of voltage island-aware Network-on-Chip (NoC) design has become even more challenging. Even though the PDN and NoC design goals are non-overlapping, both the optimizations are interdependent. Unfortunately, designers today seldom consider design of the PDN while synthesizing NoCs. In this work, for the first time, we propose a novel system-level co-synthesis methodology that minimizes 3D NoC energy while meeting performance goals; and simultaneously optimizes the 3D PDN design while satisfying IR-drop constraints. Our experimental results show that the proposed co-synthesis methodology meets IR-drop constraints while minimizing energy consumption for several real-world applications, improving upon results from traditional system-level methodologies that perform PDN design and NoC synthesis separately.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126321375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

A versatile rail to rail current mode instrumentation amplifier with an embedded band-pass filter for bio-potential signal conditioning 一种多用途轨对轨电流模式仪表放大器，带有嵌入式带通滤波器，用于生物电位信号调理

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523685

A. Anvesha, M. Baghini

A novel ultra low power, area-efficient, current-mode instrumentation amplifier (CMIA) with embedded bandpass filter, for acquisition of bio-potential signals is presented. A novel bias front-end is also presented, which achieves CMRR of 195dB at 1Hz in the presence of ±5% component mismatch. The CMIA has a tunable bandwidth, from 160Hz to 7.2kHz with input referred rms noise of 2.24 μV in frequency band of 5mHz to 160Hz. The CMIA is designed in 180nm mixed-mode CMOS technology and provides rail to rail output voltage of 1.65V(p-p) while dissipating 39 μW, at 1.8V supply voltage. The output of CMIA doesn't need any further signal conditioning.

提出了一种新型的超低功耗、面积高效、内置带通滤波器的电流型仪表放大器(CMIA)，用于采集生物电位信号。提出了一种新的偏置前端，在±5%分量失配的情况下，在1Hz时实现了195dB的CMRR。CMIA具有160Hz ~ 7.2kHz的可调带宽，在5mHz ~ 160Hz频段内输入参考有效值噪声为2.24 μV。CMIA采用180nm混合模CMOS技术设计，在1.8V电源电压下，输出电压为1.65V(p-p)，功耗为39 μW。CMIA的输出不需要进一步的信号调理。

引用次数: 2

Resource allocation and consolidation in a multi-core server cluster using a Markov decision process model 使用马尔可夫决策过程模型的多核服务器集群中的资源分配和整合

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523677

Yanzhi Wang, Shuang Chen, H. Goudarzi, Massoud Pedram

Distributed computing systems have attracted a lot of attention due to increasing demand for high performance computing and storage. Resource allocation is one of the most important challenges in the distributed systems especially when the clients have some Service Level Agreements (SLAs) and the total profit depends on how the system can meet these SLAs. In this paper, an SLA-based resource allocation problem in a server cluster is considered. The objective is to maximize the total profit, which is the total price gained from serving the clients subtracted by the operation cost of the server cluster. The total price depends on the average request response time for each client as defined in their utility functions, while the operating cost is related to the total energy consumption. A joint optimization framework is proposed, comprised of request dispatching, dynamic voltage and frequency scaling (DVFS) for individual cores, as well as server-level and core-level consolidations. Each core in the cluster is modeled using a continuous-time Markov decision process (CTMDP). A near-optimal hierarchical solution is proposed, consisting of a central manager and distributed local agents. Each local agent employs linear programming-based CTMDP solving method to solve the DVFS problem for the corresponding core. The central manager solves the request dispatching problem and finds the optimal number of turned on cores and servers for request processing, thereby achieving a desirable tradeoff between service request response time and power consumption. Experimental results demonstrate that the proposed near-optimal resource allocation and consolidation algorithm consistently outperforms baseline algorithms.

由于对高性能计算和存储的需求日益增长，分布式计算系统引起了人们的广泛关注。资源分配是分布式系统中最重要的挑战之一，特别是当客户端有一些服务水平协议(sla)，而总利润取决于系统如何满足这些sla时。本文研究了服务器集群中基于sla的资源分配问题。目标是使总利润最大化，即服务客户获得的总价格减去服务器集群的运营成本。总价格取决于每个客户的平均请求响应时间(在其效用函数中定义)，而运营成本与总能耗相关。提出了一个联合优化框架，包括请求调度、单个核的动态电压和频率缩放(DVFS)以及服务器级和核心级合并。集群中的每个核心都使用连续时间马尔可夫决策过程(CTMDP)建模。提出了一种近似最优的分层解决方案，该方案由一个中央管理器和分布式本地代理组成。每个局部代理采用基于线性规划的CTMDP求解方法求解相应核心的DVFS问题。中央管理器解决请求调度问题，并找到用于处理请求的核心和服务器的最优数量，从而在服务请求响应时间和功耗之间实现理想的权衡。实验结果表明，本文提出的近最优资源分配和整合算法始终优于基线算法。

{"title":"Resource allocation and consolidation in a multi-core server cluster using a Markov decision process model","authors":"Yanzhi Wang, Shuang Chen, H. Goudarzi, Massoud Pedram","doi":"10.1109/ISQED.2013.6523677","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523677","url":null,"abstract":"Distributed computing systems have attracted a lot of attention due to increasing demand for high performance computing and storage. Resource allocation is one of the most important challenges in the distributed systems especially when the clients have some Service Level Agreements (SLAs) and the total profit depends on how the system can meet these SLAs. In this paper, an SLA-based resource allocation problem in a server cluster is considered. The objective is to maximize the total profit, which is the total price gained from serving the clients subtracted by the operation cost of the server cluster. The total price depends on the average request response time for each client as defined in their utility functions, while the operating cost is related to the total energy consumption. A joint optimization framework is proposed, comprised of request dispatching, dynamic voltage and frequency scaling (DVFS) for individual cores, as well as server-level and core-level consolidations. Each core in the cluster is modeled using a continuous-time Markov decision process (CTMDP). A near-optimal hierarchical solution is proposed, consisting of a central manager and distributed local agents. Each local agent employs linear programming-based CTMDP solving method to solve the DVFS problem for the corresponding core. The central manager solves the request dispatching problem and finds the optimal number of turned on cores and servers for request processing, thereby achieving a desirable tradeoff between service request response time and power consumption. Experimental results demonstrate that the proposed near-optimal resource allocation and consolidation algorithm consistently outperforms baseline algorithms.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134261586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 29

Reliability-constrained die stacking order in 3DICs under manufacturing variability 制造可变性下的三维集成电路可靠性约束的模具堆叠顺序

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523584

T. Chan, A. Kahng, Jiajia Li

3D integrated circuits (3DICs) with through-silicon vias (TSVs) are an important direction for semiconductor-based products and “More than Moore” scaling. However, 3DICs bring simultaneous challenges of reliability (power and temperature in stacks of thinned die) as well as variability (performance and power) in advanced technology nodes. In this paper, we study variability-reliability interactions and optimizations in 3DICs. Initial motivating studies show that in the presence of manufacturing variability, different die stacking orders can lead to as much as 2 years (~44%) difference in MTTF of a 3DIC stack. We study MTTF-driven die-stacking optimization with consideration of variability, and propose a “rule-of-thumb” guideline for stacking optimization to improve peak temperature as well as reliability in 3DICs. We also propose integer-linear programming (ILP) methods for reliability-driven die-stacking optimization. Our methods can achieve ~7% and ~28% improvement in average and minimum MTTF, respectively, of 3DICs; we also achieve ~3% improvement in performance under fixed reliability constraints. Our stacking optimizations can help improve 3DIC product yields under reliability requirements. Our research also yields the notable observation that a limited amount of manufacturing variation can “help” improve 3DIC product reliability when die-stacking optimization is applied.

具有硅通孔(tsv)的三维集成电路(3dic)是半导体产品和“超越摩尔”缩放的重要方向。然而，3dic同时带来了可靠性(薄芯片堆栈中的功率和温度)以及先进技术节点中的可变性(性能和功率)的挑战。在本文中，我们研究了3dic中的变异性-可靠性交互和优化问题。最初的激励研究表明，在存在制造可变性的情况下，不同的模具堆叠顺序可能导致3DIC堆叠的MTTF差异长达2年(~44%)。我们研究了考虑可变性的mttf驱动的层叠优化，并提出了一个“经验法则”的层叠优化准则，以提高3dic的峰值温度和可靠性。我们还提出了可靠性驱动的模堆优化的整数线性规划(ILP)方法。我们的方法可以使3dic的平均MTTF和最小MTTF分别提高7%和28%;在固定的可靠性约束下，我们也实现了约3%的性能改进。我们的堆叠优化可以帮助提高可靠性要求下的3DIC产品良率。我们的研究还得出了一个值得注意的观察结果，即当应用模堆优化时，有限的制造变化可以“帮助”提高3DIC产品的可靠性。

{"title":"Reliability-constrained die stacking order in 3DICs under manufacturing variability","authors":"T. Chan, A. Kahng, Jiajia Li","doi":"10.1109/ISQED.2013.6523584","DOIUrl":"https://doi.org/10.1109/ISQED.2013.6523584","url":null,"abstract":"3D integrated circuits (3DICs) with through-silicon vias (TSVs) are an important direction for semiconductor-based products and “More than Moore” scaling. However, 3DICs bring simultaneous challenges of reliability (power and temperature in stacks of thinned die) as well as variability (performance and power) in advanced technology nodes. In this paper, we study variability-reliability interactions and optimizations in 3DICs. Initial motivating studies show that in the presence of manufacturing variability, different die stacking orders can lead to as much as 2 years (~44%) difference in MTTF of a 3DIC stack. We study MTTF-driven die-stacking optimization with consideration of variability, and propose a “rule-of-thumb” guideline for stacking optimization to improve peak temperature as well as reliability in 3DICs. We also propose integer-linear programming (ILP) methods for reliability-driven die-stacking optimization. Our methods can achieve ~7% and ~28% improvement in average and minimum MTTF, respectively, of 3DICs; we also achieve ~3% improvement in performance under fixed reliability constraints. Our stacking optimizations can help improve 3DIC product yields under reliability requirements. Our research also yields the notable observation that a limited amount of manufacturing variation can “help” improve 3DIC product reliability when die-stacking optimization is applied.","PeriodicalId":127115,"journal":{"name":"International Symposium on Quality Electronic Design (ISQED)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122186548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A novel 6T SRAM cell with asymmetrically gate underlap engineered FinFETs for enhanced read data stability and write ability 一种新型的6T SRAM单元，具有非对称栅极underlap设计的finfet，可增强读取数据的稳定性和写入能力

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523634

S. Salahuddin, Hailong Jiao, V. Kursun

A new FinFET memory circuit technique based on asymmetrically gate underlap engineered bitline access transistors is proposed in this paper. The strengths of the asymmetrical bitline access transistors are weakened during read operations while enhanced during write operations as the direction of current flow is reversed. With the proposed asymmetrical six-FinFET SRAM cell, the read data stability and write ability are both enhanced by up to 6.12x and 58%, respectively, without causing any area overhead as compared to the standard symmetrical six-FinFET SRAM cells in a 15nm FinFET technology. The leakage power consumption is also reduced by up to 96.5% with the proposed asymmetrical FinFET SRAM cell as compared to the standard six-FinFET SRAM cells with symmetrical bitline access transistors.

本文提出了一种基于非对称门搭接工程位线存取晶体管的FinFET存储电路新技术。非对称位线访问晶体管的强度在读操作期间减弱，而在写操作期间由于电流方向相反而增强。与15nm FinFET技术中的标准对称6 -FinFET SRAM单元相比，所提出的非对称6 -FinFET SRAM单元的读取数据稳定性和写入能力分别提高了6.12倍和58%，而不会造成任何面积开销。与具有对称位线访问晶体管的标准6 -FinFET SRAM单元相比，所提出的非对称FinFET SRAM单元的泄漏功耗也降低了高达96.5%。

引用次数: 48

Performance and cache access time of SRAM-eDRAM hybrid caches considering wire delay 考虑线延迟的SRAM-eDRAM混合缓存的性能和缓存访问时间

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523661

Young-Ho Gong, H. Jang, S. Chung

Most modern microprocessors have multi-level on-chip caches with multi-megabyte shared last-level cache (LLC). By using multi-level cache hierarchy, the whole size of on-chip caches becomes larger. The increased cache size causes the leakage power and area of the on-chip caches to increase. Recently, to reduce the leakage power and area of the SRAM based cache, the SRAM-eDRAM hybrid cache was proposed. For SRAM-eDRAM hybrid caches, however, there has not been any study to analyze the effects of the reduced area on wire delay, cache access time, and performance. By replacing half (or three fourth) of SRAM cells by small eDRAM cells for the SRAM-eDRAM hybrid caches, wire length is shortened, which eventually results in the reduction of wire delay and cache access time. In this paper, we evaluate the SRAM-eDRAM hybrid caches in terms of the energy, area, wire delay, access time, and performance. We show that the SRAM-eDRAM hybrid cache reduces the energy consumption, area, wire delay, and SRAM array access time by up to 53.9%, 49.9%, 50.4%, and 38.7%, respectively, compared to the SRAM based cache.

大多数现代微处理器都具有多兆字节的共享最后一级缓存(LLC)的多级片上缓存。通过使用多级缓存层次结构，片上缓存的整体大小变得更大。缓存大小的增加导致片上缓存的泄漏功率和面积增加。近年来，为了降低SRAM高速缓存的泄漏功率和泄漏面积，提出了SRAM- edram混合高速缓存。然而，对于SRAM-eDRAM混合缓存，还没有任何研究分析减小的面积对线延迟、缓存访问时间和性能的影响。通过将SRAM-eDRAM混合缓存的一半(或四分之三)SRAM单元替换为小型eDRAM单元，缩短了线长度，最终减少了线延迟和缓存访问时间。在本文中，我们从能量、面积、线延迟、存取时间和性能等方面评估了SRAM-eDRAM混合高速缓存。研究表明，与基于SRAM的高速缓存相比，SRAM- edram混合高速缓存的能耗、面积、线延迟和SRAM阵列访问时间分别降低了53.9%、49.9%、50.4%和38.7%。

引用次数: 3

Runtime 3-D stacked cache management for chip-multiprocessors 芯片多处理器的运行时3-D堆叠缓存管理

International Symposium on Quality Electronic Design (ISQED)

Pub Date : 2013-03-04 DOI: 10.1109/ISQED.2013.6523592

Jongpil Jung, K. Kang, G. Micheli, C. Kyung

Three-dimensional (3-D) memory stacking is one of the most promising solutions to tackle memory bandwidth problems in chip multiprocessors. In this work, we propose an efficient runtime 3-D cache management technique which not only takes advantage of the low memory access latency through vertical interconnections, but also exploits runtime memory access demand of applications which varies dynamically with time. Experimental results show that the proposed method offers performance improvement by up to 26.7% and on average 13.1% compared with a configuration of private stacked cache.

三维(3-D)内存堆叠是解决芯片多处理器内存带宽问题最有前途的解决方案之一。在这项工作中，我们提出了一种高效的运行时三维缓存管理技术，该技术不仅利用了垂直互连的低内存访问延迟，而且利用了应用程序随时间动态变化的运行时内存访问需求。实验结果表明，与私有堆叠缓存配置相比，该方法的性能提高了26.7%，平均提高了13.1%。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Symposium on Quality Electronic Design (ISQED)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀