首页 > 最新文献

2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)最新文献

英文 中文
Tunnel FETs with tunneling normal to the gate 隧穿法向栅极的隧道场效应管
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705871
H. Xing, Guangle Zhou, Mingda Li, Yiqing Lu, Rui Li, M. Wistey, P. Fay, D. Jena, A. Seabaugh
Summary form only given. In this talk, I will review some of the recent development of tunnel field effect transistors (TFETs) at Notre Dame [1-8]. Tunnel FETs are promising replacements of Si-MOSFETs beyond 2020 due to their promise to achieve Ion/Ioff > 103 with Ion > 100 uA/um at low supply voltages (up to 0.5 V). To date we have demonstrated Ion/Ioff ~ 106, Ion ~ 180 uA/um, separately, based on III-V heterostructures. Challenges ahead include electrostatic control, defect-assisted tunneling and interface state density and parasitics. More recently, we have started to investigate 2D crystal based TFETs for their promises to realize ultrascaled electronic switches.
只提供摘要形式。在这次演讲中,我将回顾一些在巴黎圣母院隧道场效应晶体管(tfet)的最新发展[1-8]。隧道fet有望在2020年以后取代si - mosfet,因为它们有望在低电源电压(高达0.5 V)下实现离子> 100 uA/um的离子/ off > 103。到目前为止,我们已经分别展示了基于III-V异质结构的离子/ off ~ 106,离子~ 180 uA/um。未来的挑战包括静电控制、缺陷辅助隧道、界面态密度和寄生。最近,我们开始研究基于二维晶体的tfet,因为它们有望实现超尺度电子开关。
{"title":"Tunnel FETs with tunneling normal to the gate","authors":"H. Xing, Guangle Zhou, Mingda Li, Yiqing Lu, Rui Li, M. Wistey, P. Fay, D. Jena, A. Seabaugh","doi":"10.1109/E3S.2013.6705871","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705871","url":null,"abstract":"Summary form only given. In this talk, I will review some of the recent development of tunnel field effect transistors (TFETs) at Notre Dame [1-8]. Tunnel FETs are promising replacements of Si-MOSFETs beyond 2020 due to their promise to achieve I<sub>on</sub>/I<sub>off</sub> > 10<sup>3</sup> with I<sub>on</sub> > 100 uA/um at low supply voltages (up to 0.5 V). To date we have demonstrated I<sub>on</sub>/I<sub>off</sub> ~ 10<sup>6</sup>, I<sub>on</sub> ~ 180 uA/um, separately, based on III-V heterostructures. Challenges ahead include electrostatic control, defect-assisted tunneling and interface state density and parasitics. More recently, we have started to investigate 2D crystal based TFETs for their promises to realize ultrascaled electronic switches.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124790128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Sub-Boltzmann transistors with piezoelectric gate barriers 压电栅极阻挡的亚玻尔兹曼晶体管
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705877
R. Jana, G. Snider, D. Jena
Scaling of field-effect transistors (FETs) is limited by the high power dissipation density, and the resulting heat generation in ICs [1]. This is due to the non-scalability of subthreshold slope (SS), i.e. the gate voltage required to change the drain current by an order of magnitude; the value is limited to SS=kTln(10)~60 mV/dec in a classical Boltzmann FET switch [2-3]. Tunneling FETs are being investigated for sub-Boltzmann switching. But even a conventional FET can potentially achieve sub-Boltzmann switching taking advantage of ferroelectric gates materials [3]. It is possible to amplify the internal channel surface potential, Ψs over the applied gate bias voltage, Vg; using negative differential capacitance (NDC) in the gate insulator. The “body factor” then reduces below unity i.e. m = ∂Vg / ∂Ψs <; 1, and hence the subthreshold slope (SS=m×60 mV/dec) can be lowered below 60 mV/dec. In this work, we show that such internal gain mechanism can also exist in piezoelectric gate materials, such as in AIN/GaN heterostructures.
场效应晶体管(fet)的缩放受到高功耗密度和ic中产生的热量的限制[1]。这是由于亚阈值斜率(SS)的不可扩展性,即将漏极电流改变一个数量级所需的栅极电压;在经典玻尔兹曼FET开关中,该值被限制在SS=kTln(10)~ 60mv /dec[2-3]。隧道场效应管正在研究亚玻尔兹曼开关。但即使是传统的场效应管也有可能利用铁电栅极材料实现亚玻尔兹曼开关[3]。可以在施加的栅极偏置电压Vg上放大内部通道表面电位Ψs;栅极绝缘子采用负差分电容(NDC)。然后“体因子”降低到单位以下,即m =∂Vg /∂Ψs <;因此,阈下斜率(SS=m×60 mV/dec)可以降低到60 mV/dec以下。在这项工作中,我们表明这种内部增益机制也可以存在于压电栅材料中,例如AIN/GaN异质结构中。
{"title":"Sub-Boltzmann transistors with piezoelectric gate barriers","authors":"R. Jana, G. Snider, D. Jena","doi":"10.1109/E3S.2013.6705877","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705877","url":null,"abstract":"Scaling of field-effect transistors (FETs) is limited by the high power dissipation density, and the resulting heat generation in ICs [1]. This is due to the non-scalability of subthreshold slope (SS), i.e. the gate voltage required to change the drain current by an order of magnitude; the value is limited to SS=kTln(10)~60 mV/dec in a classical Boltzmann FET switch [2-3]. Tunneling FETs are being investigated for sub-Boltzmann switching. But even a conventional FET can potentially achieve sub-Boltzmann switching taking advantage of ferroelectric gates materials [3]. It is possible to amplify the internal channel surface potential, Ψs over the applied gate bias voltage, Vg; using negative differential capacitance (NDC) in the gate insulator. The “body factor” then reduces below unity i.e. m = ∂Vg / ∂Ψs <; 1, and hence the subthreshold slope (SS=m×60 mV/dec) can be lowered below 60 mV/dec. In this work, we show that such internal gain mechanism can also exist in piezoelectric gate materials, such as in AIN/GaN heterostructures.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"329 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134105470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Efficient optical interconnections for data-center computing systems 数据中心计算系统的高效光互连
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705884
A. Krishnamoorthy
Within the past decade, the semiconductor computing industry has developed multicore and multithreaded core processors to overcome the challenges and shrinking benefits of traditional technology scaling. Multichip systems built using these components will require immense amount of off-chip bandwidth and low latency chip-to-chip links at the lowest energy cost possible. Wavelength-division multiplexed (WDM) silicon photonics have the potential to provide a solution for this immense interconnect problem. At Oracle, we are aggressively building a portfolio of active and passive nanophotonic devices, circuits, and multichip packaging with the aim to achieve sub-picojoule per bit communication links between computing elements in a large array “Macrochip”. To achieve ultralow energy consumption will certainly require integrating best-in-breed photonic devices with electronic circuits. While the juxtaposition of silicon photonic devices and VLSI circuits on the same silicon substrate represents the most intimate integration of electronic and photonic technologies, achieving this will require an immense amount of sophistication in design and process integration. Instead, hybrid integration of aggressive components, built on individually optimized technology platforms, is a pragmatic approach to achieving peak performance. Such “photonic bridge” chip components may be used as units in a larger transmitter or receiver array or as drop-in communication physical layer elements in a multi-chip computing node.
在过去的十年中,半导体计算行业已经开发了多核和多线程核心处理器,以克服传统技术扩展的挑战和缩小的好处。使用这些组件构建的多芯片系统将需要大量的片外带宽和低延迟的芯片到芯片链路,以尽可能低的能源成本。波分复用(WDM)硅光子学有潜力为这一巨大的互连问题提供解决方案。在Oracle,我们正在积极地构建一个有源和无源纳米光子器件、电路和多芯片封装的组合,目标是在大型阵列“Macrochip”的计算元件之间实现每比特亚皮焦耳的通信链接。要实现超低能耗,必然需要将同类最佳的光子器件与电子电路集成在一起。虽然在同一硅衬底上并置硅光子器件和超大规模集成电路代表了电子和光子技术的最密切集成,但实现这一目标将需要大量复杂的设计和工艺集成。相反,在单独优化的技术平台上构建侵略性组件的混合集成是实现最佳性能的实用方法。这种“光子桥”芯片组件可以用作较大的发射器或接收器阵列中的单元,或用作多芯片计算节点中的插入式通信物理层元件。
{"title":"Efficient optical interconnections for data-center computing systems","authors":"A. Krishnamoorthy","doi":"10.1109/E3S.2013.6705884","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705884","url":null,"abstract":"Within the past decade, the semiconductor computing industry has developed multicore and multithreaded core processors to overcome the challenges and shrinking benefits of traditional technology scaling. Multichip systems built using these components will require immense amount of off-chip bandwidth and low latency chip-to-chip links at the lowest energy cost possible. Wavelength-division multiplexed (WDM) silicon photonics have the potential to provide a solution for this immense interconnect problem. At Oracle, we are aggressively building a portfolio of active and passive nanophotonic devices, circuits, and multichip packaging with the aim to achieve sub-picojoule per bit communication links between computing elements in a large array “Macrochip”. To achieve ultralow energy consumption will certainly require integrating best-in-breed photonic devices with electronic circuits. While the juxtaposition of silicon photonic devices and VLSI circuits on the same silicon substrate represents the most intimate integration of electronic and photonic technologies, achieving this will require an immense amount of sophistication in design and process integration. Instead, hybrid integration of aggressive components, built on individually optimized technology platforms, is a pragmatic approach to achieving peak performance. Such “photonic bridge” chip components may be used as units in a larger transmitter or receiver array or as drop-in communication physical layer elements in a multi-chip computing node.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134601936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prospects for high-aspect-ratio FinFETs in low-power logic 低功耗逻辑中高宽高比finfet的前景
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705878
M. Rodwell, D. Elias
As we reduce transistor capacitances, node capacitances are limited by wiring, setting a minimum power dissipation determined by the number of gates, the mean wire length, the mean switching rate, and the supply voltage VDD. With thermally-activated, FETs, the off-state leakage Ioff and target on-current Ion then determine the minimum feasible VDD, and the IC clock frequency can then be increased only at the expense of increased power consumption. Tunnel transistors [1] offer subthreshold characteristics steeper than 60mV/decade, but achieving high Ion at low Ioff and low VDD is challenging. Subthreshold logic [2] operates at lowVDD, but is slow because of low Ion. Here we propose low-power logic using high-aspect-ratio finFETs, devices we have fabricated with few-nm body thicknesses and 180nm height [3]. If these can fabricated at ~20nm pitch, then the fin surface area can exceed its footprint area - i.e. the area the transistor occupies on the IC - by ~10:1. IC performance can be then improved by maintaining fixed VDD, but with reduced FET footprint area hence reduced die size and therefore reduced wiring capacitance, or can be improved by reducing VDD to ~300mV while maintaining large Ion per unit IC die area.
当我们减小晶体管的电容时,节点的电容受到布线的限制,设置由栅极数、平均导线长度、平均开关速率和电源电压VDD决定的最小功耗。对于热激活的fet,关断状态漏关和目标导通电流离子确定最小可行VDD,然后IC时钟频率只能以增加功耗为代价来增加。隧道晶体管[1]提供比60mV/decade更陡的亚阈值特性,但在低关断和低VDD下实现高离子是具有挑战性的。亚阈值逻辑[2]在低vdd下工作,但由于低离子而缓慢。在这里,我们提出了使用高宽高比finfet的低功耗逻辑,我们制造的器件具有几纳米的体厚和180nm的高度[3]。如果这些可以以~20nm的间距制造,那么翅片表面积可以超过其占地面积-即晶体管在IC上占据的面积-约10:1。然后可以通过保持固定的VDD来提高IC性能,但由于FET占地面积减少,因此减少了芯片尺寸,从而减少了布线电容,或者可以通过将VDD降低到~300mV同时保持单位IC芯片面积的大离子来提高IC性能。
{"title":"Prospects for high-aspect-ratio FinFETs in low-power logic","authors":"M. Rodwell, D. Elias","doi":"10.1109/E3S.2013.6705878","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705878","url":null,"abstract":"As we reduce transistor capacitances, node capacitances are limited by wiring, setting a minimum power dissipation determined by the number of gates, the mean wire length, the mean switching rate, and the supply voltage VDD. With thermally-activated, FETs, the off-state leakage Ioff and target on-current Ion then determine the minimum feasible VDD, and the IC clock frequency can then be increased only at the expense of increased power consumption. Tunnel transistors [1] offer subthreshold characteristics steeper than 60mV/decade, but achieving high Ion at low Ioff and low VDD is challenging. Subthreshold logic [2] operates at lowVDD, but is slow because of low Ion. Here we propose low-power logic using high-aspect-ratio finFETs, devices we have fabricated with few-nm body thicknesses and 180nm height [3]. If these can fabricated at ~20nm pitch, then the fin surface area can exceed its footprint area - i.e. the area the transistor occupies on the IC - by ~10:1. IC performance can be then improved by maintaining fixed VDD, but with reduced FET footprint area hence reduced die size and therefore reduced wiring capacitance, or can be improved by reducing VDD to ~300mV while maintaining large Ion per unit IC die area.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116680922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
2.5 GB/s germanium gate photoMOSFET integrated to silicon photonics 集成到硅光子学的2.5 GB/s锗栅photoMOSFET
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705885
R. Going, Jodi Loo, Tsu-Jae King-Liu, Ming C. Wu
We demonstrate a monocrystalline 1×8 μm germanium gate photoMOSFET integrated with silicon photonic waveguides and grating coupler operating at over 2.5 GB/s at 1550nm.
我们展示了一个集成了硅光子波导和光栅耦合器的单晶1×8 μm锗栅极photoMOSFET,在1550nm处工作速度超过2.5 GB/s。
{"title":"2.5 GB/s germanium gate photoMOSFET integrated to silicon photonics","authors":"R. Going, Jodi Loo, Tsu-Jae King-Liu, Ming C. Wu","doi":"10.1109/E3S.2013.6705885","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705885","url":null,"abstract":"We demonstrate a monocrystalline 1×8 μm germanium gate photoMOSFET integrated with silicon photonic waveguides and grating coupler operating at over 2.5 GB/s at 1550nm.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116702346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Why tunneling FETs don't work, and how to fix it 为什么隧道效应晶体管不工作,如何解决它
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705868
S. Agarwal, E. Yablonovitch
To date, TFET results have been unsatisfying. The best reported subthreshold swings have been measured at a current density of around a nA/um and get significantly worse as the current increases. In order to achieve a better performance, there are fundamental design issues that need to be engineered. We can understand these issues by analyzing the three types of devices shown in Fig 1. The voltage required to operate a TFET can be given by: VDD = VSS × Log(Ion /Ioff)+ VOV. VSS is the subthreshold swing and VOV is the overdrive voltage needed to achieve the desired on-current after threshold. VOV will be determined by the device geometry as shown in Fig 2 [1]. Introducing quantum confinement in the direction of tunneling increases the conductance by 1-2 orders of magnitude at low voltage. VSS is given by the following model [2]: SS = 1/ ηel × (1/SBarrier + ηconf/SDOS)-1 (1) ηel is the electrostatic gate efficiency. ηconf is the quantum confinement efficiency and comes from energy level shifts that occur when the quantum well shape changes with bias. SBarrier represents the steepness in mV/decade that comes from changing the thickness of the tunneling barrier. SDOS is the steepness of the joint density of states (DOS) and represents the rate at which the joint DOS fall off as the band edges are misaligned.
迄今为止,TFET的结果并不令人满意。最好的亚阈值波动是在电流密度约为1 nA/um时测量到的,并且随着电流的增加而变得明显更糟。为了获得更好的性能,需要设计一些基本的设计问题。我们可以通过分析图1所示的三种类型的设备来理解这些问题。工作TFET所需的电压可以由:VDD = VSS × Log(Ion /Ioff)+ VOV给出。VSS是亚阈值摆幅,VOV是达到阈值后所需导通电流所需的超速电压。VOV将由器件几何形状决定,如图2[1]所示。在低电压下,在隧穿方向上引入量子约束使电导率提高了1-2个数量级。VSS由以下模型给出[2]:SS = 1/ ηel × (1/SBarrier + ηconf/SDOS)-1 (1) ηel为静电栅极效率。ηconf是量子约束效率,来自于量子阱形状随偏置变化时发生的能级位移。SBarrier表示陡峭度,单位为mV/ 10年,它来自于隧道势垒厚度的改变。SDOS是关节状态密度(DOS)的陡度,表示当带边缘不对齐时关节状态密度下降的速率。
{"title":"Why tunneling FETs don't work, and how to fix it","authors":"S. Agarwal, E. Yablonovitch","doi":"10.1109/E3S.2013.6705868","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705868","url":null,"abstract":"To date, TFET results have been unsatisfying. The best reported subthreshold swings have been measured at a current density of around a nA/um and get significantly worse as the current increases. In order to achieve a better performance, there are fundamental design issues that need to be engineered. We can understand these issues by analyzing the three types of devices shown in Fig 1. The voltage required to operate a TFET can be given by: V<sub>DD</sub> = V<sub>SS</sub> × Log(I<sub>on</sub> /I<sub>off</sub>)+ V<sub>OV</sub>. V<sub>SS</sub> is the subthreshold swing and V<sub>OV</sub> is the overdrive voltage needed to achieve the desired on-current after threshold. V<sub>OV</sub> will be determined by the device geometry as shown in Fig 2 [1]. Introducing quantum confinement in the direction of tunneling increases the conductance by 1-2 orders of magnitude at low voltage. V<sub>SS</sub> is given by the following model [2]: SS = 1/ η<sub>el</sub> × (1/S<sub>Barrier</sub> + η<sub>conf</sub>/S<sub>DOS</sub>)<sup>-1</sup> (1) η<sub>el</sub> is the electrostatic gate efficiency. η<sub>conf</sub> is the quantum confinement efficiency and comes from energy level shifts that occur when the quantum well shape changes with bias. S<sub>Barrier</sub> represents the steepness in mV/decade that comes from changing the thickness of the tunneling barrier. S<sub>DOS</sub> is the steepness of the joint density of states (DOS) and represents the rate at which the joint DOS fall off as the band edges are misaligned.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134080595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Exascale computer architecture adjusting to the “New normal” for computing 百亿亿级计算机架构适应计算的“新常态”
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705854
J. Shalf
The current MPI+Fortran ecosystem has sustained HPC application software development for the past decade, but was architected for coarse-grained concurrency largely dominated by bulk-synchronous algorithms. The trends in computer architecture have turned our model for how to get good performance from computing systems upside-down, and will require rethinking our entire programming environment and algorithm design to be better aligned with the new cost metrics for these emerging hardware architectures. There are already promising avenues of exploration underway to mitigate these effects. Future hardware constraints on bandwidth and memory capacity, together with exponential growth in explicit on-chip parallelism will likely require a mass migration to new algorithms and software architecture that is as broad and disruptive as the migration from vector to parallel computing systems that occurred 15 years go. The challenge is to efficiently express massive parallelism and hierarchical data locality without subjecting the programmer to overwhelming complexity. The author covers how changes in hardware (governed by the fundamental physics of Silicon based CMOS technology) are breaking our existing abstract machine models, and DOE's program to overcome these obstacles to continued performance improvements. He examines potential approaches that range from revolutionary asynchronous and dataflow models of computation to evolutionary extensions to existing APIs and OpenMP directives.
当前的MPI+Fortran生态系统在过去的十年里一直支持着HPC应用软件的开发,但它的架构主要是为大容量同步算法所主导的粗粒度并发。计算机体系结构的趋势已经颠覆了我们如何从计算系统中获得良好性能的模型,并且需要重新思考我们的整个编程环境和算法设计,以便更好地与这些新兴硬件体系结构的新成本指标保持一致。已经有一些很有希望的探索途径正在进行中,以减轻这些影响。未来硬件对带宽和内存容量的限制,以及显式芯片上并行性的指数级增长,可能需要大规模迁移到新的算法和软件架构,这就像15年前从矢量计算系统迁移到并行计算系统一样广泛和具有破坏性。挑战在于如何有效地表达大规模并行性和分层数据局部性,同时又不让程序员承受压倒性的复杂性。作者介绍了硬件的变化(由基于硅的CMOS技术的基本物理控制)如何打破我们现有的抽象机器模型,以及能源部克服这些障碍以持续提高性能的计划。他考察了各种可能的方法,从革命性的异步和数据流计算模型到对现有api和OpenMP指令的进化扩展。
{"title":"Exascale computer architecture adjusting to the “New normal” for computing","authors":"J. Shalf","doi":"10.1109/E3S.2013.6705854","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705854","url":null,"abstract":"The current MPI+Fortran ecosystem has sustained HPC application software development for the past decade, but was architected for coarse-grained concurrency largely dominated by bulk-synchronous algorithms. The trends in computer architecture have turned our model for how to get good performance from computing systems upside-down, and will require rethinking our entire programming environment and algorithm design to be better aligned with the new cost metrics for these emerging hardware architectures. There are already promising avenues of exploration underway to mitigate these effects. Future hardware constraints on bandwidth and memory capacity, together with exponential growth in explicit on-chip parallelism will likely require a mass migration to new algorithms and software architecture that is as broad and disruptive as the migration from vector to parallel computing systems that occurred 15 years go. The challenge is to efficiently express massive parallelism and hierarchical data locality without subjecting the programmer to overwhelming complexity. The author covers how changes in hardware (governed by the fundamental physics of Silicon based CMOS technology) are breaking our existing abstract machine models, and DOE's program to overcome these obstacles to continued performance improvements. He examines potential approaches that range from revolutionary asynchronous and dataflow models of computation to evolutionary extensions to existing APIs and OpenMP directives.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130084026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Waveguide-integrated optical antenna nanoLEDs for on-chip communication 用于片上通信的波导集成光学天线纳米oled
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705886
M. Eggleston, K. Messer, S. Fortuna, E. Yablonovitch, Ming C. Wu
We present on an optical antenna based nanoLED that is fabricated directly on top of an InP waveguide. Waveguide coupling efficiency of 70% and directional emission is achieved with a Yagi-Uda antenna structure. By using an epitaxial lift-off process, we show that this device could be integrated directly onto a Silicon-photonics substrate.
我们提出了一种基于纳米oled的光学天线,它是直接在InP波导上制造的。采用Yagi-Uda天线结构实现了70%的波导耦合效率和定向发射。通过使用外延提升工艺,我们表明该器件可以直接集成到硅光子衬底上。
{"title":"Waveguide-integrated optical antenna nanoLEDs for on-chip communication","authors":"M. Eggleston, K. Messer, S. Fortuna, E. Yablonovitch, Ming C. Wu","doi":"10.1109/E3S.2013.6705886","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705886","url":null,"abstract":"We present on an optical antenna based nanoLED that is fabricated directly on top of an InP waveguide. Waveguide coupling efficiency of 70% and directional emission is achieved with a Yagi-Uda antenna structure. By using an epitaxial lift-off process, we show that this device could be integrated directly onto a Silicon-photonics substrate.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133948236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nanoelectromechanical switching devices: Scaling toward ultimate energy efficiency and longevity 纳米机电开关器件:向终极能源效率和寿命扩展
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705881
P. Feng
The active search for candidates of an ideal switching device for low-voltage logic and ultralow-power applications has stimulated focused explorations of contact-mode switches (relays) based on micro/nanoelectromechanical systems (MEMS/NEMS) [1-7]. This has been driven by the fundamental advantages that mechanical devices offer, such as ideally abrupt switching with zero off-state leakage, suitable for harsh and extreme environments, and very small footprints (e.g., particularly with NEMS). In pursuing and realizing these advantages, however, significant challenges still remain today: (i) All the high-performance mechanical switches recently demonstrated are still in the MEMS domain [2-5] and are orders of magnitude larger in size or volume (>103 to 104) than the nanoscale devices presented in this work. (ii) Most truly nanoscale contact-mode NEMS switches known to date (often based on various nanowires, cantilevers and nanotubes) still suffer from very short lifetimes.nanoscale contact-mode NEMS switches
积极寻找低压逻辑和超低功耗应用的理想开关器件,刺激了基于微/纳米机电系统(MEMS/NEMS)的接触模式开关(继电器)的重点探索[1-7]。这是由机械设备提供的基本优势所驱动的,例如理想的突然开关,零断开状态泄漏,适用于恶劣和极端环境,占地面积非常小(例如,特别是NEMS)。然而,在追求和实现这些优势的过程中,今天仍然存在重大挑战:(i)最近展示的所有高性能机械开关仍然处于MEMS领域[2-5],并且在尺寸或体积上比本工作中提出的纳米级器件大几个数量级(>103到104)。(ii)目前已知的大多数真正的纳米级接触型NEMS开关(通常基于各种纳米线、悬臂和纳米管)的寿命仍然很短。纳米级接触型NEMS开关
{"title":"Nanoelectromechanical switching devices: Scaling toward ultimate energy efficiency and longevity","authors":"P. Feng","doi":"10.1109/E3S.2013.6705881","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705881","url":null,"abstract":"The active search for candidates of an ideal switching device for low-voltage logic and ultralow-power applications has stimulated focused explorations of contact-mode switches (relays) based on micro/nanoelectromechanical systems (MEMS/NEMS) [1-7]. This has been driven by the fundamental advantages that mechanical devices offer, such as ideally abrupt switching with zero off-state leakage, suitable for harsh and extreme environments, and very small footprints (e.g., particularly with NEMS). In pursuing and realizing these advantages, however, significant challenges still remain today: (i) All the high-performance mechanical switches recently demonstrated are still in the MEMS domain [2-5] and are orders of magnitude larger in size or volume (>103 to 104) than the nanoscale devices presented in this work. (ii) Most truly nanoscale contact-mode NEMS switches known to date (often based on various nanowires, cantilevers and nanotubes) still suffer from very short lifetimes.nanoscale contact-mode NEMS switches","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133682800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Achieving energy efficiency by HW/SW co-design 通过硬件/软件协同设计实现能源效率
Pub Date : 2013-10-01 DOI: 10.1109/E3S.2013.6705856
S. Borkar
Summary form only given. Compute performance increased by orders of magnitude in the last few decades, made possible by continued technology scaling, increasing frequency, providing integration capacity to realize novel architectures, and reducing energy to keep power dissipation within limit. The technology treadmill will continue, and one would expect to reach Exascale level performance this decade; however, it's the same Physics that helped you in the past will now pose some barriers-Business as usual will not be an option. The energy and power will pose as a major challenge- an Exascale machine would consume in excess of a Giga-watt! Memory & communication bandwidth with conventional technology would be prohibitive. Orders of magnitude increased parallelism, let alone extreme parallelism due to energy saving techniques, would increase unreliability. And programming system will be posed with even severe challenge of harnessing the performance with concurrency. We will discuss potential solutions in all disciplines, such as circuit design, system architecture, system software, programming system, and resiliency to pave the road towards Exascale performance.
仅提供摘要形式。在过去的几十年里,计算性能提高了几个数量级,这得益于持续的技术扩展、频率提高、提供集成能力以实现新型架构,以及降低能耗以将功率耗散控制在极限范围内。技术发展的脚步仍将继续,人们有望在本十年内达到 Exascale 级别的性能;然而,过去曾帮助过你的物理学现在也将构成一些障碍--一切照旧将不再是一种选择。能源和电力将是一大挑战--一台超大规模机器的能耗将超过千兆瓦!传统技术的内存和通信带宽将令人望而却步。数量级的并行性提高,更不用说节能技术带来的极端并行性,会增加不稳定性。编程系统在利用并发性能方面将面临更加严峻的挑战。我们将讨论电路设计、系统架构、系统软件、编程系统和弹性等所有学科的潜在解决方案,为实现超大规模性能铺平道路。
{"title":"Achieving energy efficiency by HW/SW co-design","authors":"S. Borkar","doi":"10.1109/E3S.2013.6705856","DOIUrl":"https://doi.org/10.1109/E3S.2013.6705856","url":null,"abstract":"Summary form only given. Compute performance increased by orders of magnitude in the last few decades, made possible by continued technology scaling, increasing frequency, providing integration capacity to realize novel architectures, and reducing energy to keep power dissipation within limit. The technology treadmill will continue, and one would expect to reach Exascale level performance this decade; however, it's the same Physics that helped you in the past will now pose some barriers-Business as usual will not be an option. The energy and power will pose as a major challenge- an Exascale machine would consume in excess of a Giga-watt! Memory & communication bandwidth with conventional technology would be prohibitive. Orders of magnitude increased parallelism, let alone extreme parallelism due to energy saving techniques, would increase unreliability. And programming system will be posed with even severe challenge of harnessing the performance with concurrency. We will discuss potential solutions in all disciplines, such as circuit design, system architecture, system software, programming system, and resiliency to pave the road towards Exascale performance.","PeriodicalId":231837,"journal":{"name":"2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126445247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2013 Third Berkeley Symposium on Energy Efficient Electronic Systems (E3S)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1