首页 > 最新文献

2019 IEEE International Conference on Rebooting Computing (ICRC)最新文献

英文 中文
Message from the 2019 ICRC Program Co-Chairs 2019年红十字国际委员会项目联合主席致辞
Pub Date : 2019-11-01 DOI: 10.1109/icrc.2019.8914714
{"title":"Message from the 2019 ICRC Program Co-Chairs","authors":"","doi":"10.1109/icrc.2019.8914714","DOIUrl":"https://doi.org/10.1109/icrc.2019.8914714","url":null,"abstract":"","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122304072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Limits of Stochastic Computing 论随机计算的极限
Pub Date : 2019-11-01 DOI: 10.1109/ICRC.2019.8914706
Florian Neugebauer, I. Polian, J. Hayes
Stochastic computing (SC) provides large benefits in area and power consumption at the cost of limited computational accuracy. For this reason, it has been proposed for computation intensive applications that can tolerate approximate results such as neural networks (NNs) and digital filters. Most system implementations employing SC are referred to as stochastic circuits, even though they can have vastly different properties and limitations. In this work, we propose a distinction between strongly and weakly stochastic circuits, which provide different options and trade-offs for implementing SC operations. On this basis, we investigate some fundamental theoretical and practical limits of SC that have not been considered before. In particular, we analyze the limits of stochastic addition and show via the example of a convolutional NN that these limits can restrict the viability of strongly stochastic systems. We further show that theoretically all non-affine functions do not have exact SC implementations and investigate the practical implications of this discovery.
随机计算(SC)以有限的计算精度为代价,在面积和功耗方面提供了巨大的优势。由于这个原因,它已被提出用于计算密集型应用,可以容忍近似结果,如神经网络(nn)和数字滤波器。大多数采用SC的系统实现被称为随机电路,尽管它们可能具有截然不同的特性和限制。在这项工作中,我们提出了强随机电路和弱随机电路之间的区别,它们为实现SC操作提供了不同的选择和权衡。在此基础上,我们研究了一些以前没有考虑到的基本理论和实践限制。特别地,我们分析了随机加法的极限,并通过卷积神经网络的例子表明,这些极限可以限制强随机系统的生存能力。我们进一步表明,理论上所有的非仿射函数都没有精确的SC实现,并研究了这一发现的实际意义。
{"title":"On the Limits of Stochastic Computing","authors":"Florian Neugebauer, I. Polian, J. Hayes","doi":"10.1109/ICRC.2019.8914706","DOIUrl":"https://doi.org/10.1109/ICRC.2019.8914706","url":null,"abstract":"Stochastic computing (SC) provides large benefits in area and power consumption at the cost of limited computational accuracy. For this reason, it has been proposed for computation intensive applications that can tolerate approximate results such as neural networks (NNs) and digital filters. Most system implementations employing SC are referred to as stochastic circuits, even though they can have vastly different properties and limitations. In this work, we propose a distinction between strongly and weakly stochastic circuits, which provide different options and trade-offs for implementing SC operations. On this basis, we investigate some fundamental theoretical and practical limits of SC that have not been considered before. In particular, we analyze the limits of stochastic addition and show via the example of a convolutional NN that these limits can restrict the viability of strongly stochastic systems. We further show that theoretically all non-affine functions do not have exact SC implementations and investigate the practical implications of this discovery.","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"11 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120928429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Integrated Photonics Architectures for Residue Number System Computations 残数系统计算的集成光子体系结构
Pub Date : 2019-11-01 DOI: 10.1109/ICRC.2019.8914700
Jiaxin Peng, Y. Alkabani, Shuai Sun, V. Sorger, T. El-Ghazawi
Residue number system (RNS) can represent large numbers as sets of relatively smaller prime numbers. Architectures for such systems can be inherently parallel, as arithmetic operations on large numbers can then be performed on elements of those sets individually. As RNS arithmetic is based on modulo operations, an RNS computational unit is usually constructed as a network of switches that are controlled to perform a specific computation, giving rise to the processing in network (PIN) paradigm. In this work, we explore using integrated photonics switches to build different high-speed architectures of RNS computational units based on multistage interconnection networks. The inherent parallelism of RNS, as well as very low energy of integrated phontonics are two primary reasons for the promise of this direction. We study the trade-offs between the area and the control complexity of five different architectures. We show that our newly proposed architecture, which is based on arbitrary size Benes (AS-Benes) networks, saves up to 90% of the area and is up to 16 times faster than the other architectures.
残数系统(RNS)可以将大数表示为相对较小的素数集合。这类系统的体系结构本质上是并行的,因为对大数的算术运算可以分别在这些集合的元素上执行。由于RNS算法基于模运算,一个RNS计算单元通常被构造为一个由交换机组成的网络,这些交换机被控制来执行特定的计算,从而产生了网络处理(PIN)范式。在这项工作中,我们探索了使用集成光子开关来构建基于多级互连网络的RNS计算单元的不同高速架构。RNS固有的并行性,以及集成声子的低能量是这个方向有希望的两个主要原因。我们研究了五种不同架构的面积和控制复杂性之间的权衡。我们表明,我们新提出的基于任意大小的Benes (AS-Benes)网络的架构节省了高达90%的面积,并且比其他架构快16倍。
{"title":"Integrated Photonics Architectures for Residue Number System Computations","authors":"Jiaxin Peng, Y. Alkabani, Shuai Sun, V. Sorger, T. El-Ghazawi","doi":"10.1109/ICRC.2019.8914700","DOIUrl":"https://doi.org/10.1109/ICRC.2019.8914700","url":null,"abstract":"Residue number system (RNS) can represent large numbers as sets of relatively smaller prime numbers. Architectures for such systems can be inherently parallel, as arithmetic operations on large numbers can then be performed on elements of those sets individually. As RNS arithmetic is based on modulo operations, an RNS computational unit is usually constructed as a network of switches that are controlled to perform a specific computation, giving rise to the processing in network (PIN) paradigm. In this work, we explore using integrated photonics switches to build different high-speed architectures of RNS computational units based on multistage interconnection networks. The inherent parallelism of RNS, as well as very low energy of integrated phontonics are two primary reasons for the promise of this direction. We study the trade-offs between the area and the control complexity of five different architectures. We show that our newly proposed architecture, which is based on arbitrary size Benes (AS-Benes) networks, saves up to 90% of the area and is up to 16 times faster than the other architectures.","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114624138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Entangled State Preparation for Non-Binary Quantum Computing 非二进制量子计算的纠缠态制备
Pub Date : 2019-11-01 DOI: 10.1109/ICRC.2019.8914717
Kaitlin N. Smith, M. Thornton
A common model of quantum computing is the gate model with binary basis states. Here, we consider the gate model of quantum computing with a non-binary radix resulting in more than two basis states to represent a quantum digit, or qudit. Quantum entanglement is an important phenomenon that is a critical component of quantum computation and communications algorithms. The generation and use of entanglement among radix-2 qubits is well-known and used often in quantum computing algorithms. Quantum entanglement exists in higher-radix systems as well although little is written regarding the generation of higher-radix entangled states. We provide background describing the feasibility of multiple-valued logic quantum systems and describe a new systematic method for generating maximally entangled states in quantum systems of dimension greater than two. This method is implemented in a synthesis algorithm that is described. Experimental results are included that demonstrate the transformations needed to create specific forms of maximally entangled quantum states.
量子计算的一个常用模型是二元基态的门模型。在这里,我们考虑量子计算的门模型,其非二进制基数导致两个以上的基态来表示量子数字或qudit。量子纠缠是一种重要现象,是量子计算和通信算法的重要组成部分。基2量子位之间纠缠的产生和使用是众所周知的,并且经常用于量子计算算法。量子纠缠也存在于高基数系统中,尽管关于高基数纠缠态的产生的文献很少。我们提供了描述多值逻辑量子系统可行性的背景,并描述了在大于2维的量子系统中产生最大纠缠态的一种新的系统方法。本文描述了一种综合算法来实现该方法。实验结果包括,证明需要创建最大纠缠量子态的特定形式的转换。
{"title":"Entangled State Preparation for Non-Binary Quantum Computing","authors":"Kaitlin N. Smith, M. Thornton","doi":"10.1109/ICRC.2019.8914717","DOIUrl":"https://doi.org/10.1109/ICRC.2019.8914717","url":null,"abstract":"A common model of quantum computing is the gate model with binary basis states. Here, we consider the gate model of quantum computing with a non-binary radix resulting in more than two basis states to represent a quantum digit, or qudit. Quantum entanglement is an important phenomenon that is a critical component of quantum computation and communications algorithms. The generation and use of entanglement among radix-2 qubits is well-known and used often in quantum computing algorithms. Quantum entanglement exists in higher-radix systems as well although little is written regarding the generation of higher-radix entangled states. We provide background describing the feasibility of multiple-valued logic quantum systems and describe a new systematic method for generating maximally entangled states in quantum systems of dimension greater than two. This method is implemented in a synthesis algorithm that is described. Experimental results are included that demonstrate the transformations needed to create specific forms of maximally entangled quantum states.","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114512708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Non-Volatile Memory Array Based Quantization- and Noise-Resilient LSTM Neural Networks 基于非易失性存储器阵列的量化和抗噪声LSTM神经网络
Pub Date : 2019-11-01 DOI: 10.1109/ICRC.2019.8914713
Wen Ma, P. Chiu, Won Ho Choi, Minghai Qin, D. Bedau, Martin Lueker-Boden
In cloud and edge computing models, it is important that compute devices at the edge be as power efficient as possible. Long short-term memory (LSTM) neural networks have been widely used for natural language processing, time series prediction and many other sequential data tasks. Thus, for these applications there is increasing need for low-power accelerators for LSTM model inference at the edge. In order to reduce power dissipation due to data transfers within inference devices, there has been significant interest in accelerating vector-matrix multiplication (VMM) operations using non-volatile memory (NVM) weight arrays. In NVM array-based hardware, reduced bit-widths also significantly increases the power efficiency. In this paper, we focus on the application of quantization-aware training algorithm to LSTM models, and the benefits these models bring in terms of resilience against both quantization error and analog device noise. We have shown that only 4-bit NVM weights and 4-bit ADC/DACs are needed to produce equivalent LSTM network performance as floating-point baseline. Reasonable levels of ADC quantization noise and weight noise can be naturally tolerated within our NVM-based quantized LSTM network. Benchmark analysis of our proposed LSTM accelerator for inference has shown at least 2.4× better computing efficiency and 40× higher area efficiency than traditional digital approaches (GPU, FPGA, and ASIC). Some other novel approaches based on NVM promise to deliver higher computing efficiency (up to ×4.7) but require larger arrays with potential higher error rates.
在云和边缘计算模型中,重要的是使边缘计算设备尽可能地节能。长短期记忆(LSTM)神经网络在自然语言处理、时间序列预测和许多其他序列数据任务中得到了广泛的应用。因此,对于这些应用,越来越需要在边缘进行LSTM模型推理的低功率加速器。为了减少由于推理设备内数据传输造成的功耗,人们对使用非易失性存储器(NVM)权重数组加速向量矩阵乘法(VMM)运算非常感兴趣。在基于NVM阵列的硬件中,减小的位宽度也显著提高了功耗效率。在本文中,我们重点研究了量化感知训练算法在LSTM模型中的应用,以及这些模型在抵御量化误差和模拟设备噪声方面带来的好处。我们已经证明,仅需要4位NVM权重和4位ADC/ dac就可以产生与浮点基准相当的LSTM网络性能。在我们基于nvm的量化LSTM网络中,合理水平的ADC量化噪声和权值噪声是可以自然容忍的。我们提出的用于推理的LSTM加速器的基准分析表明,与传统的数字方法(GPU, FPGA和ASIC)相比,LSTM加速器的计算效率至少提高2.4倍,面积效率提高40倍。其他一些基于NVM的新方法承诺提供更高的计算效率(高达×4.7),但需要更大的阵列,潜在的错误率更高。
{"title":"Non-Volatile Memory Array Based Quantization- and Noise-Resilient LSTM Neural Networks","authors":"Wen Ma, P. Chiu, Won Ho Choi, Minghai Qin, D. Bedau, Martin Lueker-Boden","doi":"10.1109/ICRC.2019.8914713","DOIUrl":"https://doi.org/10.1109/ICRC.2019.8914713","url":null,"abstract":"In cloud and edge computing models, it is important that compute devices at the edge be as power efficient as possible. Long short-term memory (LSTM) neural networks have been widely used for natural language processing, time series prediction and many other sequential data tasks. Thus, for these applications there is increasing need for low-power accelerators for LSTM model inference at the edge. In order to reduce power dissipation due to data transfers within inference devices, there has been significant interest in accelerating vector-matrix multiplication (VMM) operations using non-volatile memory (NVM) weight arrays. In NVM array-based hardware, reduced bit-widths also significantly increases the power efficiency. In this paper, we focus on the application of quantization-aware training algorithm to LSTM models, and the benefits these models bring in terms of resilience against both quantization error and analog device noise. We have shown that only 4-bit NVM weights and 4-bit ADC/DACs are needed to produce equivalent LSTM network performance as floating-point baseline. Reasonable levels of ADC quantization noise and weight noise can be naturally tolerated within our NVM-based quantized LSTM network. Benchmark analysis of our proposed LSTM accelerator for inference has shown at least 2.4× better computing efficiency and 40× higher area efficiency than traditional digital approaches (GPU, FPGA, and ASIC). Some other novel approaches based on NVM promise to deliver higher computing efficiency (up to ×4.7) but require larger arrays with potential higher error rates.","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116990145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
ICRC 2019 Sponsor 红十字国际委员会2019
Pub Date : 2019-11-01 DOI: 10.1109/icrc.2019.8914690
{"title":"ICRC 2019 Sponsor","authors":"","doi":"10.1109/icrc.2019.8914690","DOIUrl":"https://doi.org/10.1109/icrc.2019.8914690","url":null,"abstract":"","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122854572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Future Computing Systems (FCS) to Support "Understanding" Capability 支持“理解”能力的未来计算系统(FCS)
Pub Date : 2019-11-01 DOI: 10.1109/ICRC.2019.8914712
R. Beausoleil, T. Vaerenbergh, Kirk M. Bresniker, Catherine E. Graves, Kimberly Keeton, Suhas Kumar, Can Li, D. Milojicic, S. Serebryakov, J. Strachan
The massive explosion in data acquisition, processing, and archiving, accelerated by the end of Moore's Law, creates a challenge and an opportunity for a complete redesign of technology, devices, hardware architecture, software stack and AI stack to enable future computing systems with "understanding" capability. We propose a Future Computing System (FCS) based on a memory driven computing AI architecture, that leverages different types of next generation accelerators (e.g., Ising and Hopfield Machines), connected over an intelligent successor of the Gen-Z interconnect. On top of this architecture we propose a software stack and subsequently, an AI stack built on top of the software stack. While intelligence characteristics (learning, training, self-awareness, etc.) permeate all layers, we also separate AI-specific components into a separate layer for clear design. There are two aspects of AI in FCSs: a) AI embedded in the system to make the system better: better performing, more robust, self-healing, maintainable, repairable, and energy efficient. b) AI as the level of reasoning over the information contained within the system: the supervised and unsupervised techniques finding relationships over the data placed into the system. Developing the software and AI stack will require adapting to each redundant component. At least initially, specialization will be required. For this reason, starting with an interoperable, memory driven computing architecture and associated interconnect is essential for subsequent generalization. Our architecture is composable, i.e., it could be pursued in: a) its entirety, b) per-layer c) per component inside of the layer (e.g., only one of the accelerators, use cases, etc.); or d) exploring specific characteristics across the layers.
摩尔定律的终结加速了数据采集、处理和存档的大规模爆炸,这为彻底重新设计技术、设备、硬件架构、软件堆栈和人工智能堆栈创造了挑战和机遇,从而使未来的计算系统具有“理解”能力。我们提出了一个基于内存驱动计算人工智能架构的未来计算系统(FCS),它利用不同类型的下一代加速器(例如,Ising和Hopfield Machines),通过Gen-Z互连的智能继任者连接。在这个架构之上,我们提出了一个软件堆栈,随后,在软件堆栈之上构建了一个人工智能堆栈。虽然智能特征(学习,训练,自我意识等)渗透到所有层,但我们也将ai特定组件分离到单独的层中以进行清晰的设计。fcs中的人工智能有两个方面:a)将人工智能嵌入系统以使系统更好:性能更好、更健壮、自愈、可维护、可修复和节能。b) AI作为对系统中包含的信息的推理水平:有监督和无监督的技术在系统中发现数据之间的关系。开发软件和人工智能堆栈将需要适应每个冗余组件。至少在开始阶段,专业化是必需的。出于这个原因,从一个可互操作的、内存驱动的计算体系结构和相关的互连开始,对于后续的推广是必不可少的。我们的架构是可组合的,也就是说,它可以在:a)整体,b)每层,c)层内的每个组件(例如,只有一个加速器,用例等)中进行;或者d)探索跨层的特定特征。
{"title":"Future Computing Systems (FCS) to Support \"Understanding\" Capability","authors":"R. Beausoleil, T. Vaerenbergh, Kirk M. Bresniker, Catherine E. Graves, Kimberly Keeton, Suhas Kumar, Can Li, D. Milojicic, S. Serebryakov, J. Strachan","doi":"10.1109/ICRC.2019.8914712","DOIUrl":"https://doi.org/10.1109/ICRC.2019.8914712","url":null,"abstract":"The massive explosion in data acquisition, processing, and archiving, accelerated by the end of Moore's Law, creates a challenge and an opportunity for a complete redesign of technology, devices, hardware architecture, software stack and AI stack to enable future computing systems with \"understanding\" capability. We propose a Future Computing System (FCS) based on a memory driven computing AI architecture, that leverages different types of next generation accelerators (e.g., Ising and Hopfield Machines), connected over an intelligent successor of the Gen-Z interconnect. On top of this architecture we propose a software stack and subsequently, an AI stack built on top of the software stack. While intelligence characteristics (learning, training, self-awareness, etc.) permeate all layers, we also separate AI-specific components into a separate layer for clear design. There are two aspects of AI in FCSs: a) AI embedded in the system to make the system better: better performing, more robust, self-healing, maintainable, repairable, and energy efficient. b) AI as the level of reasoning over the information contained within the system: the supervised and unsupervised techniques finding relationships over the data placed into the system. Developing the software and AI stack will require adapting to each redundant component. At least initially, specialization will be required. For this reason, starting with an interoperable, memory driven computing architecture and associated interconnect is essential for subsequent generalization. Our architecture is composable, i.e., it could be pursued in: a) its entirety, b) per-layer c) per component inside of the layer (e.g., only one of the accelerators, use cases, etc.); or d) exploring specific characteristics across the layers.","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127584222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Experimental Insights from the Rogues Gallery 来自Rogues画廊的实验见解
Pub Date : 2019-11-01 DOI: 10.1109/ICRC.2019.8914707
Jeffrey S. Young, E. J. Riedy, T. Conte, Vivek Sarkar, Prasanth Chatarasi, S. Srikanth
The Rogues Gallery is a new deployment for understanding next-generation hardware with a focus on unorthodox and uncommon technologies. This testbed project was initiated in 2017 in response to Rebooting Computing efforts and initiatives. The Gallery's focus is to acquire new and unique hardware (the rogues) from vendors, research labs, and start-ups and to make this hardware widely available to students, faculty, and industry collaborators within a managed data center environment. By exposing students and researchers to this set of unique hardware, we hope to foster cross-cutting discussions about hardware designs that will drive future performance improvements in computing long after the Moore's Law era of cheap transistors ends. We have defined an initial vision of the infrastructure and driving engineering challenges for such a testbed in a separate document, so here we present highlights of the first one to two years of post-Moore era research with the Rogues Gallery and give an indication of where we see future growth for this testbed and related efforts.
Rogues Gallery是理解下一代硬件的新部署,专注于非正统和不寻常的技术。该测试平台项目于2017年启动,以响应重新启动计算的工作和倡议。该画廊的重点是从供应商、研究实验室和初创企业那里获得新的、独特的硬件,并在托管数据中心环境中向学生、教师和行业合作者广泛提供这些硬件。通过让学生和研究人员接触这组独特的硬件,我们希望促进硬件设计的跨领域讨论,这将在摩尔定律时代的廉价晶体管结束后很久推动未来计算性能的提高。我们已经在一份单独的文件中定义了这样一个测试平台的基础设施和驱动工程挑战的初步愿景,因此我们在这里介绍了Rogues Gallery在后摩尔时代研究的前一到两年的亮点,并指出了我们对该测试平台和相关工作的未来发展方向。
{"title":"Experimental Insights from the Rogues Gallery","authors":"Jeffrey S. Young, E. J. Riedy, T. Conte, Vivek Sarkar, Prasanth Chatarasi, S. Srikanth","doi":"10.1109/ICRC.2019.8914707","DOIUrl":"https://doi.org/10.1109/ICRC.2019.8914707","url":null,"abstract":"The Rogues Gallery is a new deployment for understanding next-generation hardware with a focus on unorthodox and uncommon technologies. This testbed project was initiated in 2017 in response to Rebooting Computing efforts and initiatives. The Gallery's focus is to acquire new and unique hardware (the rogues) from vendors, research labs, and start-ups and to make this hardware widely available to students, faculty, and industry collaborators within a managed data center environment. By exposing students and researchers to this set of unique hardware, we hope to foster cross-cutting discussions about hardware designs that will drive future performance improvements in computing long after the Moore's Law era of cheap transistors ends. We have defined an initial vision of the infrastructure and driving engineering challenges for such a testbed in a separate document, so here we present highlights of the first one to two years of post-Moore era research with the Rogues Gallery and give an indication of where we see future growth for this testbed and related efforts.","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114914007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
ICRC 2019 Technical Program 红十字国际委员会2019年技术方案
Pub Date : 2019-11-01 DOI: 10.1109/icrc.2019.8914695
Sachin S. Bhat, Csaba Andras, Moritz, M. Inuiguchi, Hirosato, Seki, S. Serebryakov, D. Milojicic, Natalia, Vassilieva, S. Fleischman, D. Bedau, Craig Warner, C. Brueggen, Charles, Williams, Nathaniel Jansen, J. Strachan, Amit, Sharma
Session 1 Machine Learning Systems Reconfigurable Probabilistic AI Architecture for Personalized Cancer Treatment Sourabh Kulkarni, Sachin Bhat, Csaba Andras Moritz (University of Massachusetts Amherst) On a Learning Method of the SIC Fuzzy Inference Model with Consequent Fuzzy Sets Genki Ohashi, Masahiro Inuiguchi, Hirosato Seki (Osaka University) Deep Learning Cookbook: Recipes for your AI Infrastructure and Applications Sergey Serebryakov, Dejan Milojicic, Natalia Vassilieva, Stephen Fleischman, Robert Clark (Cerebras, Hewlett Packard Enterprise)
面向个性化癌症治疗的机器学习系统可重新配置概率AI架构Sourabh Kulkarni, Sachin Bhat, Csaba Andras Moritz(美国马萨诸塞大学阿默斯特分校)Sergey Serebryakov, Dejan Milojicic, Natalia Vassilieva, Stephen Fleischman, Robert Clark (Cerebras, Hewlett Packard Enterprise)
{"title":"ICRC 2019 Technical Program","authors":"Sachin S. Bhat, Csaba Andras, Moritz, M. Inuiguchi, Hirosato, Seki, S. Serebryakov, D. Milojicic, Natalia, Vassilieva, S. Fleischman, D. Bedau, Craig Warner, C. Brueggen, Charles, Williams, Nathaniel Jansen, J. Strachan, Amit, Sharma","doi":"10.1109/icrc.2019.8914695","DOIUrl":"https://doi.org/10.1109/icrc.2019.8914695","url":null,"abstract":"Session 1 Machine Learning Systems Reconfigurable Probabilistic AI Architecture for Personalized Cancer Treatment Sourabh Kulkarni, Sachin Bhat, Csaba Andras Moritz (University of Massachusetts Amherst) On a Learning Method of the SIC Fuzzy Inference Model with Consequent Fuzzy Sets Genki Ohashi, Masahiro Inuiguchi, Hirosato Seki (Osaka University) Deep Learning Cookbook: Recipes for your AI Infrastructure and Applications Sergey Serebryakov, Dejan Milojicic, Natalia Vassilieva, Stephen Fleischman, Robert Clark (Cerebras, Hewlett Packard Enterprise)","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127097916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of a 16-Bit Adiabatic Microprocessor 16位绝热微处理器的设计
Pub Date : 2019-11-01 DOI: 10.1109/ICRC.2019.8914699
Rene Celis-Cordova, A. Orlov, Tian Lu, J. Kulick, G. Snider
Heat production is one of the main limiting factors in modern computing. In this paper, we explore adiabatic reversible logic which can dramatically reduce energy dissipation and is a viable implementation of future energy-efficient computing. We present a 16-bit adiabatic microprocessor with a multicycle MIPS architecture designed in 90nm technology. The adiabatic circuits are implemented using split-rail charge recovery logic, which allows the same circuit to be operated both in adiabatic mode and in standard CMOS mode. Simulations of a shift register show that energy dissipation can be much lower when operating in adiabatic mode compared to its CMOS counterpart. We present a standard cell library with all the necessary components to build adiabatic circuits and implement the subsystems of the microprocessor. The microprocessor has a proposed operating frequency of 0.5 GHz representing a useful implementation of adiabatic reversible computing.
热产生是现代计算的主要限制因素之一。在本文中,我们探索绝热可逆逻辑,它可以显著降低能量耗散,是未来节能计算的可行实现。我们提出了一种采用90nm技术设计的多周期MIPS架构的16位绝热微处理器。绝热电路采用分离轨电荷恢复逻辑实现,这使得同一电路可以在绝热模式和标准CMOS模式下工作。移位寄存器的仿真表明,与CMOS对应物相比,在绝热模式下工作时,能量耗散可以低得多。我们提出了一个标准单元库,其中包含了构建绝热电路和实现微处理器子系统所需的所有组件。该微处理器的建议工作频率为0.5 GHz,代表了绝热可逆计算的有用实现。
{"title":"Design of a 16-Bit Adiabatic Microprocessor","authors":"Rene Celis-Cordova, A. Orlov, Tian Lu, J. Kulick, G. Snider","doi":"10.1109/ICRC.2019.8914699","DOIUrl":"https://doi.org/10.1109/ICRC.2019.8914699","url":null,"abstract":"Heat production is one of the main limiting factors in modern computing. In this paper, we explore adiabatic reversible logic which can dramatically reduce energy dissipation and is a viable implementation of future energy-efficient computing. We present a 16-bit adiabatic microprocessor with a multicycle MIPS architecture designed in 90nm technology. The adiabatic circuits are implemented using split-rail charge recovery logic, which allows the same circuit to be operated both in adiabatic mode and in standard CMOS mode. Simulations of a shift register show that energy dissipation can be much lower when operating in adiabatic mode compared to its CMOS counterpart. We present a standard cell library with all the necessary components to build adiabatic circuits and implement the subsystems of the microprocessor. The microprocessor has a proposed operating frequency of 0.5 GHz representing a useful implementation of adiabatic reversible computing.","PeriodicalId":297574,"journal":{"name":"2019 IEEE International Conference on Rebooting Computing (ICRC)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123460287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
2019 IEEE International Conference on Rebooting Computing (ICRC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1