C. Bengel, Kaihua Zhang, J. Mohr, Tobias Ziegler, S. Wiefels, R. Waser, D. Wouters, S. Menzel
{"title":"Tailor-made synaptic dynamics based on memristive devices","authors":"C. Bengel, Kaihua Zhang, J. Mohr, Tobias Ziegler, S. Wiefels, R. Waser, D. Wouters, S. Menzel","doi":"10.3389/femat.2023.1061269","DOIUrl":null,"url":null,"abstract":"The proliferation of machine learning algorithms in everyday applications such as image recognition or language translation has increased the pressure to adapt underlying computing architectures towards these algorithms. Application specific integrated circuits (ASICs) such as the Tensor Processing Units by Google, Hanguang by Alibaba or Inferentia by Amazon Web Services were designed specifically for machine learning algorithms and have been able to outperform CPU based solutions by great margins during training and inference. As newer generations of chips allow handling of and computation on more and more data, the size of neural networks has dramatically increased, while the challenges they are trying to solve have become more complex. Neuromorphic computing tries to take inspiration from biological information processing systems, aiming to further improve the efficiency with which these networks can be trained or the inference can be performed. Enhancing neuromorphic computing architectures with memristive devices as non-volatile storage elements could potentially allow for even higher energy efficiencies. Their ability to mimic synaptic plasticity dynamics brings neuromorphic architectures closer to the biological role models. So far, memristive devices are mainly investigated for the emulation of the weights of neural networks during training and inference as their non-volatility would enable both processes in the same location without data transfer. In this paper, we explore realisations of different synapses build from memristive ReRAM devices, based on the Valence Change Mechanism. These synapses are the 1R synapse, the NR synapse and the 1T1R synapse. For the 1R synapse, we propose three dynamical regimes and explore their performance through different synapse criteria. For the NR synapse, we discuss how the same dynamical regimes can be addressed in a more reliable way. We also show experimental results measured on ZrOx devices to support our simulation based claims. For the 1T1R synapse, we explore the trade offs between the connection direction of the ReRAM device and the transistor. For all three synapse concepts we discuss the impact of device-to-device and cycle-to-cycle variability. Additionally, the impact of the stimulation mode on the observed behavior is discussed.","PeriodicalId":119676,"journal":{"name":"Frontiers in Electronic Materials","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Electronic Materials","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/femat.2023.1061269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The proliferation of machine learning algorithms in everyday applications such as image recognition or language translation has increased the pressure to adapt underlying computing architectures towards these algorithms. Application specific integrated circuits (ASICs) such as the Tensor Processing Units by Google, Hanguang by Alibaba or Inferentia by Amazon Web Services were designed specifically for machine learning algorithms and have been able to outperform CPU based solutions by great margins during training and inference. As newer generations of chips allow handling of and computation on more and more data, the size of neural networks has dramatically increased, while the challenges they are trying to solve have become more complex. Neuromorphic computing tries to take inspiration from biological information processing systems, aiming to further improve the efficiency with which these networks can be trained or the inference can be performed. Enhancing neuromorphic computing architectures with memristive devices as non-volatile storage elements could potentially allow for even higher energy efficiencies. Their ability to mimic synaptic plasticity dynamics brings neuromorphic architectures closer to the biological role models. So far, memristive devices are mainly investigated for the emulation of the weights of neural networks during training and inference as their non-volatility would enable both processes in the same location without data transfer. In this paper, we explore realisations of different synapses build from memristive ReRAM devices, based on the Valence Change Mechanism. These synapses are the 1R synapse, the NR synapse and the 1T1R synapse. For the 1R synapse, we propose three dynamical regimes and explore their performance through different synapse criteria. For the NR synapse, we discuss how the same dynamical regimes can be addressed in a more reliable way. We also show experimental results measured on ZrOx devices to support our simulation based claims. For the 1T1R synapse, we explore the trade offs between the connection direction of the ReRAM device and the transistor. For all three synapse concepts we discuss the impact of device-to-device and cycle-to-cycle variability. Additionally, the impact of the stimulation mode on the observed behavior is discussed.
机器学习算法在日常应用(如图像识别或语言翻译)中的激增,增加了使底层计算架构适应这些算法的压力。专用集成电路(asic),如谷歌的张量处理单元(Tensor Processing Units)、阿里巴巴的汉光(hanang)或亚马逊网络服务(Amazon Web Services)的interentia,都是专门为机器学习算法设计的,在训练和推理过程中,它们的性能大大超过了基于CPU的解决方案。随着新一代芯片允许处理和计算越来越多的数据,神经网络的规模急剧增加,而它们试图解决的挑战也变得更加复杂。神经形态计算试图从生物信息处理系统中获得灵感,旨在进一步提高这些网络的训练效率或执行推理的效率。使用忆阻器件作为非易失性存储元件来增强神经形态计算架构可能会带来更高的能源效率。它们模仿突触可塑性动态的能力使神经形态结构更接近生物学的角色模型。到目前为止,记忆装置主要用于神经网络在训练和推理过程中的权重仿真,因为记忆装置的非易失性可以使两个过程在同一位置进行,而无需传输数据。在本文中,我们探索了基于价变机制的记忆性ReRAM器件构建不同突触的实现。这些突触是1R突触,NR突触和1T1R突触。对于1R突触,我们提出了三种动态机制,并通过不同的突触标准探讨了它们的性能。对于NR突触,我们讨论了如何以更可靠的方式处理相同的动态机制。我们还展示了在ZrOx设备上测量的实验结果,以支持我们基于仿真的主张。对于1T1R突触,我们探索了ReRAM器件和晶体管连接方向之间的权衡。对于所有三个突触概念,我们将讨论设备对设备和周期对周期可变性的影响。此外,还讨论了刺激模式对观察行为的影响。