首页 > 最新文献

npj Unconventional Computing最新文献

英文 中文
Thermodynamic linear algebra 热力学线性代数
Pub Date : 2024-11-05 DOI: 10.1038/s44335-024-00014-0
Maxwell Aifer, Kaelan Donatella, Max Hunter Gordon, Samuel Duffield, Thomas Ahle, Daniel Simpson, Gavin Crooks, Patrick J. Coles
Linear algebra is central to many algorithms in engineering, science, and machine learning; hence, accelerating it would have tremendous economic impact. Quantum computing has been proposed for this purpose, although the resource requirements are far beyond current technological capabilities. We consider an alternative physics-based computing paradigm based on classical thermodynamics, to provide a near-term approach to accelerating linear algebra. At first sight, thermodynamics and linear algebra seem to be unrelated fields. Here, we connect solving linear algebra problems to sampling from the thermodynamic equilibrium distribution of a system of coupled harmonic oscillators. We present simple thermodynamic algorithms for solving linear systems of equations, computing matrix inverses, and computing matrix determinants. Under reasonable assumptions, we rigorously establish asymptotic speedups for our algorithms, relative to digital methods, that scale linearly in matrix dimension. Our algorithms exploit thermodynamic principles like ergodicity, entropy, and equilibration, highlighting the deep connection between these two seemingly distinct fields, and opening up algebraic applications for thermodynamic computers.
线性代数是工程、科学和机器学习中许多算法的核心;因此,加速线性代数将产生巨大的经济影响。为此,有人提出了量子计算,但其资源需求远远超出了目前的技术能力。我们考虑基于经典热力学的另一种物理计算范式,为加速线性代数提供一种近期方法。乍一看,热力学和线性代数似乎是互不相关的领域。在这里,我们将解决线性代数问题与从耦合谐振子系统的热力学平衡分布中采样联系起来。我们介绍了求解线性方程组、计算矩阵倒数和计算矩阵行列式的简单热力学算法。在合理的假设条件下,我们严格地确定了我们算法相对于数字方法的渐进提速,这种提速在矩阵维度上呈线性扩展。我们的算法利用了热力学原理,如遍历性、熵和平衡,突出了这两个看似不同的领域之间的深刻联系,并为热力学计算机开辟了代数应用领域。
{"title":"Thermodynamic linear algebra","authors":"Maxwell Aifer, Kaelan Donatella, Max Hunter Gordon, Samuel Duffield, Thomas Ahle, Daniel Simpson, Gavin Crooks, Patrick J. Coles","doi":"10.1038/s44335-024-00014-0","DOIUrl":"10.1038/s44335-024-00014-0","url":null,"abstract":"Linear algebra is central to many algorithms in engineering, science, and machine learning; hence, accelerating it would have tremendous economic impact. Quantum computing has been proposed for this purpose, although the resource requirements are far beyond current technological capabilities. We consider an alternative physics-based computing paradigm based on classical thermodynamics, to provide a near-term approach to accelerating linear algebra. At first sight, thermodynamics and linear algebra seem to be unrelated fields. Here, we connect solving linear algebra problems to sampling from the thermodynamic equilibrium distribution of a system of coupled harmonic oscillators. We present simple thermodynamic algorithms for solving linear systems of equations, computing matrix inverses, and computing matrix determinants. Under reasonable assumptions, we rigorously establish asymptotic speedups for our algorithms, relative to digital methods, that scale linearly in matrix dimension. Our algorithms exploit thermodynamic principles like ergodicity, entropy, and equilibration, highlighting the deep connection between these two seemingly distinct fields, and opening up algebraic applications for thermodynamic computers.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-11"},"PeriodicalIF":0.0,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00014-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142579797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient generation of grids and traversal graphs in compositional spaces towards exploration and path planning 在组合空间中高效生成网格和遍历图,实现探索和路径规划
Pub Date : 2024-11-05 DOI: 10.1038/s44335-024-00012-2
Adam M. Krajewski, Allison M. Beese, Wesley F. Reinhart, Zi-Kui Liu
Diverse disciplines across science and engineering deal with problems related to compositions, which exist in non-Euclidean simplex spaces, rendering many standard tools inaccurate or inefficient. This work explores such spaces conceptually in the context of materials discovery, quantifies their computational feasibility, and implements several essential methods specific to simplex spaces through a new high-performance open-source library nimplex. Most significantly, we derive and implement an algorithm for constructing a novel n-dimensional simplex graph data structure, containing all discretized compositions and possible neighbor-to-neighbor transitions. Critically, no distance or neighborhood calculations are performed, instead leveraging pure combinatorics and order in procedurally generated simplex grids, keeping the algorithm $${mathcal{O}}(N)$$ , with minimal memory, enabling rapid construction of graphs with billions of transitions in seconds. Additionally, we demonstrate how such graph representations can be combined to homogeneously express complex path-planning problems, while facilitating efficient deployment of existing high-performance gradient descent, graph traversal, and other optimization algorithms.
科学和工程领域的多个学科都在处理与组合相关的问题,这些问题存在于非欧几里得单纯形空间中,导致许多标准工具不准确或效率低下。这项研究从概念上探索了材料发现背景下的这类空间,量化了其计算可行性,并通过一个新的高性能开源库 nimplex 实现了几种针对单纯形空间的基本方法。最重要的是,我们推导并实现了一种构建新颖的 n 维单纯形图数据结构的算法,其中包含所有离散化组合和可能的邻域到邻域转换。重要的是,该算法不进行距离或邻域计算,而是利用纯粹的组合学和程序化生成的单纯形网格中的秩序,保持算法 $${mathcal{O}}(N)$$ 的最小内存,从而能够在数秒内快速构建具有数十亿次转换的图形。此外,我们还展示了如何将这种图表示法结合起来,同质地表达复杂的路径规划问题,同时促进现有高性能梯度下降、图遍历和其他优化算法的高效部署。
{"title":"Efficient generation of grids and traversal graphs in compositional spaces towards exploration and path planning","authors":"Adam M. Krajewski, Allison M. Beese, Wesley F. Reinhart, Zi-Kui Liu","doi":"10.1038/s44335-024-00012-2","DOIUrl":"10.1038/s44335-024-00012-2","url":null,"abstract":"Diverse disciplines across science and engineering deal with problems related to compositions, which exist in non-Euclidean simplex spaces, rendering many standard tools inaccurate or inefficient. This work explores such spaces conceptually in the context of materials discovery, quantifies their computational feasibility, and implements several essential methods specific to simplex spaces through a new high-performance open-source library nimplex. Most significantly, we derive and implement an algorithm for constructing a novel n-dimensional simplex graph data structure, containing all discretized compositions and possible neighbor-to-neighbor transitions. Critically, no distance or neighborhood calculations are performed, instead leveraging pure combinatorics and order in procedurally generated simplex grids, keeping the algorithm $${mathcal{O}}(N)$$ , with minimal memory, enabling rapid construction of graphs with billions of transitions in seconds. Additionally, we demonstrate how such graph representations can be combined to homogeneously express complex path-planning problems, while facilitating efficient deployment of existing high-performance gradient descent, graph traversal, and other optimization algorithms.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-12"},"PeriodicalIF":0.0,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00012-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142579804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demonstration of 4-quadrant analog in-memory matrix multiplication in a single modulation 单调制 4 象限模拟内存矩阵乘法演示
Pub Date : 2024-10-03 DOI: 10.1038/s44335-024-00010-4
Manuel Le Gallo, Oscar Hrynkevych, Benedikt Kersting, Geethan Karunaratne, Athanasios Vasilopoulos, Riduan Khaddam-Aljameh, Ghazi Sarwat Syed, Abu Sebastian
Analog in-memory computing (AIMC) leverages the inherent physical characteristics of resistive memory devices to execute computational operations, notably matrix-vector multiplications (MVMs). However, executing MVMs using a single-phase reading scheme to reduce latency necessitates the simultaneous application of both positive and negative voltages across resistive memory devices. This degrades the accuracy of the computation due to the dependence of the device conductance on the voltage polarity. Here, we demonstrate the realization of a 4-quadrant MVM in a single modulation by developing analog and digital calibration procedures to mitigate the conductance polarity dependence, fully implemented on a multi-core AIMC chip based on phase-change memory. With this approach, we experimentally demonstrate accurate neural network inference and similarity search tasks using one or multiple cores of the chip, at 4 times higher MVM throughput and energy efficiency than the conventional four-phase reading scheme.
模拟内存计算(AIMC)利用电阻式内存设备的固有物理特性执行计算操作,特别是矩阵向量乘法(MVM)。然而,要使用单相读取方案执行 MVM 以减少延迟,就必须在电阻式存储器件上同时施加正负电压。由于器件电导与电压极性有关,这就降低了计算的准确性。在这里,我们通过开发模拟和数字校准程序来减轻电导极性依赖性,并在基于相变存储器的多核 AIMC 芯片上全面实施,从而展示了在单调制中实现 4 象限 MVM 的方法。利用这种方法,我们在实验中演示了使用芯片的一个或多个内核进行精确的神经网络推理和相似性搜索任务,其 MVM 吞吐量和能效比传统的四相读取方案高出 4 倍。
{"title":"Demonstration of 4-quadrant analog in-memory matrix multiplication in a single modulation","authors":"Manuel Le Gallo, Oscar Hrynkevych, Benedikt Kersting, Geethan Karunaratne, Athanasios Vasilopoulos, Riduan Khaddam-Aljameh, Ghazi Sarwat Syed, Abu Sebastian","doi":"10.1038/s44335-024-00010-4","DOIUrl":"10.1038/s44335-024-00010-4","url":null,"abstract":"Analog in-memory computing (AIMC) leverages the inherent physical characteristics of resistive memory devices to execute computational operations, notably matrix-vector multiplications (MVMs). However, executing MVMs using a single-phase reading scheme to reduce latency necessitates the simultaneous application of both positive and negative voltages across resistive memory devices. This degrades the accuracy of the computation due to the dependence of the device conductance on the voltage polarity. Here, we demonstrate the realization of a 4-quadrant MVM in a single modulation by developing analog and digital calibration procedures to mitigate the conductance polarity dependence, fully implemented on a multi-core AIMC chip based on phase-change memory. With this approach, we experimentally demonstrate accurate neural network inference and similarity search tasks using one or multiple cores of the chip, at 4 times higher MVM throughput and energy efficiency than the conventional four-phase reading scheme.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00010-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142368745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In-memory search with learning to hash based on resistive memory for recommendation acceleration 基于电阻式内存的学习散列内存搜索,为推荐加速
Pub Date : 2024-10-01 DOI: 10.1038/s44335-024-00009-x
Fei Wang, Woyu Zhang, Zhi Li, Ning Lin, Rui Bao, Xiaoxin Xu, Chunmeng Dou, Zhongrui Wang, Dashan Shang
Similarity search is essential in current artificial intelligence applications and widely utilized in various fields, such as recommender systems. However, the exponential growth of data poses significant challenges in search time and energy consumption on traditional digital hardware. Here, we propose a software-hardware co-optimization to address these challenges. On the software side, we employ a learning-to-hash method for vector encoding and achieve an approximate nearest neighbor search by calculating Hamming distance, thereby reducing computational complexity. On the hardware side, we leverage the resistance random-access memory crossbar array to implement the hash encoding process and the content-addressable memory with an in-memory computing paradigm to lower the energy consumption during searches. Simulations on the MovieLens dataset demonstrate that the implementation achieves comparable accuracy to software and reduces energy consumption by 30-fold compared to traditional digital systems. These results provide insight into the development of energy-efficient in-memory search systems for edge computing.
相似性搜索在当前的人工智能应用中至关重要,并广泛应用于推荐系统等多个领域。然而,数据的指数级增长给传统数字硬件的搜索时间和能耗带来了巨大挑战。在此,我们提出了一种软硬件协同优化的方法来应对这些挑战。在软件方面,我们采用学习到哈希(learning-to-hash)方法进行向量编码,并通过计算汉明距离(Hamming distance)实现近似近邻搜索,从而降低计算复杂度。在硬件方面,我们利用电阻式随机存取存储器横条阵列来实现哈希编码过程,并利用内容可寻址存储器的内存计算模式来降低搜索过程中的能耗。在 MovieLens 数据集上进行的仿真表明,与传统数字系统相比,该实现方法达到了与软件相当的精度,并将能耗降低了 30 倍。这些结果为开发用于边缘计算的高能效内存搜索系统提供了启示。
{"title":"In-memory search with learning to hash based on resistive memory for recommendation acceleration","authors":"Fei Wang, Woyu Zhang, Zhi Li, Ning Lin, Rui Bao, Xiaoxin Xu, Chunmeng Dou, Zhongrui Wang, Dashan Shang","doi":"10.1038/s44335-024-00009-x","DOIUrl":"10.1038/s44335-024-00009-x","url":null,"abstract":"Similarity search is essential in current artificial intelligence applications and widely utilized in various fields, such as recommender systems. However, the exponential growth of data poses significant challenges in search time and energy consumption on traditional digital hardware. Here, we propose a software-hardware co-optimization to address these challenges. On the software side, we employ a learning-to-hash method for vector encoding and achieve an approximate nearest neighbor search by calculating Hamming distance, thereby reducing computational complexity. On the hardware side, we leverage the resistance random-access memory crossbar array to implement the hash encoding process and the content-addressable memory with an in-memory computing paradigm to lower the energy consumption during searches. Simulations on the MovieLens dataset demonstrate that the implementation achieves comparable accuracy to software and reduces energy consumption by 30-fold compared to traditional digital systems. These results provide insight into the development of energy-efficient in-memory search systems for edge computing.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-9"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00009-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142360088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A perfect storm and a new dawn for unconventional computing technologies 非传统计算技术的完美风暴和新曙光
Pub Date : 2024-09-12 DOI: 10.1038/s44335-024-00011-3
Wei D. Lu, Christof Teuscher, Stephen A. Sarles, Yuchao Yang, Aida Todri-Sanial, Xiao-Bo Zhu
{"title":"A perfect storm and a new dawn for unconventional computing technologies","authors":"Wei D. Lu, Christof Teuscher, Stephen A. Sarles, Yuchao Yang, Aida Todri-Sanial, Xiao-Bo Zhu","doi":"10.1038/s44335-024-00011-3","DOIUrl":"10.1038/s44335-024-00011-3","url":null,"abstract":"","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-2"},"PeriodicalIF":0.0,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00011-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142170398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
28 nm FDSOI embedded PCM exhibiting near zero drift at 12 K for cryogenic SNNs 用于低温 SNN 的 28 纳米 FDSOI 嵌入式 PCM 在 12 K 时漂移接近于零
Pub Date : 2024-09-02 DOI: 10.1038/s44335-024-00008-y
Joao Henrique Quintino Palhares, Nikhil Garg, Pierre-Antoine Mouny, Yann Beilliard, J. Sandrini, F. Arnaud, Lorena Anghel, Fabien Alibart, Dominique Drouin, Philippe Galy
Seeking to circumvent conventional computing bottlenecks, hardware alternatives, from brain-inspired designs to cryogenic quantum systems, necessitate integrating emerging non-volatile memories. Yet, the immaturity and unreliability of cryogenic-compatible memories hinder scalable computing advancements. This study characterizes 28 nm FD-SOI substrate-embedded Ge-rich Ge2Sb2Te5 phase change memories (ePCMs) down to 12 K to overcome these hurdles. It reveals that ePCMs is cryogenic compatible and can encode multiple resistance states with minimal drift, essential for advanced computing solutions. Through simulations, the ePCM’s impact on a spiking neural network (SNN) performing MNIST classification is evaluated. The SNN maintains high accuracy for extended periods of 2 years at cryogenic temperatures, while an accuracy drop of 10.8% is observed at room temperature. These results highlight the potential of multilevel ePCMs in brain-inspired cryogenic computing applications, offering a promising avenue for the evolution of unconventional computing systems.
为了规避传统计算的瓶颈,从大脑启发设计到低温量子系统等硬件替代方案都需要集成新兴的非易失性存储器。然而,低温兼容存储器的不成熟和不可靠阻碍了可扩展计算的发展。本研究对 28 纳米 FD-SOI 衬底嵌入富 Ge Ge2Sb2Te5 相变存储器(ePCMs)进行了表征,其温度可低至 12 K,从而克服了这些障碍。研究表明,ePCMs 与低温兼容,能以最小漂移编码多种电阻状态,这对先进计算解决方案至关重要。通过模拟,评估了 ePCM 对执行 MNIST 分类的尖峰神经网络 (SNN) 的影响。在低温条件下,SNN 可在长达 2 年的时间内保持较高的准确度,而在室温条件下,准确度下降了 10.8%。这些结果凸显了多级 ePCM 在大脑启发的低温计算应用中的潜力,为非常规计算系统的发展提供了一条前景广阔的途径。
{"title":"28 nm FDSOI embedded PCM exhibiting near zero drift at 12 K for cryogenic SNNs","authors":"Joao Henrique Quintino Palhares, Nikhil Garg, Pierre-Antoine Mouny, Yann Beilliard, J. Sandrini, F. Arnaud, Lorena Anghel, Fabien Alibart, Dominique Drouin, Philippe Galy","doi":"10.1038/s44335-024-00008-y","DOIUrl":"10.1038/s44335-024-00008-y","url":null,"abstract":"Seeking to circumvent conventional computing bottlenecks, hardware alternatives, from brain-inspired designs to cryogenic quantum systems, necessitate integrating emerging non-volatile memories. Yet, the immaturity and unreliability of cryogenic-compatible memories hinder scalable computing advancements. This study characterizes 28 nm FD-SOI substrate-embedded Ge-rich Ge2Sb2Te5 phase change memories (ePCMs) down to 12 K to overcome these hurdles. It reveals that ePCMs is cryogenic compatible and can encode multiple resistance states with minimal drift, essential for advanced computing solutions. Through simulations, the ePCM’s impact on a spiking neural network (SNN) performing MNIST classification is evaluated. The SNN maintains high accuracy for extended periods of 2 years at cryogenic temperatures, while an accuracy drop of 10.8% is observed at room temperature. These results highlight the potential of multilevel ePCMs in brain-inspired cryogenic computing applications, offering a promising avenue for the evolution of unconventional computing systems.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-9"},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00008-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accurate compact nonlinear dynamical model for a volatile ferroelectric ZrO2 capacitor 挥发性铁电 ZrO2 电容器的精确紧凑非线性动力学模型
Pub Date : 2024-09-02 DOI: 10.1038/s44335-024-00007-z
Shiva Asapu, Taehwan Moon, Krishnamurthy Mahalingam, Kurt G. Eyink, James Nicolas Pagaduan, Ruoyu Zhao, Sabyasachi Ganguli, Reika Katsumata, Qiangfei Xia, R. Stanley Williams, J. Joshua Yang
We have measured the dynamical response of ZrO2 capacitors to applied triangular voltage waveforms with varying frequencies and amplitudes to determine the voltage and charge on the devices as a function of time. We have fit our experimental results to a Landau–Khalatnikov dynamical equation with a sixth order Landau–Ginzburg–Devonshire polynomial to represent the static charge-voltage behavior, and obtained coefficients of determination R2 > 0.99 for the fits. Analysis of the resulting quantitative model reveals an extremely small range of negative differential capacitance <16 mV. The hysteresis loops in the dynamical charge-voltage curves are found to result primarily from energy loss during the ferroelectric transitions, as represented by a frequency-dependent series resistance in the model.
我们测量了 ZrO2 电容器对不同频率和振幅的三角电压波形的动态响应,以确定器件上的电压和电荷随时间的变化。我们将实验结果与六阶 Landau-Ginzburg-Devonshire 多项式的 Landau-Khalatnikov 动力学方程进行拟合,以表示静态电荷-电压行为,并获得了拟合确定系数 R2 > 0.99。对所得定量模型的分析表明,负微分电容的范围极小,仅为 16 mV。研究发现,动态电荷-电压曲线中的滞后环主要是铁电转换过程中的能量损失造成的,在模型中表现为与频率相关的串联电阻。
{"title":"Accurate compact nonlinear dynamical model for a volatile ferroelectric ZrO2 capacitor","authors":"Shiva Asapu,&nbsp;Taehwan Moon,&nbsp;Krishnamurthy Mahalingam,&nbsp;Kurt G. Eyink,&nbsp;James Nicolas Pagaduan,&nbsp;Ruoyu Zhao,&nbsp;Sabyasachi Ganguli,&nbsp;Reika Katsumata,&nbsp;Qiangfei Xia,&nbsp;R. Stanley Williams,&nbsp;J. Joshua Yang","doi":"10.1038/s44335-024-00007-z","DOIUrl":"10.1038/s44335-024-00007-z","url":null,"abstract":"We have measured the dynamical response of ZrO2 capacitors to applied triangular voltage waveforms with varying frequencies and amplitudes to determine the voltage and charge on the devices as a function of time. We have fit our experimental results to a Landau–Khalatnikov dynamical equation with a sixth order Landau–Ginzburg–Devonshire polynomial to represent the static charge-voltage behavior, and obtained coefficients of determination R2 &gt; 0.99 for the fits. Analysis of the resulting quantitative model reveals an extremely small range of negative differential capacitance &lt;16 mV. The hysteresis loops in the dynamical charge-voltage curves are found to result primarily from energy loss during the ferroelectric transitions, as represented by a frequency-dependent series resistance in the model.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00007-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Random memristor-based dynamic graph CNN for efficient point cloud learning at the edge 基于随机忆阻器的动态图 CNN,实现边缘点云的高效学习
Pub Date : 2024-08-21 DOI: 10.1038/s44335-024-00006-0
Yifei Yu, Shaocong Wang, Meng Xu, Woyu Zhang, Bo Wang, Jichang Yang, Songqi Wang, Yue Zhang, Xiaoshan Wu, Hegan Chen, Dingchen Wang, Xi Chen, Ning Lin, Xiaojuan Qi, Dashan Shang, Zhongrui Wang
The broad integration of 3D sensors into devices like smartphones and AR/VR headsets has led to a surge in 3D data, with point clouds becoming a mainstream representation method. Efficient real-time learning of point cloud data on edge devices is crucial for applications such as autonomous vehicles and embodied AI. Traditional machine learning models on digital processors face limitations, with software challenges like high training complexity, and hardware challenges such as large time and energy overheads due to von Neumann bottleneck. To address this, we propose a software-hardware co-designed random memristor-based dynamic graph CNN (RDGCNN). Software-wise, we transform point cloud into graph, and propose random EdgeConv for efficient hierarchical and topological features extraction. Hardware-wise, leveraging memristor’s intrinsic stochasticity and in-memory computing capability, we achieve significant reductions in training complexity and energy consumption. RDGCNN demonstrates high accuracy and efficiency across various point cloud tasks, paving the way for future edge 3D vision.
三维传感器广泛集成到智能手机和 AR/VR 头显等设备中,导致三维数据激增,点云成为主流表示方法。在边缘设备上对点云数据进行高效的实时学习对于自动驾驶汽车和嵌入式人工智能等应用至关重要。数字处理器上的传统机器学习模型面临诸多限制,软件方面的挑战包括训练复杂度高,硬件方面的挑战包括冯-诺依曼瓶颈导致的大量时间和能源开销。为此,我们提出了一种软硬件协同设计的基于随机忆阻器的动态图 CNN(RDGCN)。在软件方面,我们将点云转换为图,并提出了随机 EdgeConv 以实现高效的层次和拓扑特征提取。在硬件方面,我们利用忆阻器固有的随机性和内存计算能力,显著降低了训练复杂度和能耗。RDGCNN 在各种点云任务中都表现出了高精度和高效率,为未来的边缘 3D 视觉铺平了道路。
{"title":"Random memristor-based dynamic graph CNN for efficient point cloud learning at the edge","authors":"Yifei Yu,&nbsp;Shaocong Wang,&nbsp;Meng Xu,&nbsp;Woyu Zhang,&nbsp;Bo Wang,&nbsp;Jichang Yang,&nbsp;Songqi Wang,&nbsp;Yue Zhang,&nbsp;Xiaoshan Wu,&nbsp;Hegan Chen,&nbsp;Dingchen Wang,&nbsp;Xi Chen,&nbsp;Ning Lin,&nbsp;Xiaojuan Qi,&nbsp;Dashan Shang,&nbsp;Zhongrui Wang","doi":"10.1038/s44335-024-00006-0","DOIUrl":"10.1038/s44335-024-00006-0","url":null,"abstract":"The broad integration of 3D sensors into devices like smartphones and AR/VR headsets has led to a surge in 3D data, with point clouds becoming a mainstream representation method. Efficient real-time learning of point cloud data on edge devices is crucial for applications such as autonomous vehicles and embodied AI. Traditional machine learning models on digital processors face limitations, with software challenges like high training complexity, and hardware challenges such as large time and energy overheads due to von Neumann bottleneck. To address this, we propose a software-hardware co-designed random memristor-based dynamic graph CNN (RDGCNN). Software-wise, we transform point cloud into graph, and propose random EdgeConv for efficient hierarchical and topological features extraction. Hardware-wise, leveraging memristor’s intrinsic stochasticity and in-memory computing capability, we achieve significant reductions in training complexity and energy consumption. RDGCNN demonstrates high accuracy and efficiency across various point cloud tasks, paving the way for future edge 3D vision.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-9"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00006-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142013667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Solving Boltzmann optimization problems with deep learning 用深度学习解决波尔兹曼优化问题
Pub Date : 2024-08-05 DOI: 10.1038/s44335-024-00005-1
Fiona Knoll, John Daly, Jess Meyer
Decades of exponential scaling in high-performance computing (HPC) efficiency is coming to an end. Transistor-based logic in complementary metal-oxide semiconductor (CMOS) technology is approaching physical limits beyond which further miniaturization will be impossible. Future HPC efficiency gains will necessarily rely on new technologies and paradigms of computing. The Ising model shows particular promise as a future framework for highly energy-efficient computation. Ising systems are able to operate at energies approaching thermodynamic limits for energy consumption of computation. Ising systems can function as both logic and memory. Thus, they have the potential to significantly reduce energy costs inherent to CMOS computing by eliminating costly data movement. The challenge in creating Ising-based hardware is in optimizing useful circuits that produce correct results on fundamentally nondeterministic hardware. The contribution of this paper is a novel machine learning approach, a combination of deep neural networks and random forests, for efficiently solving optimization problems that minimize sources of error in the Ising model. In addition, we provide a process to express a Boltzmann probability optimization problem as a supervised machine learning problem.
数十年来,高性能计算(HPC)效率的指数级增长即将结束。互补金属氧化物半导体(CMOS)技术中以晶体管为基础的逻辑正在接近物理极限,超过这一极限将无法进一步小型化。未来 HPC 效率的提高将必然依赖于新的计算技术和模式。作为未来的高能效计算框架,伊辛模型显示出特别的前景。伊辛系统能够在接近热力学能耗极限的能量下运行。伊辛系统既可作为逻辑系统,也可作为存储器。因此,通过消除昂贵的数据移动,它们有可能显著降低 CMOS 计算所固有的能源成本。创建基于 Ising 系统的硬件所面临的挑战在于如何优化有用的电路,使其在基本非确定性硬件上产生正确的结果。本文的贡献在于结合了深度神经网络和随机森林的新型机器学习方法,可高效解决优化问题,最大限度地减少伊辛模型中的误差源。此外,我们还提供了一种将玻尔兹曼概率优化问题表达为有监督机器学习问题的过程。
{"title":"Solving Boltzmann optimization problems with deep learning","authors":"Fiona Knoll,&nbsp;John Daly,&nbsp;Jess Meyer","doi":"10.1038/s44335-024-00005-1","DOIUrl":"10.1038/s44335-024-00005-1","url":null,"abstract":"Decades of exponential scaling in high-performance computing (HPC) efficiency is coming to an end. Transistor-based logic in complementary metal-oxide semiconductor (CMOS) technology is approaching physical limits beyond which further miniaturization will be impossible. Future HPC efficiency gains will necessarily rely on new technologies and paradigms of computing. The Ising model shows particular promise as a future framework for highly energy-efficient computation. Ising systems are able to operate at energies approaching thermodynamic limits for energy consumption of computation. Ising systems can function as both logic and memory. Thus, they have the potential to significantly reduce energy costs inherent to CMOS computing by eliminating costly data movement. The challenge in creating Ising-based hardware is in optimizing useful circuits that produce correct results on fundamentally nondeterministic hardware. The contribution of this paper is a novel machine learning approach, a combination of deep neural networks and random forests, for efficiently solving optimization problems that minimize sources of error in the Ising model. In addition, we provide a process to express a Boltzmann probability optimization problem as a supervised machine learning problem.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00005-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental demonstration of magnetic tunnel junction-based computational random-access memory 基于磁隧道结的计算随机存取存储器的实验演示。
Pub Date : 2024-07-25 DOI: 10.1038/s44335-024-00003-3
Yang Lv, Brandon R. Zink, Robert P. Bloom, Hüsrev Cılasun, Pravin Khanal, Salonik Resch, Zamshed Chowdhury, Ali Habiboglu, Weigang Wang, Sachin S. Sapatnekar, Ulya Karpuzcu, Jian-Ping Wang
The conventional computing paradigm struggles to fulfill the rapidly growing demands from emerging applications, especially those for machine intelligence because much of the power and energy is consumed by constant data transfers between logic and memory modules. A new paradigm, called “computational random-access memory (CRAM),” has emerged to address this fundamental limitation. CRAM performs logic operations directly using the memory cells themselves, without having the data ever leave the memory. The energy and performance benefits of CRAM for both conventional and emerging applications have been well established by prior numerical studies. However, there is a lack of experimental demonstration and study of CRAM to evaluate its computational accuracy, which is a realistic and application-critical metric for its technological feasibility and competitiveness. In this work, a CRAM array based on magnetic tunnel junctions (MTJs) is experimentally demonstrated. First, basic memory operations, as well as 2-, 3-, and 5-input logic operations, are studied. Then, a 1-bit full adder with two different designs is demonstrated. Based on the experimental results, a suite of models has been developed to characterize the accuracy of CRAM computation. Scalar addition, multiplication, and matrix multiplication, which are essential building blocks for many conventional and machine intelligence applications, are evaluated and show promising accuracy performance. With the confirmation of MTJ-based CRAM’s accuracy, there is a strong case that this technology will have a significant impact on power- and energy-demanding applications of machine intelligence.
传统计算模式难以满足新兴应用(尤其是机器智能应用)快速增长的需求,因为大部分电力和能源都消耗在逻辑模块和内存模块之间的持续数据传输上。一种名为 "计算随机存取存储器(CRAM)"的新模式应运而生,以解决这一根本性限制。CRAM 直接使用内存单元本身执行逻辑运算,数据无需离开内存。先前的数值研究已经充分证明,CRAM 在传统应用和新兴应用中都具有能耗和性能优势。然而,目前还缺乏对 CRAM 的实验演示和研究,以评估其计算精度,而计算精度是衡量其技术可行性和竞争力的现实和应用关键指标。在这项工作中,对基于磁隧道结 (MTJ) 的 CRAM 阵列进行了实验演示。首先,研究了基本内存操作以及 2、3 和 5 输入逻辑运算。然后,演示了采用两种不同设计的 1 位全加法器。在实验结果的基础上,开发了一套模型来描述 CRAM 计算的准确性。标量加法、乘法和矩阵乘法是许多传统应用和机器智能应用的基本构件,对它们进行了评估,并显示出良好的精度性能。随着基于 MTJ 的 CRAM 的准确性得到确认,这项技术将对机器智能的高能耗应用产生重大影响。
{"title":"Experimental demonstration of magnetic tunnel junction-based computational random-access memory","authors":"Yang Lv,&nbsp;Brandon R. Zink,&nbsp;Robert P. Bloom,&nbsp;Hüsrev Cılasun,&nbsp;Pravin Khanal,&nbsp;Salonik Resch,&nbsp;Zamshed Chowdhury,&nbsp;Ali Habiboglu,&nbsp;Weigang Wang,&nbsp;Sachin S. Sapatnekar,&nbsp;Ulya Karpuzcu,&nbsp;Jian-Ping Wang","doi":"10.1038/s44335-024-00003-3","DOIUrl":"10.1038/s44335-024-00003-3","url":null,"abstract":"The conventional computing paradigm struggles to fulfill the rapidly growing demands from emerging applications, especially those for machine intelligence because much of the power and energy is consumed by constant data transfers between logic and memory modules. A new paradigm, called “computational random-access memory (CRAM),” has emerged to address this fundamental limitation. CRAM performs logic operations directly using the memory cells themselves, without having the data ever leave the memory. The energy and performance benefits of CRAM for both conventional and emerging applications have been well established by prior numerical studies. However, there is a lack of experimental demonstration and study of CRAM to evaluate its computational accuracy, which is a realistic and application-critical metric for its technological feasibility and competitiveness. In this work, a CRAM array based on magnetic tunnel junctions (MTJs) is experimentally demonstrated. First, basic memory operations, as well as 2-, 3-, and 5-input logic operations, are studied. Then, a 1-bit full adder with two different designs is demonstrated. Based on the experimental results, a suite of models has been developed to characterize the accuracy of CRAM computation. Scalar addition, multiplication, and matrix multiplication, which are essential building blocks for many conventional and machine intelligence applications, are evaluated and show promising accuracy performance. With the confirmation of MTJ-based CRAM’s accuracy, there is a strong case that this technology will have a significant impact on power- and energy-demanding applications of machine intelligence.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-10"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11287819/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141857556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
npj Unconventional Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1