首页 > 最新文献

npj Unconventional Computing最新文献

英文 中文
Negative photo conductivity triggered with visible light in wide bandgap oxide-based optoelectronic crossbar memristive array for photograph sensing and neuromorphic computing applications 用可见光触发宽带隙氧化物光电横条忆阻阵中的负光电导,用于照片传感和神经形态计算应用
Pub Date : 2024-07-25 DOI: 10.1038/s44335-024-00001-5
Dayanand Kumar, Hanrui Li, Amit Singh, Manoj Kumar Rajbhar, Abdul Momin Syed, Hoonkyung Lee, Nazek El-Atab
Photoresponsivity studies of wide-bandgap oxide-based devices have emerged as a vibrant and popular research area. Researchers have explored various material systems in their quest to develop devices capable of responding to illumination. In this study, we engineered a mature wide-bandgap oxide-based bilayer heterostructure synaptic memristor to emulate the human brain for applications in neuromorphic computing and photograph sensing. The device exhibits advanced electric and electrophotonic synaptic functions, such as long-term potentiation (LTP), long-term depression (LTD), and paired-pulse facilitation (PPF), by applying successive electric and photonic pulses. Moreover, the device exhibits exceptional electrical SET and photonic RESET endurance, maintaining its stability for a minimum of 1200 cycles without any degradation. Density functional theory calculations of the band structures provide insights into the conduction mechanism of the device. Based on this memristor array, we developed an autoencoder and convolutional neural network for noise reduction and image recognition tasks, which achieves a peak signal-to-noise ratio of 562 and high accuracy of 84.23%, while consuming lower energy by four orders of magnitude compared with the Tesla P40 GPU. This groundbreaking research not only opens doors for the integration of our device into image processing but also represents a significant advancement in the realm of in-memory computing and photograph-sensing features in a single cell.
宽带隙氧化物器件的光致发光性研究已成为一个充满活力的热门研究领域。研究人员探索了各种材料系统,力求开发出能够对光照做出响应的器件。在这项研究中,我们设计了一种成熟的基于宽带隙氧化物的双层异质结构突触忆阻器,以模拟人脑,应用于神经形态计算和照片传感。该器件通过应用连续的电脉冲和光子脉冲,表现出先进的电和光子突触功能,如长期延时(LTP)、长期抑制(LTD)和成对脉冲促进(PPF)。此外,该器件还表现出卓越的电 SET 和光子 RESET 耐久性,可在至少 1200 个周期内保持稳定而无任何退化。带状结构的密度泛函理论计算深入揭示了该器件的传导机制。基于这种忆阻器阵列,我们开发了一种用于降噪和图像识别任务的自动编码器和卷积神经网络,实现了 562 的峰值信噪比和 84.23% 的高准确率,同时能耗比 Tesla P40 GPU 低四个数量级。这项开创性的研究不仅为我们的设备集成到图像处理中打开了大门,而且标志着在单细胞内存计算和照片传感功能领域取得了重大进展。
{"title":"Negative photo conductivity triggered with visible light in wide bandgap oxide-based optoelectronic crossbar memristive array for photograph sensing and neuromorphic computing applications","authors":"Dayanand Kumar, Hanrui Li, Amit Singh, Manoj Kumar Rajbhar, Abdul Momin Syed, Hoonkyung Lee, Nazek El-Atab","doi":"10.1038/s44335-024-00001-5","DOIUrl":"10.1038/s44335-024-00001-5","url":null,"abstract":"Photoresponsivity studies of wide-bandgap oxide-based devices have emerged as a vibrant and popular research area. Researchers have explored various material systems in their quest to develop devices capable of responding to illumination. In this study, we engineered a mature wide-bandgap oxide-based bilayer heterostructure synaptic memristor to emulate the human brain for applications in neuromorphic computing and photograph sensing. The device exhibits advanced electric and electrophotonic synaptic functions, such as long-term potentiation (LTP), long-term depression (LTD), and paired-pulse facilitation (PPF), by applying successive electric and photonic pulses. Moreover, the device exhibits exceptional electrical SET and photonic RESET endurance, maintaining its stability for a minimum of 1200 cycles without any degradation. Density functional theory calculations of the band structures provide insights into the conduction mechanism of the device. Based on this memristor array, we developed an autoencoder and convolutional neural network for noise reduction and image recognition tasks, which achieves a peak signal-to-noise ratio of 562 and high accuracy of 84.23%, while consuming lower energy by four orders of magnitude compared with the Tesla P40 GPU. This groundbreaking research not only opens doors for the integration of our device into image processing but also represents a significant advancement in the realm of in-memory computing and photograph-sensing features in a single cell.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-9"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00001-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PIM GPT a hybrid process in memory accelerator for autoregressive transformers PIM GPT 自回归变压器内存混合过程加速器
Pub Date : 2024-07-25 DOI: 10.1038/s44335-024-00004-2
Yuting Wu, Ziyu Wang, Wei D. Lu
Decoder-only Transformer models such as Generative Pre-trained Transformers (GPT) have demonstrated exceptional performance in text generation by autoregressively predicting the next token. However, the efficiency of running GPT on current hardware systems is bounded by low compute-to-memory-ratio and high memory access. In this work, we propose a Process-in-memory (PIM) GPT accelerator, PIM-GPT, which achieves end-to-end acceleration of GPT inference with high performance and high energy efficiency. PIM-GPT leverages DRAM-based PIM designs for executing multiply-accumulate (MAC) operations directly in the DRAM chips, eliminating the need to move matrix data off-chip. Non-linear functions and data communication are supported by an application specific integrated chip (ASIC). At the software level, mapping schemes are designed to maximize data locality and computation parallelism. Overall, PIM-GPT achieves 41 − 137 × , 631 − 1074 × speedup and 123 − 383 × , 320 − 602 × energy efficiency over GPU and CPU baseline on 8 GPT models with up to 1.4 billion parameters.
纯解码器变换器模型,如生成式预训练变换器(GPT),通过自回归预测下一个标记,在文本生成方面表现出了卓越的性能。然而,在当前的硬件系统上运行 GPT 的效率受限于低计算内存比和高内存访问。在这项工作中,我们提出了一种内存进程(PIM)GPT 加速器 PIM-GPT,它能以高性能和高能效实现 GPT 推理的端到端加速。PIM-GPT 利用基于 DRAM 的 PIM 设计,直接在 DRAM 芯片中执行乘积 (MAC) 运算,无需将矩阵数据移至芯片外。非线性功能和数据通信由专用集成芯片(ASIC)支持。在软件层面,映射方案旨在最大限度地提高数据局部性和计算并行性。总体而言,与 GPU 和 CPU 相比,PIM-GPT 在具有多达 14 亿个参数的 8 个 GPT 模型上实现了 41 - 137 ×、631 - 1074 × 的速度提升和 123 - 383 ×、320 - 602 × 的能效。
{"title":"PIM GPT a hybrid process in memory accelerator for autoregressive transformers","authors":"Yuting Wu, Ziyu Wang, Wei D. Lu","doi":"10.1038/s44335-024-00004-2","DOIUrl":"10.1038/s44335-024-00004-2","url":null,"abstract":"Decoder-only Transformer models such as Generative Pre-trained Transformers (GPT) have demonstrated exceptional performance in text generation by autoregressively predicting the next token. However, the efficiency of running GPT on current hardware systems is bounded by low compute-to-memory-ratio and high memory access. In this work, we propose a Process-in-memory (PIM) GPT accelerator, PIM-GPT, which achieves end-to-end acceleration of GPT inference with high performance and high energy efficiency. PIM-GPT leverages DRAM-based PIM designs for executing multiply-accumulate (MAC) operations directly in the DRAM chips, eliminating the need to move matrix data off-chip. Non-linear functions and data communication are supported by an application specific integrated chip (ASIC). At the software level, mapping schemes are designed to maximize data locality and computation parallelism. Overall, PIM-GPT achieves 41 − 137 × , 631 − 1074 × speedup and 123 − 383 × , 320 − 602 × energy efficiency over GPU and CPU baseline on 8 GPT models with up to 1.4 billion parameters.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-13"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00004-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141805370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spiking neural networks for nonlinear regression of complex transient signals on sustainable neuromorphic processors 可持续神经形态处理器上用于复杂瞬态信号非线性回归的尖峰神经网络
Pub Date : 2024-07-25 DOI: 10.1038/s44335-024-00002-4
Marcus Stoffel, Saurabh Balkrishna Tandale
In recent years, spiking neural networks were introduced in science as the third generation of artificial neural networks leading to a tremendous energy saving on neuromorphic processors. This sustainable effect is due to the sparse nature of signal processing in-between spiking neurons leading to much less scalar multiplications as in second-generation networks. The spiking neuron’s efficiency is even more pronounced by their inherently recurrent nature being useful for recursive function approximations. We believe that there is a need for a general regression framework for SNNs to explore the high potential of neuromorphic computations. However, besides many classification studies with SNNs in the literature, nonlinear neuromorphic regression analysis represents a gap in research. Hence, we propose a general SNN approach for function approximation applicable for complex transient signal processing taking surrogate gradients due to the discontinuous spike representation into account. However, to pay attention to the need for high memory access during deep SNN network communications, additional spiking Legrendre Memory Units are introduced in the neuromorphic architecture. Path-dependencies and evolutions of signals can be tackled in this way. Furthermore, interfaces between real physical and binary spiking values are necessary. Following this intention, a hybrid approach is introduced, exhibiting an autoencoding strategy between dense and spiking layers. However, to verify the presented framework of nonlinear regression for a wide spectrum of scientific purposes, we see the need for obtaining realistic complex transient short-time signals by an extensive experimental set-up. Hence, a measurement technique for benchmark experiments is proposed with high-frequency oscillations measured by capacitive and piezoelectric sensors resulting in wave propagations and inelastic solid deformations to be predicted by the developed SNN regression analysis. Hence, the proposed nonlinear regression framework can be deployed to a wide range of scientific and technical applications.
近年来,尖峰神经网络作为第三代人工神经网络被引入科学领域,为神经形态处理器节省了大量能源。这种可持续效应是由于尖峰神经元之间信号处理的稀疏性,从而大大减少了第二代网络的标量乘法。尖峰神经元固有的递归性质可用于递归函数逼近,从而使其效率更加显著。我们认为,有必要为 SNN 建立一个通用回归框架,以发掘神经形态计算的巨大潜力。然而,除了文献中许多利用 SNNs 进行的分类研究外,非线性神经形态回归分析还是研究领域的空白。因此,我们提出了一种适用于复杂瞬态信号处理的通用 SNN 函数逼近方法,并将不连续尖峰表示引起的代用梯度考虑在内。然而,为了关注深度 SNN 网络通信过程中的高内存访问需求,我们在神经形态架构中引入了额外的尖峰 Legrendre 内存单元。这样就可以解决信号的路径依赖和演变问题。此外,真实物理值和二进制尖峰值之间的接口也是必要的。为此,我们引入了一种混合方法,在密集层和尖峰层之间采用自动编码策略。然而,为了验证所提出的非线性回归框架是否适用于广泛的科学用途,我们认为有必要通过广泛的实验设置来获取真实的复杂瞬态短时信号。因此,我们提出了一种用于基准实验的测量技术,通过电容式和压电式传感器测量高频振荡,从而通过开发的 SNN 回归分析预测波的传播和非弹性固体变形。因此,所提出的非线性回归框架可广泛应用于科学和技术领域。
{"title":"Spiking neural networks for nonlinear regression of complex transient signals on sustainable neuromorphic processors","authors":"Marcus Stoffel, Saurabh Balkrishna Tandale","doi":"10.1038/s44335-024-00002-4","DOIUrl":"10.1038/s44335-024-00002-4","url":null,"abstract":"In recent years, spiking neural networks were introduced in science as the third generation of artificial neural networks leading to a tremendous energy saving on neuromorphic processors. This sustainable effect is due to the sparse nature of signal processing in-between spiking neurons leading to much less scalar multiplications as in second-generation networks. The spiking neuron’s efficiency is even more pronounced by their inherently recurrent nature being useful for recursive function approximations. We believe that there is a need for a general regression framework for SNNs to explore the high potential of neuromorphic computations. However, besides many classification studies with SNNs in the literature, nonlinear neuromorphic regression analysis represents a gap in research. Hence, we propose a general SNN approach for function approximation applicable for complex transient signal processing taking surrogate gradients due to the discontinuous spike representation into account. However, to pay attention to the need for high memory access during deep SNN network communications, additional spiking Legrendre Memory Units are introduced in the neuromorphic architecture. Path-dependencies and evolutions of signals can be tackled in this way. Furthermore, interfaces between real physical and binary spiking values are necessary. Following this intention, a hybrid approach is introduced, exhibiting an autoencoding strategy between dense and spiking layers. However, to verify the presented framework of nonlinear regression for a wide spectrum of scientific purposes, we see the need for obtaining realistic complex transient short-time signals by an extensive experimental set-up. Hence, a measurement technique for benchmark experiments is proposed with high-frequency oscillations measured by capacitive and piezoelectric sensors resulting in wave propagations and inelastic solid deformations to be predicted by the developed SNN regression analysis. Hence, the proposed nonlinear regression framework can be deployed to a wide range of scientific and technical applications.","PeriodicalId":501715,"journal":{"name":"npj Unconventional Computing","volume":" ","pages":"1-15"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s44335-024-00002-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141802813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
npj Unconventional Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1