首页 > 最新文献

Memories - Materials, Devices, Circuits and Systems最新文献

英文 中文
Improved reliability single loop single feed 7T SRAM cell for biomedical applications 用于生物医学应用的提高可靠性的单环单馈7T SRAM单元
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100057
Ashish Panchal , Priyanka Sharma , Aastha Gupta , Vaibhav Neema , Nidhi Tiwari , Ravi Sindal

Portable biomedical devices are born to reach a maximum number of people at an effective cost, and because of their small size and battery operation, the impact of portable medical devices is huge. For biomedical image processing devices, it is very important to store pixel information in embedded memory, because pixel values contain critical information about the image. For this critical information storage, most embedded memories consist of static random access memory (SRAM). SRAM, which stores critical information must have a high level of stability and reliability with low power dissipation. This paper proposes a single loop single-feed 7T (SLSF7T) SRAM cell that operates in the sub-threshold region (reducing the supply voltage to reduce power dissipation) and attains a high read static margin. To evaluate the read-and-write stability of the SRAM cell, the N-curve method is adopted in this work.

The proposed SLSF7T SRAM cell design offers several improvements over existing Biomedical Transmission Gate 8T (BT8T) and 9T SRAM cells. Specifically, the SLSF7T SRAM cell design shows an increase in static voltage noise margin (SVNM) by 75.86% and 75.34%, reduction in delay by 37.86% and 58.52%, and also offers less leakage power dissipation by 72% and 23.29% as compared to the BT8T and 9T cells, respectively. Along with the low power and high stability, the other most significant feature of the proposed work is its area efficiency because the proposed memory cell only consists of 7 transistors, it requires only 1.1X area overhead compared to the conventional 6T memory cell. The calculated performance matrix of the proposed cell is the highest among the considered SRAM cells for compression. The proposed cell operates in the sub-threshold region and achieves the best performance parameters for memory design for biomedical devices and applications at a 300 mV supply voltage.

便携式生物医学设备生来就是为了以有效的成本接触到最大数量的人,而且由于其体积小和电池操作,便携式医疗设备的影响是巨大的。对于生物医学图像处理设备,将像素信息存储在嵌入式存储器中是非常重要的,因为像素值包含关于图像的关键信息。对于这种关键信息存储,大多数嵌入式存储器由静态随机存取存储器(SRAM)组成。存储关键信息的SRAM必须具有高水平的稳定性和可靠性以及低功耗。本文提出了一种单环单馈7T(SLSF7T)SRAM单元,该单元在亚阈值区域工作(降低电源电压以降低功耗),并获得高的读取静态裕度。为了评估SRAM单元的读写稳定性,本文采用了N曲线法。与现有的生物医学传输门8T(BT8T)和9T SRAM单元相比,所提出的SLSF7T SRAM电池设计提供了一些改进。具体而言,与BT8T和9T单元相比,SLSF7T SRAM单元设计显示静态电压噪声裕度(SVNM)分别增加了75.86%和75.34%,延迟减少了37.86%和58.52%,并且还提供了分别减少72%和23.29%的泄漏功率耗散。除了低功率和高稳定性之外,所提出的工作的另一个最显著的特征是其面积效率,因为所提出的存储单元仅由7个晶体管组成,与传统的6T存储单元相比,它只需要1.1X的面积开销。所提出的单元的计算性能矩阵在所考虑的用于压缩的SRAM单元中是最高的。所提出的单元在亚阈值区域中操作,并在300mV电源电压下实现用于生物医学设备和应用的存储器设计的最佳性能参数。
{"title":"Improved reliability single loop single feed 7T SRAM cell for biomedical applications","authors":"Ashish Panchal ,&nbsp;Priyanka Sharma ,&nbsp;Aastha Gupta ,&nbsp;Vaibhav Neema ,&nbsp;Nidhi Tiwari ,&nbsp;Ravi Sindal","doi":"10.1016/j.memori.2023.100057","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100057","url":null,"abstract":"<div><p>Portable biomedical devices are born to reach a maximum number of people at an effective cost, and because of their small size and battery operation, the impact of portable medical devices is huge. For biomedical image processing devices, it is very important to store pixel information in embedded memory, because pixel values contain critical information about the image. For this critical information storage, most embedded memories consist of static random access memory (SRAM). SRAM, which stores critical information must have a high level of stability and reliability with low power dissipation. This paper proposes a single loop single-feed 7T (SLSF7T) SRAM cell that operates in the sub-threshold region (reducing the supply voltage to reduce power dissipation) and attains a high read static margin. To evaluate the read-and-write stability of the SRAM cell, the N-curve method is adopted in this work.</p><p>The proposed SLSF7T SRAM cell design offers several improvements over existing Biomedical Transmission Gate 8T (BT8T) and 9T SRAM cells. Specifically, the SLSF7T SRAM cell design shows an increase in static voltage noise margin (SVNM) by 75.86% and 75.34%, reduction in delay by 37.86% and 58.52%, and also offers less leakage power dissipation by 72% and 23.29% as compared to the BT8T and 9T cells, respectively. Along with the low power and high stability, the other most significant feature of the proposed work is its area efficiency because the proposed memory cell only consists of 7 transistors, it requires only 1.1X area overhead compared to the conventional 6T memory cell. The calculated performance matrix of the proposed cell is the highest among the considered SRAM cells for compression. The proposed cell operates in the sub-threshold region and achieves the best performance parameters for memory design for biomedical devices and applications at a 300 mV supply voltage.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100057"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50199557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Stress Issue of Vertical Connections in 3D Integration for High-Bandwidth Memory Applications 高带宽存储器应用三维集成中垂直连接的应力问题
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100024
Tzu-Heng Hung , Yu-Ming Pan , Kuan-Neng Chen

The stress of TSV with different dimensions under annealing condition has been investigated. Since the application of TSV and bonding technology has demonstrated a promising approach for vertical connection in HBM stacking, the stress caused by Cu TSV substrates needs to be carefully investigated. The changing in TSV size under the same TSV aspect ratio does not obviously affect the stress toward the surroundings. On the other hand, the adjustment on TSV aspect ratios results in different stress values, and the aspect ratio of 1:8 results in the largest stress in the analysis. Besides, the annealing temperature has more influence on the stress than the size of TSV. As a consequence, reduction on the annealing temperature is an effective method to achieve a low stress for TSV in HBM stacks. Therefore, several methods for low temperature hybrid bonding have also been reviewed and discussed.

研究了不同尺寸TSV在退火条件下的应力。由于TSV和键合技术的应用已经证明了在HBM堆叠中垂直连接的一种很有前途的方法,因此需要仔细研究Cu TSV衬底引起的应力。在相同的TSV纵横比下,TSV尺寸的变化不会明显影响向周围的应力。另一方面,TSV纵横比的调整导致不同的应力值,1:8的纵横比导致分析中的最大应力。此外,退火温度对应力的影响比对TSV尺寸的影响更大。因此,降低退火温度是实现HBM堆叠中TSV的低应力的有效方法。因此,还对几种低温混合键合方法进行了综述和讨论。
{"title":"Stress Issue of Vertical Connections in 3D Integration for High-Bandwidth Memory Applications","authors":"Tzu-Heng Hung ,&nbsp;Yu-Ming Pan ,&nbsp;Kuan-Neng Chen","doi":"10.1016/j.memori.2023.100024","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100024","url":null,"abstract":"<div><p>The stress of TSV with different dimensions under annealing condition has been investigated. Since the application of TSV and bonding technology has demonstrated a promising approach for vertical connection in HBM stacking, the stress caused by Cu TSV substrates needs to be carefully investigated. The changing in TSV size under the same TSV aspect ratio does not obviously affect the stress toward the surroundings. On the other hand, the adjustment on TSV aspect ratios results in different stress values, and the aspect ratio of 1:8 results in the largest stress in the analysis. Besides, the annealing temperature has more influence on the stress than the size of TSV. As a consequence, reduction on the annealing temperature is an effective method to achieve a low stress for TSV in HBM stacks. Therefore, several methods for low temperature hybrid bonding have also been reviewed and discussed.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100024"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50200137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Exploiting switching properties of non-volatile memory chips for data security applications 利用非易失性存储芯片的开关特性实现数据安全应用
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100044
Supriya Chakraborty, Manan Suri

This paper presents a technique of utilizing Commercial-Off-The-Self (COTS) Non-Volatile Memory (NVM) chips for data security applications. In particular, True Random Numbers (TRNs) are generated by harnessing the latency variability observed in NVM chips. Subsequent series of mathematical operations are implemented as post-processing techniques to increase the randomness of the TRNs. The generated TRNs are then utilized as a source of random keys for One-Time Pad (OTP) cryptosystem. The proposed methodology of TRNs extraction is experimentally validated on three different types of NVM technologies. TRNG throughput in a range of 0.09 Kb/s to 0.67 Kb/s is observed for the investigated technologies. Generated TRNs pass all the tests of NIST SP 800-22 statistical test suite with significant P–values. Metrics like MSE, CC, SSIM, NPCR, UACI, PSNR, and key space are also analyzed for the OTP cryptosystem.

本文提出了一种将商用非易失性存储器(COTS)芯片用于数据安全应用的技术。特别地,真实随机数(TRN)是通过利用NVM芯片中观察到的延迟可变性来生成的。随后的一系列数学运算被实现为后处理技术,以增加TRN的随机性。生成的TRN然后被用作一次性密码(OTP)的随机密钥的源。所提出的TRNs提取方法在三种不同类型的NVM技术上进行了实验验证。对于所研究的技术,观察到在0.09Kb/s至0.67kb/s的范围内的TRNG吞吐量。生成的TRN通过了NIST SP 800-22统计测试套件的所有测试,具有显著的P值。还分析了OTP密码系统的MSE、CC、SSIM、NPCR、UACI、PSNR和密钥空间等度量。
{"title":"Exploiting switching properties of non-volatile memory chips for data security applications","authors":"Supriya Chakraborty,&nbsp;Manan Suri","doi":"10.1016/j.memori.2023.100044","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100044","url":null,"abstract":"<div><p>This paper presents a technique of utilizing Commercial-Off-The-Self (COTS) Non-Volatile Memory (NVM) chips for data security applications. In particular, True Random Numbers (TRNs) are generated by harnessing the latency variability observed in NVM chips. Subsequent series of mathematical operations are implemented as post-processing techniques to increase the randomness of the TRNs. The generated TRNs are then utilized as a source of random keys for One-Time Pad (OTP) cryptosystem. The proposed methodology of TRNs extraction is experimentally validated on three different types of NVM technologies. TRNG throughput in a range of 0.09 Kb/s to 0.67 Kb/s is observed for the investigated technologies. Generated TRNs pass all the tests of NIST SP 800-22 statistical test suite with significant <span><math><mi>P</mi></math></span>–values. Metrics like MSE, CC, SSIM, NPCR, UACI, PSNR, and key space are also analyzed for the OTP cryptosystem.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100044"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50200168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fixed charges at the HfO2/SiO2 interface: Impact on the memory window of FeFET HfO2/SiO2界面的固定电荷:对FeFET存储窗口的影响
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100050
Masud Rana Sk , Shubham Pande , Franz Müller , Yannick Raffel , Maximilian Lederer , Luca Pirro , Sven Beyer , Konrad Seidel , Thomas Kämpfe , Sourav De , Bhaswar Chakrabarti

In this article, the impact of interfacial fixed charges on the memory window (MW) of HfO2-based ferroelectric field-effect transistor (FeFET) is investigated using technology computer-aided design (TCAD) device simulations. We have considered the presence of fixed charges at the interface between the ferroelectric layer (FE) and the interlayer dielectric (IL) of FeFET with metal/ferroelectric/interlayer/Si (MFIS) gate structure. Our study indicates that the presence of fixed charges affects the polarization and corresponding depolarization field in the ferroelectric. Positive and negative interface charges can align the polarization direction. The MW degradation is observed with the increase in the fixed charge concentration (Qf).

本文利用计算机辅助设计(TCAD)器件模拟技术,研究了界面固定电荷对HfO2基铁电场效应晶体管(FeFET)存储窗口(MW)的影响。我们已经考虑了在具有金属/铁电/层间/Si(MFIS)栅极结构的FeFET的铁电层(FE)和层间电介质(IL)之间的界面处存在固定电荷。我们的研究表明,固定电荷的存在会影响铁电体的极化和相应的去极化场。正界面电荷和负界面电荷可以使极化方向对齐。MW退化是随着固定电荷浓度(Qf)的增加而观察到的。
{"title":"Fixed charges at the HfO2/SiO2 interface: Impact on the memory window of FeFET","authors":"Masud Rana Sk ,&nbsp;Shubham Pande ,&nbsp;Franz Müller ,&nbsp;Yannick Raffel ,&nbsp;Maximilian Lederer ,&nbsp;Luca Pirro ,&nbsp;Sven Beyer ,&nbsp;Konrad Seidel ,&nbsp;Thomas Kämpfe ,&nbsp;Sourav De ,&nbsp;Bhaswar Chakrabarti","doi":"10.1016/j.memori.2023.100050","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100050","url":null,"abstract":"<div><p>In this article, the impact of interfacial fixed charges on the memory window (MW) of HfO<span><math><msub><mrow></mrow><mrow><mn>2</mn></mrow></msub></math></span>-based ferroelectric field-effect transistor (FeFET) is investigated using technology computer-aided design (TCAD) device simulations. We have considered the presence of fixed charges at the interface between the ferroelectric layer (FE) and the interlayer dielectric (IL) of FeFET with metal/ferroelectric/interlayer/Si (MFIS) gate structure. Our study indicates that the presence of fixed charges affects the polarization and corresponding depolarization field in the ferroelectric. Positive and negative interface charges can align the polarization direction. The MW degradation is observed with the increase in the fixed charge concentration (<span><math><msub><mrow><mi>Q</mi></mrow><mrow><mi>f</mi></mrow></msub></math></span>).</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100050"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50199565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of novel hybrid - digitally controlled oscillator for ADPLL 一种用于ADPLL的新型混合数字控制振荡器的设计
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100052
Mohd Ziauddin Jahangir , Paidimarry Chandra Sekhar

Digitally Controlled Oscillators (DCOs) are an integral part of All Digital Phase Locked Loops (ADPLLs). It is used to generate output frequency corresponding to the applied digital input. But, due to practical limitations in circuit design, DCOs resolution will be highly limited. Delta Sigma Modulator (DSM) is generally used in DCOs/ADPLLs to obtain a greater resolution. However, using DSM will lead to introduction of frequency spurs in the output spectrum. In this work we propose an alternate method to increase the frequency resolution of DCO without introducing frequency spurs. Novel Hybrid DCO architecture is proposed in this work to increase the resolution. Hybrid DCO proposed in this work has both digital and analog control for tuning frequency. Digital control input of the proposed DCO provides coarse frequency control and the analog control input provides fine frequency control. It has been demonstrated in this work that by integrating Hybrid DCO and DSM with an analog low pass filter (LPF), resolution can be greatly increased, without introducing spurs in the spectrum. As a proof of concept, two Hybrid Digitally controlled Ring Oscillators (DCROs) are designed in 90 nm CMOS process, and their period Jitter performance is compared.

数字控制振荡器(DCO)是全数字锁相环(ADPLL)的组成部分。它用于生成与所施加的数字输入相对应的输出频率。但是,由于电路设计中的实际限制,DCO的分辨率将受到高度限制。德尔塔-西格玛调制器(DSM)通常用于DCO/ADPLL以获得更高的分辨率。然而,使用DSM将导致在输出频谱中引入频率杂散。在这项工作中,我们提出了一种在不引入频率杂散的情况下提高DCO频率分辨率的替代方法。为了提高分辨率,本文提出了一种新的混合DCO结构。本工作中提出的混合DCO具有数字和模拟两种频率调谐控制。所提出的DCO的数字控制输入提供粗略频率控制,而模拟控制输入提供精细频率控制。这项工作已经证明,通过将混合DCO和DSM与模拟低通滤波器(LPF)集成,可以大大提高分辨率,而不会在频谱中引入杂散。作为概念验证,在90nm CMOS工艺中设计了两个混合数字控制环形振荡器,并比较了它们的周期抖动性能。
{"title":"Design of novel hybrid - digitally controlled oscillator for ADPLL","authors":"Mohd Ziauddin Jahangir ,&nbsp;Paidimarry Chandra Sekhar","doi":"10.1016/j.memori.2023.100052","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100052","url":null,"abstract":"<div><p>Digitally Controlled Oscillators (DCOs) are an integral part of All Digital Phase Locked Loops (ADPLLs). It is used to generate output frequency corresponding to the applied digital input. But, due to practical limitations in circuit design, DCOs resolution will be highly limited. Delta Sigma Modulator (DSM) is generally used in DCOs/ADPLLs to obtain a greater resolution. However, using DSM will lead to introduction of frequency spurs in the output spectrum. In this work we propose an alternate method to increase the frequency resolution of DCO without introducing frequency spurs. Novel Hybrid DCO architecture is proposed in this work to increase the resolution. Hybrid DCO proposed in this work has both digital and analog control for tuning frequency. Digital control input of the proposed DCO provides coarse frequency control and the analog control input provides fine frequency control. It has been demonstrated in this work that by integrating Hybrid DCO and DSM with an analog low pass filter (LPF), resolution can be greatly increased, without introducing spurs in the spectrum. As a proof of concept, two Hybrid Digitally controlled Ring Oscillators (DCROs) are designed in 90 nm CMOS process, and their period Jitter performance is compared.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100052"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50199567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of a nonvolatile-register-embedded RISC-V CPU with software-controlled data-retention and hardware-acceleration functions 具有软件控制数据保留和硬件加速功能的非易失性寄存器嵌入式RISC-V CPU的设计
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100035
Masanori Natsui, Keisuke Sakamoto, Takahiro Hanyu

This paper describes the design of a nonvolatile CPU based on RISC-V that is an open-source and highly flexible instruction set architecture. This CPU incorporates nonvolatile registers utilizing magnetic tunnel junction (MTJ) device, as well as custom instructions specific to the control of these nonvolatile registers and an accelerator module embedded into the CPU architecture. These techniques enable efficient execution of intermittent operations suitable for energy-limited internet-of-things (IoT) applications. Through performance evaluation of the CPU designed in a 55-nm CMOS/MTJ-hybrid process technology, we show that our CPU can save up to 56.9% of power consumption compared to conventional ones, with an average power consumption of 3.91 μW/MHz.

本文描述了一种基于RISC-V的非易失性CPU的设计,RISC-V是一种开源且高度灵活的指令集架构。该CPU结合了利用磁性隧道结(MTJ)设备的非易失性寄存器,以及专用于控制这些非易失寄存器的定制指令和嵌入CPU架构中的加速器模块。这些技术能够有效执行适用于能源有限的物联网(IoT)应用的间歇性操作。通过对采用55nm CMOS/MTJ混合工艺设计的CPU的性能评估,我们发现与传统CPU相比,我们的CPU可以节省高达56.9%的功耗,平均功耗为3.91μW/MHz。
{"title":"Design of a nonvolatile-register-embedded RISC-V CPU with software-controlled data-retention and hardware-acceleration functions","authors":"Masanori Natsui,&nbsp;Keisuke Sakamoto,&nbsp;Takahiro Hanyu","doi":"10.1016/j.memori.2023.100035","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100035","url":null,"abstract":"<div><p>This paper describes the design of a nonvolatile CPU based on RISC-V that is an open-source and highly flexible instruction set architecture. This CPU incorporates nonvolatile registers utilizing magnetic tunnel junction (MTJ) device, as well as custom instructions specific to the control of these nonvolatile registers and an accelerator module embedded into the CPU architecture. These techniques enable efficient execution of intermittent operations suitable for energy-limited internet-of-things (IoT) applications. Through performance evaluation of the CPU designed in a 55-nm CMOS/MTJ-hybrid process technology, we show that our CPU can save up to 56.9% of power consumption compared to conventional ones, with an average power consumption of 3.91 <span><math><mi>μ</mi></math></span>W/MHz.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100035"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50199596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Will computing in memory become a new dawn of associative processors? 内存计算会成为联想处理器的新曙光吗?
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100033
Leonid Yavits

Computer architecture faces an enormous challenge in recent years: while the demand for performance is constantly growing, the performance improvement of general-purpose CPU has almost stalled. Among the reasons are memory and power walls, due to which data transfer increasingly dominates computing. By significantly reducing data transfer, data-centric (or in-memory) computing promises to alleviate the memory and power walls. Associative processor is a non von Neumann computer invented in the 1960s but effectively cast aside until recently. It computes using associative memory in a perfect induction like fashion, using associative memory cells for both data storage and processing. Associative processor can be implemented using conventional CMOS as well as emerging memories. We show that associative processor can outperform state-of-the-art computing platforms by up to almost two orders of magnitude in a variety of data-intensive workloads.

近年来,计算机体系结构面临着巨大的挑战:在对性能需求不断增长的同时,通用CPU的性能提升几乎停滞不前。原因之一是内存和电源墙,因此数据传输越来越主导计算。通过显著减少数据传输,以数据为中心(或内存中)的计算有望缓解内存和电源墙的问题。联想处理器是20世纪60年代发明的一种非冯·诺依曼计算机,但直到最近才被有效地抛弃。它以一种完美的类似归纳的方式使用联想记忆进行计算,使用联想记忆单元进行数据存储和处理。关联处理器可以使用传统CMOS以及新兴存储器来实现。我们表明,在各种数据密集型工作负载中,关联处理器可以比最先进的计算平台高出近两个数量级。
{"title":"Will computing in memory become a new dawn of associative processors?","authors":"Leonid Yavits","doi":"10.1016/j.memori.2023.100033","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100033","url":null,"abstract":"<div><p>Computer architecture faces an enormous challenge in recent years: while the demand for performance is constantly growing, the performance improvement of general-purpose CPU has almost stalled. Among the reasons are memory and power walls, due to which data transfer increasingly dominates computing. By significantly reducing data transfer, data-centric (or in-memory) computing promises to alleviate the memory and power walls. Associative processor is a non von Neumann computer invented in the 1960s but effectively cast aside until recently. It computes using associative memory in a perfect induction like fashion, using associative memory cells for both data storage and processing. Associative processor can be implemented using conventional CMOS as well as emerging memories. We show that associative processor can outperform state-of-the-art computing platforms by up to almost two orders of magnitude in a variety of data-intensive workloads.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100033"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50199599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of fully differential fast SCL Schmitt-trigger delay element with tunable delay and hysteresis in design and run-time 具有可调延迟和滞后的全差分快速SCL-Schmitt触发延迟元件的设计和运行
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100036
Saeideh Pahlavan , M.B. Ghaznavi-Ghoushchi , Mostafa Shooshtari

Tuning the delay of the circuit during the circuit performance can give a chance to a circuit to reduce Process, Voltage and Temperature (PVT) effects on delay and frequency by resetting its delay in feedback. This paper presented a full differential Schmitt-trigger (ST) with tunable delay and hysteresis. The delay-hysteresis setting is done in the design phase by tuning the biasing current, sizing, bias voltage and also during the execute phase (run time) by a digital bit and restructuring the circuit and delay route. The presented ST can have high and low delays with different frequencies using a digital bit in the circuit. This can help the band selection for multi-band applications. A Flip Voltage Follower (FVF) circuit is used for the current tail to increase the current and increase the frequency bands. In this Schmitt-trigger delay changes associated with restructuring result in a 40 % power reduction. A circuit analysis for the equivalent circuit of the presented circuit has also been done and the factors affecting the frequency and delay change have been analyzed and investigated in the simulation. Monte Carlo and PVT analysis have also been performed for circuit accuracy. Power changing with an incremental delay in CMOS is improved and almost monotonous by designing Source-Coupled-Logic (SCL) Schmitt-trigger.

在电路性能期间调整电路的延迟可以使电路有机会通过重置反馈中的延迟来减少过程、电压和温度(PVT)对延迟和频率的影响。本文提出了一种具有可调延迟和滞后特性的全差分施密特触发器。延迟滞后设置在设计阶段通过调整偏置电流、大小、偏置电压来完成,也在执行阶段(运行时间)通过数字位来完成,并重构电路和延迟路径。使用电路中的数字位,所提出的ST可以具有不同频率的高延迟和低延迟。这有助于多频段应用的频段选择。翻转电压跟随器(FVF)电路用于电流尾以增加电流并增加频带。在这种施密特触发器中,与重组相关的延迟变化导致功率降低40%。还对所提出的电路的等效电路进行了电路分析,并在仿真中分析和研究了影响频率和延迟变化的因素。还对电路精度进行了蒙特卡罗和PVT分析。通过设计源极耦合逻辑(SCL)施密特触发器,改进了CMOS中具有增量延迟的功率变化,并且几乎是单调的。
{"title":"Design of fully differential fast SCL Schmitt-trigger delay element with tunable delay and hysteresis in design and run-time","authors":"Saeideh Pahlavan ,&nbsp;M.B. Ghaznavi-Ghoushchi ,&nbsp;Mostafa Shooshtari","doi":"10.1016/j.memori.2023.100036","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100036","url":null,"abstract":"<div><p>Tuning the delay of the circuit during the circuit performance can give a chance to a circuit to reduce Process, Voltage and Temperature (PVT) effects on delay and frequency by resetting its delay in feedback. This paper presented a full differential Schmitt-trigger (ST) with tunable delay and hysteresis. The delay-hysteresis setting is done in the design phase by tuning the biasing current, sizing, bias voltage and also during the execute phase (run time) by a digital bit and restructuring the circuit and delay route. The presented ST can have high and low delays with different frequencies using a digital bit in the circuit. This can help the band selection for multi-band applications. A Flip Voltage Follower (FVF) circuit is used for the current tail to increase the current and increase the frequency bands. In this Schmitt-trigger delay changes associated with restructuring result in a 40 % power reduction. A circuit analysis for the equivalent circuit of the presented circuit has also been done and the factors affecting the frequency and delay change have been analyzed and investigated in the simulation. Monte Carlo and PVT analysis have also been performed for circuit accuracy. Power changing with an incremental delay in CMOS is improved and almost monotonous by designing Source-Coupled-Logic (SCL) Schmitt-trigger.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100036"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50199600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An efficient read approach for memristive crossbar array 一种高效的忆阻交叉阵列读取方法
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100047
Pravanjan Samanta , Dev Narayan Yadav , Partha Pratim Das , Indranil Sengupta

Resistive random access memories (ReRAM) have drawn attention of researchers due to their unique properties with applications in in-memory computing, which allows storage and computation in the same unit. This mitigates one of the major limitations in current computing architectures, where for each computation we require to move data from memory to processor or vice versa, which incurs immense amount of energy overheads. Among the various technologies for implementing ReRAM, memristor is considered to be one of the most desirable candidates due to its small size, low power consumption, and high data retention. Such ReRAM systems are often fabricated in the form of crossbar for compact layout. However, they suffer from various challenges, one of the major ones being the sneak-path problem during reading of cell values. The read operation is mostly disturbed by sneak-path currents that can result in incorrect reading of the cell. This paper presents a new approach for reading the cell values in memristive crossbars, which is capable of avoiding erroneous read operations caused by sneak-paths. It also supports parallel operations whereby multiple memristor states can be read in a single cycle. A straightforward approach for reading all the cells in an n×n crossbar, where the read operation is performed sequentially, requires O(n2) cycles, whereas the proposed approach requires O(n) cycles.

电阻式随机存取存储器(ReRAM)由于其独特的特性而引起了研究人员的注意,它在内存计算中的应用允许在同一单元中存储和计算。这缓解了当前计算体系结构中的一个主要限制,即对于每次计算,我们都需要将数据从内存移动到处理器,反之亦然,这会带来巨大的能量开销。在用于实现ReRAM的各种技术中,忆阻器由于其小尺寸、低功耗和高数据保留率而被认为是最理想的候选者之一。这样的ReRAM系统通常以横杆的形式制造,以实现紧凑的布局。然而,它们面临着各种挑战,其中一个主要挑战是读取单元格值时的潜行路径问题。读取操作主要受到潜通路电流的干扰,潜通路电流可能导致单元的错误读取。本文提出了一种在忆阻交叉开关中读取单元值的新方法,该方法能够避免由潜行路径引起的错误读取操作。它还支持并行操作,从而可以在单个循环中读取多个忆阻器状态。读取n×n交叉开关中所有单元的直接方法,其中读取操作按顺序执行,需要O(n2)个周期,而所提出的方法需要O(n)个周期。
{"title":"An efficient read approach for memristive crossbar array","authors":"Pravanjan Samanta ,&nbsp;Dev Narayan Yadav ,&nbsp;Partha Pratim Das ,&nbsp;Indranil Sengupta","doi":"10.1016/j.memori.2023.100047","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100047","url":null,"abstract":"<div><p>Resistive random access memories (ReRAM) have drawn attention of researchers due to their unique properties with applications in in-memory computing, which allows storage and computation in the same unit. This mitigates one of the major limitations in current computing architectures, where for each computation we require to move data from memory to processor or vice versa, which incurs immense amount of energy overheads. Among the various technologies for implementing ReRAM, memristor is considered to be one of the most desirable candidates due to its small size, low power consumption, and high data retention. Such ReRAM systems are often fabricated in the form of crossbar for compact layout. However, they suffer from various challenges, one of the major ones being the sneak-path problem during reading of cell values. The read operation is mostly disturbed by sneak-path currents that can result in incorrect reading of the cell. This paper presents a new approach for reading the cell values in memristive crossbars, which is capable of avoiding erroneous read operations caused by sneak-paths. It also supports parallel operations whereby multiple memristor states can be read in a single cycle. A straightforward approach for reading all the cells in an <span><math><mrow><mi>n</mi><mo>×</mo><mi>n</mi></mrow></math></span> crossbar, where the read operation is performed sequentially, requires <span><math><mrow><mi>O</mi><mrow><mo>(</mo><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>)</mo></mrow></mrow></math></span> cycles, whereas the proposed approach requires <span><math><mrow><mi>O</mi><mrow><mo>(</mo><mi>n</mi><mo>)</mo></mrow></mrow></math></span> cycles.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100047"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50200167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MPEG/H256 video encoder with 6T/8T hybrid memory architecture for high quality output at lower supply 具有6T/8T混合存储器架构的MPEG/H256视频编码器,可在较低电源下实现高质量输出
Pub Date : 2023-07-01 DOI: 10.1016/j.memori.2023.100028
Priyanka Sharma , Vaibhav Neema , Santosh Kumar Vishvakarma , Shailesh Singh Chouhan

The use of Multimedia video content is increased rapidly in the past decade, and most multimedia video content is used by mobile phone users. Multimedia video processing consumes significant power during video compression, and thus low power multimedia video compression is essential for battery operated devices. Moving Picture Experts Group (MPEG) Video encoding is giving a higher compression rate and low bandwidth requirement. Conventional MPEG Video encoding architecture uses the conventional 6T memory cells to store video frames for further compression processing. The failure probability of 6T cells is significantly large (0.0988 at 600 mV supply voltage), leading to a decrease in the output quality of the encoded video. From the hybrid memory matrix formulation, it is calculated that storing higher-order MSB bits in highly stable memory cells will provide high-quality video encoding processing as compared to the conventional technique because the human eye is more susceptible to higher-order luminance bits. Hence, in this research work instant of using conventional 6T memory cells during video encoding processing, a unique Hybrid 6T/8T memory architecture is proposed, where the 8-bit Luminance pixels are stored favourably in consonance with their effect on the output quality. The higher order luminance bits (MSB’s) require high stability and thus these bits are stored in the 8T bit cells and the remaining bits (LSB’s) are stored in the conventional 6T bit cells for high-quality video encoding processing. This research article also proposes a separate memory peripheral circuitry for hybrid memory architecture for video encoding techniques. In addition, this article proposes a unique architecture for parallel video processing with the use of a hybrid pixel memory array. The failure probability of 6T and 8T at the worst failure corner (FS corner for read and SF corner for write) is simulated for 30000 Monte-Carlo simulations points at 45 nm CMOS technology node using CADENCE EDA tool. For the simulation work here, a standard Common Intermediate Format/Quarter Common Intermediate Format (CIF/QCIF) Coastguard video sample is used and for output quality here average PSNR method is used and simulation work is performed using the MATLAB tool.

The worst PSNR for conventional 6T memory array and Hybrid memory array at 600 mV supply voltage shows improvement in worst minimum PSNR as 6.43 dB is calculated. 30% less power consumption to conventional memory architecture.

在过去的十年里,多媒体视频内容的使用迅速增加,大多数多媒体视频内容都被手机用户使用。多媒体视频处理在视频压缩期间消耗大量功率,因此低功率多媒体视频压缩对于电池操作的设备是必不可少的。运动图像专家组(MPEG)视频编码给出了更高的压缩率和低带宽要求。传统的MPEG视频编码体系结构使用传统的6T存储单元来存储视频帧以用于进一步的压缩处理。6T单元的故障概率非常大(在600mV电源电压下为0.0988),导致编码视频的输出质量下降。根据混合存储器矩阵公式,计算出与传统技术相比,将高阶MSB位存储在高度稳定的存储器单元中将提供高质量的视频编码处理,因为人眼更容易受到高阶亮度位的影响。因此,在视频编码处理期间使用传统6T存储器单元的研究工作中,提出了一种独特的混合6T/8T存储器架构,其中8位亮度像素根据其对输出质量的影响而被有利地存储。高阶亮度比特(MSB)需要高稳定性,因此这些比特被存储在8T比特单元中,而剩余比特(LSB)被存储在用于高质量视频编码处理的传统6T比特单元中。本文还提出了一种用于视频编码技术的混合存储器结构的独立存储器外围电路。此外,本文提出了一种独特的并行视频处理架构,使用混合像素存储器阵列。使用CADENCE EDA工具对45nm CMOS技术节点上的30000个蒙特卡罗模拟点模拟了6T和8T在最坏故障角(读取为FS角,写入为SF角)的故障概率。对于此处的模拟工作,使用标准通用中间格式/四分之一通用中间格式(CIF/QCIF)海岸警卫队视频样本,对于此处的输出质量,使用平均PSNR方法,并使用MATLAB工具进行模拟工作。传统6T存储器阵列和混合存储器阵列在600mV电源电压下的最差PSNR显示出最差最小PSNR的改善,计算出6.43dB。与传统存储器架构相比,功耗降低30%。
{"title":"MPEG/H256 video encoder with 6T/8T hybrid memory architecture for high quality output at lower supply","authors":"Priyanka Sharma ,&nbsp;Vaibhav Neema ,&nbsp;Santosh Kumar Vishvakarma ,&nbsp;Shailesh Singh Chouhan","doi":"10.1016/j.memori.2023.100028","DOIUrl":"https://doi.org/10.1016/j.memori.2023.100028","url":null,"abstract":"<div><p>The use of Multimedia video content is increased rapidly in the past decade, and most multimedia video content is used by mobile phone users. Multimedia video processing consumes significant power during video compression, and thus low power multimedia video compression is essential for battery operated devices. Moving Picture Experts Group (MPEG) Video encoding is giving a higher compression rate and low bandwidth requirement. Conventional MPEG Video encoding architecture uses the conventional 6T memory cells to store video frames for further compression processing. The failure probability of 6T cells is significantly large (0.0988 at 600 mV supply voltage), leading to a decrease in the output quality of the encoded video. From the hybrid memory matrix formulation, it is calculated that storing higher-order MSB bits in highly stable memory cells will provide high-quality video encoding processing as compared to the conventional technique because the human eye is more susceptible to higher-order luminance bits. Hence, in this research work instant of using conventional 6T memory cells during video encoding processing, a unique Hybrid 6T/8T memory architecture is proposed, where the 8-bit Luminance pixels are stored favourably in consonance with their effect on the output quality. The higher order luminance bits (MSB’s) require high stability and thus these bits are stored in the 8T bit cells and the remaining bits (LSB’s) are stored in the conventional 6T bit cells for high-quality video encoding processing. This research article also proposes a separate memory peripheral circuitry for hybrid memory architecture for video encoding techniques. In addition, this article proposes a unique architecture for parallel video processing with the use of a hybrid pixel memory array. The failure probability of 6T and 8T at the worst failure corner (FS corner for read and SF corner for write) is simulated for 30000 Monte-Carlo simulations points at 45 nm CMOS technology node using CADENCE EDA tool. For the simulation work here, a standard Common Intermediate Format/Quarter Common Intermediate Format (CIF/QCIF) Coastguard video sample is used and for output quality here average PSNR method is used and simulation work is performed using the MATLAB tool.</p><p>The worst PSNR for conventional 6T memory array and Hybrid memory array at 600 mV supply voltage shows improvement in worst minimum PSNR as 6.43 dB is calculated. 30% less power consumption to conventional memory architecture.</p></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"4 ","pages":"Article 100028"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50199554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Memories - Materials, Devices, Circuits and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1