首页 > 最新文献

2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)最新文献

英文 中文
An IMPLY-based Memristive Multiplier for Computing-in-Memory Systems with Weight-Stationary CNN Acceleration 一种基于隐式记忆乘法器的权重稳定CNN加速系统
Wenhui Liang, Jiarui Xu, Yuansheng Zhao, Zixuan Shen, Guoyi Yu, Yuhui He, Chao Wang
Adders and multipliers based on memristive Material Implication (IMPLY) logic are widely used in primary building blocks of Arithmetic Logic Unit (ALU). To solve the issue that the existing IMPLY-based multipliers cannot protect the input operands, this paper presents a novel data non-destructive memristive IMPLY-based semi-parallel multiplier for Computing-in-Memory (CIM) systems, by assigning function-specific memristors for data-protection and introducing additional switches for higher parallelism. Simulation results show that the proposed multiplier can achieve 30% faster than conventional semi-parallel design and 9.1 % less memristors against the state-of-art semi-serial design for 4-bit multiplication, while preventing the input weight from destruction as required by CNN weight reuse.
基于记忆物质蕴涵(IMPLY)逻辑的加法器和乘法器被广泛应用于算术逻辑单元(ALU)的主要组成部分。为了解决现有基于impl的乘法器不能保护输入操作数的问题,本文提出了一种用于内存计算(CIM)系统的基于impl的数据非破坏性忆阻半并行乘法器,通过分配特定功能的忆阻器来保护数据,并引入额外的开关以提高并行性。仿真结果表明,该乘法器比传统的半并行设计快30%,比目前最先进的半串行设计少9.1%的忆阻器用于4位乘法,同时防止了CNN权值重用所要求的输入权值破坏。
{"title":"An IMPLY-based Memristive Multiplier for Computing-in-Memory Systems with Weight-Stationary CNN Acceleration","authors":"Wenhui Liang, Jiarui Xu, Yuansheng Zhao, Zixuan Shen, Guoyi Yu, Yuhui He, Chao Wang","doi":"10.1109/ICTA56932.2022.9962994","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9962994","url":null,"abstract":"Adders and multipliers based on memristive Material Implication (IMPLY) logic are widely used in primary building blocks of Arithmetic Logic Unit (ALU). To solve the issue that the existing IMPLY-based multipliers cannot protect the input operands, this paper presents a novel data non-destructive memristive IMPLY-based semi-parallel multiplier for Computing-in-Memory (CIM) systems, by assigning function-specific memristors for data-protection and introducing additional switches for higher parallelism. Simulation results show that the proposed multiplier can achieve 30% faster than conventional semi-parallel design and 9.1 % less memristors against the state-of-art semi-serial design for 4-bit multiplication, while preventing the input weight from destruction as required by CNN weight reuse.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125547469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Implementation of CNN Heterogeneous Scheme Based on Domestic FPGA with RISC-V Soft Core CPU 基于RISC-V软核CPU的国产FPGA实现CNN异构方案
Hailong Wu, Jindong Li, Xiang Chen
Field Programmable Gate Array (FPGA) has the characteristics of low power consumption, high performance and flexibility. Research on FPGA neural network acceleration is emerging, but most of the researches are based on foreign FPGA devices. In order to improve the current situation of domestic FPGA, a novel Convolutional neural networks (CNNs) accelerator for domestic FPGA equipped with lightweight RISC-V soft core is proposed. The peak performance of the proposed accelerator reaches 153.6 GOP/s, occupying only 14K LUTs (Look-Up-Table), 32 DRMs (Dedicated RAM Modules) and 208 APMs (Arithmetic Process Modules). The proposed accelerator has enough computing power for most of the Edge-AI applications and embedded systems, providing a possible AI inference acceleration solution for domestic FPGA.
现场可编程门阵列(FPGA)具有低功耗、高性能和灵活的特点。FPGA神经网络加速的研究正在兴起,但大多数研究都是基于国外的FPGA器件。为了改善国产FPGA的现状,提出了一种搭载轻量级RISC-V软核的国产FPGA卷积神经网络加速器。该加速器的峰值性能达到153.6 GOP/s,仅占用14K LUTs(查找表),32个DRMs(专用RAM模块)和208个APMs(算术处理模块)。该加速器具有足够的计算能力,适用于大多数Edge-AI应用和嵌入式系统,为国内FPGA提供了一种可能的AI推理加速解决方案。
{"title":"Implementation of CNN Heterogeneous Scheme Based on Domestic FPGA with RISC-V Soft Core CPU","authors":"Hailong Wu, Jindong Li, Xiang Chen","doi":"10.1109/ICTA56932.2022.9963056","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9963056","url":null,"abstract":"Field Programmable Gate Array (FPGA) has the characteristics of low power consumption, high performance and flexibility. Research on FPGA neural network acceleration is emerging, but most of the researches are based on foreign FPGA devices. In order to improve the current situation of domestic FPGA, a novel Convolutional neural networks (CNNs) accelerator for domestic FPGA equipped with lightweight RISC-V soft core is proposed. The peak performance of the proposed accelerator reaches 153.6 GOP/s, occupying only 14K LUTs (Look-Up-Table), 32 DRMs (Dedicated RAM Modules) and 208 APMs (Arithmetic Process Modules). The proposed accelerator has enough computing power for most of the Edge-AI applications and embedded systems, providing a possible AI inference acceleration solution for domestic FPGA.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129450555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CVD Monolayer tungsten-based PMOS Transistor with high performance at Vds = -1 V 在Vds = -1 V下具有高性能的CVD单层钨基PMOS晶体管
Xin Wang, Yanqing Wu
Two-dimensional (2D) semiconducting materials channels enable ultimate scaling of transistors and will help Moore's Law Scaling for decades. In this paper, we reported p-type WSe2transistors using monolayer (¬0.85 nm) channels by molten-salt-assisted chemical vapor deposition. The transfer-free back-gate devices fabricated based on 100 nm SiO2/Si substrate exhibit highest on current at Vds= -1 V among transistors of monolayer p-WSe2, and a high on/off ratio up to 108.
二维(2D)半导体材料通道能够实现晶体管的最终缩放,并将在几十年内帮助摩尔定律缩放。在本文中,我们报道了用熔盐辅助化学气相沉积的方法,使用单层(- 0.85 nm)通道制备p型wse2晶体管。在单层p-WSe2晶体管中,基于100 nm SiO2/Si衬底制备的无转移后门器件在Vds= -1 V处具有最高的导通电流,且导通比高达108。
{"title":"CVD Monolayer tungsten-based PMOS Transistor with high performance at Vds = -1 V","authors":"Xin Wang, Yanqing Wu","doi":"10.1109/ICTA56932.2022.9963068","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9963068","url":null,"abstract":"Two-dimensional (2D) semiconducting materials channels enable ultimate scaling of transistors and will help Moore's Law Scaling for decades. In this paper, we reported p-type WSe2transistors using monolayer (¬0.85 nm) channels by molten-salt-assisted chemical vapor deposition. The transfer-free back-gate devices fabricated based on 100 nm SiO2/Si substrate exhibit highest on current at Vds= -1 V among transistors of monolayer p-WSe2, and a high on/off ratio up to 108.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114376919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Prediction of Key Metrics of Stacked Nanosheet nFETs using Genetic Algorithm-based Neural Networks 基于遗传算法的神经网络预测堆叠纳米片非场效应管关键指标
Haoqing Xu, Weizhuo Gan, Lei Cao, H. Yin, Zhenhua Wu
In this paper, we demonstrate the prediction of important figures of merit (FoMs) including threshold voltage (Vth), subthreshold swing (SS), on-state (Ion) and off-state (Ioft) current, of vertically stacked lateral nanosheet field-effect-transistors (NSFET) using 1) an artificial neural network generated by genetic algorithm (GA) and 2) a conventional multi-layer neural network (NN). Our work shows that the trained GA-based NN has a great capability of predicting FoMs with an average of coefficients of determination at 0.992, which is better than that of the trained multi-layer neural network at 0.987. Additionally, GA-based NN has a significant reduction of calculation time by 80% compared with that of multi-layer NN under the same computing power, which indicates the possibility to reduce the computational cost by using the auto-machine learning approach for TCAD simulation.
在本文中,我们展示了使用1)遗传算法(GA)生成的人工神经网络和2)传统多层神经网络(NN)预测垂直堆叠的横向纳米片场效应晶体管(NSFET)的重要品质值(FoMs),包括阈值电压(Vth)、亚阈值摆幅(SS)、导通状态(Ion)和关断状态(Ioft)电流。我们的工作表明,训练后的基于ga的神经网络具有很好的预测FoMs的能力,其确定系数的平均值为0.992,优于训练后的多层神经网络的平均值0.987。此外,在相同的计算能力下,基于ga的神经网络的计算时间比多层神经网络的计算时间减少了80%,这表明将自动机器学习方法用于TCAD仿真可以降低计算成本。
{"title":"Prediction of Key Metrics of Stacked Nanosheet nFETs using Genetic Algorithm-based Neural Networks","authors":"Haoqing Xu, Weizhuo Gan, Lei Cao, H. Yin, Zhenhua Wu","doi":"10.1109/ICTA56932.2022.9963088","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9963088","url":null,"abstract":"In this paper, we demonstrate the prediction of important figures of merit (FoMs) including threshold voltage (Vth), subthreshold swing (SS), on-state (Ion) and off-state (Ioft) current, of vertically stacked lateral nanosheet field-effect-transistors (NSFET) using 1) an artificial neural network generated by genetic algorithm (GA) and 2) a conventional multi-layer neural network (NN). Our work shows that the trained GA-based NN has a great capability of predicting FoMs with an average of coefficients of determination at 0.992, which is better than that of the trained multi-layer neural network at 0.987. Additionally, GA-based NN has a significant reduction of calculation time by 80% compared with that of multi-layer NN under the same computing power, which indicates the possibility to reduce the computational cost by using the auto-machine learning approach for TCAD simulation.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131358855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 56Gb/s De-serializer with PAM-4 CDR for Chiplet Optical-I/O 一个56Gb/s的反串行器,带有PAM-4 CDR,用于Chiplet光i /O
Yunqiang Yang, Ming Zhong, Qianli Ma, Ziyi Lin, Leliang Li, Guike Li, Liyuan Liu, Jian Liu, N. Wu, Haikun Jia, Xinghui Liu, Nan Qi
This paper presents a 56Gb/s de-serializer with PAM-4 CDR for chiplet optical-I/O in 28nm CMOS. There are two channels in this chip. Each channel consists of a high-performance analog front end (AFE) and a half-rate clock and data recovery (CDR) circuit based on a digital phase interpolator and digital loop filter. To provide 28-GHz clock signals to both channels, a clock distribution circuit is integrated. Experimental results show that the proposed de-serializer recovers a 56Gb/s PAM-4 input signal with channel loss, achieving an output swing of 1.01-Vppd and 760ps RMS jitter.
本文提出了一种基于PAM-4 CDR的56Gb/s反串行器,用于28nm CMOS芯片的光输入/输出。这个芯片有两个通道。每个通道由高性能模拟前端(AFE)和基于数字相位插值器和数字环路滤波器的半速率时钟和数据恢复(CDR)电路组成。为了向两个通道提供28ghz的时钟信号,集成了时钟分配电路。实验结果表明,该反串行器可以恢复56Gb/s的PAM-4输入信号,并具有信道损耗,输出摆幅为1.01 vppd, RMS抖动为760ps。
{"title":"A 56Gb/s De-serializer with PAM-4 CDR for Chiplet Optical-I/O","authors":"Yunqiang Yang, Ming Zhong, Qianli Ma, Ziyi Lin, Leliang Li, Guike Li, Liyuan Liu, Jian Liu, N. Wu, Haikun Jia, Xinghui Liu, Nan Qi","doi":"10.1109/ICTA56932.2022.9963101","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9963101","url":null,"abstract":"This paper presents a 56Gb/s de-serializer with PAM-4 CDR for chiplet optical-I/O in 28nm CMOS. There are two channels in this chip. Each channel consists of a high-performance analog front end (AFE) and a half-rate clock and data recovery (CDR) circuit based on a digital phase interpolator and digital loop filter. To provide 28-GHz clock signals to both channels, a clock distribution circuit is integrated. Experimental results show that the proposed de-serializer recovers a 56Gb/s PAM-4 input signal with channel loss, achieving an output swing of 1.01-Vppd and 760ps RMS jitter.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131386439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Tunable Monopole Antenna for 5G Communication Applications 用于5G通信应用的可调谐单极天线
Liangfan Chen, Lu Zhao, Zihao Chen
A dual-band tunable monopole antenna is designed for 5G communication applications. The devised tuner consists of RF switch and RF capacitors of 0.3 pF, 0.5 pF, 1 pF, 2 pF and 5 pF, which enables the monopole antenna to be operated in different frequency bands. The proposed antenna is fabricated and measured. The measured -10 dB input impedance bandwidths of the proposed antenna are 1.32 GHz - 1.95 GHz and 1.98 GHz - 5.02 GHz, which can fully cover the 5G frequency spectrum in China.
为5G通信应用而设计的双频可调谐单极天线。所设计的调谐器由射频开关和0.3 pF、0.5 pF、1pf、2pf和5pf的射频电容组成,使单极天线能够在不同的频段工作。对所提出的天线进行了制作和测量。该天线实测-10 dB输入阻抗带宽为1.32 GHz - 1.95 GHz和1.98 GHz - 5.02 GHz,可完全覆盖中国5G频谱。
{"title":"A Tunable Monopole Antenna for 5G Communication Applications","authors":"Liangfan Chen, Lu Zhao, Zihao Chen","doi":"10.1109/ICTA56932.2022.9962969","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9962969","url":null,"abstract":"A dual-band tunable monopole antenna is designed for 5G communication applications. The devised tuner consists of RF switch and RF capacitors of 0.3 pF, 0.5 pF, 1 pF, 2 pF and 5 pF, which enables the monopole antenna to be operated in different frequency bands. The proposed antenna is fabricated and measured. The measured -10 dB input impedance bandwidths of the proposed antenna are 1.32 GHz - 1.95 GHz and 1.98 GHz - 5.02 GHz, which can fully cover the 5G frequency spectrum in China.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132760444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A High PSR and Fast Transient Response Output-Capacitorless LDO using Gm-Boosting and Capacitive Bulk-Driven Feed-Forward Technique in 22nm CMOS 基于gm增强和电容体驱动前馈技术的22nm CMOS高PSR和快速瞬态响应输出无电容LDO
Heng Liu, Dongxu Li, Xian Tang
This paper presents an output-capacitorless low-dropout regulator (OCL-LDO) using capacitive bulk-driven feed-forward (CBDFF) technique and an adaptive-biasing error amplifier with gm-boosting to enhance the power supply rejection (PSR) and the transient response. The proposed OCL-LDO has been implemented in a 22nm CMOS technology. It consumes a quiescent current of 49 µA from a power supply of 1.05-1.25 V and has a dropout voltage of 200 mV. The OCL-LDO achieves -84 dB PSR at low frequency and -69 dB PSR at 1 MHz for the load current of 20 mA. It achieves a line regulation of 0.18 mV/V, a load regulation of 0.77 µV/mA, and a settling time of 135 ns.
本文提出了一种采用电容体驱动前馈(CBDFF)技术的无输出电容低差稳压器(OCL-LDO)和一种带有gm增强的自适应偏置误差放大器,以提高电源抑制(PSR)和瞬态响应。所提出的OCL-LDO已在22nm CMOS技术中实现。它从1.05-1.25 V的电源中消耗49µa的静态电流,压降电压为200mv。在负载电流为20 mA时,OCL-LDO在低频时实现-84 dB的PSR,在1 MHz时实现-69 dB的PSR。实现了0.18 mV/V的线路调节、0.77µV/mA的负载调节和135 ns的稳定时间。
{"title":"A High PSR and Fast Transient Response Output-Capacitorless LDO using Gm-Boosting and Capacitive Bulk-Driven Feed-Forward Technique in 22nm CMOS","authors":"Heng Liu, Dongxu Li, Xian Tang","doi":"10.1109/ICTA56932.2022.9963003","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9963003","url":null,"abstract":"This paper presents an output-capacitorless low-dropout regulator (OCL-LDO) using capacitive bulk-driven feed-forward (CBDFF) technique and an adaptive-biasing error amplifier with gm-boosting to enhance the power supply rejection (PSR) and the transient response. The proposed OCL-LDO has been implemented in a 22nm CMOS technology. It consumes a quiescent current of 49 µA from a power supply of 1.05-1.25 V and has a dropout voltage of 200 mV. The OCL-LDO achieves -84 dB PSR at low frequency and -69 dB PSR at 1 MHz for the load current of 20 mA. It achieves a line regulation of 0.18 mV/V, a load regulation of 0.77 µV/mA, and a settling time of 135 ns.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117002343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Suppression to Lateral Charge Migration (LCM) Related Error Bits in Charge-Trap TLC 3D NAND Flash 电荷阱TLC三维NAND闪存中对横向电荷迁移(LCM)相关错误位的大抑制
Kenie Xie, Pena Guo, Fei Chen, Binglu Chen, Xiaotong Fang, Jixuan Wu, Xuepeng Zhan, Jiezhi Chen
We present a study to suppress error bits from lateral charge migration (LCM) in charge-trap (CT) 3D NAND flash memory. For the first time, a new Baking-and-Pre-read (BPR) method is proposed with combined long-time charge diffusion by baking and short-time stabilizing by Pre-read. By characterizing 96-layer Triple-level-cell (TLC) 3D NAND chips by the raw NAND chip tester, the storage stabilities, including data retention (DR) and read disturb (RD), are studied and it is found that DR/RD error bits can be reduced up to >70%, which could be explained by the large effects of suppression to LCM-related threshold voltage (Vth) down-shifts.
我们提出了一种抑制电荷阱(CT) 3D NAND闪存中横向电荷迁移(LCM)错误位的研究。首次提出了一种结合烘烤长时间电荷扩散和预读短时间稳定的烘烤预读(BPR)方法。通过对96层三电平单元(TLC) 3D NAND芯片的原始NAND芯片测试,研究了其存储稳定性,包括数据保留(DR)和读取干扰(RD),发现DR/RD错误位可以减少高达>70%,这可以解释为对lcm相关阈值电压(Vth)降移的抑制作用很大。
{"title":"Large Suppression to Lateral Charge Migration (LCM) Related Error Bits in Charge-Trap TLC 3D NAND Flash","authors":"Kenie Xie, Pena Guo, Fei Chen, Binglu Chen, Xiaotong Fang, Jixuan Wu, Xuepeng Zhan, Jiezhi Chen","doi":"10.1109/ICTA56932.2022.9962997","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9962997","url":null,"abstract":"We present a study to suppress error bits from lateral charge migration (LCM) in charge-trap (CT) 3D NAND flash memory. For the first time, a new Baking-and-Pre-read (BPR) method is proposed with combined long-time charge diffusion by baking and short-time stabilizing by Pre-read. By characterizing 96-layer Triple-level-cell (TLC) 3D NAND chips by the raw NAND chip tester, the storage stabilities, including data retention (DR) and read disturb (RD), are studied and it is found that DR/RD error bits can be reduced up to >70%, which could be explained by the large effects of suppression to LCM-related threshold voltage (Vth) down-shifts.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117174482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 64Gb/s PAM-4 Digital Equalizer With Tap-Configurable FFE and Partially Unrolled DFE in 28nm CMOS 具有分接可配置FFE和部分展开DFE的64Gb/s PAM-4数字均衡器
Xinjie Feng, Yong-Nan Chen, Youzhi Gu, Jiangfeng Wu
This paper presents a high-performance digital equalizer with four-level pulse amplitude modulation (PAM-4) for 64Gb/$s$ backplane I/Os. The digital equalizer consists of a tap-configurable feed-forward equalizer (FFE) and a partially unrolled decision-feedback equalizer (DFE). The first two post-cursor is covered by DFE and then FFE follows, which can largely reduce the influence of noise and crosstalk. The configurable FFE taps enable better adaption for different kind of channels. In order to optimize the internal algorithm, the look-up table (LUT) is used in both FFE and DFE. And the DFE is unrolled for timing closing using a new architecture introduced in this paper. Fabricated in 28nm CMOS, the digital equalizer operates at 64Gb/s with only 5pJ/bit power consumption at 1V.
本文提出了一种适用于64Gb/ s /背板I/ o的高性能四电平脉冲调幅(PAM-4)数字均衡器。数字均衡器由分接可配置的前馈均衡器(FFE)和部分展开的决策反馈均衡器(DFE)组成。前两个后光标被DFE覆盖,然后是FFE,这可以很大程度上减少噪声和串扰的影响。可配置的FFE抽头能够更好地适应不同类型的通道。为了优化内部算法,在FFE和DFE中都使用了查找表(LUT)。并利用本文介绍的一种新结构展开了DFE定时闭合。该数字均衡器采用28nm CMOS制造,工作速度为64Gb/s,在1V电压下功耗仅为5pJ/bit。
{"title":"A 64Gb/s PAM-4 Digital Equalizer With Tap-Configurable FFE and Partially Unrolled DFE in 28nm CMOS","authors":"Xinjie Feng, Yong-Nan Chen, Youzhi Gu, Jiangfeng Wu","doi":"10.1109/ICTA56932.2022.9963099","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9963099","url":null,"abstract":"This paper presents a high-performance digital equalizer with four-level pulse amplitude modulation (PAM-4) for 64Gb/$s$ backplane I/Os. The digital equalizer consists of a tap-configurable feed-forward equalizer (FFE) and a partially unrolled decision-feedback equalizer (DFE). The first two post-cursor is covered by DFE and then FFE follows, which can largely reduce the influence of noise and crosstalk. The configurable FFE taps enable better adaption for different kind of channels. In order to optimize the internal algorithm, the look-up table (LUT) is used in both FFE and DFE. And the DFE is unrolled for timing closing using a new architecture introduced in this paper. Fabricated in 28nm CMOS, the digital equalizer operates at 64Gb/s with only 5pJ/bit power consumption at 1V.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117262143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fully-Connected and Area-Efficient Ising Model Annealing Accelerator for Combinatorial Optimization Problems 一种用于组合优化问题的全连通和面积高效的Ising模型退火加速器
Yukang Huang, Dong Jiang, Yongkui Yang, Enyi Yao
The combinatorial optimization problem is ubiquitously in our daily life and typically inefficient for modern Von Neumann architecture-based computer. Targeting for various combinatorial optimization problems, this paper presents a 10K-bit area-efficient architecture of the domain specific accelerator based on fully-connected Ising model using an FPGA platform. The proposed system is based on simulated annealing algorithm with a spin preselection scheme to prevent the system to be trapped in the local minimum and increase the convergence efficiency, which is more easily and efficiently to be hardware implemented. Using max-cut problem as the experiment benchmark, the proposed hardware architecture achieves an acceleration of 50,000 × compared with the software simulation result.
组合优化问题在我们的日常生活中无处不在,对于基于冯·诺依曼体系结构的现代计算机来说,组合优化问题通常效率低下。针对各种组合优化问题,本文提出了一种基于全连接Ising模型的10k位区域高效领域加速器架构。该系统基于模拟退火算法,采用自旋预选方案,避免了系统陷入局部极小值,提高了收敛效率,更易于硬件实现。以最大切割问题为实验基准,与软件仿真结果相比,所提出的硬件架构实现了5万倍的加速。
{"title":"A Fully-Connected and Area-Efficient Ising Model Annealing Accelerator for Combinatorial Optimization Problems","authors":"Yukang Huang, Dong Jiang, Yongkui Yang, Enyi Yao","doi":"10.1109/ICTA56932.2022.9963022","DOIUrl":"https://doi.org/10.1109/ICTA56932.2022.9963022","url":null,"abstract":"The combinatorial optimization problem is ubiquitously in our daily life and typically inefficient for modern Von Neumann architecture-based computer. Targeting for various combinatorial optimization problems, this paper presents a 10K-bit area-efficient architecture of the domain specific accelerator based on fully-connected Ising model using an FPGA platform. The proposed system is based on simulated annealing algorithm with a spin preselection scheme to prevent the system to be trapped in the local minimum and increase the convergence efficiency, which is more easily and efficiently to be hardware implemented. Using max-cut problem as the experiment benchmark, the proposed hardware architecture achieves an acceleration of 50,000 × compared with the software simulation result.","PeriodicalId":325602,"journal":{"name":"2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123502538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1