首页 > 最新文献

IEEE Transactions on Very Large Scale Integration (VLSI) Systems最新文献

英文 中文
A 50-kHz BW, 84.6-dB SNDR Noise-Shaping SAR ADC With Capacitor-Mismatch-Error-Free Switching Scheme 一种50 khz宽,84.6 db信噪比的无电容失匹配开关降噪SAR ADC
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-26 DOI: 10.1109/TVLSI.2025.3611632
Linlin Huang;Yongjia Li;Jianhui Wu
This brief presents a noise-shaping (NS) successive approximation register (SAR) analog-to-digital converter (ADC) for biomedical Internet-of-Things (IoT) applications. The proposed capacitor-mismatch-error-free (CMEF) switching scheme intrinsically eliminates MSB digital-to-analog converter (DAC) mismatch errors through identical shift in the bottom-plate reference voltage, thereby realizing 0.9- and 2.9-dB improvements in the signal-to-noise-and-distortion ratio (SNDR) and spurious-free dynamic range (SFDR), respectively, over the tri-level switching method. Fabricated in a 40-nm CMOS technology, the prototype NS-SAR ADC occupies a core area of 0.053 mm ${}^{mathbf {2}}$ and consumes $87.7~mu $ W at 1.1-V supply. With an oversampling ratio (OSR) of 12 and a 50-kHz bandwidth (BW), it achieves 84.6-dB SNDR and 95.4-dB SFDR, yielding a Schreier figure-of-merit (FoM) of 172.2 dB and a Walden FoM of 63.2 fJ/step.
本简介介绍了一种用于生物医学物联网(IoT)应用的噪声整形(NS)逐次逼近寄存器(SAR)模数转换器(ADC)。所提出的无电容失匹配(CMEF)开关方案通过底板参考电压的相同移位从本质上消除了MSB数模转换器(DAC)失配误差,从而实现了比三电平开关方法分别提高0.9 db和2.9 db的信噪比和失真比(SNDR)和无杂散动态范围(SFDR)。原型NS-SAR ADC采用40纳米CMOS技术制造,核心面积为0.053 mm ${}^{mathbf {2}}$, 1.1 v电源功耗为87.7~mu $ W。过采样比(OSR)为12,带宽为50 khz (BW), SNDR为84.6 dB, SFDR为95.4 dB, Schreier FoM为172.2 dB, Walden FoM为63.2 fJ/step。
{"title":"A 50-kHz BW, 84.6-dB SNDR Noise-Shaping SAR ADC With Capacitor-Mismatch-Error-Free Switching Scheme","authors":"Linlin Huang;Yongjia Li;Jianhui Wu","doi":"10.1109/TVLSI.2025.3611632","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3611632","url":null,"abstract":"This brief presents a noise-shaping (NS) successive approximation register (SAR) analog-to-digital converter (ADC) for biomedical Internet-of-Things (IoT) applications. The proposed capacitor-mismatch-error-free (CMEF) switching scheme intrinsically eliminates MSB digital-to-analog converter (DAC) mismatch errors through identical shift in the bottom-plate reference voltage, thereby realizing 0.9- and 2.9-dB improvements in the signal-to-noise-and-distortion ratio (SNDR) and spurious-free dynamic range (SFDR), respectively, over the tri-level switching method. Fabricated in a 40-nm CMOS technology, the prototype NS-SAR ADC occupies a core area of 0.053 mm<inline-formula> <tex-math>${}^{mathbf {2}}$ </tex-math></inline-formula> and consumes <inline-formula> <tex-math>$87.7~mu $ </tex-math></inline-formula>W at 1.1-V supply. With an oversampling ratio (OSR) of 12 and a 50-kHz bandwidth (BW), it achieves 84.6-dB SNDR and 95.4-dB SFDR, yielding a Schreier figure-of-merit (FoM) of 172.2 dB and a Walden FoM of 63.2 fJ/step.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 12","pages":"3540-3544"},"PeriodicalIF":3.1,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145595114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cactus: A Multicore Spiking Neural Network Accelerator With Fine-Grained Structured Weight Sparsity Cactus:具有细粒度结构权重稀疏性的多核峰值神经网络加速器
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-18 DOI: 10.1109/TVLSI.2025.3608498
Zilin Wang;Zehong Ou;Yi Zhong;Yuan Wang
Spiking neural networks (SNNs) are a promising alternative to traditional artificial neural networks (ANNs) due to their biologically inspired and event-driven characteristics. Similar to ANN, the weights in SNN also exhibit significant sparsity. How to make full use of the weight sparsity in SNN and coordinate hardware design to optimize resource utilization has become a challenge. In this brief, a multicore SNN accelerator named Cactus, based on a fine-grained and programmable structured pruning strategy is proposed. It is a balanced block pruning strategy, which achieves high accuracy in image and speech classification tasks while ensuring high processing elements (PEs) utilization. To increase flexibility, the block size can be configured as $8times 8$ , $16times 16$ , $32times 32$ , $64times 64$ in Cactus. Implemented on Xilinx Kintex UltraScale XCKU115 FPGA board, Cactus can operate at 200 MHz frequency, achieving 198.59GSOP/s peak performance and 56.47GSOP/W energy efficiency at 75% weight sparsity and 0% spike sparsity.
脉冲神经网络(SNNs)由于其生物启发和事件驱动的特性,是传统人工神经网络(ann)的一个很有前途的替代品。与人工神经网络类似,SNN中的权值也表现出显著的稀疏性。如何充分利用SNN的权值稀疏性,协调硬件设计,优化资源利用成为一个挑战。本文提出了一种基于细粒度和可编程结构化修剪策略的多核SNN加速器Cactus。它是一种平衡的块剪枝策略,在保证高处理元素利用率的同时,在图像和语音分类任务中实现了较高的准确率。为了增加灵活性,块大小可以在Cactus中配置为$8乘以8$,$16乘以16$,$32乘以32$,$64乘以64$。Cactus在Xilinx Kintex UltraScale XCKU115 FPGA板上实现,可以在200mhz频率下工作,在75%的权重稀疏度和0%的峰值稀疏度下实现198.59GSOP/s的峰值性能和56.47GSOP/W的能效。
{"title":"Cactus: A Multicore Spiking Neural Network Accelerator With Fine-Grained Structured Weight Sparsity","authors":"Zilin Wang;Zehong Ou;Yi Zhong;Yuan Wang","doi":"10.1109/TVLSI.2025.3608498","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3608498","url":null,"abstract":"Spiking neural networks (SNNs) are a promising alternative to traditional artificial neural networks (ANNs) due to their biologically inspired and event-driven characteristics. Similar to ANN, the weights in SNN also exhibit significant sparsity. How to make full use of the weight sparsity in SNN and coordinate hardware design to optimize resource utilization has become a challenge. In this brief, a multicore SNN accelerator named Cactus, based on a fine-grained and programmable structured pruning strategy is proposed. It is a balanced block pruning strategy, which achieves high accuracy in image and speech classification tasks while ensuring high processing elements (PEs) utilization. To increase flexibility, the block size can be configured as <inline-formula> <tex-math>$8times 8$ </tex-math></inline-formula>, <inline-formula> <tex-math>$16times 16$ </tex-math></inline-formula>, <inline-formula> <tex-math>$32times 32$ </tex-math></inline-formula>, <inline-formula> <tex-math>$64times 64$ </tex-math></inline-formula> in Cactus. Implemented on Xilinx Kintex UltraScale XCKU115 FPGA board, Cactus can operate at 200 MHz frequency, achieving 198.59GSOP/s peak performance and 56.47GSOP/W energy efficiency at 75% weight sparsity and 0% spike sparsity.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 12","pages":"3535-3539"},"PeriodicalIF":3.1,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145595122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Mixed-Mode Acceleration via Sparsity-Adjustable Pruning for Balancing Computation Density in Lightweight CNNs 基于稀疏度可调剪枝的混合模式加速在轻量级cnn中平衡计算密度
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-16 DOI: 10.1109/TVLSI.2025.3607319
Yishuo Meng;Qiang Fu;Jianfei Wang;Siwei Xiang;Jia Hou;Ge Li;Zhijie Lin;Chen Yang
Convolutional neural network (CNN) pruning is an effective way to reduce the computation requirement and improve the inference performance of standard convolutional layers. However, as for the low-computation-density layers in lightweight CNNs, pruning not only fails to improve their processing efficiency but also exacerbates the underutilization problem when deploying them on the convolutional engine. To efficiently execute these pruning-ineffective layers and further accelerate lightweight CNNs, a sparsity-adjustable CNN pruning method, which allows the pruning ratio to be adjusted, is proposed to prune the nonpruning-ineffective layers while shielding the pruning-ineffective layers. As a result, it achieves an additional 40% pruning ratio for nonpruning-ineffective layers with only 0.09% accuracy loss. Furthermore, a dense/sparse mixed-mode convolution computation scheme is designed to efficiently process the pruning- and nonpruning-ineffective layers using multiple acceleration techniques. Finally, a lightweight CNN accelerator is implemented on the Xilinx VCU118 FPGA platform. The comparison results with current studies present that this work can realize 1004.2 performance and 0.98 DSP efficiency while deploying MobileNetV2, achieving $1.26times $ - $6.13times $ enhancement on DSP efficiency.
卷积神经网络(CNN)的剪枝是减少标准卷积层计算量和提高其推理性能的有效方法。然而,对于轻量级cnn中的低计算密度层,当将其部署到卷积引擎上时,剪枝不仅不能提高其处理效率,还会加剧利用率不足的问题。为了有效地执行这些无效修剪层,进一步加速轻量级CNN,提出了一种稀疏可调CNN修剪方法,该方法允许调整修剪比例,在屏蔽无效修剪层的同时,对非无效修剪层进行修剪。结果,对于非修剪无效的层,它实现了额外40%的修剪比率,而精度损失仅为0.09%。此外,设计了一种密集/稀疏混合模式卷积计算方案,利用多种加速技术对无效剪枝层和无效非剪枝层进行高效处理。最后,在Xilinx VCU118 FPGA平台上实现了一个轻量级CNN加速器。与目前的研究结果对比表明,在部署MobileNetV2时,本工作可以实现1004.2的性能和0.98的DSP效率,DSP效率提高1.26 ~ 6.13倍。
{"title":"A Mixed-Mode Acceleration via Sparsity-Adjustable Pruning for Balancing Computation Density in Lightweight CNNs","authors":"Yishuo Meng;Qiang Fu;Jianfei Wang;Siwei Xiang;Jia Hou;Ge Li;Zhijie Lin;Chen Yang","doi":"10.1109/TVLSI.2025.3607319","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3607319","url":null,"abstract":"Convolutional neural network (CNN) pruning is an effective way to reduce the computation requirement and improve the inference performance of standard convolutional layers. However, as for the low-computation-density layers in lightweight CNNs, pruning not only fails to improve their processing efficiency but also exacerbates the underutilization problem when deploying them on the convolutional engine. To efficiently execute these pruning-ineffective layers and further accelerate lightweight CNNs, a sparsity-adjustable CNN pruning method, which allows the pruning ratio to be adjusted, is proposed to prune the nonpruning-ineffective layers while shielding the pruning-ineffective layers. As a result, it achieves an additional 40% pruning ratio for nonpruning-ineffective layers with only 0.09% accuracy loss. Furthermore, a dense/sparse mixed-mode convolution computation scheme is designed to efficiently process the pruning- and nonpruning-ineffective layers using multiple acceleration techniques. Finally, a lightweight CNN accelerator is implemented on the Xilinx VCU118 FPGA platform. The comparison results with current studies present that this work can realize 1004.2 performance and 0.98 DSP efficiency while deploying MobileNetV2, achieving <inline-formula> <tex-math>$1.26times $ </tex-math></inline-formula>-<inline-formula> <tex-math>$6.13times $ </tex-math></inline-formula> enhancement on DSP efficiency.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 12","pages":"3525-3529"},"PeriodicalIF":3.1,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145595119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sub 0.1-pJ/bit 14-Gb/s Receiver With Stack-Reduced Slicer Embedding One-Tap DFE for Low-Power Memory Interfaces Sub - 0.1 pj /bit 14gb /s接收器,堆栈减少切片器嵌入低功耗存储器接口的一接DFE
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-16 DOI: 10.1109/TVLSI.2025.3608198
Ki-Soo Lee;Joo-Hyung Chae
This brief presents a single-ended receiver (RX) with a decision feedback equalizer (DFE)-embedded and stack-reduced slicer using a DFE weight selection multiplexer (MUX). The RX employs a quarter-rate clocking architecture to reduce the on-chip clock (CK) frequency and ensure reliable operation under stringent DFE timing constraints. The slicer output is fed back to the DFE weight selection MUX integrated into the second-stage slicer, achieving a short feedback loop latency. In the proposed architecture, the number of stacked transistors in the slicer is reduced to three, thereby reducing the CK-to-Q delay and overall DFE feedback loop latency. This optimized design increases the feedback speed and alleviates DFE timing constraints, ensuring stable operation even at low supply voltages. A prototype RX was fabricated using a 65-nm CMOS process and had an area of 0.004 mm2. The proposed RX achieved a measured bit error rate (BER) below $10^{-12}$ at a data rate of 14 Gb/s with an insertion loss of −12 dB and achieved a power efficiency of 0.097 pJ/bit with a supply voltage of 0.75 V.
本文介绍了一种单端接收机(RX),该接收机具有决策反馈均衡器(DFE)嵌入式和堆栈减少切片器,使用DFE权重选择多路复用器(MUX)。RX采用四分之一速率时钟架构,以降低片上时钟(CK)频率,并确保在严格的DFE时序约束下可靠运行。切片机输出反馈到集成到第二阶段切片机的DFE权重选择MUX,实现短反馈环路延迟。在提出的架构中,切片器中堆叠的晶体管数量减少到三个,从而降低了ck - q延迟和总体DFE反馈环路延迟。这种优化设计提高了反馈速度,减轻了DFE时间限制,即使在低电源电压下也能确保稳定运行。RX原型采用65纳米CMOS工艺制造,面积为0.004 mm2。在数据速率为14 Gb/s,插入损耗为-12 dB的情况下,RX的误码率(BER)低于10^{-12}$,电源电压为0.75 V时,功率效率为0.097 pJ/bit。
{"title":"Sub 0.1-pJ/bit 14-Gb/s Receiver With Stack-Reduced Slicer Embedding One-Tap DFE for Low-Power Memory Interfaces","authors":"Ki-Soo Lee;Joo-Hyung Chae","doi":"10.1109/TVLSI.2025.3608198","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3608198","url":null,"abstract":"This brief presents a single-ended receiver (RX) with a decision feedback equalizer (DFE)-embedded and stack-reduced slicer using a DFE weight selection multiplexer (MUX). The RX employs a quarter-rate clocking architecture to reduce the on-chip clock (CK) frequency and ensure reliable operation under stringent DFE timing constraints. The slicer output is fed back to the DFE weight selection MUX integrated into the second-stage slicer, achieving a short feedback loop latency. In the proposed architecture, the number of stacked transistors in the slicer is reduced to three, thereby reducing the CK-to-Q delay and overall DFE feedback loop latency. This optimized design increases the feedback speed and alleviates DFE timing constraints, ensuring stable operation even at low supply voltages. A prototype RX was fabricated using a 65-nm CMOS process and had an area of 0.004 mm<sup>2</sup>. The proposed RX achieved a measured bit error rate (BER) below <inline-formula> <tex-math>$10^{-12}$ </tex-math></inline-formula> at a data rate of 14 Gb/s with an insertion loss of −12 dB and achieved a power efficiency of 0.097 pJ/bit with a supply voltage of 0.75 V.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 12","pages":"3530-3534"},"PeriodicalIF":3.1,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145595128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ASiM: Modeling and Analyzing Inference Accuracy of SRAM-Based Analog CiM Circuits 基于sram的模拟CiM电路的建模与推理精度分析
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-05 DOI: 10.1109/TVLSI.2025.3605286
Wenlun Zhang;Shimpei Ando;Yung-Chin Chen;Kentaro Yoshioka
Static random-access memory (SRAM)-based analog compute-in-memory (ACiM) demonstrates promising energy efficiency for deep neural network (DNN) processing. Nevertheless, efforts to optimize efficiency frequently compromise accuracy, and this trade-off remains insufficiently studied due to the difficulty of performing full-system validation. Specifically, existing simulation tools rarely target SRAM-based ACiM and exhibit inconsistent accuracy predictions, highlighting the need for a standardized, SRAM compute-in-memory (CiM) circuit-aware evaluation methodology. This article presents ASiM, a simulation framework for evaluating inference accuracy in SRAM-based ACiM systems. ASiM captures critical effects in SRAM-based analog compute in memory systems, such as analog-to-digital converter (ADC) quantization, bit-parallel encoding, and analog noise, which must be modeled with high fidelity due to their distinct behavior in charge-domain architectures compared to other memory technologies. ASiM supports a wide range of modern DNN workloads, including CNN and Transformer-based models such as ViT, and scales to large-scale tasks like ImageNet classification. Our results indicate that bit-parallel encoding can improve energy efficiency with only modest accuracy degradation; however, even 1 LSB of analog noise can significantly impair inference performance, particularly in complex tasks such as ImageNet. To address this, we explore hybrid analog-digital execution and majority voting schemes, both of which enhance robustness without negating energy savings. ASiM bridges the gap between hardware design and inference performance, offering actionable insights for energy-efficient, high-accuracy ACiM deployment. The code is available at https://github.com/Keio-CSG/ASiM
基于静态随机存取存储器(SRAM)的模拟内存计算(ACiM)在深度神经网络(DNN)处理中展示了有前途的能量效率。然而,优化效率的努力经常会损害准确性,由于执行全系统验证的困难,这种权衡仍然没有得到充分的研究。具体来说,现有的仿真工具很少针对基于SRAM的ACiM,并且表现出不一致的准确性预测,这突出了对标准化的SRAM内存计算(CiM)电路感知评估方法的需求。本文介绍了一种用于评估基于sram的ACiM系统推理精度的仿真框架ASiM。ASiM捕获了存储系统中基于sram的模拟计算的关键影响,例如模数转换器(ADC)量化、位并行编码和模拟噪声,由于与其他存储技术相比,它们在电荷域架构中的不同行为,因此必须以高保真度建模。ASiM支持广泛的现代深度神经网络工作负载,包括CNN和基于变压器的模型,如ViT,并扩展到像ImageNet分类这样的大规模任务。我们的研究结果表明,位并行编码可以提高能源效率,只有适度的精度下降;然而,即使是1 LSB的模拟噪声也会严重损害推理性能,特别是在像ImageNet这样的复杂任务中。为了解决这个问题,我们探索了混合模拟数字执行和多数投票方案,这两种方案都增强了鲁棒性,而不会抵消节能。ASiM弥补了硬件设计和推理性能之间的差距,为节能、高精度的ACiM部署提供了可行的见解。代码可在https://github.com/Keio-CSG/ASiM上获得
{"title":"ASiM: Modeling and Analyzing Inference Accuracy of SRAM-Based Analog CiM Circuits","authors":"Wenlun Zhang;Shimpei Ando;Yung-Chin Chen;Kentaro Yoshioka","doi":"10.1109/TVLSI.2025.3605286","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3605286","url":null,"abstract":"Static random-access memory (SRAM)-based analog compute-in-memory (ACiM) demonstrates promising energy efficiency for deep neural network (DNN) processing. Nevertheless, efforts to optimize efficiency frequently compromise accuracy, and this trade-off remains insufficiently studied due to the difficulty of performing full-system validation. Specifically, existing simulation tools rarely target SRAM-based ACiM and exhibit inconsistent accuracy predictions, highlighting the need for a standardized, SRAM compute-in-memory (CiM) circuit-aware evaluation methodology. This article presents ASiM, a simulation framework for evaluating inference accuracy in SRAM-based ACiM systems. ASiM captures critical effects in SRAM-based analog compute in memory systems, such as analog-to-digital converter (ADC) quantization, bit-parallel encoding, and analog noise, which must be modeled with high fidelity due to their distinct behavior in charge-domain architectures compared to other memory technologies. ASiM supports a wide range of modern DNN workloads, including CNN and Transformer-based models such as ViT, and scales to large-scale tasks like ImageNet classification. Our results indicate that bit-parallel encoding can improve energy efficiency with only modest accuracy degradation; however, even 1 LSB of analog noise can significantly impair inference performance, particularly in complex tasks such as ImageNet. To address this, we explore hybrid analog-digital execution and majority voting schemes, both of which enhance robustness without negating energy savings. ASiM bridges the gap between hardware design and inference performance, offering actionable insights for energy-efficient, high-accuracy ACiM deployment. The code is available at <uri>https://github.com/Keio-CSG/ASiM</uri>","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 10","pages":"2838-2851"},"PeriodicalIF":3.1,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Ultralow-Energy Voltage Level Shifter With an Output-Cycle-Based Dynamic Biasing Scheme in a 130-nm CMOS Technology 一种基于输出周期动态偏置方案的130纳米CMOS超低能量电压电平移位器
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-03 DOI: 10.1109/TVLSI.2025.3603557
Qun Zhou;Kang Zeng;Weiwei Yue;Qing Hua
This brief presents an ultralow-energy and low-propagation-delay balanced voltage level shifter (VLS) with a wide voltage conversion range. The proposed VLS achieves low propagation delay while operating with subthreshold voltage, utilizing a newly introduced dynamic biasing scheme (DBS). This scheme improves both turn-on and turn-off speeds by reducing the threshold voltage of the input device circularly. The biasing-related power consumption is largely reduced by well-designed structure. The function of the proposed VLS design has been validated by using a standard 130-nm CMOS technology, taking into account process, voltage, and temperature (PVT) variations. With the implementation of the DBS, the delay is reduced to 6.5 ns, dynamic energy is lowered to 21.7 fJ, the minimum supply voltage is 0.18 V, and the average static power consumption is 2.8 nW for a conversion from 0.3 to 1.2 V.
介绍了一种超低能量、低传播延迟、宽电压转换范围的平衡电压电平移位器(VLS)。该VLS利用新引入的动态偏置方案(DBS),在亚阈值电压下实现低传输延迟。该方案通过循环降低输入器件的阈值电压来提高导通和关断速度。良好的结构设计大大降低了与偏置相关的功耗。考虑到工艺、电压和温度(PVT)的变化,采用标准的130纳米CMOS技术验证了所提出的VLS设计的功能。在实现DBS后,延迟降低到6.5 ns,动态能量降低到21.7 fJ,最小电源电压为0.18 V,从0.3到1.2 V转换的平均静态功耗为2.8 nW。
{"title":"An Ultralow-Energy Voltage Level Shifter With an Output-Cycle-Based Dynamic Biasing Scheme in a 130-nm CMOS Technology","authors":"Qun Zhou;Kang Zeng;Weiwei Yue;Qing Hua","doi":"10.1109/TVLSI.2025.3603557","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3603557","url":null,"abstract":"This brief presents an ultralow-energy and low-propagation-delay balanced voltage level shifter (VLS) with a wide voltage conversion range. The proposed VLS achieves low propagation delay while operating with subthreshold voltage, utilizing a newly introduced dynamic biasing scheme (DBS). This scheme improves both turn-on and turn-off speeds by reducing the threshold voltage of the input device circularly. The biasing-related power consumption is largely reduced by well-designed structure. The function of the proposed VLS design has been validated by using a standard 130-nm CMOS technology, taking into account process, voltage, and temperature (PVT) variations. With the implementation of the DBS, the delay is reduced to 6.5 ns, dynamic energy is lowered to 21.7 fJ, the minimum supply voltage is 0.18 V, and the average static power consumption is 2.8 nW for a conversion from 0.3 to 1.2 V.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 11","pages":"3201-3205"},"PeriodicalIF":3.1,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145398668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Camouflaged Logic Gates Using Threshold-Voltage-Defined Memory Cells 使用阈值电压定义记忆单元的伪装逻辑门
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-09-01 DOI: 10.1109/TVLSI.2025.3601192
Min-Woo Kim;Jae-Mun Oh;Byung-Do Yang
This brief proposes two types of camouflaged logic gates using threshold-voltage-defined memory cells (TVD-MCs). The proposed multiplexer-select TVD-MC (MS-TVDMC) gate consists of a target logic gate, several camouflage logic gates, a multiplexer (MUX), and TVD-MCs. All logic gates and MUX are implemented with standard threshold-voltage transistors. The TVD-MC is composed of two cross-coupled inverters with low- or high-threshold-voltage transistors. When its supply voltage increases from ground to ${V} _{text {DD}}$ , its data become “0” or “1” according to the threshold voltages of transistors in two inverters. The target logic gate is selected with the MUX by the data stored in the TVD-MCs. The data are defined by the threshold voltages of transistors, so that it is difficult to distinguish the target logic gate from the other camouflage logic gates. The proposed logic-merged TVD-MC (LM-TVDMC) gate merges all logic gates and MUX in the MS-TVDMC gate at the transistor level. The proposed camouflaged gates significantly reduce the delay, power consumption, and leakage current compared to the conventional dynamic enhanced-TVD (DE-TVD) camouflaged gate requiring the dynamic power and delay overheads and the conventional threshold-voltage-defined (TVD) switch camouflaged gate with large on-resistances in switch transistors.
本文提出了两种使用阈值电压定义存储单元(TVD-MCs)的伪装逻辑门。所提出的多路选择TVD-MC (MS-TVDMC)门由一个目标逻辑门、多个伪装逻辑门、一个多路复用器(MUX)和TVD-MC组成。所有逻辑门和MUX都是用标准阈值电压晶体管实现的。TVD-MC由两个具有低或高阈值电压晶体管的交叉耦合逆变器组成。当其电源电压从地升高到${V} _{text {DD}}$时,根据两个逆变器中晶体管的阈值电压,其数据变为“0”或“1”。目标逻辑门由存储在tvd - mc中的数据与MUX一起选择。数据由晶体管的阈值电压定义,因此很难将目标逻辑门与其他伪装逻辑门区分开来。提出的逻辑合并TVD-MC (LM-TVDMC)门在晶体管级合并MS-TVDMC门中的所有逻辑门和MUX。与需要动态功率和延迟开销的传统动态增强TVD (DE-TVD)伪装门和在开关晶体管中具有大导通电阻的传统阈值电压定义(TVD)开关伪装门相比,所提出的伪装门显著降低了延迟、功耗和泄漏电流。
{"title":"Camouflaged Logic Gates Using Threshold-Voltage-Defined Memory Cells","authors":"Min-Woo Kim;Jae-Mun Oh;Byung-Do Yang","doi":"10.1109/TVLSI.2025.3601192","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3601192","url":null,"abstract":"This brief proposes two types of camouflaged logic gates using threshold-voltage-defined memory cells (TVD-MCs). The proposed multiplexer-select TVD-MC (MS-TVDMC) gate consists of a target logic gate, several camouflage logic gates, a multiplexer (MUX), and TVD-MCs. All logic gates and MUX are implemented with standard threshold-voltage transistors. The TVD-MC is composed of two cross-coupled inverters with low- or high-threshold-voltage transistors. When its supply voltage increases from ground to <inline-formula> <tex-math>${V} _{text {DD}}$ </tex-math></inline-formula>, its data become “0” or “1” according to the threshold voltages of transistors in two inverters. The target logic gate is selected with the MUX by the data stored in the TVD-MCs. The data are defined by the threshold voltages of transistors, so that it is difficult to distinguish the target logic gate from the other camouflage logic gates. The proposed logic-merged TVD-MC (LM-TVDMC) gate merges all logic gates and MUX in the MS-TVDMC gate at the transistor level. The proposed camouflaged gates significantly reduce the delay, power consumption, and leakage current compared to the conventional dynamic enhanced-TVD (DE-TVD) camouflaged gate requiring the dynamic power and delay overheads and the conventional threshold-voltage-defined (TVD) switch camouflaged gate with large <sc>on</small>-resistances in switch transistors.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 12","pages":"3505-3509"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145595125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Guest Editorial: Selected Papers From IEEE Computer Society Annual Symposium on VLSI (ISVLSI) 2024 嘉宾评论:IEEE计算机学会超大规模集成电路(ISVLSI)年度研讨会论文选集
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-08-26 DOI: 10.1109/TVLSI.2025.3593728
Himanshu Thapliyal;Jürgen Becker;Garrett S. Rose;Tosiron Adegbija;Selçuk Köse
{"title":"Guest Editorial: Selected Papers From IEEE Computer Society Annual Symposium on VLSI (ISVLSI) 2024","authors":"Himanshu Thapliyal;Jürgen Becker;Garrett S. Rose;Tosiron Adegbija;Selçuk Köse","doi":"10.1109/TVLSI.2025.3593728","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3593728","url":null,"abstract":"","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 9","pages":"2354-2356"},"PeriodicalIF":3.1,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11142544","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144904831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Very Large Scale Integration (VLSI) Systems Publication Information IEEE超大规模集成电路(VLSI)系统学报
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-08-26 DOI: 10.1109/TVLSI.2025.3598542
{"title":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems Publication Information","authors":"","doi":"10.1109/TVLSI.2025.3598542","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3598542","url":null,"abstract":"","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 9","pages":"C2-C2"},"PeriodicalIF":3.1,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11142503","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144904730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 2.4-GHz −33-dBm Sensitivity Battery-Free RF Energy Harvesting System With 17-dB Input Power Range 一种灵敏度为2.4 ghz−33 dbm、输入功率范围为17db的无电池射频能量收集系统
IF 3.1 2区 工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-08-26 DOI: 10.1109/TVLSI.2025.3600811
Guanci Wang;Xiaguang Li;Zhiyuan Chen
This brief presents a high-sensitivity battery-free radio frequency (RF) energy harvesting system with ultralow-power auxiliary modules. The proposed design implements two-stage energy conversion based on burst charging mode, achieving ultrahigh sensitivity by using an intermittent charging method that eliminates the charge pump’s loading effect on the RF rectifier. An all-nMOS RF--dc rectifier with internal ${V}_{mathrm {TH}}$ cancellation (IVC) technique achieves an ultrawide high-power conversion efficiency (PCE) input power range for an RF energy harvesting. Furthermore, a ${V} _{mathrm {TH}}$ -based voltage reference is introduced, enabling subthreshold operation of transistors with picowatt-level power consumption, thereby simultaneously improving both PCE and sensitivity. The proposed RF energy harvesting system is implemented with a 0.18- $mu $ m standard CMOS technology. The results show that the system achieves a 55% PCE, a −33-dBm sensitivity, and a 17-dB input power range at 2.4 GHz.
本文介绍了一种具有超低功率辅助模块的高灵敏度无电池射频(RF)能量收集系统。该设计基于突发充电模式实现两级能量转换,采用间歇充电方式消除了充电泵对射频整流器的负载影响,实现了超高灵敏度。采用内部${V}_{math {TH}}$抵消(IVC)技术的全nmos RF- dc整流器实现了用于射频能量收集的超宽高功率转换效率(PCE)输入功率范围。此外,还引入了一个基于${V} _{math}}}$的基准电压,使功耗为皮瓦级的晶体管能够实现亚阈值工作,从而同时提高PCE和灵敏度。所提出的射频能量收集系统采用0.18- $mu $ m标准CMOS技术实现。结果表明,该系统在2.4 GHz下的PCE为55%,灵敏度为- 33 dbm,输入功率范围为17 db。
{"title":"A 2.4-GHz −33-dBm Sensitivity Battery-Free RF Energy Harvesting System With 17-dB Input Power Range","authors":"Guanci Wang;Xiaguang Li;Zhiyuan Chen","doi":"10.1109/TVLSI.2025.3600811","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3600811","url":null,"abstract":"This brief presents a high-sensitivity battery-free radio frequency (RF) energy harvesting system with ultralow-power auxiliary modules. The proposed design implements two-stage energy conversion based on burst charging mode, achieving ultrahigh sensitivity by using an intermittent charging method that eliminates the charge pump’s loading effect on the RF rectifier. An all-nMOS RF--dc rectifier with internal <inline-formula> <tex-math>${V}_{mathrm {TH}}$ </tex-math></inline-formula> cancellation (IVC) technique achieves an ultrawide high-power conversion efficiency (PCE) input power range for an RF energy harvesting. Furthermore, a <inline-formula> <tex-math>${V} _{mathrm {TH}}$ </tex-math></inline-formula>-based voltage reference is introduced, enabling subthreshold operation of transistors with picowatt-level power consumption, thereby simultaneously improving both PCE and sensitivity. The proposed RF energy harvesting system is implemented with a 0.18-<inline-formula> <tex-math>$mu $ </tex-math></inline-formula>m standard CMOS technology. The results show that the system achieves a 55% PCE, a −33-dBm sensitivity, and a 17-dB input power range at 2.4 GHz.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"33 12","pages":"3520-3524"},"PeriodicalIF":3.1,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145595108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1