IEEE Transactions on Very Large Scale Integration (VLSI) Systems最新文献

英文中文

EDMOT: A 25–65 ns Latency Event-Driven Multiobject Tracker for Dynamic Vision Sensors EDMOT：一种时延25 - 65ns的动态视觉传感器事件驱动多目标跟踪器

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-11-25 DOI: 10.1109/TVLSI.2025.3633693

Feiqiang Li;Yujie Huang;Yaoyi Chen;Mingyu Wang;Minge Jing;Wenhong Li;Xiaoyang Zeng

Dynamic vision sensors (DVSs), renowned for their low latency and sparse event-driven output, have garnered significant attention in machine vision applications, particularly in latency-sensitive applications like object tracking. However, current DVS-based object tracking systems have not fully utilized the advantages of DVS due to their frame-based processing or the high computing intensity of their algorithms. This brief presents EDMOT, a fully event-driven multiobject tracking system for DVS, achieving 25–65 ns latency at 200 MHz. EDMOT introduces three key innovations: 1) a novel event-driven update mechanism that processes only the latest event and the expired oldest event, minimizing computational overhead; 2) a dual-threshold tracking strategy that decouples object formation and motion phases, significantly improving tracking accuracy; and 3) a row–column feature memory with flag registers, enabling object separation within eight clock cycles. The proposed EDMOT is evaluated on public datasets, demonstrating superior tracking accuracy compared to prior methods. Finally, EDMOT was implemented at the HLMC 55 nm, supporting 20–100 Me/s throughput. To the best of our knowledge, this is a multiobject tracking system with minimal delay and the highest event processing throughput.

动态视觉传感器（DVSs）以其低延迟和稀疏事件驱动输出而闻名，在机器视觉应用中引起了极大的关注，特别是在对象跟踪等延迟敏感应用中。然而，目前基于分布式交换机的目标跟踪系统由于其基于帧的处理或算法的高计算强度，并没有充分发挥分布式交换机的优势。本文介绍了EDMOT，一种完全事件驱动的分布式交换机多目标跟踪系统，在200mhz下实现25 - 65ns延迟。EDMOT引入了三个关键创新：1)一种新颖的事件驱动更新机制，只处理最新的事件和过期的最老的事件，最大限度地减少了计算开销；2)采用双阈值跟踪策略，将目标形成和运动阶段解耦，显著提高了跟踪精度；3)带有标志寄存器的行-列特征存储器，可在8个时钟周期内实现对象分离。提出的EDMOT在公共数据集上进行了评估，与之前的方法相比，显示出更高的跟踪精度。最后，EDMOT在HLMC 55 nm上实现，支持20-100 Me/s的吞吐量。据我们所知，这是一个具有最小延迟和最高事件处理吞吐量的多目标跟踪系统。

{"title":"EDMOT: A 25–65 ns Latency Event-Driven Multiobject Tracker for Dynamic Vision Sensors","authors":"Feiqiang Li;Yujie Huang;Yaoyi Chen;Mingyu Wang;Minge Jing;Wenhong Li;Xiaoyang Zeng","doi":"10.1109/TVLSI.2025.3633693","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3633693","url":null,"abstract":"Dynamic vision sensors (DVSs), renowned for their low latency and sparse event-driven output, have garnered significant attention in machine vision applications, particularly in latency-sensitive applications like object tracking. However, current DVS-based object tracking systems have not fully utilized the advantages of DVS due to their frame-based processing or the high computing intensity of their algorithms. This brief presents EDMOT, a fully event-driven multiobject tracking system for DVS, achieving 25–65 ns latency at 200 MHz. EDMOT introduces three key innovations: 1) a novel event-driven update mechanism that processes only the latest event and the expired oldest event, minimizing computational overhead; 2) a dual-threshold tracking strategy that decouples object formation and motion phases, significantly improving tracking accuracy; and 3) a row–column feature memory with flag registers, enabling object separation within eight clock cycles. The proposed EDMOT is evaluated on public datasets, demonstrating superior tracking accuracy compared to prior methods. Finally, EDMOT was implemented at the HLMC 55 nm, supporting 20–100 Me/s throughput. To the best of our knowledge, this is a multiobject tracking system with minimal delay and the highest event processing throughput.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"34 2","pages":"692-696"},"PeriodicalIF":3.1,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An LSGQ-FFS Framework for Adaptive Optimization of Hybrid INT-CIM Architecture 基于LSGQ-FFS的混合INT-CIM结构自适应优化框架

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-11-13 DOI: 10.1109/TVLSI.2025.3627654

Shaochen Li;Xi Chen;Yujia Xiong;Yuhan Liu;Lingyi Kong;He Wang;Tianhui Jiao;Xing Wang;Xiaomin Li;Yan Yan;Yuchen Ma;Xin Si

Hybrid computing-in-memory (CIM) has recently gained significant attention due to its ability to leverage the strengths of both digital CIM (DCIM) and analog CIM (ACIM). The multibit fusion (MF) scheme enhances energy efficiency by fusing low-bit results, which typically require multiple read-out cycles, into a single-cycle read out. However, the relationship between hybrid INT-CIM circuit design and network performance based on the MF scheme has not yet been systematically explored. In addition, we investigate how different MF configurations affect the performance of various neural networks. To address this gap, we first propose a less-significant group quantization (LSGQ) model, which defines and explores the design space of hybrid INT-CIM. Second, we develop a FastFuse-Search (FFS) algorithm, which optimizes configurations for different networks to strike a better balance between model accuracy and energy efficiency. Based on the experimental results, some key considerations on hybrid CIM design are derived. FFS yields a

$1.72times $

energy-efficiency boost with negligible accuracy loss. Finally, we fabricate a 28-nm hybrid INT-CIM test chip, achieving 59.74 TOPS/W and 0.96 TOPS/mm², with performance metrics of 23.21 perplexity for GPT-2, 68.69% accuracy for ResNet18, and 80.53% accuracy for ViT.

由于能够同时利用数字CIM （DCIM）和模拟CIM （ACIM）的优势，混合内存计算（CIM）最近获得了极大的关注。多比特融合（MF）方案通过将通常需要多个读出周期的低比特结果融合到单周期读出中来提高能源效率。然而，基于MF方案的混合INT-CIM电路设计与网络性能之间的关系尚未得到系统的探讨。此外，我们研究了不同的MF配置如何影响各种神经网络的性能。为了解决这一差距，我们首先提出了一个不太重要的群体量化（LSGQ）模型，该模型定义并探索了混合intcim的设计空间。其次，我们开发了一种快速融合搜索（FastFuse-Search， FFS）算法，该算法优化了不同网络的配置，以更好地平衡模型精度和能源效率。根据实验结果，推导了混合CIM设计的一些关键考虑因素。FFS产生1.72倍的能源效率提升，精度损失可以忽略不计。最后，我们制作了一个28纳米的混合INT-CIM测试芯片，达到59.74 TOPS/W和0.96 TOPS/mm2， GPT-2的perplexity为23.21，ResNet18的准确率为68.69%，ViT的准确率为80.53%。

{"title":"An LSGQ-FFS Framework for Adaptive Optimization of Hybrid INT-CIM Architecture","authors":"Shaochen Li;Xi Chen;Yujia Xiong;Yuhan Liu;Lingyi Kong;He Wang;Tianhui Jiao;Xing Wang;Xiaomin Li;Yan Yan;Yuchen Ma;Xin Si","doi":"10.1109/TVLSI.2025.3627654","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3627654","url":null,"abstract":"Hybrid computing-in-memory (CIM) has recently gained significant attention due to its ability to leverage the strengths of both digital CIM (DCIM) and analog CIM (ACIM). The multibit fusion (MF) scheme enhances energy efficiency by fusing low-bit results, which typically require multiple read-out cycles, into a single-cycle read out. However, the relationship between hybrid INT-CIM circuit design and network performance based on the MF scheme has not yet been systematically explored. In addition, we investigate how different MF configurations affect the performance of various neural networks. To address this gap, we first propose a less-significant group quantization (LSGQ) model, which defines and explores the design space of hybrid INT-CIM. Second, we develop a FastFuse-Search (FFS) algorithm, which optimizes configurations for different networks to strike a better balance between model accuracy and energy efficiency. Based on the experimental results, some key considerations on hybrid CIM design are derived. FFS yields a <inline-formula> <tex-math>$1.72times $ </tex-math></inline-formula> energy-efficiency boost with negligible accuracy loss. Finally, we fabricate a 28-nm hybrid INT-CIM test chip, achieving 59.74 TOPS/W and 0.96 TOPS/mm<sup>2</sup>, with performance metrics of 23.21 perplexity for GPT-2, 68.69% accuracy for ResNet18, and 80.53% accuracy for ViT.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"34 2","pages":"677-681"},"PeriodicalIF":3.1,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Wide Fault Coverage Pre-Bond TSV Test Method Based on Ring Oscillator Using Variable Discharge Path 基于变放电路径环振的大故障覆盖键前TSV测试方法

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-11-11 DOI: 10.1109/TVLSI.2025.3629045

Xu Fang;Xiaofeng Zhao

Through silicon via (TSV), serving as the vertical signal path in 3-D integrated circuits (3D-ICs), must be tested prior to die bonding (pre-bond). Most conventional pre-bond TSV test methods are often limited in terms of leakage fault detection range, hybrid fault detection capability, and fault parameter diagnosability. To address these challenges, we propose a pre-bond TSV test method based on ring oscillator (RO) circuits using a variable discharge path, capable of detecting and diagnosing a wide range of TSV faults, including open, leakage, and hybrid open-leakage faults with high resolution. The entire DfT framework employs standard digital cells to construct all test structures, facilitating its application in diverse testing scenarios. The effectiveness of the proposed method is evaluated through HSPICE simulations. The simulation results demonstrate that, in the presence of PVT variation, the proposed method can effectively detect and diagnose open faults occurring within the first 96.7% tail section of TSV with a maximum diagnosing error of 3.71 fF. Additionally, it can accurately identify leakage faults exceeding 2.5 nS with a maximum diagnosing error of 13.5%.

作为3d集成电路（3d - ic）中垂直信号通路的通硅通孔（TSV）必须在晶片键合（预键合）之前进行测试。传统的焊前TSV检测方法在漏电故障检测范围、混合故障检测能力和故障参数可诊断性等方面存在一定的局限性。为了解决这些挑战，我们提出了一种基于环形振荡器（RO）电路的键前TSV测试方法，该方法采用可变放电路径，能够以高分辨率检测和诊断各种TSV故障，包括开路、漏电和混合式开路和漏电故障。整个DfT框架采用标准数字单元来构建所有测试结构，便于其在各种测试场景中的应用。通过HSPICE仿真验证了该方法的有效性。仿真结果表明，在PVT存在变化的情况下，该方法能够有效地检测和诊断TSV尾部前96.7%区间内的开路故障，最大诊断误差为3.71 fF。该方法能准确识别大于2.5 nS的泄漏故障，最大诊断误差为13.5%。

{"title":"A Wide Fault Coverage Pre-Bond TSV Test Method Based on Ring Oscillator Using Variable Discharge Path","authors":"Xu Fang;Xiaofeng Zhao","doi":"10.1109/TVLSI.2025.3629045","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3629045","url":null,"abstract":"Through silicon via (TSV), serving as the vertical signal path in 3-D integrated circuits (3D-ICs), must be tested prior to die bonding (pre-bond). Most conventional pre-bond TSV test methods are often limited in terms of leakage fault detection range, hybrid fault detection capability, and fault parameter diagnosability. To address these challenges, we propose a pre-bond TSV test method based on ring oscillator (RO) circuits using a variable discharge path, capable of detecting and diagnosing a wide range of TSV faults, including open, leakage, and hybrid open-leakage faults with high resolution. The entire DfT framework employs standard digital cells to construct all test structures, facilitating its application in diverse testing scenarios. The effectiveness of the proposed method is evaluated through HSPICE simulations. The simulation results demonstrate that, in the presence of PVT variation, the proposed method can effectively detect and diagnose open faults occurring within the first 96.7% tail section of TSV with a maximum diagnosing error of 3.71 fF. Additionally, it can accurately identify leakage faults exceeding 2.5 nS with a maximum diagnosing error of 13.5%.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"34 2","pages":"648-661"},"PeriodicalIF":3.1,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Crosstalk Suppressed Reconfigurable Fully Passive Multichannel Noise-Shaping SAR ADC 一种串扰抑制可重构全无源多通道噪声整形SAR ADC

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-11-11 DOI: 10.1109/TVLSI.2025.3627560

Yu Lu;Pengfei Jiang;Qingsong Zhang;Tianyue Sun;Wenjun Gong;Yuxin Liao;Hao Min

This brief presents a reconfigurable multichannel noise-shaping (NS) SAR analog-to-digital converter (ADC) with crosstalk suppression. All nonmemory blocks are multiplexed among all channels assisted by the proposed timing control scheme, enabling reconfiguration with negligible power and hardware cost. A novel approach to determine the design parameters of nonideal loop filters is proposed, and a crosstalk suppressed multichannel fully passive loop filter is developed based on this method. A memoryless binary dynamic element matching (DEM) is also adopted to suppress the nonlinearity of each channel. The ADC achieves a typical SNDR of 77.49/72.17/66.62 dB with a 20-kHz bandwidth under one-/two-/four-channel modes, respectively, realizing typical Schreier FoMs of 169.54, 167.20, and 164.64 dB. Averaged interchannel crosstalks are −79.17 and −77.37 dB with −2.5-dBFS inputs in all input channels under two- and four-channel modes.

本文介绍了一种具有串扰抑制功能的可重构多通道噪声整形SAR模数转换器（ADC）。所有的非记忆体块都在所有的通道中进行多路复用，并辅以所提出的时序控制方案，使得重新配置的功耗和硬件成本可以忽略不计。提出了一种确定非理想环路滤波器设计参数的新方法，并在此基础上研制了一种串扰抑制多通道全无源环路滤波器。采用无记忆二进制动态单元匹配（DEM）来抑制各通道的非线性。该ADC在单通道/双通道/四通道模式下的典型SNDR分别为77.49/72.17/66.62 dB，带宽为20 khz，可实现典型的Schreier FoMs为169.54、167.20和164.64 dB。在双通道和四通道模式下，当输入为- 2.5 dbfs时，平均通道间串扰分别为- 79.17和- 77.37 dB。

引用次数: 0

A Tri-Band Two-Stage LNA With Simultaneous Linearity and Gain Enhancement for WiFi 一种用于WiFi的同时线性和增益增强的三频段两级LNA

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-11-10 DOI: 10.1109/TVLSI.2025.3627973

Teng-Shen Yang;Liang-Hung Lu

This brief presents a tri-band, two-stage compact low-noise amplifier (LNA) that simultaneously enhances linearity, gain, and noise performance for WiFi applications. The second stage adopts a dual-path architecture, consisting of a main amplifier and an additional amplifier. The additional amplifier, biased in the subthreshold region, suppresses third-order nonlinearity and enhances gain without increasing power consumption. The first-stage LNA reduces the noise contribution from the second stage, improving overall noise performance. To further minimize power consumption, an inverter-based topology is employed. Fabricated in a 90-nm CMOS process, the proposed LNA achieves an

$S_{11}$

below −5 dB at 2.4, 5, and 6 GHz, covering key WiFi bands. At 6-GHz band, it delivers 13.5-dB gain, 3.2-dBm third-order input intercept point (IIP3), and 3.1-dB noise figure (NF). At 5-GHz band, it achieves 15-dB gain, 0.7-dBm IIP3, and 2.76-dB NF. At 2.4-GHz band, it provides 20.66-dB gain, −7-dBm IIP3, and 2.7-dB NF. The circuit consumes only 3 mW of dc power. Measurements at 6 GHz show that the dual-path technique in the second stage improves IIP3 by 8.6 dB, increases gain by 1.5 dB, and reduces NF by 0.6 dB, all without additional power or area overhead.

本简报介绍了一种三频段，两级紧凑型低噪声放大器（LNA），可同时增强WiFi应用的线性度，增益和噪声性能。第二级采用双路结构，由一个主放大器和一个附加放大器组成。附加放大器偏置于亚阈值区域，抑制三阶非线性并在不增加功耗的情况下提高增益。第一级LNA降低了第二级的噪声贡献，提高了整体噪声性能。为了进一步降低功耗，采用了基于逆变器的拓扑结构。该LNA采用90纳米CMOS工艺制造，在2.4 GHz、5 GHz和6 GHz下实现了低于- 5 dB的$S_{11}$，覆盖了关键的WiFi频段。在6ghz频段，它提供13.5 db增益，3.2 dbm三阶输入截距点（IIP3）和3.1 db噪声系数（NF）。在5ghz频段，它实现了15db增益，0.7 dbm IIP3和2.76 db NF。在2.4 ghz频段，增益为20.66 db， IIP3为−7 dbm， NF为2.7 db。电路只消耗3毫瓦的直流功率。在6 GHz的测量表明，双路技术在第二级提高IIP3 8.6 dB，增加增益1.5 dB，减少NF 0.6 dB，所有这些都没有额外的功率或面积开销。

{"title":"A Tri-Band Two-Stage LNA With Simultaneous Linearity and Gain Enhancement for WiFi","authors":"Teng-Shen Yang;Liang-Hung Lu","doi":"10.1109/TVLSI.2025.3627973","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3627973","url":null,"abstract":"This brief presents a tri-band, two-stage compact low-noise amplifier (LNA) that simultaneously enhances linearity, gain, and noise performance for WiFi applications. The second stage adopts a dual-path architecture, consisting of a main amplifier and an additional amplifier. The additional amplifier, biased in the subthreshold region, suppresses third-order nonlinearity and enhances gain without increasing power consumption. The first-stage LNA reduces the noise contribution from the second stage, improving overall noise performance. To further minimize power consumption, an inverter-based topology is employed. Fabricated in a 90-nm CMOS process, the proposed LNA achieves an <inline-formula> <tex-math>$S_{11}$ </tex-math></inline-formula> below −5 dB at 2.4, 5, and 6 GHz, covering key WiFi bands. At 6-GHz band, it delivers 13.5-dB gain, 3.2-dBm third-order input intercept point (IIP3), and 3.1-dB noise figure (NF). At 5-GHz band, it achieves 15-dB gain, 0.7-dBm IIP3, and 2.76-dB NF. At 2.4-GHz band, it provides 20.66-dB gain, −7-dBm IIP3, and 2.7-dB NF. The circuit consumes only 3 mW of dc power. Measurements at 6 GHz show that the dual-path technique in the second stage improves IIP3 by 8.6 dB, increases gain by 1.5 dB, and reduces NF by 0.6 dB, all without additional power or area overhead.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"34 2","pages":"687-691"},"PeriodicalIF":3.1,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Area-Time Efficient Formula-Based BCH Decoder With Trace Mechanism for WBAN Applications 基于区域时间效率公式的BCH解码器，具有WBAN应用的跟踪机制

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-11-04 DOI: 10.1109/TVLSI.2025.3624380

Vinh T. Nguyen;Wei-Che Liang;Thai N. Nguyen;Shao-I Chu;Bing-Hong Liu;Chen-Yang Hong;Shao-Tong Chen

This brief presents the area-time efficient decoding algorithm and architecture for the double error correcting

$(63,51)$

Bose–Chaudhuri–Hocquenghem (BCH) code with applications to wireless body area networks (WBANs). The formula for finding the roots of an error locator polynomial (ELP) is rederived to reduce the decoding complexity. The trace constraint for this formula is also taken into consideration to avoid the performance loss of bit error rate (BER). Hardware implementation results reveal that the proposed architecture surpasses the well-known Chien search-based and searchless decoders by the improvements of at least 50.89% and 29.35%, respectively, in terms of area-time complexity.

本文简要介绍了双纠错$(63,51)$ Bose-Chaudhuri-Hocquenghem （BCH）码的区域时间高效解码算法和结构，并应用于无线体域网络（wban）。为了降低译码复杂度，重新推导了求错误定位多项式（ELP）根的公式。为了避免误码率（BER）带来的性能损失，该公式还考虑了迹约束。硬件实现结果表明，该架构在区域时间复杂度方面分别优于著名的基于搜索和无搜索的Chien解码器，分别提高了50.89%和29.35%。

引用次数: 0

A 0.31-V 16-Kb 9T SRAM With Enhanced Sensing Margin and Read Performance for Low-Power Applications 低功耗应用中具有增强传感裕度和读性能的0.31 v 16kb 9T SRAM

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-11-04 DOI: 10.1109/TVLSI.2025.3624842

Pengyuan Zhao;Huidong Zhao;Linnan Li;Zhi Li;Minglong Jia;Xiang Li;Shushan Qiao

This brief presents a low-power 9T static random access memory (SRAM) with enhanced read sensing margin and read performance. The read decoupled port of the proposed 9T SRAM cell achieves the enhanced sensing margin by mitigating the read bitline (RBL) leakage and improves the read performance through using one-transistor read path. The multithreshold voltage devices are used in SRAM cell for improving the leakage power and performance of SRAM. Additionally, an interleaved write wordline (WWL) structure is implemented to address the write half-select issue. The measurement results of the test chip fabricated in the 22-nm FDSOI technology demonstrate that the designed 9T SRAM achieves a minimum operation voltage of 0.31 V at 1.05 MHz and can operate at 60.5 MHz when the supply voltage is 0.5 V. The minimum active energy of 18.56 fJ/access-bit is obtained at 0.33 V. Furthermore, the designed SRAM exhibits a minimum leakage power of 0.11 pW/bitcell in the retention mode.

本文介绍了一种具有增强读感知裕度和读性能的低功耗9T静态随机存取存储器（SRAM）。所提出的9T SRAM单元的读去耦端口通过减少读位线（RBL）泄漏实现了增强的传感裕度，并通过使用单晶体管读路径提高了读性能。为了提高SRAM的泄漏功率和性能，在SRAM单元中采用了多阈值电压器件。此外，还实现了一个交错写字行（WWL）结构来解决写半选择问题。采用22 nm FDSOI技术制作的测试芯片的测量结果表明，所设计的9T SRAM在1.05 MHz时的最小工作电压为0.31 V，在电源电压为0.5 V时可以工作在60.5 MHz。在0.33 V时获得了18.56 fJ/存取位的最小有功能量。此外，所设计的SRAM在保持模式下的最小泄漏功率为0.11 pW/位元。

引用次数: 0

ASTRA: Automated Insertion of Distributed Entropy Sources for Robust Authentication 用于鲁棒认证的分布式熵源自动插入

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-10-31 DOI: 10.1109/TVLSI.2025.3624325

Atri Chatterjee;Habibur Rahaman;Swarup Bhunia

The horizontal business model of modern semiconductors—where design, fabrication, and testing are handled by separate entities across a global supply chain—exposes integrated circuits (ICs) to various security threats throughout their lifecycle. Physical unclonable functions (PUFs) have emerged as effective hardware security primitives for device identification and attestation. However, integrating PUFs into existing designs is often manual, labor-intensive, and incurs high overhead in area, power, and design time. Moreover, traditional PUFs are typically localized to small regions of a chip, limiting entropy extraction from the full design surface. To address these limitations, we propose ASTRA, an automated framework that integrates PUF-based entropy sources into digital logic circuits in a distributed and timing-aware fashion. ASTRA enhances the conventional logic synthesis flow by inserting memory-in-logic PUFs (MeLPUFs), which are constructed using standard cell elements and offer high entropy. By distributing MeLPUF primitives across the circuit, ASTRA maximizes response randomness while minimizing area and power overhead. It can also reuse existing logic elements and supports multiple MeLPUF templates. ASTRA ensures timing constraints are respected and enables validation of both functional and logic equivalence checking (LEC) between the original and PUF-inserted designs. Experimental results show that ASTRA achieves near-ideal PUF quality metrics, demonstrating its effectiveness and scalability for secure hardware design.

现代半导体的横向商业模式——设计、制造和测试由全球供应链上的独立实体处理——使集成电路（ic）在其整个生命周期中面临各种安全威胁。物理不可克隆函数（puf）已经成为设备识别和认证的有效硬件安全原语。然而，将puf集成到现有设计中通常是手工的、劳动密集型的，并且在面积、功率和设计时间上产生很高的开销。此外，传统的puf通常局限于芯片的小区域，限制了从整个设计表面提取熵。为了解决这些限制，我们提出了ASTRA，这是一个自动化框架，以分布式和时间感知的方式将基于puf的熵源集成到数字逻辑电路中。ASTRA通过插入内存-逻辑puf （melpuf）来增强传统的逻辑合成流程，melpuf是使用标准单元元素构建的，具有高熵。通过在整个电路中分布MeLPUF原语，ASTRA最大限度地提高了响应随机性，同时最小化了面积和功耗开销。它还可以重用现有的逻辑元素，并支持多个MeLPUF模板。ASTRA确保时间约束得到尊重，并允许在原始设计和插入puf的设计之间进行功能和逻辑等效检查（LEC）的验证。实验结果表明，ASTRA达到了接近理想的PUF质量指标，证明了其在安全硬件设计中的有效性和可扩展性。

{"title":"ASTRA: Automated Insertion of Distributed Entropy Sources for Robust Authentication","authors":"Atri Chatterjee;Habibur Rahaman;Swarup Bhunia","doi":"10.1109/TVLSI.2025.3624325","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3624325","url":null,"abstract":"The horizontal business model of modern semiconductors—where design, fabrication, and testing are handled by separate entities across a global supply chain—exposes integrated circuits (ICs) to various security threats throughout their lifecycle. Physical unclonable functions (PUFs) have emerged as effective hardware security primitives for device identification and attestation. However, integrating PUFs into existing designs is often manual, labor-intensive, and incurs high overhead in area, power, and design time. Moreover, traditional PUFs are typically localized to small regions of a chip, limiting entropy extraction from the full design surface. To address these limitations, we propose ASTRA, an automated framework that integrates PUF-based entropy sources into digital logic circuits in a distributed and timing-aware fashion. ASTRA enhances the conventional logic synthesis flow by inserting memory-in-logic PUFs (MeLPUFs), which are constructed using standard cell elements and offer high entropy. By distributing MeLPUF primitives across the circuit, ASTRA maximizes response randomness while minimizing area and power overhead. It can also reuse existing logic elements and supports multiple MeLPUF templates. ASTRA ensures timing constraints are respected and enables validation of both functional and logic equivalence checking (LEC) between the original and PUF-inserted designs. Experimental results show that ASTRA achieves near-ideal PUF quality metrics, demonstrating its effectiveness and scalability for secure hardware design.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"34 2","pages":"634-647"},"PeriodicalIF":3.1,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IEEE Transactions on Very Large Scale Integration (VLSI) Systems Society Information 超大规模集成电路（VLSI）系统学报

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-10-30 DOI: 10.1109/TVLSI.2025.3621796

引用次数: 0

A Buffer-in-Loop NS-SAR ADC With CDAC Sharing-Error Feedback Structure and kT/C Noise Cancellation in 0.18-μm CMOS Technology 基于0.18 μm CMOS技术的CDAC共享误差反馈和kT/C噪声抑制的环内缓冲NS-SAR ADC

IF 3.1 2区工程技术 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Pub Date : 2025-10-30 DOI: 10.1109/TVLSI.2025.3619126

Zirui Cui;Yiqun Shi;Qingqing Sun;Hao Zhu

Analog-to-digital converters (ADCs) with a wide input range and power efficiency are highly desired in various applications like robust industrial sensor data acquisition and bioelectrical signal monitoring, where successive approximation register (SAR) ADCs fabricated in mature process nodes account for a large proportion. This brief presents a buffer-in-loop (BIL) noise-shaping (NS) SAR ADC with a proposed CDAC sharing (CS)-error-feedback (EF) structure and kT/

$C$

cancellation method in 180 nm. Compared with traditional EF methods, this CS-EF NS scheme takes advantage of the simple BIL structure and reduces the input signal degradation problem due to charge sharing with the input signal or parasitic capacitor of the series EF capacitor in the signal path, and enhances the system robustness. Combined with the kT/

$C$

cancellation and coarse-to-fine conversion techniques, this BIL NS-SAR reduces the noise from sampling, buffer, and pre-amp. The measured peak signal-to-noise-and-distortion ratio (SNDR) is 74.9 dB with a bandwidth of 0.6 MHz, and this ADC consumes

$218.7~mu $

W without the buffer.

具有宽输入范围和功率效率的模数转换器（adc）在鲁棒工业传感器数据采集和生物电信号监测等各种应用中都是非常需要的，其中在成熟工艺节点中制造的逐次逼近寄存器（SAR） adc占很大比例。本文介绍了一种环内缓冲（BIL）噪声整形（NS） SAR ADC，该ADC采用了CDAC共享(CS)-误差反馈（EF）结构和180 nm的kT/ $C$抵消方法。与传统EF方法相比，CS-EF NS方案利用了简单的BIL结构，减少了信号路径中由于与输入信号或串联EF电容的寄生电容共用电荷而导致的输入信号退化问题，增强了系统的鲁棒性。结合kT/ $C$对消和粗到精转换技术，该BIL NS-SAR降低了采样，缓冲和前置放大器的噪声。测量的峰值信噪比（SNDR）为74.9 dB，带宽为0.6 MHz，该ADC在不含缓冲器的情况下功耗为218.7~mu $ W。

{"title":"A Buffer-in-Loop NS-SAR ADC With CDAC Sharing-Error Feedback Structure and kT/C Noise Cancellation in 0.18-μm CMOS Technology","authors":"Zirui Cui;Yiqun Shi;Qingqing Sun;Hao Zhu","doi":"10.1109/TVLSI.2025.3619126","DOIUrl":"https://doi.org/10.1109/TVLSI.2025.3619126","url":null,"abstract":"Analog-to-digital converters (ADCs) with a wide input range and power efficiency are highly desired in various applications like robust industrial sensor data acquisition and bioelectrical signal monitoring, where successive approximation register (SAR) ADCs fabricated in mature process nodes account for a large proportion. This brief presents a buffer-in-loop (BIL) noise-shaping (NS) SAR ADC with a proposed CDAC sharing (CS)-error-feedback (EF) structure and <italic>kT</i>/<inline-formula> <tex-math>$C$ </tex-math></inline-formula> cancellation method in 180 nm. Compared with traditional EF methods, this CS-EF NS scheme takes advantage of the simple BIL structure and reduces the input signal degradation problem due to charge sharing with the input signal or parasitic capacitor of the series EF capacitor in the signal path, and enhances the system robustness. Combined with the <italic>kT</i>/<inline-formula> <tex-math>$C$ </tex-math></inline-formula> cancellation and coarse-to-fine conversion techniques, this BIL NS-SAR reduces the noise from sampling, buffer, and pre-amp. The measured peak signal-to-noise-and-distortion ratio (SNDR) is 74.9 dB with a bandwidth of 0.6 MHz, and this ADC consumes <inline-formula> <tex-math>$218.7~mu $ </tex-math></inline-formula>W without the buffer.","PeriodicalId":13425,"journal":{"name":"IEEE Transactions on Very Large Scale Integration (VLSI) Systems","volume":"34 1","pages":"327-331"},"PeriodicalIF":3.1,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145847795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀