Integration-The Vlsi Journal最新文献_第2页

True canonical third-order resonance-based oscillators and application to chaos generation 真正则三阶谐振振荡器及其在混沌产生中的应用

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-29 DOI: 10.1016/j.vlsi.2026.102668

Ahmed S. Elwakil , Abdulrahman A. Nutfaji , Brent J. Maundy

In this work, we introduce the set of four true canonical third-order oscillators based on third-order resonance networks with two capacitors and one inductor (2C-1L). Although a number of third-order LC oscillators are readily known, including some versions of the Colpitts oscillator, the unique set of four possible oscillators that are based on true canonical third-order resonance networks have never been explicitly introduced in the literature. This is because unlike second-order resonance networks, third-order ones are not in wide use and are hardly studied in basic circuit theory textbooks. We first recall the four elementary third-order canonical resonance networks and then focus on the two networks having a single inductor. Using these two networks, the set of four possible oscillators are derived and experimentally verified. An application in chaos generation is also presented and verified experimentally and via numerical simulations.

在这项工作中，我们介绍了基于三阶谐振网络（2C-1L）的四个真正则三阶振子。虽然许多三阶LC振荡器是众所周知的，包括一些版本的Colpitts振荡器，但基于真正规范三阶共振网络的四种可能振荡器的独特集合从未在文献中明确介绍过。这是因为与二阶共振网络不同，三阶共振网络的应用并不广泛，在基础电路理论教科书中也很少有研究。我们首先回顾了四个基本的三阶正则共振网络，然后重点讨论了具有单个电感的两个网络。利用这两种网络，推导了四种可能的振子集合，并进行了实验验证。并通过实验和数值模拟验证了该方法在混沌生成中的应用。

引用次数: 0

Design of a load modulated balance doherty power amplifier using parallel coupled line (PCL) structure line coupler for 5G IoT applications 5G物联网应用中采用并联耦合线（PCL）结构的负载调制平衡多赫蒂功率放大器的设计

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-15 DOI: 10.1016/j.vlsi.2026.102659

Rajesh Kumar , Sachin Kumar , Binod Kumar Kanaujia

This paper presents a new design of a load-modulated balance doherty power amplifier (LM-BDPA) using a parallel coupled line (PCL) structure line coupler for 5G internet of things (IoT) applications. The proposed design aims to achieve high efficiency and wide bandwidth, which are critical for 5G communication systems. The LM-BDPA utilizes a PCL structure to enhance the load modulation capability, resulting in improved power-added efficiency (PAE) and linearity. The design also incorporates advanced thermal management techniques to ensure stable operation under high-power conditions. The performance of the proposed LM-BDPA is evaluated through both simulations and measurements. The fabricated prototype achieves a measured gain of 11.2–13.4 dB and a saturated output power of ∼41 dBm. A PAE of 46.4–56.5 % and 43.2–50.3 % is achieved at 6 dB and 8 dB output power back-off, respectively, across the designed frequency band.

针对5G物联网应用，提出了一种采用平行耦合线（PCL）结构线耦合器的负载调制平衡型多谐功率放大器（LM-BDPA）的新设计。提出的设计旨在实现对5G通信系统至关重要的高效率和宽带宽。LM-BDPA采用PCL结构来增强负载调制能力，从而提高了功率附加效率（PAE）和线性度。该设计还采用了先进的热管理技术，以确保在高功率条件下稳定运行。通过仿真和测量对LM-BDPA的性能进行了评价。制作的原型实现了11.2-13.4 dB的测量增益和~ 41 dBm的饱和输出功率。在设计的频段内，在6 dB和8 dB输出功率回退时，PAE分别为46.4 - 56.5%和43.2 - 50.3%。

引用次数: 0

Design of a dynamic obfuscation-based strong PUF resistant to modeling attacks and mutual authentication protocol 设计了一种基于动态模糊的抗建模攻击的强PUF和互认证协议

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-11 DOI: 10.1016/j.vlsi.2026.102655

Yingchun Lu , Hongliang Lu , Yujie Liu , Huaguo Liang , Zhengfeng Huang , Jinlin Chen , Xiumin Xu , Liang Yao

Strong Physical Unclonable Functions (PUFs) are vulnerable to modeling attacks using Machine Learning (ML), and PUF-based authentication protocols also face security risks. To address these issues, this paper proposes a PUF structure with resistance to modeling attacks based on Dynamic Obfuscation (DO), composed of Linear Feedback Shift Registers (LFSRs), PUFs, and several logic gates. The characteristics of DO are as follows: (1) the initial state of the LFSR is determined by the PUF's response, making it uncontrollable; (2) the updated state of the LFSR determines the obfuscated bit of each input challenge, achieving a dynamic mapping between challenges and responses. An Arbiter PUF (APUF) based on DO is implemented on Xilinx Artix-7 FPGA, and experimental results show that the structure can effectively resist modeling attacks from various ML algorithms, with prediction accuracy close to 50 %. In addition, this paper proposes a mutual authentication protocol based on PUF, suitable for Internet of Things (IoT) systems.

强物理不可克隆函数（puf）容易受到机器学习（ML）的建模攻击，基于puf的认证协议也面临安全风险。为了解决这些问题，本文提出了一种基于动态混淆（DO）的PUF结构，该结构具有抗建模攻击的能力，由线性反馈移位寄存器（LFSRs）， PUF和几个逻辑门组成。DO的特点是：(1)LFSR的初始状态由PUF的响应决定，不可控；(2) LFSR的更新状态决定了每个输入挑战的混淆位，实现了挑战与响应之间的动态映射。在Xilinx Artix-7 FPGA上实现了基于DO的Arbiter PUF (APUF)，实验结果表明，该结构能够有效抵御各种ML算法的建模攻击，预测准确率接近50%。此外，本文还提出了一种适用于物联网（IoT）系统的基于PUF的互认证协议。

{"title":"Design of a dynamic obfuscation-based strong PUF resistant to modeling attacks and mutual authentication protocol","authors":"Yingchun Lu , Hongliang Lu , Yujie Liu , Huaguo Liang , Zhengfeng Huang , Jinlin Chen , Xiumin Xu , Liang Yao","doi":"10.1016/j.vlsi.2026.102655","DOIUrl":"10.1016/j.vlsi.2026.102655","url":null,"abstract":"<div><div>Strong Physical Unclonable Functions (PUFs) are vulnerable to modeling attacks using Machine Learning (ML), and PUF-based authentication protocols also face security risks. To address these issues, this paper proposes a PUF structure with resistance to modeling attacks based on Dynamic Obfuscation (DO), composed of Linear Feedback Shift Registers (LFSRs), PUFs, and several logic gates. The characteristics of DO are as follows: (1) the initial state of the LFSR is determined by the PUF's response, making it uncontrollable; (2) the updated state of the LFSR determines the obfuscated bit of each input challenge, achieving a dynamic mapping between challenges and responses. An Arbiter PUF (APUF) based on DO is implemented on Xilinx Artix-7 FPGA, and experimental results show that the structure can effectively resist modeling attacks from various ML algorithms, with prediction accuracy close to 50 %. In addition, this paper proposes a mutual authentication protocol based on PUF, suitable for Internet of Things (IoT) systems.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"108 ","pages":"Article 102655"},"PeriodicalIF":2.5,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hybrid algorithm based optimization strategies for analog circuit sizing in low dropout regulators 基于混合算法的低差稳压器模拟电路尺寸优化策略

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-05 DOI: 10.1016/j.vlsi.2026.102646

S. Karipidis , A. Buzo , G. Pelz , T. Noulis

Analog and Mixed Signal circuit sizing with large-scale parameters requires a lot of simulations, especially in non-linear topology where large-signal analysis is a need. Reducing the number of simulations and in general the total design cycle time, is the main objective for optimal sizing of complicated circuits. In this work a circuit sizing automated design methodology is presented using the hybrid dual annealing and Nelder–Mead algorithm, significantly reducing the design cycle time and the required number of transient simulations. A customized hybrid algorithm environment using Dual Annealing and Nelder–Mead is developed where the optimization process is divided into different optimization sub-steps. The proposed hybrid algorithm based method achieves rapid convergence to the needed circuit performance specification. It uses combinations of direct search algorithms to separate metric evaluation accelerating the performance specifications convergence speed in a large parameter space. A complicated non-linear topology like a product level low-dropout (LDO) regulator, in 180 nm process node, with 30 parameters is used as the circuit vehicle to verify the proposed methodology. The sizing process converged with less than 1700 simulations having as input just the circuit schematic with no prior sizing knowledge. Sub optimization is also performed focused on each analysis type — DC, AC and transient, with a focus on reducing the number of transient simulations. The proposed combined algorithm method achieved 31 % faster convergence speed compared to the state-of-the-art methods and handles efficiently each simulation analysis.

具有大参数的模拟和混合信号电路需要大量的仿真，特别是在非线性拓扑中需要进行大信号分析。减少模拟次数和总体设计周期时间，是优化复杂电路尺寸的主要目标。在这项工作中，提出了一种使用混合双退火和Nelder-Mead算法的电路尺寸自动设计方法，显着减少了设计周期时间和所需的瞬态模拟次数。开发了一种基于双退火和Nelder-Mead的自定义混合算法环境，将优化过程划分为不同的优化子步骤。基于混合算法的方法能够快速收敛到所需的电路性能指标。它采用直接搜索算法的组合来分离度量评估，加快了性能指标在大参数空间中的收敛速度。采用复杂的非线性拓扑，如产品级低差（LDO）稳压器，在180 nm工艺节点上，30个参数作为电路载体来验证所提出的方法。尺寸过程融合了不到1700个模拟，只有电路原理图作为输入，没有事先的尺寸知识。还针对每种分析类型（直流、交流和瞬态）进行了子优化，重点是减少瞬态模拟的次数。该组合算法的收敛速度比现有方法快31%，并能有效地处理各种仿真分析。

{"title":"Hybrid algorithm based optimization strategies for analog circuit sizing in low dropout regulators","authors":"S. Karipidis , A. Buzo , G. Pelz , T. Noulis","doi":"10.1016/j.vlsi.2026.102646","DOIUrl":"10.1016/j.vlsi.2026.102646","url":null,"abstract":"<div><div>Analog and Mixed Signal circuit sizing with large-scale parameters requires a lot of simulations, especially in non-linear topology where large-signal analysis is a need. Reducing the number of simulations and in general the total design cycle time, is the main objective for optimal sizing of complicated circuits. In this work a circuit sizing automated design methodology is presented using the hybrid dual annealing and Nelder–Mead algorithm, significantly reducing the design cycle time and the required number of transient simulations. A customized hybrid algorithm environment using Dual Annealing and Nelder–Mead is developed where the optimization process is divided into different optimization sub-steps. The proposed hybrid algorithm based method achieves rapid convergence to the needed circuit performance specification. It uses combinations of direct search algorithms to separate metric evaluation accelerating the performance specifications convergence speed in a large parameter space. A complicated non-linear topology like a product level low-dropout (LDO) regulator, in 180 nm process node, with 30 parameters is used as the circuit vehicle to verify the proposed methodology. The sizing process converged with less than 1700 simulations having as input just the circuit schematic with no prior sizing knowledge. Sub optimization is also performed focused on each analysis type — DC, AC and transient, with a focus on reducing the number of transient simulations. The proposed combined algorithm method achieved 31 % faster convergence speed compared to the state-of-the-art methods and handles efficiently each simulation analysis.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"108 ","pages":"Article 102646"},"PeriodicalIF":2.5,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Encoding and decoding devices based on memristor-diode crossbar-array and CMOS logic for spiking neural networks 基于忆阻二极管交叉栅阵列和CMOS逻辑的尖峰神经网络编解码装置

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-20 DOI: 10.1016/j.vlsi.2026.102670

A.N. Busygin , S.Yu. Udovichenko , A.H.A. Ebrahim

Electric circuits of the encoding and decoding devices for converting information from binary representation into a spike sequence and back for hardware spiking neural networks are proposed. The devices differ from the known ones by using fully digital circuitry and memristor-diode crossbars, which potentially reduces energy consumption, provides greater integration of elements and, accordingly, a smaller occupied area on the chip. In addition, changing the states of the memristors makes it possible to arbitrarily set the functions of direct and reverse conversion of binary numbers into spike sequences. The operability of the encoding device is confirmed by numerical simulation of the process of encoding input four-digit number to the times of the first spikes and the average spike frequency. The decoding process is verified during the simulation of the extracting of a four-digit binary number encoded in spike times of three neurons.

提出了用于硬件尖峰神经网络的将二进制表示的信息转换成尖峰序列再转换回来的编码和解码装置电路。该器件与已知器件的不同之处在于，它采用了全数字电路和忆阻二极管横条，这可能会降低能耗，提供更大的元件集成度，相应地，芯片上的占用面积也更小。另外，通过改变忆阻器的状态，可以任意设置二进制数到尖峰序列的正反转换功能。通过对输入四位数到第一尖峰次数和平均尖峰频率的编码过程进行数值模拟，验证了编码装置的可操作性。在三个神经元尖峰时间编码的四位数二进制数提取仿真过程中验证了解码过程。

引用次数: 0

MORL-IC: Multi-objective reinforcement learning approaches for analog integrated circuits optimization 模拟集成电路优化的多目标强化学习方法

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-19 DOI: 10.1016/j.vlsi.2026.102664

Hakan Taşkıran, Engin Afacan

Analog and RF integrated circuit (IC) design requires the simultaneous optimization of multiple, conflicting objectives under highly nonlinear and tightly coupled constraints. While prior studies — including our own — have demonstrated the feasibility of applying multi-objective reinforcement learning (MORL) to analog circuit optimization, the specific impact of workflow-level design choices on convergence behavior, simulation cost, and Pareto-front characteristics has remained insufficiently explored. This paper reformulates the Multi-Objective Deep Deterministic Policy Gradient (MODDPG) approach not as a single fixed algorithm, but as a family of optimization workflows that share an identical multi-objective actor–critic learning core while systematically differing in their initialization strategy and environment evaluation mechanism. Within this unified formulation, three configurations are investigated: (i) a baseline MODDPG workflow with random initialization and direct SPICE evaluation, (ii) MODDPG-2, which employs analytically derived extreme solutions to guide early exploration, and (iii) MODDPG-3, which introduces an ANN-based pseudo-designer to generate boundary solutions directly from performance specifications. In addition, a Fully-ANN execution mode is examined, where an ANN-based pseudo-simulator replaces SPICE during policy learning to accelerate environment interaction. By preserving the same reinforcement learning architecture across all variants, the proposed framework isolates the effects of structured initialization and surrogate-based environments on optimization outcomes. The workflows are evaluated on three analog circuits (active-loaded differential amplifier, folded-cascode amplifier, and voltage comparator) and one RF circuit (CMOS cross-coupled LC oscillator), as well as on standard analytical benchmarks. Comparative results against NSGA-II and MOEA/D show that no single method universally dominates; however, the proposed workflows consistently reduce the number of required SPICE simulations by approximately 30%–75% while maintaining competitive Pareto-front quality. This efficiency gain indicates that the reinforcement-learning agent progressively acquires design intuition comparable to that of an experienced human designer—learning to avoid unpromising regions of the design space and focusing evaluations on high-value candidates. The results therefore demonstrate that, beyond the choice of learning algorithm, workflow-level design decisions critically shape how effectively RL can emulate expert design behavior, offering practical guidance for balancing solution quality and computational cost in automated analog and RF circuit design flows.

模拟和射频集成电路（IC）设计需要在高度非线性和紧密耦合约束下同时优化多个相互冲突的目标。虽然之前的研究（包括我们自己的研究）已经证明了将多目标强化学习（MORL）应用于模拟电路优化的可行性，但工作流级设计选择对收敛行为、仿真成本和帕累托前特征的具体影响仍然没有得到充分的探讨。本文将多目标深度确定性策略梯度（MODDPG）方法重新表述为一组优化工作流，而不是单一的固定算法，这些工作流共享相同的多目标行为者-批评者学习核心，但在初始化策略和环境评估机制上存在系统差异。在这个统一的公式中，研究了三种配置：(i)随机初始化和直接SPICE评估的基线MODDPG工作流，（ii） MODDPG-2，它采用解析导出的极值解来指导早期探索，以及（iii） MODDPG-3，它引入了基于人工神经网络的伪设计器，直接从性能规范生成边界解。此外，研究了全神经网络执行模式，其中基于神经网络的伪模拟器在策略学习期间取代SPICE以加速环境交互。通过在所有变量中保留相同的强化学习架构，所提出的框架隔离了结构化初始化和基于代理的环境对优化结果的影响。工作流程在三个模拟电路（有源负载差分放大器、折叠级联放大器和电压比较器）和一个射频电路（CMOS交叉耦合LC振荡器）以及标准分析基准上进行了评估。与NSGA-II和MOEA/D的比较结果表明，没有一种方法具有普遍的优势；然而，所提出的工作流程始终如一地将所需SPICE模拟的数量减少了大约30%-75%，同时保持了具有竞争力的Pareto-front质量。这种效率增益表明，强化学习代理逐渐获得了与经验丰富的人类设计师相当的设计直觉——学习避免设计空间中没有希望的区域，并将评估集中在高价值的候选对象上。因此，结果表明，除了学习算法的选择之外，工作流级设计决策对强化学习如何有效地模拟专家设计行为至关重要，为在自动化模拟和射频电路设计流程中平衡解决方案质量和计算成本提供了实用指导。

{"title":"MORL-IC: Multi-objective reinforcement learning approaches for analog integrated circuits optimization","authors":"Hakan Taşkıran, Engin Afacan","doi":"10.1016/j.vlsi.2026.102664","DOIUrl":"10.1016/j.vlsi.2026.102664","url":null,"abstract":"<div><div>Analog and RF integrated circuit (IC) design requires the simultaneous optimization of multiple, conflicting objectives under highly nonlinear and tightly coupled constraints. While prior studies — including our own — have demonstrated the feasibility of applying multi-objective reinforcement learning (MORL) to analog circuit optimization, the specific impact of workflow-level design choices on convergence behavior, simulation cost, and Pareto-front characteristics has remained insufficiently explored. This paper reformulates the Multi-Objective Deep Deterministic Policy Gradient (MODDPG) approach not as a single fixed algorithm, but as a family of optimization workflows that share an identical multi-objective actor–critic learning core while systematically differing in their initialization strategy and environment evaluation mechanism. Within this unified formulation, three configurations are investigated: (i) a baseline MODDPG workflow with random initialization and direct SPICE evaluation, (ii) MODDPG-2, which employs analytically derived extreme solutions to guide early exploration, and (iii) MODDPG-3, which introduces an ANN-based pseudo-designer to generate boundary solutions directly from performance specifications. In addition, a Fully-ANN execution mode is examined, where an ANN-based pseudo-simulator replaces SPICE during policy learning to accelerate environment interaction. By preserving the same reinforcement learning architecture across all variants, the proposed framework isolates the effects of structured initialization and surrogate-based environments on optimization outcomes. The workflows are evaluated on three analog circuits (active-loaded differential amplifier, folded-cascode amplifier, and voltage comparator) and one RF circuit (CMOS cross-coupled LC oscillator), as well as on standard analytical benchmarks. Comparative results against NSGA-II and MOEA/D show that no single method universally dominates; however, the proposed workflows consistently reduce the number of required SPICE simulations by approximately 30%–75% while maintaining competitive Pareto-front quality. This efficiency gain indicates that the reinforcement-learning agent progressively acquires design intuition comparable to that of an experienced human designer—learning to avoid unpromising regions of the design space and focusing evaluations on high-value candidates. The results therefore demonstrate that, beyond the choice of learning algorithm, workflow-level design decisions critically shape how effectively RL can emulate expert design behavior, offering practical guidance for balancing solution quality and computational cost in automated analog and RF circuit design flows.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"108 ","pages":"Article 102664"},"PeriodicalIF":2.5,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146038495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hardware-efficient architecture of spiking neural networks based on sign-magnitude stochastic computing 基于符号量随机计算的峰值神经网络硬件高效架构

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-02-02 DOI: 10.1016/j.vlsi.2026.102662

Thai N. Nguyen, Jun-Xiang Shi, Vinh T. Nguyen, Shao-I Chu , Bing-Hong Liu

Artificial neural networks (ANNs) have numerous advantages that make them a powerful tool in solving complex problems in many fields, such as image classification, object detection and natural language processing. However, ANNs are facing many challenges due to the increasing complexity of models and the demand to optimize hardware resources. To address these issues, spiking neural networks (SNNs) offers a potential solution to ANNs, which imitate the way the human brain processes information by exploiting the advantage of discrete spikes. This paper presents the sign-magnitude stochastic computing (SC)-based hardware architecture of SNN by the ANN–SNN conversion techniques. This paradigm avoids the complicated floating-point operations and has better computing accuracy as compared to conventional SC. A novel SC-based integrate-and-fire (IF) neuron is presented to construct the core architecture of SNNs, improving the hardware efficiency while reducing the area cost and computational complexity. Hardware implementation results show that the proposed architecture achieves an accuracy of 97.06% for MNIST hand-written digit recognition dataset. As compared with the state-of-the-art designs, the presented architecture has the reductions of at least 35% in chip area, 42% in power consumption, and 86% in core area. The second highest area and energy efficiency are achieved. The hardware efficiency outperforms the baseline by up to

392 \times

.

人工神经网络（ann）具有许多优点，使其成为解决图像分类、目标检测和自然语言处理等领域复杂问题的有力工具。然而，由于模型的复杂性和优化硬件资源的需求，人工神经网络面临着许多挑战。为了解决这些问题，尖峰神经网络（snn）为人工神经网络提供了一个潜在的解决方案，人工神经网络通过利用离散尖峰的优势来模仿人类大脑处理信息的方式。利用ANN-SNN转换技术，提出了基于符号幅度随机计算的SNN硬件结构。该模式避免了复杂的浮点运算，并且与传统的单节点网络相比具有更好的计算精度。提出了一种新的基于单节点网络的IF （integrated -and-fire）神经元来构建单节点网络的核心架构，在提高硬件效率的同时降低了面积成本和计算复杂度。硬件实现结果表明，该架构在MNIST手写数字识别数据集上的准确率达到97.06%。与最先进的设计相比，该架构的芯片面积减少了至少35%，功耗减少了42%，核心面积减少了86%。实现了第二高的面积和能源效率。硬件效率比基线高出392倍。

{"title":"Hardware-efficient architecture of spiking neural networks based on sign-magnitude stochastic computing","authors":"Thai N. Nguyen, Jun-Xiang Shi, Vinh T. Nguyen, Shao-I Chu , Bing-Hong Liu","doi":"10.1016/j.vlsi.2026.102662","DOIUrl":"10.1016/j.vlsi.2026.102662","url":null,"abstract":"<div><div>Artificial neural networks (ANNs) have numerous advantages that make them a powerful tool in solving complex problems in many fields, such as image classification, object detection and natural language processing. However, ANNs are facing many challenges due to the increasing complexity of models and the demand to optimize hardware resources. To address these issues, spiking neural networks (SNNs) offers a potential solution to ANNs, which imitate the way the human brain processes information by exploiting the advantage of discrete spikes. This paper presents the sign-magnitude stochastic computing (SC)-based hardware architecture of SNN by the ANN–SNN conversion techniques. This paradigm avoids the complicated floating-point operations and has better computing accuracy as compared to conventional SC. A novel SC-based integrate-and-fire (IF) neuron is presented to construct the core architecture of SNNs, improving the hardware efficiency while reducing the area cost and computational complexity. Hardware implementation results show that the proposed architecture achieves an accuracy of 97.06% for MNIST hand-written digit recognition dataset. As compared with the state-of-the-art designs, the presented architecture has the reductions of at least 35% in chip area, 42% in power consumption, and 86% in core area. The second highest area and energy efficiency are achieved. The hardware efficiency outperforms the baseline by up to <span><math><mrow><mn>392</mn><mo>×</mo></mrow></math></span>.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"108 ","pages":"Article 102662"},"PeriodicalIF":2.5,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146189107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying malicious modules using deformable graph convolutional network-based security framework for reliable VLSI circuit protection 利用基于可变形图卷积网络的安全框架识别恶意模块，实现可靠的VLSI电路保护

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2025-12-15 DOI: 10.1016/j.vlsi.2025.102633

M. Maria Rubiston , B.R. Tapas Bapu

Hardware security remains a significant concern because Very Large Scale Integration (VLSI) circuits have become increasingly complex, and industries have begun utilizing untrusted third-party Intellectual Property. Security threats from Hardware Trojans (HTs) remain particularly dangerous since these devices create unethical modifications that break circuit integrity while challenging reliability and damaging confidentiality. Current HT detection methods struggle to scale properly and maintain high accuracy rates due to malicious Trojan design strategies, as well as the constraints of functional testing, side-channel evaluation, and formal verification techniques. To address these challenges, this research introduces DGCoNet-GBOA, a Diffusion Kernel Attention Network with Deformable Graph Convolutional Network-Based Security Framework optimized using the Gooseneck Barnacle Optimization Algorithm (GBOA) for real-time and highly accurate HT detection. The proposed framework extracts structural, power, and transition probability features using Scale-aware Modulation Meet Transformer (S-ammT) and balances the dataset using Diminishing Batch Normalization (DimBN). The DGCoNet framework analyses gate-level netlists (GLNs) as graphical networks to identify HT development changes, and GBOA uses optimization methods that boost detection precision capabilities. The model displays precise Trojan detection abilities, achieving 99.87 % accuracy with just 0.12 % false positive occurrences and 99.91 % precision when testing ISCAS'85 and ISCAS'89 benchmark systems. The proposed DGCoNet-GBOA method achieves an average 0.7–4.5 % improvement in accuracy over existing state-of-the-art approaches across ISCAS'85 and ISCAS'89 benchmarks. The framework built in this research provides scalable, high-reliability HT detection capabilities to safeguard VLSI circuits from present-day hardware security threats during semiconductor design.

硬件安全仍然是一个重要的问题，因为超大规模集成电路（VLSI）已经变得越来越复杂，行业已经开始使用不可信的第三方知识产权。来自硬件木马（ht）的安全威胁仍然特别危险，因为这些设备会进行不道德的修改，破坏电路完整性，同时挑战可靠性并破坏机密性。由于恶意木马设计策略以及功能测试、侧信道评估和形式化验证技术的限制，当前的HT检测方法难以适当扩展并保持较高的准确率。为了解决这些挑战，本研究引入了DGCoNet-GBOA，这是一个基于可变形图卷积网络的扩散核注意网络，使用鹅颈藤壶优化算法（GBOA）进行优化，用于实时和高精度的高温检测。该框架使用尺度感知调制满足变压器（S-ammT）提取结构、功率和转移概率特征，并使用递减批处理归一化（DimBN）平衡数据集。DGCoNet框架将门级网络（gln）作为图形网络进行分析，以识别HT发展变化，GBOA使用优化方法提高检测精度能力。该模型显示了精确的特洛伊木马检测能力，在测试ISCAS'85和ISCAS'89基准系统时，准确率达到99.87%，假阳性发生率仅为0.12%，准确率为99.91%。拟议的DGCoNet-GBOA方法在ISCAS'85和ISCAS'89基准中，比现有的最先进方法的准确性平均提高了0.7 - 4.5%。本研究中构建的框架提供了可扩展的、高可靠性的高温检测功能，以保护VLSI电路在半导体设计期间免受当今硬件安全威胁。

{"title":"Identifying malicious modules using deformable graph convolutional network-based security framework for reliable VLSI circuit protection","authors":"M. Maria Rubiston , B.R. Tapas Bapu","doi":"10.1016/j.vlsi.2025.102633","DOIUrl":"10.1016/j.vlsi.2025.102633","url":null,"abstract":"<div><div>Hardware security remains a significant concern because Very Large Scale Integration (VLSI) circuits have become increasingly complex, and industries have begun utilizing untrusted third-party Intellectual Property. Security threats from Hardware Trojans (HTs) remain particularly dangerous since these devices create unethical modifications that break circuit integrity while challenging reliability and damaging confidentiality. Current HT detection methods struggle to scale properly and maintain high accuracy rates due to malicious Trojan design strategies, as well as the constraints of functional testing, side-channel evaluation, and formal verification techniques. To address these challenges, this research introduces DGCoNet-GBOA, a Diffusion Kernel Attention Network with Deformable Graph Convolutional Network-Based Security Framework optimized using the Gooseneck Barnacle Optimization Algorithm (GBOA) for real-time and highly accurate HT detection. The proposed framework extracts structural, power, and transition probability features using Scale-aware Modulation Meet Transformer (S-ammT) and balances the dataset using Diminishing Batch Normalization (DimBN). The DGCoNet framework analyses gate-level netlists (GLNs) as graphical networks to identify HT development changes, and GBOA uses optimization methods that boost detection precision capabilities. The model displays precise Trojan detection abilities, achieving 99.87 % accuracy with just 0.12 % false positive occurrences and 99.91 % precision when testing ISCAS'85 and ISCAS'89 benchmark systems. The proposed DGCoNet-GBOA method achieves an average 0.7–4.5 % improvement in accuracy over existing state-of-the-art approaches across ISCAS'85 and ISCAS'89 benchmarks. The framework built in this research provides scalable, high-reliability HT detection capabilities to safeguard VLSI circuits from present-day hardware security threats during semiconductor design.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"108 ","pages":"Article 102633"},"PeriodicalIF":2.5,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-performance FIR filter designs using Brent Kung Adder and pipelined Vedic multiplier 高性能FIR滤波器设计使用布伦特孔加法器和流水线吠陀乘法器

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-03 DOI: 10.1016/j.vlsi.2025.102645

J. Banumathi, G. Karthy

Signal processing widely uses Finite Impulse Response (FIR) filters because of their stability and linear phase. However, traditional FIR filter designs are limited by multiplication operations that lead to high hardware utilization and delay. To address this, modified FIR filters with optimized multipliers and adders are being developed to improve hardware resource utilization and delay performance. This paper presents novel designs for 8-tap and 16-tap FIR filters, leveraging Brent-Kung Adders (BKA) and Pipelined (P) Urdhva Triyakbhyam Vedic multipliers (UTVM) to achieve minimal delay and enhanced performance. Three architectures—Pipelined FIR using UTVM with BKA (PFIR-UTVM-BKA), FIR using PUTVM with BKA (FIR-PUTVM-BKA), and Pipelined FIR using PUTVM with BKA (PFIR-PUTVM-BKA)—were implemented with different device specifications on Kintex-7, Virtex-7, and Zynq 7000 platforms using Xilinx Vivado 2022.2 and ASIC 45 nm, simulated in Verilog. In FPGA, the proposed multiplier reduces delay by 43 %, 61.55 %, 73.01 %, and 78.51 % across different bit widths, and power, delay, and (power delay Product) PDP were reduced by 82.16 %,94.34 % and 98.99 % respectively, in ASIC. Additionally, the proposed FIR filter architectures achieve significant improvements, including 51.88 % and 27.13 % delay reduction, 75.40 % slice improvement, and 92.53 % and 97.03 % enhancement in slice registers for 8-tap and 16-tap 8-bit designs in FPGA, and power, delay, and PDP (Power Delay Product) were reduced by 87.97 %,97.59 % and 99.71 % respectively, in ASIC. These advancements make the proposed FIR filters highly suitable for high-speed digital signal processing (DSP) applications, where efficient processing and minimized latency are crucial. Integrating PVM and BKA plays a pivotal role in achieving these performance enhancements, positioning these filter designs as promising solutions for next-generation signal processing systems.

有限脉冲响应（FIR）滤波器由于其稳定性和相位线性而被广泛应用于信号处理。然而，传统的FIR滤波器设计受到乘法运算的限制，导致高硬件利用率和延迟。为了解决这个问题，正在开发带有优化乘法器和加法器的改进FIR滤波器，以提高硬件资源利用率和延迟性能。本文提出了8分频和16分频FIR滤波器的新设计，利用Brent-Kung加法器（BKA）和Pipelined (P) Urdhva Triyakbhyam Vedic乘法器（UTVM）实现最小的延迟和增强的性能。采用Xilinx Vivado 2022.2和45纳米ASIC，在Kintex-7、Virtex-7和Zynq 7000平台上以不同的设备规格实现了三种架构——使用UTVM和BKA的流水线FIR （PFIR-UTVM-BKA）、使用PUTVM和BKA的流水线FIR （FIR-PUTVM-BKA）和使用PUTVM和BKA的流水线FIR (pir -PUTVM-BKA)，并在Verilog中进行了模拟。在FPGA中，该乘法器在不同比特宽度下的时延分别降低了43%、61.55%、73.01%和78.51%，在ASIC中，功率、时延和（功率延迟积）PDP分别降低了82.16%、94.34%和98.99%。此外，所提出的FIR滤波器架构取得了显著的改进，包括FPGA中8分路和16分路8位设计的延迟降低51.88%和27.13%，切片改善75.40%，切片寄存器提高92.53%和97.03%，ASIC中的功耗，延迟和PDP（功率延迟产品）分别降低87.97%，97.59%和99.71%。这些进步使得所提出的FIR滤波器非常适合高速数字信号处理（DSP）应用，其中高效处理和最小化延迟至关重要。集成PVM和BKA在实现这些性能增强方面起着关键作用，将这些滤波器设计定位为下一代信号处理系统的有前途的解决方案。

{"title":"High-performance FIR filter designs using Brent Kung Adder and pipelined Vedic multiplier","authors":"J. Banumathi, G. Karthy","doi":"10.1016/j.vlsi.2025.102645","DOIUrl":"10.1016/j.vlsi.2025.102645","url":null,"abstract":"<div><div>Signal processing widely uses Finite Impulse Response (FIR) filters because of their stability and linear phase. However, traditional FIR filter designs are limited by multiplication operations that lead to high hardware utilization and delay. To address this, modified FIR filters with optimized multipliers and adders are being developed to improve hardware resource utilization and delay performance. This paper presents novel designs for 8-tap and 16-tap FIR filters, leveraging Brent-Kung Adders (BKA) and Pipelined (P) Urdhva Triyakbhyam Vedic multipliers (UTVM) to achieve minimal delay and enhanced performance. Three architectures—Pipelined FIR using UTVM with BKA (PFIR-UTVM-BKA), FIR using PUTVM with BKA (FIR-PUTVM-BKA), and Pipelined FIR using PUTVM with BKA (PFIR-PUTVM-BKA)—were implemented with different device specifications on Kintex-7, Virtex-7, and Zynq 7000 platforms using Xilinx Vivado 2022.2 and ASIC 45 nm, simulated in Verilog. In FPGA, the proposed multiplier reduces delay by 43 %, 61.55 %, 73.01 %, and 78.51 % across different bit widths, and power, delay, and (power delay Product) PDP were reduced by 82.16 %,94.34 % and 98.99 % respectively, in ASIC. Additionally, the proposed FIR filter architectures achieve significant improvements, including 51.88 % and 27.13 % delay reduction, 75.40 % slice improvement, and 92.53 % and 97.03 % enhancement in slice registers for 8-tap and 16-tap 8-bit designs in FPGA, and power, delay, and PDP (Power Delay Product) were reduced by 87.97 %,97.59 % and 99.71 % respectively, in ASIC. These advancements make the proposed FIR filters highly suitable for high-speed digital signal processing (DSP) applications, where efficient processing and minimized latency are crucial. Integrating PVM and BKA plays a pivotal role in achieving these performance enhancements, positioning these filter designs as promising solutions for next-generation signal processing systems.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"108 ","pages":"Article 102645"},"PeriodicalIF":2.5,"publicationDate":"2026-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design of a memristive CNN chaotic system: From chaotic behavior analysis to circuit implementation and robust control 记忆CNN混沌系统的设计：从混沌行为分析到电路实现和鲁棒控制

IF 2.5 3区工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

Integration-The Vlsi Journal

Pub Date : 2026-05-01 Epub Date: 2026-01-16 DOI: 10.1016/j.vlsi.2026.102669

Jie Zhang, Jiliang Lv, Nana Cheng, Liu Yang

Using a quadratic memristor as the connection weight between cells, a 4D memristive cellular neural network (CNN) chaotic system is constructed. Through a series of dynamical analyses, it is found that the system exhibits relatively rich dynamical characteristics. In this study, offset boosting is achieved by adding an offset parameter. Furthermore, the system’s amplitude control and attractor rotation are also realized. Based on the fundamental theory of attractor rotation, a multi-wing attractor transformation of the chaotic system is achieved. Additionally, circuit simulation software is used to design and implement a simulation circuit for the 4D memristive chaotic system, and the simulation results verified the physical feasibility of the constructed system. Finally, feedback control is implemented using the

H_{\infty}

control principle. Validation confirms that the designed controller can effectively counteract external disturbances and stabilize the system output.

利用二次型忆阻器作为单元间的连接权，构造了一个四维忆阻细胞神经网络（CNN）混沌系统。通过一系列的动力学分析，发现该系统具有较为丰富的动力学特性。在本研究中，偏移增强是通过增加偏移参数来实现的。此外，还实现了系统的幅值控制和吸引子旋转。基于吸引子旋转的基本理论，实现了混沌系统的多翼吸引子变换。利用电路仿真软件设计并实现了四维忆阻混沌系统的仿真电路，仿真结果验证了所构建系统的物理可行性。最后，利用H∞控制原理实现反馈控制。验证结果表明，所设计的控制器能够有效地抵消外部干扰，稳定系统输出。

引用次数: 0