首页 > 最新文献

Integration-The Vlsi Journal最新文献

英文 中文
An efficient XOR-free implementation of polar encoder for reconfigurable hardware 可重构硬件极性编码器的高效无 XOR 实现
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-24 DOI: 10.1016/j.vlsi.2024.102291
Navin Kumar , Deepak Kedia , Gaurav Purohit
— This paper presents a novel approach to implementing an XOR-Free architecture of the non-systematic polar encoder (NSPE) for 5G radio. The optimization of XOR logic for hardware (HW) implementation is essential to reduce delay and power consumption. The proposed architecture for NSPE replaces XOR operations with combinational logical patterns, and some redundant patterns are removed with the help of bit manipulation to make it more efficient. The design infers multiplexers (2:1 or 4:1) and inverters as its functional units, making the design adequate and effective in alleviating HW complexity. The XOR-Free encoder performs the same functionality as the XOR-based conventional encoder. We have written a MATLAB script that generates Verilog hardware description language (HDL) code for fully or partially parallel polar encoders tailored to specific code lengths (N) and degrees of parallelism (M). A comparative analysis of various fully and partially parallel encoders with the XOR-Free algorithm is presented. The implementation results show that the proposed architectures are more efficient in terms of HW cost, power consumption, throughput, and latency.
- 本文介绍了一种为 5G 射频实现无 XOR 架构的非系统极性编码器(NSPE)的新方法。优化硬件(HW)实现的 XOR 逻辑对于减少延迟和功耗至关重要。所提出的 NSPE 架构用组合逻辑模式取代了 XOR 操作,并通过位操作删除了一些冗余模式,从而提高了效率。该设计推导出多路复用器(2:1 或 4:1)和反相器作为其功能单元,使该设计能够充分有效地降低硬件复杂性。无 XOR 编码器的功能与基于 XOR 的传统编码器相同。我们编写了一个 MATLAB 脚本,可生成完全或部分并行极性编码器的 Verilog 硬件描述语言 (HDL) 代码,并根据特定的代码长度 (N) 和并行程度 (M) 进行定制。本文对采用 XOR-Free 算法的各种完全并行和部分并行编码器进行了比较分析。实施结果表明,就硬件成本、功耗、吞吐量和延迟而言,所提出的架构更为高效。
{"title":"An efficient XOR-free implementation of polar encoder for reconfigurable hardware","authors":"Navin Kumar ,&nbsp;Deepak Kedia ,&nbsp;Gaurav Purohit","doi":"10.1016/j.vlsi.2024.102291","DOIUrl":"10.1016/j.vlsi.2024.102291","url":null,"abstract":"<div><div>— This paper presents a novel approach to implementing an XOR-Free architecture of the non-systematic polar encoder (NSPE) for 5G radio. The optimization of XOR logic for hardware (HW) implementation is essential to reduce delay and power consumption. The proposed architecture for NSPE replaces XOR operations with combinational logical patterns, and some redundant patterns are removed with the help of bit manipulation to make it more efficient. The design infers multiplexers (2:1 or 4:1) and inverters as its functional units, making the design adequate and effective in alleviating HW complexity. The XOR-Free encoder performs the same functionality as the XOR-based conventional encoder. We have written a MATLAB script that generates Verilog hardware description language (HDL) code for fully or partially parallel polar encoders tailored to specific code lengths (<em>N</em>) and degrees of parallelism (<em>M</em>). A comparative analysis of various fully and partially parallel encoders with the XOR-Free algorithm is presented. The implementation results show that the proposed architectures are more efficient in terms of HW cost, power consumption, throughput, and latency.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102291"},"PeriodicalIF":2.2,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142359424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accelerating large-scale multi-scalar multiplication in Zk-SNARK through exploiting its multilevel parallelism 利用 Zk-SNARK 的多级并行性,加速 Zk-SNARK 中的大规模多标量乘法运算
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-24 DOI: 10.1016/j.vlsi.2024.102286
Ning Wang , Feng Wang , Pengcheng Hua , Xu Zhao , Zhilei Chai
In the context of zk-SNARK, MSM emerges as a major computational bottleneck, particularly due to its high computational and memory overhead. In this work, we exploit multiple levels of parallelism to effectively accelerate largescale MSM in zk-SNARKs. Firstly, a distributed parameter generation method is proposed in this paper to replace that of centralized method. Based on this methodology, the paper realizes a system with extraordinary scalability, capable of computing large-scale MSMs. Subsequently, the approach presented in this paper elevates the computation of Bellperson, the most prominent zero-knowledge proof system in real-world applications, from a single-node computation to a clustering mode, significantly enhancing its computational performance – a crucial advancement for practical applications. Finally, we implement a multi-level, fully parallelised MSM computing system by leveraging hierarchical sub-task partitioning and cross-node communication optimization, thereby thoroughly exploiting parallelism at diverse granularities. Experimental results show that in the cluster scenario, the proposed approach achieves acceleration ratios of approximately 3.60 and 6.50 times compared to the cutting-edge heterogeneous version Bellperson in dual-node and quad-node settings, respectively. On a single node, the proposed optimization approach achieves an acceleration ratio of 1.38 times compared to the current State-of-the-Art MSM calculation module of cuZK, outperforms the industry-popular Bellman by 186 times and the Bellperson by 1.96 times.
在 zk-SNARK 中,MSM 是一个主要的计算瓶颈,特别是由于其计算和内存开销较大。在这项工作中,我们利用多层并行性有效加速了 zk-SNARK 中的大规模 MSM。首先,本文提出了一种分布式参数生成方法,以取代集中式方法。基于这种方法,本文实现了一个具有超强可扩展性的系统,能够计算大规模 MSM。随后,本文提出的方法将实际应用中最重要的零知识证明系统 Bellperson 的计算从单节点计算提升到集群模式,显著提高了其计算性能--这对实际应用来说是一个至关重要的进步。最后,我们利用分层子任务分区和跨节点通信优化,实现了多层次、完全并行化的 MSM 计算系统,从而彻底利用了不同粒度的并行性。实验结果表明,在集群场景中,与双节点和四节点设置下的尖端异构版本 Bellperson 相比,所提出的方法分别实现了约 3.60 倍和 6.50 倍的加速比。在单节点上,与目前最先进的 cuZK MSM 计算模块相比,拟议的优化方法实现了 1.38 倍的加速比,比业界流行的 Bellman 高出 186 倍,比 Bellperson 高出 1.96 倍。
{"title":"Accelerating large-scale multi-scalar multiplication in Zk-SNARK through exploiting its multilevel parallelism","authors":"Ning Wang ,&nbsp;Feng Wang ,&nbsp;Pengcheng Hua ,&nbsp;Xu Zhao ,&nbsp;Zhilei Chai","doi":"10.1016/j.vlsi.2024.102286","DOIUrl":"10.1016/j.vlsi.2024.102286","url":null,"abstract":"<div><div>In the context of zk-SNARK, MSM emerges as a major computational bottleneck, particularly due to its high computational and memory overhead. In this work, we exploit multiple levels of parallelism to effectively accelerate largescale MSM in zk-SNARKs. Firstly, a distributed parameter generation method is proposed in this paper to replace that of centralized method. Based on this methodology, the paper realizes a system with extraordinary scalability, capable of computing large-scale MSMs. Subsequently, the approach presented in this paper elevates the computation of Bellperson, the most prominent zero-knowledge proof system in real-world applications, from a single-node computation to a clustering mode, significantly enhancing its computational performance – a crucial advancement for practical applications. Finally, we implement a multi-level, fully parallelised MSM computing system by leveraging hierarchical sub-task partitioning and cross-node communication optimization, thereby thoroughly exploiting parallelism at diverse granularities. Experimental results show that in the cluster scenario, the proposed approach achieves acceleration ratios of approximately 3.60 and 6.50 times compared to the cutting-edge heterogeneous version Bellperson in dual-node and quad-node settings, respectively. On a single node, the proposed optimization approach achieves an acceleration ratio of 1.38 times compared to the current State-of-the-Art MSM calculation module of cuZK, outperforms the industry-popular Bellman by 186 times and the Bellperson by 1.96 times.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102286"},"PeriodicalIF":2.2,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142419987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lorenz’s state equations as RC filters 作为 RC 滤波器的洛伦兹状态方程
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-23 DOI: 10.1016/j.vlsi.2024.102284
Isaac Campos-Cantón
In this study, the Lorenz system electronic implementation is performed using an RC low-pass filter within the xyz state equations. This electronic development shows that it is possible to know for each state the cutoff frequency. The bifurcation diagram for each state is shown as a function of cutoff frequency. Simulation results support this suggestion.
在本研究中,洛伦兹系统的电子实现是在 xyz 状态方程内使用 RC 低通滤波器进行的。这种电子开发表明,可以知道每个状态的截止频率。每个状态的分岔图都显示为截止频率的函数。模拟结果支持这一建议。
{"title":"Lorenz’s state equations as RC filters","authors":"Isaac Campos-Cantón","doi":"10.1016/j.vlsi.2024.102284","DOIUrl":"10.1016/j.vlsi.2024.102284","url":null,"abstract":"<div><div>In this study, the Lorenz system electronic implementation is performed using an <span><math><mrow><mi>R</mi><mi>C</mi></mrow></math></span> low-pass filter within the <span><math><mrow><mi>x</mi><mi>y</mi><mi>z</mi></mrow></math></span> state equations. This electronic development shows that it is possible to know for each state the cutoff frequency. The bifurcation diagram for each state is shown as a function of cutoff frequency. Simulation results support this suggestion.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102284"},"PeriodicalIF":2.2,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PDQRRFF: Poisson-distributed quantum random reversible flip flop generator for BIST PDQRRFF:用于 BIST 的泊松分布式量子随机可逆触发器发生器
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-23 DOI: 10.1016/j.vlsi.2024.102289
Kannan R , Vidhya K
Reversible logic has gained popularity recently because it allows circuits to use significantly less power. Due to the inherent reversibility of quantum operation, there is enormous interest in designing and optimizing reversible circuits. In this work, a new gate named as transvidkan gate, Quantum Random Reversible Flip Flop, Sequence generator, and Build in Self-Test (BIST) has been proposed. Quantum Random Reversible Flip Flop (QRRFF) is an emerging technology that is used in a variety of apps that use modern security and encryption systems. For creating a random string, typical approaches combine an entropy source with an elimination or bit-generation system. Quantum computing reversible logic chips and Low-power design are emerging as intriguing research topics. A classical logic-based 8-bit reversible comparator is represented using existing reversible gates. This study provides a BIST-based architecture for a comparator design that reduces the garbage outputs, quantum cost, and constant inputs. According to simulation result, the proposed approach outperforms traditional methods in terms of hardware complexity and quantum cost.
可逆逻辑最近大受欢迎,因为它使电路的功耗大大降低。由于量子操作固有的可逆性,人们对设计和优化可逆电路产生了极大的兴趣。在这项工作中,我们提出了一种新的门,名为跨维德坎门(transvidkan gate)、量子随机可逆触发器(Quantum Random Reversible Flip Flop)、序列发生器(Sequence generator)和内置自检(BIST)。量子随机可逆触发器(QRRFF)是一种新兴技术,被用于各种使用现代安全和加密系统的应用程序中。为创建随机字符串,典型的方法是将熵源与消除或比特生成系统相结合。量子计算可逆逻辑芯片和低功耗设计正成为引人入胜的研究课题。基于经典逻辑的 8 位可逆比较器使用现有的可逆门来表示。本研究为比较器设计提供了一种基于 BIST 的架构,可减少垃圾输出、量子成本和常数输入。根据仿真结果,所提出的方法在硬件复杂性和量子成本方面优于传统方法。
{"title":"PDQRRFF: Poisson-distributed quantum random reversible flip flop generator for BIST","authors":"Kannan R ,&nbsp;Vidhya K","doi":"10.1016/j.vlsi.2024.102289","DOIUrl":"10.1016/j.vlsi.2024.102289","url":null,"abstract":"<div><div>Reversible logic has gained popularity recently because it allows circuits to use significantly less power. Due to the inherent reversibility of quantum operation, there is enormous interest in designing and optimizing reversible circuits. In this work, a new gate named as transvidkan gate, Quantum Random Reversible Flip Flop, Sequence generator, and Build in Self-Test (BIST) has been proposed. Quantum Random Reversible Flip Flop (QRRFF) is an emerging technology that is used in a variety of apps that use modern security and encryption systems. For creating a random string, typical approaches combine an entropy source with an elimination or bit-generation system. Quantum computing reversible logic chips and Low-power design are emerging as intriguing research topics. A classical logic-based 8-bit reversible comparator is represented using existing reversible gates. This study provides a BIST-based architecture for a comparator design that reduces the garbage outputs, quantum cost, and constant inputs. According to simulation result, the proposed approach outperforms traditional methods in terms of hardware complexity and quantum cost.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102289"},"PeriodicalIF":2.2,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142327647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Innovative feedback approach for high-performance low-voltage current mirror 用于高性能低压电流镜的创新反馈方法
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-18 DOI: 10.1016/j.vlsi.2024.102283
Astha Dadheech, Nikhil Raj, Divyang Rawal

The paper presents an approach to increase the performance in terms of input and output resistance of a low voltage flipped voltage follower (FVF) based current mirror. The proposed technique consists of substituting the main output transistor with a network of transistors in a feedback arrangement, designed to improve the output resistance. Furthermore, a low saturation onset transistor approach is used to improve the performance. Such an approach also helped in reducing the input resistance of the current mirror, which ranges in ohms. A wide current range of up to 1 mA is achieved at a minimal current transfer error of 0.38 %. This feedback mechanism-based current mirror exhibits an output resistance of 29.61 GΩ, an input resistance of 30.45 Ω, and a bandwidth of 1.464 GHz. The proposed current mirror runs on ±0.5 V supply voltage. The robustness of the proposed circuit is evaluated through process corner analysis, temperature mismatch assessment, and Monte-Carlo simulations. The performance characteristics of the proposed current mirror have been validated and simulated using Cadence Virtuoso and Spectre simulations on 0.18 μm UMC technology. The validation process included both pre-layout and post-layout simulation results.

本文介绍了一种提高基于低电压翻转电压跟随器(FVF)的电流镜输入和输出电阻性能的方法。所提出的技术包括用一个反馈布置的晶体管网络取代主输出晶体管,以提高输出电阻。此外,还采用了低饱和起始晶体管方法来提高性能。这种方法还有助于降低电流镜的输入电阻(以欧姆为单位)。在电流传输误差最小为 0.38 % 的情况下,实现了高达 1 mA 的宽电流范围。这种基于反馈机制的电流镜的输出电阻为 29.61 GΩ,输入电阻为 30.45 Ω,带宽为 1.464 GHz。拟议的电流镜在±0.5 V 电源电压下运行。通过工艺转角分析、温度失配评估和蒙特卡罗模拟,对所提电路的稳健性进行了评估。利用 Cadence Virtuoso 和 Spectre 仿真在 0.18 μm UMC 技术上验证和模拟了拟议电流镜的性能特征。验证过程包括布局前和布局后仿真结果。
{"title":"Innovative feedback approach for high-performance low-voltage current mirror","authors":"Astha Dadheech,&nbsp;Nikhil Raj,&nbsp;Divyang Rawal","doi":"10.1016/j.vlsi.2024.102283","DOIUrl":"10.1016/j.vlsi.2024.102283","url":null,"abstract":"<div><p>The paper presents an approach to increase the performance in terms of input and output resistance of a low voltage flipped voltage follower (FVF) based current mirror. The proposed technique consists of substituting the main output transistor with a network of transistors in a feedback arrangement, designed to improve the output resistance. Furthermore, a low saturation onset transistor approach is used to improve the performance. Such an approach also helped in reducing the input resistance of the current mirror, which ranges in ohms. A wide current range of up to 1 mA is achieved at a minimal current transfer error of 0.38 %. This feedback mechanism-based current mirror exhibits an output resistance of 29.61 GΩ, an input resistance of 30.45 Ω, and a bandwidth of 1.464 GHz. The proposed current mirror runs on ±0.5 V supply voltage. The robustness of the proposed circuit is evaluated through process corner analysis, temperature mismatch assessment, and Monte-Carlo simulations. The performance characteristics of the proposed current mirror have been validated and simulated using Cadence Virtuoso and Spectre simulations on 0.18 μm UMC technology. The validation process included both pre-layout and post-layout simulation results.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102283"},"PeriodicalIF":2.2,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142272574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A local positive feedback loop-reused technique for enhancing performance of folded cascode amplifier 提高折叠式级联放大器性能的局部正反馈回路重复使用技术
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-12 DOI: 10.1016/j.vlsi.2024.102277
Xiaosong Wang, Xiao Zhao, Yu Zhang, Chen Wang, Liyuan Dong

A current-reused folded cascode operational transconductance amplifier (OTA) using a local positive feedback (LPFB) technique has been proposed in previous literature, which does not achieve maximum unity gain-bandwidth (GBW). Besides, the stability of LPFB in the LPFB-OTA is limited by local common mode feedback (LCMFB) resistors. Based on the analysis, a local positive feedback loop-reused (LPFBR) technique is proposed to improve the performance of conventional LPFB OTA. For a fair comparison, both conventional and proposed OTAs working at saturation region are designed and simulated in SMIC 0.18μm process. The simulated results demonstrate that the proposed LPFBR-OTA has almost 10.5 times the bandwidth and maintains stability compared to that of the conventional LPFB-OTA under the condition that LCMFB resistors are increased by a factor of 10.

以前的文献提出了一种采用局部正反馈(LPFB)技术的电流复用折叠级联运算跨导放大器(OTA),但这种放大器无法实现最大统一增益带宽(GBW)。此外,LPFB-OTA 中 LPFB 的稳定性受到本地共模反馈(LCMFB)电阻的限制。在分析的基础上,提出了一种本地正反馈回路重复使用(LPFBR)技术,以改善传统 LPFB OTA 的性能。为了进行公平比较,在中芯国际 0.18μm 工艺中设计并仿真了工作在饱和区的传统和拟议 OTA。仿真结果表明,在 LCMFB 电阻增加 10 倍的条件下,拟议的 LPFBR-OTA 的带宽几乎是传统 LPFB-OTA 的 10.5 倍,并且保持稳定。
{"title":"A local positive feedback loop-reused technique for enhancing performance of folded cascode amplifier","authors":"Xiaosong Wang,&nbsp;Xiao Zhao,&nbsp;Yu Zhang,&nbsp;Chen Wang,&nbsp;Liyuan Dong","doi":"10.1016/j.vlsi.2024.102277","DOIUrl":"10.1016/j.vlsi.2024.102277","url":null,"abstract":"<div><p>A current-reused folded cascode operational transconductance amplifier (OTA) using a local positive feedback (LPFB) technique has been proposed in previous literature, which does not achieve maximum unity gain-bandwidth (GBW). Besides, the stability of LPFB in the LPFB-OTA is limited by local common mode feedback (LCMFB) resistors. Based on the analysis, a local positive feedback loop-reused (LPFBR) technique is proposed to improve the performance of conventional LPFB OTA. For a fair comparison, both conventional and proposed OTAs working at saturation region are designed and simulated in SMIC <span><math><mrow><mn>0</mn><mo>.</mo><mn>18</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span> process. The simulated results demonstrate that the proposed LPFBR-OTA has almost 10.5 times the bandwidth and maintains stability compared to that of the conventional LPFB-OTA under the condition that LCMFB resistors are increased by a factor of 10.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102277"},"PeriodicalIF":2.2,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating error correction and detection techniques in RISC-V processor microarchitecture for enhanced reliability 在 RISC-V 处理器微体系结构中集成纠错和检测技术以提高可靠性
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-12 DOI: 10.1016/j.vlsi.2024.102282
Aswin Sreekumar, Bolupadra Sai Shankar, B. Naresh Kumar Reddy

An essential consideration in processor design is ensuring reliability, particularly in demanding environments such as outer space and nuclear plants. To mitigate the effects of errors and enable error recovery, processors need to incorporate fault tolerance techniques. One common type of error is SEU (Single Event Upset), which affects various microelectronic devices including microprocessors, microcontrollers, and semiconductor memory devices. While error mitigation techniques have been developed for processors based on architectures like ARM (Advanced RISC Machine) and MIPS (Million Instructions Per Second), there is a gap in research for open-source ISAs (Instruction Set Architecture) like RISC-V, which this paper aims to address. This paper focuses on designing a fault-tolerant microarchitecture for a RISC-V processor that can correct one-bit errors, detect up to two-bit errors, and integrate lockstep and pipeline rollback features at a lower LUTs (Look Up Tables) consumption by re-using the same hardware pipeline for error mitigation and recovery through instruction mimicking. By incorporating these features, the proposed approach enhances the system’s fault tolerance by detecting and correcting errors caused by transient events and achieves a lower effective die size upon realization compared to contemporary works. The proposed microarchitecture design was simulated and synthesized using the Vivado Design Suite 2023.1 and implemented on a Zynq 7000 SoC ZC702 Evaluation Kit.

处理器设计的一个基本考虑因素是确保可靠性,尤其是在外层空间和核电厂等苛刻的环境中。为了减轻错误的影响并实现错误恢复,处理器需要采用容错技术。一种常见的错误类型是 SEU(单次事件猝发),它会影响各种微电子器件,包括微处理器、微控制器和半导体存储器件。虽然针对基于 ARM(高级 RISC 机器)和 MIPS(每秒百万条指令)架构的处理器开发了错误缓解技术,但针对 RISC-V 等开源 ISA(指令集架构)的研究还存在空白,本文旨在解决这一问题。本文的重点是为 RISC-V 处理器设计一种容错微体系结构,该体系结构可以纠正一位错误、检测多达两位的错误,并通过模仿指令重新使用相同的硬件流水线进行错误缓解和恢复,从而在较低的 LUT(查找表)消耗下集成锁步和流水线回滚功能。通过集成这些功能,所提出的方法可以检测和纠正瞬态事件引起的错误,从而增强系统的容错能力,与同类产品相比,实现了更小的有效芯片尺寸。我们使用 Vivado Design Suite 2023.1 对提出的微体系结构设计进行了仿真和综合,并在 Zynq 7000 SoC ZC702 评估套件上实现了该设计。
{"title":"Integrating error correction and detection techniques in RISC-V processor microarchitecture for enhanced reliability","authors":"Aswin Sreekumar,&nbsp;Bolupadra Sai Shankar,&nbsp;B. Naresh Kumar Reddy","doi":"10.1016/j.vlsi.2024.102282","DOIUrl":"10.1016/j.vlsi.2024.102282","url":null,"abstract":"<div><p>An essential consideration in processor design is ensuring reliability, particularly in demanding environments such as outer space and nuclear plants. To mitigate the effects of errors and enable error recovery, processors need to incorporate fault tolerance techniques. One common type of error is SEU (Single Event Upset), which affects various microelectronic devices including microprocessors, microcontrollers, and semiconductor memory devices. While error mitigation techniques have been developed for processors based on architectures like ARM (Advanced RISC Machine) and MIPS (Million Instructions Per Second), there is a gap in research for open-source ISAs (Instruction Set Architecture) like RISC-V, which this paper aims to address. This paper focuses on designing a fault-tolerant microarchitecture for a RISC-V processor that can correct one-bit errors, detect up to two-bit errors, and integrate lockstep and pipeline rollback features at a lower LUTs (Look Up Tables) consumption by re-using the same hardware pipeline for error mitigation and recovery through instruction mimicking. By incorporating these features, the proposed approach enhances the system’s fault tolerance by detecting and correcting errors caused by transient events and achieves a lower effective die size upon realization compared to contemporary works. The proposed microarchitecture design was simulated and synthesized using the Vivado Design Suite 2023.1 and implemented on a Zynq 7000 SoC ZC702 Evaluation Kit.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102282"},"PeriodicalIF":2.2,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general and accurate pattern search method for various scenarios 适用于各种情况的通用精确模式搜索法
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-11 DOI: 10.1016/j.vlsi.2024.102281
Congyi Zhang , Xu He , Hao Sang , Hengzhou Yuan , Dawei Liu , Yang Guo

As chip designs grow in complexity, the optical proximity correction (OPC) process becomes increasingly time-consuming. As a result, pattern search technology is becoming a foundation stone of many tasks for manufacturing, such as lithography simulation, hotspot detection, mask optimization, and so on. The most difficult challenge of pattern search is to locate clips by specific patterns within the layout accurately and efficiently. In this paper, we present a generalized pattern search method capable in diverse scenarios, including patterns with hollow shapes, shifting edges, and multi-layer situations. Experimental results show that our method outperforms commercial tools in pattern locating accuracy and handling search problems involving complex patterns which is not directly support by commercial tools yet.

随着芯片设计日趋复杂,光学接近校正(OPC)过程变得越来越耗时。因此,图案搜索技术正成为光刻模拟、热点检测、光罩优化等许多制造任务的基石。图案搜索最困难的挑战是如何准确、高效地在版图中通过特定图案定位剪辑。在本文中,我们提出了一种通用的图案搜索方法,该方法可适用于多种场景,包括具有空心形状、边缘移动和多层情况的图案。实验结果表明,我们的方法在图案定位精度和处理复杂图案搜索问题方面优于商业工具,而商业工具尚不能直接支持复杂图案的搜索。
{"title":"A general and accurate pattern search method for various scenarios","authors":"Congyi Zhang ,&nbsp;Xu He ,&nbsp;Hao Sang ,&nbsp;Hengzhou Yuan ,&nbsp;Dawei Liu ,&nbsp;Yang Guo","doi":"10.1016/j.vlsi.2024.102281","DOIUrl":"10.1016/j.vlsi.2024.102281","url":null,"abstract":"<div><p>As chip designs grow in complexity, the optical proximity correction (OPC) process becomes increasingly time-consuming. As a result, pattern search technology is becoming a foundation stone of many tasks for manufacturing, such as lithography simulation, hotspot detection, mask optimization, and so on. The most difficult challenge of pattern search is to locate clips by specific patterns within the layout accurately and efficiently. In this paper, we present a generalized pattern search method capable in diverse scenarios, including patterns with hollow shapes, shifting edges, and multi-layer situations. Experimental results show that our method outperforms commercial tools in pattern locating accuracy and handling search problems involving complex patterns which is not directly support by commercial tools yet.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102281"},"PeriodicalIF":2.2,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142171740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synchronous control of memristive hindmarsh-rose neuron models with extreme multistability 具有极端多稳定性的记忆性后马什-蔷薇神经元模型的同步控制
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-10 DOI: 10.1016/j.vlsi.2024.102280
Shaohui Yan, Jialong Wang, Jincai Song

In this paper, the circuit simulation is achieved by the established Hindmarsh-Rose (HR) neuron model and the system is applied in projection synchronization. The chaotic behaviors of the neural network model are analyzed using bifurcation diagrams, Lyapunov exponents spectra, phase diagrams and time series diagrams. The dynamics analysis of the neuron model shows a variety of firing behaviors and extreme multistability behavior. The model is then simulated through circuit multisim to demonstrate the possibility in a physical sense. Finally, synchronization is induced to the memristive neural system through projection control, and the experimental results show that the model embodies a good synchronization effect in the process of projection synchronization, which helps to improve the security of signal transmission and the confidentiality of the system, and lays the foundation for the secure communication afterwards.

本文通过已建立的 Hindmarsh-Rose (HR) 神经元模型实现了电路仿真,并将该系统应用于投影同步。本文利用分岔图、Lyapunov 指数谱、相位图和时间序列图分析了神经网络模型的混沌行为。神经元模型的动力学分析显示了多种发射行为和极端多稳定性行为。然后通过电路 Multisim 对模型进行仿真,从物理意义上证明了这种可能性。最后,通过投影控制对记忆神经系统进行同步诱导,实验结果表明,该模型在投影同步过程中体现了良好的同步效果,有助于提高信号传输的安全性和系统的保密性,为之后的安全通信奠定了基础。
{"title":"Synchronous control of memristive hindmarsh-rose neuron models with extreme multistability","authors":"Shaohui Yan,&nbsp;Jialong Wang,&nbsp;Jincai Song","doi":"10.1016/j.vlsi.2024.102280","DOIUrl":"10.1016/j.vlsi.2024.102280","url":null,"abstract":"<div><p>In this paper, the circuit simulation is achieved by the established Hindmarsh-Rose (HR) neuron model and the system is applied in projection synchronization. The chaotic behaviors of the neural network model are analyzed using bifurcation diagrams, Lyapunov exponents spectra, phase diagrams and time series diagrams. The dynamics analysis of the neuron model shows a variety of firing behaviors and extreme multistability behavior. The model is then simulated through circuit multisim to demonstrate the possibility in a physical sense. Finally, synchronization is induced to the memristive neural system through projection control, and the experimental results show that the model embodies a good synchronization effect in the process of projection synchronization, which helps to improve the security of signal transmission and the confidentiality of the system, and lays the foundation for the secure communication afterwards.</p></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102280"},"PeriodicalIF":2.2,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two stage Ordered Escape Routing combined with LP and heuristic algorithm for large scaled PCB 两阶段有序逃逸路由与 LP 和启发式算法相结合,用于大型印刷电路板
IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2024-09-07 DOI: 10.1016/j.vlsi.2024.102270
Disi Lin , Chuandong Chen , Rongshan Wei , Qinghai Liu , Huan He , Ziran Zhu , Zhifeng Lin , Jianli Chen
The Ordered Escape Routing (OER) problem, which is an NP-hard problem, is critical to PCB design. Primary methods based on integer linear programming (ILP) work well on small-scale PCBs with fewer pins. However, when dealing with large-scale instances, traditional ILP strategies frequently cause time violations as the number of variables increases due to time-consuming preprocessing. In addition, heuristic algorithms have a time advantage when dealing with specific problems. In this paper, We propose an efficient two-stage escape routing method that employs LP for global routing and uses a heuristic algorithm to deal with the path intersection problem to minimize wiring length and runtime for large-scale PCBs. We first model the OER problem as a min-cost multi-commodity flow problem and use ILP to solve it. Then, we relax the non-crossing constraints and transform the ILP model into an LP model to reduce the runtime. we also construct a crossing graph according to the intersection of routing paths and propose a heuristic algorithm to locate congestion quickly. Finally, we reduce the local area capacity and allow global automatic congestion optimization. Compared with the state-of-the-art work, experimental results show that our method can reduce the routing time by 60% and handle larger-scale PCB escape routing problems.
有序逃逸路由(OER)问题是一个 NP 难问题,对 PCB 设计至关重要。基于整数线性规划(ILP)的主要方法在引脚较少的小规模 PCB 上运行良好。然而,在处理大规模实例时,由于预处理耗时,随着变量数量的增加,传统的 ILP 策略经常会造成时间违规。此外,启发式算法在处理特定问题时具有时间优势。在本文中,我们提出了一种高效的两阶段逃逸布线方法,该方法采用 LP 进行全局布线,并使用启发式算法处理路径交叉问题,从而最大限度地减少大规模印刷电路板的布线长度和运行时间。我们首先将 OER 问题建模为成本最小的多商品流问题,并使用 ILP 解决该问题。然后,我们放宽了非交叉约束,并将 ILP 模型转化为 LP 模型,以减少运行时间。我们还根据布线路径的交叉点构建了交叉图,并提出了一种启发式算法,以快速定位拥堵位置。最后,我们降低了局部区域容量,实现了全局拥塞自动优化。实验结果表明,与最先进的方法相比,我们的方法能缩短 60% 的路由时间,并能处理更大规模的 PCB 逃逸路由问题。
{"title":"Two stage Ordered Escape Routing combined with LP and heuristic algorithm for large scaled PCB","authors":"Disi Lin ,&nbsp;Chuandong Chen ,&nbsp;Rongshan Wei ,&nbsp;Qinghai Liu ,&nbsp;Huan He ,&nbsp;Ziran Zhu ,&nbsp;Zhifeng Lin ,&nbsp;Jianli Chen","doi":"10.1016/j.vlsi.2024.102270","DOIUrl":"10.1016/j.vlsi.2024.102270","url":null,"abstract":"<div><div>The Ordered Escape Routing (OER) problem, which is an NP-hard problem, is critical to PCB design. Primary methods based on integer linear programming (ILP) work well on small-scale PCBs with fewer pins. However, when dealing with large-scale instances, traditional ILP strategies frequently cause time violations as the number of variables increases due to time-consuming preprocessing. In addition, heuristic algorithms have a time advantage when dealing with specific problems. In this paper, We propose an efficient two-stage escape routing method that employs LP for global routing and uses a heuristic algorithm to deal with the path intersection problem to minimize wiring length and runtime for large-scale PCBs. We first model the OER problem as a min-cost multi-commodity flow problem and use ILP to solve it. Then, we relax the non-crossing constraints and transform the ILP model into an LP model to reduce the runtime. we also construct a crossing graph according to the intersection of routing paths and propose a heuristic algorithm to locate congestion quickly. Finally, we reduce the local area capacity and allow global automatic congestion optimization. Compared with the state-of-the-art work, experimental results show that our method can reduce the routing time by 60% and handle larger-scale PCB escape routing problems.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102270"},"PeriodicalIF":2.2,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142323992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Integration-The Vlsi Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1