This work presents two innovative 12T cells combining tunnel field-effect transistor (TFET) and fin field-effect transistor (FinFET) technologies. These cells address reverse bias current issues by incorporating separate paths for reading data and write enhancement cut transistors, enhancing hold/read/write static noise margin (H/R/WSNM), reducing read time, and minimizing power consumption from TFET leakage. At 0.6 V, the first (second) SRAM cell shows a WSNM improvement over O_7T, 8T, CA_10T, 12T, and HF_10T cells by 152 % (93 %), 152 % (93 %), 157.7 % (97.5 %), 95 % (50 %), and 104 % (57 %), respectively. The leakage power of the first (second) 12T TFET SRAM cell is two (four) orders of magnitude lower than O_7T and 8T SRAM cells. These hybrid SRAM cells also exhibit faster read operations across VDD voltage levels (0.3 V–1 V) and the first 12T cell demonstrates shorter write access times than 12T and CA_10T SRAM cells. These characteristics make the proposed cells particularly suitable for energy-efficient IoT devices and medical applications, where balancing power, area, performance, and data integrity is critical.
{"title":"Novel hybrid TFET-FinFET 12T SRAM cells with enhanced write margin and read performance","authors":"Seyed Arman Sabaghpour, Behzad Ebrahimi, Pooya Torkzadeh","doi":"10.1016/j.vlsi.2024.102294","DOIUrl":"10.1016/j.vlsi.2024.102294","url":null,"abstract":"<div><div>This work presents two innovative 12T cells combining tunnel field-effect transistor (TFET) and fin field-effect transistor (FinFET) technologies. These cells address reverse bias current issues by incorporating separate paths for reading data and write enhancement cut transistors, enhancing hold/read/write static noise margin (H/R/WSNM), reducing read time, and minimizing power consumption from TFET leakage. At 0.6 V, the first (second) SRAM cell shows a WSNM improvement over O_7T, 8T, CA_10T, 12T, and HF_10T cells by 152 % (93 %), 152 % (93 %), 157.7 % (97.5 %), 95 % (50 %), and 104 % (57 %), respectively. The leakage power of the first (second) 12T TFET SRAM cell is two (four) orders of magnitude lower than O_7T and 8T SRAM cells. These hybrid SRAM cells also exhibit faster read operations across <em>V</em><sub>DD</sub> voltage levels (0.3 V–1 V) and the first 12T cell demonstrates shorter write access times than 12T and CA_10T SRAM cells. These characteristics make the proposed cells particularly suitable for energy-efficient IoT devices and medical applications, where balancing power, area, performance, and data integrity is critical.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102294"},"PeriodicalIF":2.2,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142437845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper presents a novel background calibration method for pipelined analog-to-digital converters (ADCs) using a time-delay neural network (TDNN), which is optimized through genetic algorithm (GA) techniques. The proposed technique leverages TDNN to create enhanced feature sets, significantly improving the calibration of nonlinear errors exhibiting memory effects. It harnesses the GA's global optimization capabilities for feature selection, effectively reducing the feature dimension and consequently alleviating the NN's computational burden. A parallel pipeline architecture is devised for the calibration circuit, with its implementation realized on FPGA to facilitate forward inference processing. The inference circuit is synthesized using TSMC's 90 nm CMOS process, achieving a power consumption of 40.11 mW and an area of 0.45 mm2. Simulations based on MATLAB for a 14-bit Pipelined ADC demonstrate that the proposed calibration method significantly improves the SFDR from 59.77 dB to 165.52 dB, and ENOB from 8.79 bits to 19.23 bits, surpassing the target ADC's specifications. Moreover, the dimensionality of features is effectively reduced by up to 34 % without compromising the calibration performance.
{"title":"Digital background calibration algorithm for pipelined ADC based on time-delay neural network with genetic algorithm feature selection","authors":"Yongsheng Yin, Long Li, Jiashen Li, Yukun Song, Honghui Deng, Hongmei Chen, Luotian Wu, Muqi Li, Xu Meng","doi":"10.1016/j.vlsi.2024.102295","DOIUrl":"10.1016/j.vlsi.2024.102295","url":null,"abstract":"<div><div>This paper presents a novel background calibration method for pipelined analog-to-digital converters (ADCs) using a time-delay neural network (TDNN), which is optimized through genetic algorithm (GA) techniques. The proposed technique leverages TDNN to create enhanced feature sets, significantly improving the calibration of nonlinear errors exhibiting memory effects. It harnesses the GA's global optimization capabilities for feature selection, effectively reducing the feature dimension and consequently alleviating the NN's computational burden. A parallel pipeline architecture is devised for the calibration circuit, with its implementation realized on FPGA to facilitate forward inference processing. The inference circuit is synthesized using TSMC's 90 nm CMOS process, achieving a power consumption of 40.11 mW and an area of 0.45 mm<sup>2</sup>. Simulations based on MATLAB for a 14-bit Pipelined ADC demonstrate that the proposed calibration method significantly improves the SFDR from 59.77 dB to 165.52 dB, and ENOB from 8.79 bits to 19.23 bits, surpassing the target ADC's specifications. Moreover, the dimensionality of features is effectively reduced by up to 34 % without compromising the calibration performance.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102295"},"PeriodicalIF":2.2,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142432446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-10DOI: 10.1016/j.vlsi.2024.102296
Annu Dabas , Shweta Kumari , Maneesha Gupta , Richa Yadav
In this work, a novel fully differential two stage class AB Recycling Folded Cascode Operational Transconductance Amplifier (RFC OTA) using Differential Flipped Voltage Follower (DFVF) has been proposed. The DFVF and Dynamic Threshold Metal Oxide Semiconductor (DTMOS) transistors have been used as differential input stage of the proposed RFC OTA. These techniques provide enhancement in gain and bandwidth of the proposed OTA. To further improve the performance of proposed circuit, positive feedback at current mirror load along with Hybrid Cascode compensation have been implemented. A common source (CS) amplifier has been used between gate and source terminals of differential input stage which further boosts the transconductance. The proposed RFC OTA is designed and simulated using 180 nm CMOS technology with load capacitance of 10 pF. It provides an excellent dc gain of 112.61 dB and gain bandwidth product (GBW) of 25.88 MHz along with 88.140 phase margin. The proposed circuit dissipates 124.66 μW of power at ± 0.5V supply voltage. The Monte Carlo analysis against device mismatch has also been performed to prove robustness of the proposed circuit.
本研究提出了一种使用差分翻转电压跟随器(DFVF)的新型全差分两级 AB 类回收折叠级联运算跨导放大器(RFC OTA)。DFVF 和动态阈值金属氧化物半导体(DTMOS)晶体管被用作拟议 RFC OTA 的差分输入级。这些技术提高了拟议 OTA 的增益和带宽。为了进一步提高拟议电路的性能,在电流镜负载上实施了正反馈以及混合级联补偿。在差分输入级的栅极和源极之间使用了共源(CS)放大器,从而进一步提高了跨导。拟议的 RFC OTA 采用 180 nm CMOS 技术设计和仿真,负载电容为 10 pF。它的直流增益高达 112.61 dB,增益带宽积 (GBW) 为 25.88 MHz,相位裕度为 88.140。在 ± 0.5V 电源电压下,拟议电路的耗散功率为 124.66 μW。此外,还针对器件失配进行了蒙特卡罗分析,以证明所提电路的稳健性。
{"title":"A novel architecture of high performance fully differential two stage RFC OTA designed using DFVF and hybrid cascode compensation techniques","authors":"Annu Dabas , Shweta Kumari , Maneesha Gupta , Richa Yadav","doi":"10.1016/j.vlsi.2024.102296","DOIUrl":"10.1016/j.vlsi.2024.102296","url":null,"abstract":"<div><div>In this work, a novel fully differential two stage class AB Recycling Folded Cascode Operational Transconductance Amplifier (RFC OTA) using Differential Flipped Voltage Follower (DFVF) has been proposed. The DFVF and Dynamic Threshold Metal Oxide Semiconductor (DTMOS) transistors have been used as differential input stage of the proposed RFC OTA. These techniques provide enhancement in gain and bandwidth of the proposed OTA. To further improve the performance of proposed circuit, positive feedback at current mirror load along with Hybrid Cascode compensation have been implemented. A common source (CS) amplifier has been used between gate and source terminals of differential input stage which further boosts the transconductance. The proposed RFC OTA is designed and simulated using 180 nm CMOS technology with load capacitance of 10 pF. It provides an excellent dc gain of 112.61 dB and gain bandwidth product (GBW) of 25.88 MHz along with 88.14<sup>0</sup> phase margin. The proposed circuit dissipates 124.66 μW of power at ± 0.5V supply voltage. The Monte Carlo analysis against device mismatch has also been performed to prove robustness of the proposed circuit.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102296"},"PeriodicalIF":2.2,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142444707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-10DOI: 10.1016/j.vlsi.2024.102297
Abdul Aziz , Md. Asaf-uddowla Golap , Md. Rahat Ebne Alamgir Porosh , Md. Tasnimul Khair Tousif , Muhammad Sheikh Sadi
In a data stream, errors are quite likely to occur and sometimes this is much more terrible. So, data safety is very important in digital systems, especially in critical and real-time systems, microprocessors, embedded systems, computer memory, and data communication. The probability of soft error increases with the exponential rate of increasing transistor per chip, operational voltage, particle strike, condensation of bit-cell area, etc. To ensure data integrity, safety, and system reliability, error detection, and correction are fundamental components of data transmission and storage systems. Existing error correction techniques can solve several bits of error. However, these existing methods are not fully efficient, as some consume a lot of time, space, and bit overhead. An ideal approach will have the potential to minimize all of these parameters. This research paper proposes a novel error correction approach with horizontal, vertical, diagonal, and knight (HVDK) parity bits. This approach has been taken to correct 5-bit errors in 64 bits of data word using the parity-based technique with less bit overhead. Our research advances the knowledge of error correction methods and sheds light on how to pick and use parity bit schemes that are appropriate for different applications.
{"title":"Multi-bit error detection and correction technique using HVDK (Horizontal-Vertical-Diagonal-Knight) parity","authors":"Abdul Aziz , Md. Asaf-uddowla Golap , Md. Rahat Ebne Alamgir Porosh , Md. Tasnimul Khair Tousif , Muhammad Sheikh Sadi","doi":"10.1016/j.vlsi.2024.102297","DOIUrl":"10.1016/j.vlsi.2024.102297","url":null,"abstract":"<div><div>In a data stream, errors are quite likely to occur and sometimes this is much more terrible. So, data safety is very important in digital systems, especially in critical and real-time systems, microprocessors, embedded systems, computer memory, and data communication. The probability of soft error increases with the exponential rate of increasing transistor per chip, operational voltage, particle strike, condensation of bit-cell area, etc. To ensure data integrity, safety, and system reliability, error detection, and correction are fundamental components of data transmission and storage systems. Existing error correction techniques can solve several bits of error. However, these existing methods are not fully efficient, as some consume a lot of time, space, and bit overhead. An ideal approach will have the potential to minimize all of these parameters. This research paper proposes a novel error correction approach with horizontal, vertical, diagonal, and knight (HVDK) parity bits. This approach has been taken to correct 5-bit errors in 64 bits of data word using the parity-based technique with less bit overhead. Our research advances the knowledge of error correction methods and sheds light on how to pick and use parity bit schemes that are appropriate for different applications.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102297"},"PeriodicalIF":2.2,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142530089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-05DOI: 10.1016/j.vlsi.2024.102288
Nidhi Sharma , Rajesh Kumar Srivastava , Deep Sehgal , Devarshi Mrinal Das
This paper presents a low-power, high-speed dynamic comparator with a rail-to-rail input common-mode () range. The proposed comparator has high-speed performance throughout the 0-Vdd range, thus attributing common-mode insensitivity. This work introduces a merger of NMOS and PMOS dynamic pre-amplifiers with a modified latch to achieve the rail-to-rail operation. A novel activation clock logic is also proposed, activating only one pre-amplifier based on the value and ensuring low-power consumption and provides reduction of 17% in the energy per conversion as compared to the comparator without activation clock logic. The proposed comparator is designed using 65-nm CMOS technology with a 1.2 V supply voltage and is operating at 1 GHz frequency. We have presented the analytical models of the delay and offset which is verified with the rigorous post-layout simulation results. To validate the robustness of the proposed comparator, the PVT corner analysis with Monte Carlo simulation is also performed for different .
{"title":"A low-power common-mode insensitive rail-to-rail dynamic comparator for ADCs","authors":"Nidhi Sharma , Rajesh Kumar Srivastava , Deep Sehgal , Devarshi Mrinal Das","doi":"10.1016/j.vlsi.2024.102288","DOIUrl":"10.1016/j.vlsi.2024.102288","url":null,"abstract":"<div><div>This paper presents a low-power, high-speed dynamic comparator with a rail-to-rail input common-mode (<span><math><msub><mrow><mi>V</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>c</mi><mi>m</mi></mrow></msub></math></span>) range. The proposed comparator has high-speed performance throughout the 0-Vdd <span><math><msub><mrow><mi>V</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>c</mi><mi>m</mi></mrow></msub></math></span> range, thus attributing common-mode insensitivity. This work introduces a merger of NMOS and PMOS dynamic pre-amplifiers with a modified latch to achieve the rail-to-rail <span><math><msub><mrow><mi>V</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>c</mi><mi>m</mi></mrow></msub></math></span> operation. A novel activation clock logic is also proposed, activating only one pre-amplifier based on the <span><math><msub><mrow><mi>V</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>c</mi><mi>m</mi></mrow></msub></math></span> value and ensuring low-power consumption and provides reduction of 17% in the energy per conversion as compared to the comparator without activation clock logic. The proposed comparator is designed using 65-nm CMOS technology with a 1.2 V supply voltage and is operating at 1 GHz frequency. We have presented the analytical models of the delay and offset which is verified with the rigorous post-layout simulation results. To validate the robustness of the proposed comparator, the PVT corner analysis with Monte Carlo simulation is also performed for different <span><math><msub><mrow><mi>V</mi></mrow><mrow><mi>i</mi><mo>,</mo><mi>c</mi><mi>m</mi></mrow></msub></math></span>.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102288"},"PeriodicalIF":2.2,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142419985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
High-Level Synthesis (HLS) has evolved significantly due to the increasing complexity of integrated circuit design and the demand for efficient design methodologies. HLS, which raises the abstraction level of design specification, allows designers to focus on hardware functionality, thus enhancing productivity and reducing verification efforts. However, a key challenge in HLS is efficiently exploring the vast design space to find the Pareto-optimal designs. In this paper, we introduce a novel approach for multi-objective design space exploration in HLS. Our methodology decomposes the design space exploration problem into simpler sub-problems using the Multi-Objective Evolutionary Algorithm based on Decomposition (MOEA/D) framework and utilizes the Estimation of Distribution Algorithm (EDA) to build a probabilistic model for generating candidate solutions, thereby reducing the required number of expensive synthesis runs. Experimental results show that the proposed method has a faster convergence speed and reduces the number of syntheses by 24.34% to 32.01%, which significantly outperforms state-of-the-art works. Our approach achieves superior Pareto fronts with the lowest average ADRS value, outperforming Lattice-expl, -Constraint GA, and NSGA-II by 85.64%, 39.90%, and 33.31% respectively.
{"title":"Decomposition based estimation of distribution algorithm for high-level synthesis design space exploration","authors":"Yuan Yao, Huiliang Hong, Shanshan Wang, Chenglong Xiao","doi":"10.1016/j.vlsi.2024.102292","DOIUrl":"10.1016/j.vlsi.2024.102292","url":null,"abstract":"<div><div>High-Level Synthesis (HLS) has evolved significantly due to the increasing complexity of integrated circuit design and the demand for efficient design methodologies. HLS, which raises the abstraction level of design specification, allows designers to focus on hardware functionality, thus enhancing productivity and reducing verification efforts. However, a key challenge in HLS is efficiently exploring the vast design space to find the Pareto-optimal designs. In this paper, we introduce a novel approach for multi-objective design space exploration in HLS. Our methodology decomposes the design space exploration problem into simpler sub-problems using the Multi-Objective Evolutionary Algorithm based on Decomposition (MOEA/D) framework and utilizes the Estimation of Distribution Algorithm (EDA) to build a probabilistic model for generating candidate solutions, thereby reducing the required number of expensive synthesis runs. Experimental results show that the proposed method has a faster convergence speed and reduces the number of syntheses by 24.34% to 32.01%, which significantly outperforms state-of-the-art works. Our approach achieves superior Pareto fronts with the lowest average ADRS value, outperforming Lattice-expl, <span><math><mi>ϵ</mi></math></span> -Constraint GA, and NSGA-II by 85.64%, 39.90%, and 33.31% respectively.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102292"},"PeriodicalIF":2.2,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142419989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-02DOI: 10.1016/j.vlsi.2024.102293
Weiqiang Yao , Yibo Lin , Lin Li
Floorplanning is an early and essential task of physical design. Recently, there has been a surge in the application of learning-based methods to tackle floorplanning problem. A prevalent approach involves training a reinforcement learning (RL) agent to sequentially place blocks on a chip canvas. However, existing methods mainly focus on learning block placement, relying on heuristic rules for placement order determination. In contrast to previous approaches, we propose an RL-based method to determine the placement order. Based on block features and states, an agent is trained to select the block for placement. Once a block is selected, we enumerate all potential relative positions captured by sequence pairs and select the optimal placement. After establishing the layout topology, we further optimize wirelength through linear programming. Experimental results demonstrate the effectiveness of our proposed method. On the original-outline MCNC benchmarks, our method achieves a notable 25.2% average improvement in wirelength compared to a recent learning-based method. Additionally, when applied to rescaled-outline benchmarks from MCNC and GSRC, our method outperforms state-of-the-art results, resulting in an average wirelength reduction of 12.5%.
{"title":"Learning placement order for constructive floorplanning","authors":"Weiqiang Yao , Yibo Lin , Lin Li","doi":"10.1016/j.vlsi.2024.102293","DOIUrl":"10.1016/j.vlsi.2024.102293","url":null,"abstract":"<div><div>Floorplanning is an early and essential task of physical design. Recently, there has been a surge in the application of learning-based methods to tackle floorplanning problem. A prevalent approach involves training a reinforcement learning (RL) agent to sequentially place blocks on a chip canvas. However, existing methods mainly focus on learning block placement, relying on heuristic rules for placement order determination. In contrast to previous approaches, we propose an RL-based method to determine the placement order. Based on block features and states, an agent is trained to select the block for placement. Once a block is selected, we enumerate all potential relative positions captured by sequence pairs and select the optimal placement. After establishing the layout topology, we further optimize wirelength through linear programming. Experimental results demonstrate the effectiveness of our proposed method. On the original-outline MCNC benchmarks, our method achieves a notable 25.2% average improvement in wirelength compared to a recent learning-based method. Additionally, when applied to rescaled-outline benchmarks from MCNC and GSRC, our method outperforms state-of-the-art results, resulting in an average wirelength reduction of 12.5%.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102293"},"PeriodicalIF":2.2,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142432447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-27DOI: 10.1016/j.vlsi.2024.102290
Shaohui Yan, Zihao Guo, Jincai Song
Considering the shortcomings of traditional chaotic systems in weak signal detection methods, such as the high threshold sensitivity requirement and the narrow detection frequency domain. This study proposes a novel three-dimensional chaotic synchronization system, and the dynamical of the system are exhaustively characterized using equilibrium points, phase diagrams, Lyapunov exponential spectra, and bifurcation diagrams. This method involves weak signal detection by means of chaotic synchronization control. Synchronization of a chaotic system using a backstepping synchronization method is used to detect weak signals by analyzing the synchronization error after the introduction of weak signals in a strong noise background. The chaotic system is implemented by hardware circuits, and the simulation of chaotic synchronization control and detection of weak signals from the perspective of circuits is carried out by circuit simulation software. Additionally, the frequency range within which the system is capable of weak signal detection is tested through extensive simulation experiments. Finally, multi-frequency signals detection experiments are performed. The experimental results demonstrate that the system can accurately detect the frequency of weak signals address the limitations of narrow-band detection and multi-frequency signal detection is possible. Meanwhile, the circuit structure proposed in this paper is simple and has some value for engineering applications.
{"title":"Multi-frequency weak signal detection based on Liu-like chaotic synchronization system and its hardware circuit implementation","authors":"Shaohui Yan, Zihao Guo, Jincai Song","doi":"10.1016/j.vlsi.2024.102290","DOIUrl":"10.1016/j.vlsi.2024.102290","url":null,"abstract":"<div><div>Considering the shortcomings of traditional chaotic systems in weak signal detection methods, such as the high threshold sensitivity requirement and the narrow detection frequency domain. This study proposes a novel three-dimensional chaotic synchronization system, and the dynamical of the system are exhaustively characterized using equilibrium points, phase diagrams, Lyapunov exponential spectra, and bifurcation diagrams. This method involves weak signal detection by means of chaotic synchronization control. Synchronization of a chaotic system using a backstepping synchronization method is used to detect weak signals by analyzing the synchronization error after the introduction of weak signals in a strong noise background. The chaotic system is implemented by hardware circuits, and the simulation of chaotic synchronization control and detection of weak signals from the perspective of circuits is carried out by circuit simulation software. Additionally, the frequency range within which the system is capable of weak signal detection is tested through extensive simulation experiments. Finally, multi-frequency signals detection experiments are performed. The experimental results demonstrate that the system can accurately detect the frequency of weak signals address the limitations of narrow-band detection and multi-frequency signal detection is possible. Meanwhile, the circuit structure proposed in this paper is simple and has some value for engineering applications.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102290"},"PeriodicalIF":2.2,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142419988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-26DOI: 10.1016/j.vlsi.2024.102287
Mohita Jaiswal, Abhishek Sharma, Sandeep Saini
In this paper, we benchmark two automation frameworks, Vitis AI and FINN, for sign language recognition on a Field Programmable Gate Array (FPGA). We conducted an in-depth exploration of both frameworks using Tiny YOLOv2 networks by varying design parameters such as precision, parallelism ratio, etc. Further, a fair baseline comparison is made based on accuracy, speed, and hardware resources. Experimental findings demonstrate that the Vitis AI outperforms the FINN framework and traditional GPU and CPU platforms by achieving significant improvements of 1.08x, 1.7x, and 2.9x in terms of latency. Leveraging Vitis AI, our system achieved a detection speed of 32.7 frames per second (FPS) on the Kria KV260 FPGA with a power consumption rate of 5.6 W and an impressive mean Average Precision (mAP) score of 61.2% on the Hindi Indian Sign Language (ISL) dataset.
在本文中,我们在现场可编程门阵列(FPGA)上对 Vitis AI 和 FINN 这两个自动化框架进行了手语识别基准测试。我们使用 Tiny YOLOv2 网络,通过改变精度、并行比等设计参数,对这两个框架进行了深入探讨。此外,我们还根据精度、速度和硬件资源进行了公平的基线比较。实验结果表明,Vitis AI 的性能优于 FINN 框架以及传统的 GPU 和 CPU 平台,在延迟方面分别显著提高了 1.08 倍、1.7 倍和 2.9 倍。利用 Vitis AI,我们的系统在 Kria KV260 FPGA 上实现了每秒 32.7 帧 (FPS) 的检测速度,功耗仅为 5.6 W,在印地语印度手语 (ISL) 数据集上取得了 61.2% 的平均精确度 (mAP) 高分。
{"title":"Hardware acceleration of Tiny YOLO deep neural networks for sign language recognition: A comprehensive performance analysis","authors":"Mohita Jaiswal, Abhishek Sharma, Sandeep Saini","doi":"10.1016/j.vlsi.2024.102287","DOIUrl":"10.1016/j.vlsi.2024.102287","url":null,"abstract":"<div><div>In this paper, we benchmark two automation frameworks, Vitis AI and FINN, for sign language recognition on a Field Programmable Gate Array (FPGA). We conducted an in-depth exploration of both frameworks using Tiny YOLOv2 networks by varying design parameters such as precision, parallelism ratio, etc. Further, a fair baseline comparison is made based on accuracy, speed, and hardware resources. Experimental findings demonstrate that the Vitis AI outperforms the FINN framework and traditional GPU and CPU platforms by achieving significant improvements of 1.08x, 1.7x, and 2.9x in terms of latency. Leveraging Vitis AI, our system achieved a detection speed of 32.7 frames per second (FPS) on the Kria KV260 FPGA with a power consumption rate of 5.6 W and an impressive mean Average Precision (mAP) score of 61.2% on the Hindi Indian Sign Language (ISL) dataset.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102287"},"PeriodicalIF":2.2,"publicationDate":"2024-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142419986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-24DOI: 10.1016/j.vlsi.2024.102285
Jijun Ren, Ziyang Xu, Xing Wang
The power amplifier (PA), a key component of the transmitter system, operates near the saturation region, resulting in nonlinear distortion of the output signal, which affects the quality of the transmitter system. For this, a series of linearization techniques are used to compensate for distortion, one of the most effective and widely applied is the digital predistortion (DPD) technique. The traditional DPD models can be categorized into a single model or multiple models cascade or parallel. In this letter, a hybrid memory polynomial (HMP) model is proposed to further enhance the accuracy of the model, which is composed of multiple memory polynomial (MP) models by cascading and parallelizing. The experimental results show that the HMP model has better accuracy than the traditional MP model at the same complexity.
{"title":"A hybrid memory polynomial digital predistortion model for RF transmitters","authors":"Jijun Ren, Ziyang Xu, Xing Wang","doi":"10.1016/j.vlsi.2024.102285","DOIUrl":"10.1016/j.vlsi.2024.102285","url":null,"abstract":"<div><div>The power amplifier (PA), a key component of the transmitter system, operates near the saturation region, resulting in nonlinear distortion of the output signal, which affects the quality of the transmitter system. For this, a series of linearization techniques are used to compensate for distortion, one of the most effective and widely applied is the digital predistortion (DPD) technique. The traditional DPD models can be categorized into a single model or multiple models cascade or parallel. In this letter, a hybrid memory polynomial (HMP) model is proposed to further enhance the accuracy of the model, which is composed of multiple memory polynomial (MP) models by cascading and parallelizing. The experimental results show that the HMP model has better accuracy than the traditional MP model at the same complexity.</div></div>","PeriodicalId":54973,"journal":{"name":"Integration-The Vlsi Journal","volume":"100 ","pages":"Article 102285"},"PeriodicalIF":2.2,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142318588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}