首页 > 最新文献

IEEE Transactions on Circuits and Systems I: Regular Papers最新文献

英文 中文
Winograd for NTT: A Case Study on Higher-Radix and Low-Latency Implementation of NTT for Post Quantum Cryptography on FPGA Winograd for NTT:关于在 FPGA 上为后量子加密技术实现更高分辨率和低延迟 NTT 的案例研究
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-09 DOI: 10.1109/TCSI.2024.3470335
Suraj Mandal;Debapriya Basu Roy
Number Theoretic Transform (NTT) plays an important role in efficiently implementing lattice-based cryptographic algorithms like CRYSTALS-Kyber, Dilithium, and FALCON. Existing implementations of NTT for these algorithms are mostly based on radix-2 or radix-4 realization of Cooley-Tukey and Gentleman-Sande architectures. In this work, we explore an alternative method of performing NTT known as Winograd’s NTT that requires fewer number of modular multipliers than the conventional Coole-Tukey/Gentleman-Sande for higher radix NTT. We have proposed three different low-latency implementations of Winograd’s NTT, applicable to CRYSTALS-Dilithium, FALCON, and CRYSTALS-Kyber, respectively. Our first implementation of Winograd NTT focuses on radix-16 NTT multiplication unit for polynomials of length 256 and can be directly used for CRYSTALS-Dilithium. The NTT of CRYSTALS-Dilithium is also benefited from our proposed K-RED modular multiplication. Our radix-16-based Winograd outperforms existing Cooley-Tukey/Gentleman-Sande based NTT multipliers of CRYSTALS-Dilithium. Our second implementation of NTT is based on radix-8 Winograd structure with a novel modular multiplication method that targets polynomials of length 512 and can be directly applied for FALCON. For CRYSTALS-Kyber, we have designed a radix-16 Winograd Butterfly Unit (BFU) that can be configured as two parallel radix-8 Winograd BFUs during mixed-radix computation. To the best of our knowledge, this is the first work that applied the Winograd technique for NTT multiplication for post-quantum secure lattice-based cryptographic algorithms.
数论变换(NTT)在有效实现基于网格的加密算法(如 CRYSTALS-Kyber、Dilithium 和 FALCON)方面发挥着重要作用。这些算法的现有 NTT 实现大多基于库利-图基(Cooley-Tukey)和绅士-桑德(Gentleman-Sande)架构的桡度 2 或桡度 4 实现。在这项工作中,我们探索了一种执行 NTT 的替代方法,即 Winograd 的 NTT,与传统的 Coole-Tukey/Gentleman-Sande 相比,这种方法在更高弧度的 NTT 中需要的模块乘法器数量更少。我们提出了三种不同的 Winograd NTT 低延迟实现方法,分别适用于 CRYSTALS-Dilithium、FALCON 和 CRYSTALS-Kyber。我们的第一个 Winograd NTT 实现侧重于长度为 256 的多项式的弧度-16 NTT 乘法单元,可直接用于 CRYSTALS-Dilithium。CRYSTALS-Dilithium 的 NTT 也受益于我们提出的 K-RED 模块化乘法。我们基于radix-16的Winograd优于现有的基于Cooley-Tukey/Gentleman-Sande的CRYSTALS-Dilithium NTT乘法器。我们的第二个 NTT 实现是基于radix-8 Winograd 结构的,采用了新颖的模块化乘法,针对长度为 512 的多项式,可直接应用于 FALCON。对于 CRYSTALS-Kyber,我们设计了一个radix-16 Winograd 蝴蝶单元(BFU),可以在混合radix计算时配置为两个并行的radix-8 Winograd 蝴蝶单元。据我们所知,这是第一项将 Winograd 技术应用于基于后量子安全晶格的加密算法的 NTT 乘法的工作。
{"title":"Winograd for NTT: A Case Study on Higher-Radix and Low-Latency Implementation of NTT for Post Quantum Cryptography on FPGA","authors":"Suraj Mandal;Debapriya Basu Roy","doi":"10.1109/TCSI.2024.3470335","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3470335","url":null,"abstract":"Number Theoretic Transform (NTT) plays an important role in efficiently implementing lattice-based cryptographic algorithms like CRYSTALS-Kyber, Dilithium, and FALCON. Existing implementations of NTT for these algorithms are mostly based on radix-2 or radix-4 realization of Cooley-Tukey and Gentleman-Sande architectures. In this work, we explore an alternative method of performing NTT known as Winograd’s NTT that requires fewer number of modular multipliers than the conventional Coole-Tukey/Gentleman-Sande for higher radix NTT. We have proposed three different low-latency implementations of Winograd’s NTT, applicable to CRYSTALS-Dilithium, FALCON, and CRYSTALS-Kyber, respectively. Our first implementation of Winograd NTT focuses on radix-16 NTT multiplication unit for polynomials of length 256 and can be directly used for CRYSTALS-Dilithium. The NTT of CRYSTALS-Dilithium is also benefited from our proposed K-RED modular multiplication. Our radix-16-based Winograd outperforms existing Cooley-Tukey/Gentleman-Sande based NTT multipliers of CRYSTALS-Dilithium. Our second implementation of NTT is based on radix-8 Winograd structure with a novel modular multiplication method that targets polynomials of length 512 and can be directly applied for FALCON. For CRYSTALS-Kyber, we have designed a radix-16 Winograd Butterfly Unit (BFU) that can be configured as two parallel radix-8 Winograd BFUs during mixed-radix computation. To the best of our knowledge, this is the first work that applied the Winograd technique for NTT multiplication for post-quantum secure lattice-based cryptographic algorithms.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"71 12","pages":"6396-6409"},"PeriodicalIF":5.2,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142736488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A High-Reliability, Non-CRP-Discard Arbiter PUF Based on Delay Difference Quantization
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-08 DOI: 10.1109/TCSI.2024.3466972
Yao Wang;Guangyang Zhang;Xue Mei;Chongyan Gu
As a lightweight hardware security primitive, physical unclonable functions (PUFs) can provide reliable identity authentication for the Internet of Things (IoT) devices with limited resources. Arbiter PUF (APUF) is one of the most well-known PUF circuits. However, its hardware implementation has poor reliability on field programmable gate arrays (FPGAs). This paper proposed a highly reliable APUF that uses a delay difference quantization strategy (DDQ-APUF). By adding multiple configurable delay units to the two symmetrical paths of the conventional APUF, the delay difference between the two symmetrical paths of APUF can be obtained by collecting the output of APUF under different delay configurations. Compared to conventional APUFs, DDQ-APUF does not use the arbitration result of signal transmission in two symmetric paths as its response, but rather uses the quantified delay difference between the two paths as its response. A tolerance threshold is adopted in the authentication to accommodate the variations in delay differences due to environmental changes. Moreover, the modeling attack resistance of DDQ-APUF is evaluated, and a strategy for improving this resistance by incorporating pseudo-XOR technique is proposed. The circuit was implemented on Xilinx Artix-7 FPGAs and the experimental results show that the reliability achieves 99.95% with non-CRP-discard.
作为一种轻量级硬件安全基元,物理不可克隆函数(PUF)可以为资源有限的物联网(IoT)设备提供可靠的身份验证。仲裁器 PUF(APUF)是最著名的 PUF 电路之一。然而,其硬件实现在现场可编程门阵列(FPGA)上的可靠性较差。本文提出了一种采用延迟差量化策略(DDQ-APUF)的高可靠性 APUF。通过在传统 APUF 的两条对称路径上添加多个可配置的延迟单元,收集 APUF 在不同延迟配置下的输出,即可获得 APUF 两条对称路径之间的延迟差。与传统 APUF 相比,DDQ-APUF 不以两条对称路径的信号传输仲裁结果作为响应,而是以两条路径的量化延迟差作为响应。认证中采用了容差阈值,以适应环境变化引起的延迟差变化。此外,还评估了 DDQ-APUF 的建模抗攻击性,并提出了通过采用伪 XOR 技术提高抗攻击性的策略。电路在 Xilinx Artix-7 FPGA 上实现,实验结果表明,在不丢弃 CRP 的情况下,可靠性达到了 99.95%。
{"title":"A High-Reliability, Non-CRP-Discard Arbiter PUF Based on Delay Difference Quantization","authors":"Yao Wang;Guangyang Zhang;Xue Mei;Chongyan Gu","doi":"10.1109/TCSI.2024.3466972","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3466972","url":null,"abstract":"As a lightweight hardware security primitive, physical unclonable functions (PUFs) can provide reliable identity authentication for the Internet of Things (IoT) devices with limited resources. Arbiter PUF (APUF) is one of the most well-known PUF circuits. However, its hardware implementation has poor reliability on field programmable gate arrays (FPGAs). This paper proposed a highly reliable APUF that uses a delay difference quantization strategy (DDQ-APUF). By adding multiple configurable delay units to the two symmetrical paths of the conventional APUF, the delay difference between the two symmetrical paths of APUF can be obtained by collecting the output of APUF under different delay configurations. Compared to conventional APUFs, DDQ-APUF does not use the arbitration result of signal transmission in two symmetric paths as its response, but rather uses the quantified delay difference between the two paths as its response. A tolerance threshold is adopted in the authentication to accommodate the variations in delay differences due to environmental changes. Moreover, the modeling attack resistance of DDQ-APUF is evaluated, and a strategy for improving this resistance by incorporating pseudo-XOR technique is proposed. The circuit was implemented on Xilinx Artix-7 FPGAs and the experimental results show that the reliability achieves 99.95% with non-CRP-discard.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"72 2","pages":"573-585"},"PeriodicalIF":5.2,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143184447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bio-Inspired Energy- and Area-Efficient Sound Localization Neural Network
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-08 DOI: 10.1109/TCSI.2024.3466219
Bomin Joo;Minkyu Ko;Jieun Kim;Bai-Sun Kong
This paper proposes an energy- and area-efficient sound localization neural network mimicking the auditory brainstem cognitive function. By adopting the bio-plausible Jeffress model, the proposed neural network locates the sound based on the interaural time difference (ITD) in an energy- and hardware-efficient manner. The proposed network modifies the original structure of the Jeffress model having a pair of long axon lines to provide performance gain. It can reduce power consumption and area by using a single axon line. It can further improve efficiency in terms of power and area by shortening the length of the axon line for pulse propagation. Since only the leading pulse is allowed to propagate through the shortened single axon delay line, the number of delay elements and corresponding network components are reduced. Moreover, it can accurately detect the location of the sound source thanks to the axon line composed of synchronized delay elements. A further reduction of the power consumption is achieved by eliminating redundant pulse propagation through the axon line after the output neuron fires. The proposed sound localization neural network was fabricated in a 28-nm CMOS process. The performance evaluation results indicate that the proposed sound localization neural network can detect the location of a sound source with a one-degree resolution at a given robot head size of 3.0125 cm, regardless of process corners. It also indicates that the network achieves up to 86.6% and 97.2% energy and area reduction from conventional sound localization networks, operating at 0.305-V supply voltage.
本文提出了一种模仿听觉脑干认知功能的节能、面积效率高的声音定位神经网络。通过采用生物仿真的杰弗里斯模型,所提出的神经网络可根据耳间时差(ITD)以节能和硬件高效的方式定位声音。所提出的网络修改了杰弗里斯模型的原始结构,使其具有一对长轴突线,从而提高了性能。它可以通过使用单根轴突线来降低功耗和面积。通过缩短用于脉冲传播的轴突线长度,可进一步提高功率和面积效率。由于缩短后的单轴向延迟线只允许前导脉冲传播,因此减少了延迟元件和相应网络元件的数量。此外,由于轴突线由同步延迟元件组成,它还能准确检测声源的位置。通过消除输出神经元点燃后通过轴突线传播的冗余脉冲,进一步降低了功耗。所提出的声音定位神经网络采用 28 纳米 CMOS 工艺制造。性能评估结果表明,在机器人头部尺寸为 3.0125 厘米的给定条件下,所提出的声音定位神经网络能以一度的分辨率检测声源的位置,而不受工艺角的影响。评估结果还表明,与传统的声音定位网络相比,该网络在 0.305 V 电源电压下工作时,能量和面积分别减少了 86.6% 和 97.2%。
{"title":"A Bio-Inspired Energy- and Area-Efficient Sound Localization Neural Network","authors":"Bomin Joo;Minkyu Ko;Jieun Kim;Bai-Sun Kong","doi":"10.1109/TCSI.2024.3466219","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3466219","url":null,"abstract":"This paper proposes an energy- and area-efficient sound localization neural network mimicking the auditory brainstem cognitive function. By adopting the bio-plausible Jeffress model, the proposed neural network locates the sound based on the interaural time difference (ITD) in an energy- and hardware-efficient manner. The proposed network modifies the original structure of the Jeffress model having a pair of long axon lines to provide performance gain. It can reduce power consumption and area by using a single axon line. It can further improve efficiency in terms of power and area by shortening the length of the axon line for pulse propagation. Since only the leading pulse is allowed to propagate through the shortened single axon delay line, the number of delay elements and corresponding network components are reduced. Moreover, it can accurately detect the location of the sound source thanks to the axon line composed of synchronized delay elements. A further reduction of the power consumption is achieved by eliminating redundant pulse propagation through the axon line after the output neuron fires. The proposed sound localization neural network was fabricated in a 28-nm CMOS process. The performance evaluation results indicate that the proposed sound localization neural network can detect the location of a sound source with a one-degree resolution at a given robot head size of 3.0125 cm, regardless of process corners. It also indicates that the network achieves up to 86.6% and 97.2% energy and area reduction from conventional sound localization networks, operating at 0.305-V supply voltage.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"72 2","pages":"719-729"},"PeriodicalIF":5.2,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143183841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A High-Throughput and Flexible CNN Accelerator Based on Mixed-Radix FFT Method
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-04 DOI: 10.1109/TCSI.2024.3466563
Yishuo Meng;Junfeng Wu;Siwei Xiang;Jianfei Wang;Jia Hou;Zhijie Lin;Chen Yang
CNN acceleration algorithms, including Winograd, Fast Fourier Transform (FFT) and Number Theoretic transform (NTT), have demonstrated their potential in efficiently operating current Convolutional Neural Networks (CNNs). However, deploying FFT algorithm for CNN acceleration would introduce significant invalid elements, unnecessary computations and unacceptable transformation overhead. To address these issues, this paper proposes a series of improved methods along with an FFT-based architecture for efficient and simplified CNN acceleration. First, a novel mixed-radix FFT algorithm is proposed for the reduction of invalid elements. Moreover, Hermitian symmetry is utilized to further reduce the scale of FFT transformation and the number of multiplications. Furthermore, an efficient FFT-based CNN accelerator with a resource-efficient transformation component and a multiplication-reduced PE array is designed. Our proposed accelerator is implemented based on Xilinx XCVU440 with a running frequency of 238MHz, achieving actual performance of 2109-2797 GOPS and DSP efficiency of 1.37-1.82 GOPS/DSP. Compared to previous works based on Winograd, FFT and NTT, our proposed accelerator can realize up to $9.42times $ speedup on actual performance and $1.11times -6.41times $ speedup on DSP efficiency.
{"title":"A High-Throughput and Flexible CNN Accelerator Based on Mixed-Radix FFT Method","authors":"Yishuo Meng;Junfeng Wu;Siwei Xiang;Jianfei Wang;Jia Hou;Zhijie Lin;Chen Yang","doi":"10.1109/TCSI.2024.3466563","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3466563","url":null,"abstract":"CNN acceleration algorithms, including Winograd, Fast Fourier Transform (FFT) and Number Theoretic transform (NTT), have demonstrated their potential in efficiently operating current Convolutional Neural Networks (CNNs). However, deploying FFT algorithm for CNN acceleration would introduce significant invalid elements, unnecessary computations and unacceptable transformation overhead. To address these issues, this paper proposes a series of improved methods along with an FFT-based architecture for efficient and simplified CNN acceleration. First, a novel mixed-radix FFT algorithm is proposed for the reduction of invalid elements. Moreover, Hermitian symmetry is utilized to further reduce the scale of FFT transformation and the number of multiplications. Furthermore, an efficient FFT-based CNN accelerator with a resource-efficient transformation component and a multiplication-reduced PE array is designed. Our proposed accelerator is implemented based on Xilinx XCVU440 with a running frequency of 238MHz, achieving actual performance of 2109-2797 GOPS and DSP efficiency of 1.37-1.82 GOPS/DSP. Compared to previous works based on Winograd, FFT and NTT, our proposed accelerator can realize up to <inline-formula> <tex-math>$9.42times $ </tex-math></inline-formula> speedup on actual performance and <inline-formula> <tex-math>$1.11times -6.41times $ </tex-math></inline-formula> speedup on DSP efficiency.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"72 2","pages":"816-829"},"PeriodicalIF":5.2,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Side-Channel Analysis of Integrate-and-Fire Neurons Within Spiking Neural Networks
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-04 DOI: 10.1109/TCSI.2024.3470135
Matthias Probst;Manuel Brosch;Georg Sigl
Spiking neural networks gain increasing attention in constraint edge devices due to event-based low-power operation and little resource usage. Such edge devices often allow physical access, opening the door for Side-Channel Analysis. In this work, we introduce a novel robust attack strategy on the neuron level to retrieve the trained parameters of an implemented spiking neural network. Utilizing horizontal correlation power analysis, we demonstrate how to recover the weights and thresholds of a feed-forward spiking neural network implementation. We verify our methodology with real-world measurements of localized electromagnetic emanations of an FPGA design. Additionally, we propose countermeasures against the introduced novel attack approach. We evaluate shuffling and masking as countermeasures to protect the implementation against our proposed attack and demonstrate their effectiveness and limitations.
由于基于事件的低功耗运行和很少的资源占用,尖峰神经网络在受限边缘设备中获得了越来越多的关注。这类边缘设备通常允许物理访问,为侧信道分析打开了大门。在这项工作中,我们在神经元层面引入了一种新颖的鲁棒攻击策略,以检索已实施的尖峰神经网络的训练参数。利用水平相关功率分析,我们演示了如何恢复前馈尖峰神经网络实现的权重和阈值。我们通过对 FPGA 设计的局部电磁辐射的实际测量来验证我们的方法。此外,我们还针对引入的新型攻击方法提出了对策。我们评估了洗牌和掩码作为保护实现免受我们提出的攻击的对策,并证明了它们的有效性和局限性。
{"title":"Side-Channel Analysis of Integrate-and-Fire Neurons Within Spiking Neural Networks","authors":"Matthias Probst;Manuel Brosch;Georg Sigl","doi":"10.1109/TCSI.2024.3470135","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3470135","url":null,"abstract":"Spiking neural networks gain increasing attention in constraint edge devices due to event-based low-power operation and little resource usage. Such edge devices often allow physical access, opening the door for Side-Channel Analysis. In this work, we introduce a novel robust attack strategy on the neuron level to retrieve the trained parameters of an implemented spiking neural network. Utilizing horizontal correlation power analysis, we demonstrate how to recover the weights and thresholds of a feed-forward spiking neural network implementation. We verify our methodology with real-world measurements of localized electromagnetic emanations of an FPGA design. Additionally, we propose countermeasures against the introduced novel attack approach. We evaluate shuffling and masking as countermeasures to protect the implementation against our proposed attack and demonstrate their effectiveness and limitations.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"72 2","pages":"548-560"},"PeriodicalIF":5.2,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10705320","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143184445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comprehensive Approach to Improving the Thermal Reliability of RTN-Based PUFs
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-04 DOI: 10.1109/TCSI.2024.3458057
F. de Los Santos-Prieto;F. J. Rubio-Barbero;R. Castro-Lopez;E. Roca;F. V. Fernandez
Silicon Physical Unclonable Functions (PUFs) have emerged as a promising solution for generating cryptographic keys in low-cost resource-constrained devices. A PUF is expected to be reliable, meaning that its response bits should remain consistent each time the corresponding challenges are queried. Unfortunately, the stability of these challenge-response pairs (CRPs) can be seriously eroded by environmental factors like temperature variations and the aging of the integrated circuits implementing the PUF. Several approaches, including bit masking, bit selection techniques, and error-correcting codes, have been proposed to obtain a reliable PUF operation in the face of temperature variations. As for aging, a new kind of aging-resilient silicon PUF has been reported that uses the time-varying phenomenon known as Random Telegraph Noise (RTN) as the underlying entropy source. Although this type of PUF preserves its reliability well when aged, it is not immune to the impact of temperature variations. The work presented here shows that it is possible to improve the thermal reliability of RTN-based PUFs with a proper combination of (a) a novel optimization-based bit selection technique, that is also applicable to other types of PUFs based on differential measurements; and (b) a temperature-aware tuning of the entropy-harvesting function.
{"title":"A Comprehensive Approach to Improving the Thermal Reliability of RTN-Based PUFs","authors":"F. de Los Santos-Prieto;F. J. Rubio-Barbero;R. Castro-Lopez;E. Roca;F. V. Fernandez","doi":"10.1109/TCSI.2024.3458057","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3458057","url":null,"abstract":"Silicon Physical Unclonable Functions (PUFs) have emerged as a promising solution for generating cryptographic keys in low-cost resource-constrained devices. A PUF is expected to be reliable, meaning that its response bits should remain consistent each time the corresponding challenges are queried. Unfortunately, the stability of these challenge-response pairs (CRPs) can be seriously eroded by environmental factors like temperature variations and the aging of the integrated circuits implementing the PUF. Several approaches, including bit masking, bit selection techniques, and error-correcting codes, have been proposed to obtain a reliable PUF operation in the face of temperature variations. As for aging, a new kind of aging-resilient silicon PUF has been reported that uses the time-varying phenomenon known as Random Telegraph Noise (RTN) as the underlying entropy source. Although this type of PUF preserves its reliability well when aged, it is not immune to the impact of temperature variations. The work presented here shows that it is possible to improve the thermal reliability of RTN-based PUFs with a proper combination of (a) a novel optimization-based bit selection technique, that is also applicable to other types of PUFs based on differential measurements; and (b) a temperature-aware tuning of the entropy-harvesting function.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"72 2","pages":"661-670"},"PeriodicalIF":5.2,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143183837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Single-Switch High Step-Up DC-DC Converter With High-Voltage Conversion Ratio
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-04 DOI: 10.1109/TCSI.2024.3468372
Yu-En Wu;Sin-Cheng Huang;Che-Ming Chang
This paper proposes a novel high step-up DC-DC converter comprising a single switch and a three-winding coupled inductor. By using only one switch, the proposed converter simplifies control by requiring only one set of PWM signals and eliminates the need for operating at extremely high duty cycles or high turns ratios to achieve the desired gain ratio. Moreover, the converter achieves remarkable voltage gain through a voltage multiplier cell and a three-winding coupled inductor. This paper employed a 500W high step-up converter to confirm the correctness and feasibility of the proposed converter through steady-state analysis, software simulations, and hardware implementation. The measured maximum efficiency reached 95.8% when operated under 150W.
{"title":"A Novel Single-Switch High Step-Up DC-DC Converter With High-Voltage Conversion Ratio","authors":"Yu-En Wu;Sin-Cheng Huang;Che-Ming Chang","doi":"10.1109/TCSI.2024.3468372","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3468372","url":null,"abstract":"This paper proposes a novel high step-up DC-DC converter comprising a single switch and a three-winding coupled inductor. By using only one switch, the proposed converter simplifies control by requiring only one set of PWM signals and eliminates the need for operating at extremely high duty cycles or high turns ratios to achieve the desired gain ratio. Moreover, the converter achieves remarkable voltage gain through a voltage multiplier cell and a three-winding coupled inductor. This paper employed a 500W high step-up converter to confirm the correctness and feasibility of the proposed converter through steady-state analysis, software simulations, and hardware implementation. The measured maximum efficiency reached 95.8% when operated under 150W.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"72 2","pages":"953-962"},"PeriodicalIF":5.2,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDS-DOA: Fusing Model-Based and Data-Driven Approaches for Modular, Distributed, and Scalable Direction-of-Arrival Estimation
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-04 DOI: 10.1109/TCSI.2024.3469928
Adou Sangbone Assoa;Ashwin Bhat;Sigang Ryu;Arijit Raychowdhury
Massive MIMO systems are promising for wireless communications beyond 5G, but scalable Direction-of-Arrival (DOA) estimation in these systems is challenging due to the increasing number of required antennas. Existing solutions, model-based or data-driven (typically using neural networks), face scalability issues with the growing antenna array size. To address this issue, we propose a hybrid system that makes the overall approach scalable. In the front-end, we employ a modular distributed approach namely, the method of sparse linear inverse to compute a proxy spectrum from the sampled covariance matrix of the antenna subarrays. The proxy drives a fixed lightweight back-end which consists of a 1-dimensional Convolution Neural Network (1D-CNN) and a simplified peak extraction. The input proxy dimension being independent of the antenna count makes the neural network input invariant of the array size, enabling it to handle multiple array sizes without requiring any modification of the neural network structure. To reduce the computation of the covariance matrix and proxy spectrum, we employ a system of subarrays with Nearest-Neighbor communication. The proposed approach was implemented on a Xilinx ZCU102 FPGA targeting 100 MHz frequency for 8 to 256-element arrays. We achieve below 1 ms processing time for an array of 256 antennas while requiring significantly less computation than both model-based and data-driven approaches for large antenna arrays.
{"title":"MDS-DOA: Fusing Model-Based and Data-Driven Approaches for Modular, Distributed, and Scalable Direction-of-Arrival Estimation","authors":"Adou Sangbone Assoa;Ashwin Bhat;Sigang Ryu;Arijit Raychowdhury","doi":"10.1109/TCSI.2024.3469928","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3469928","url":null,"abstract":"Massive MIMO systems are promising for wireless communications beyond 5G, but scalable Direction-of-Arrival (DOA) estimation in these systems is challenging due to the increasing number of required antennas. Existing solutions, model-based or data-driven (typically using neural networks), face scalability issues with the growing antenna array size. To address this issue, we propose a hybrid system that makes the overall approach scalable. In the front-end, we employ a modular distributed approach namely, the method of sparse linear inverse to compute a proxy spectrum from the sampled covariance matrix of the antenna subarrays. The proxy drives a fixed lightweight back-end which consists of a 1-dimensional Convolution Neural Network (1D-CNN) and a simplified peak extraction. The input proxy dimension being independent of the antenna count makes the neural network input invariant of the array size, enabling it to handle multiple array sizes without requiring any modification of the neural network structure. To reduce the computation of the covariance matrix and proxy spectrum, we employ a system of subarrays with Nearest-Neighbor communication. The proposed approach was implemented on a Xilinx ZCU102 FPGA targeting 100 MHz frequency for 8 to 256-element arrays. We achieve below 1 ms processing time for an array of 256 antennas while requiring significantly less computation than both model-based and data-driven approaches for large antenna arrays.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"72 2","pages":"941-952"},"PeriodicalIF":5.2,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequential Bayesian Inference and Monte-Carlo Sampling Using Memristor Stochasticity 利用忆阻器随机性进行序列贝叶斯推理和蒙特卡洛采样
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-03 DOI: 10.1109/TCSI.2024.3470318
Adil Malik;Christos Papavassiliou
In this paper, we study the stochastic state trajectory and conductance distributions of memristors under periodic pulse excitation. Our results, backed by experimental evidence, reveal that practical memristors exhibit a $1/f^{2}$ Brownian noise power spectrum. Based on this, we develop a Memristive Distribution Generator (MDG) circuit that produces tunable analog distributions by exploiting the physical stochasticity of memristors. By encoding the prior distributions of Bayesian problems in the physical output samples of these circuits, we demonstrate that Monte-Carlo sampling can be devised without knowledge of the analytical output distribution of the memristor. Using examples of 1-D Bayesian linear regression and a dynamic 2-D nonlinear localisation problem, we show how MDG circuits can act as a tunable source of randomness, efficiently representing distributions of interest. Our results, obtained using Pt/TiO2/Pt memristors, validate the use of memristor-based MDGs for implementing probabilistic algorithms.
本文研究了周期性脉冲激励下忆阻器的随机状态轨迹和电导分布。实验证明,我们的研究结果表明,实用的忆阻器表现出 1 美元/f^{2}美元的布朗噪声功率谱。在此基础上,我们开发了一种忆阻器分布发生器(MDG)电路,利用忆阻器的物理随机性产生可调整的模拟分布。通过将贝叶斯问题的先验分布编码到这些电路的物理输出样本中,我们证明了可以在不了解忆阻器分析输出分布的情况下设计蒙特卡洛采样。利用一维贝叶斯线性回归和动态二维非线性定位问题的例子,我们展示了 MDG 电路如何充当可调整的随机性源,有效地表示感兴趣的分布。我们使用铂/二氧化钛/铂忆阻器获得的结果验证了使用基于忆阻器的 MDG 实现概率算法的有效性。
{"title":"Sequential Bayesian Inference and Monte-Carlo Sampling Using Memristor Stochasticity","authors":"Adil Malik;Christos Papavassiliou","doi":"10.1109/TCSI.2024.3470318","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3470318","url":null,"abstract":"In this paper, we study the stochastic state trajectory and conductance distributions of memristors under periodic pulse excitation. Our results, backed by experimental evidence, reveal that practical memristors exhibit a \u0000<inline-formula> <tex-math>$1/f^{2}$ </tex-math></inline-formula>\u0000 Brownian noise power spectrum. Based on this, we develop a Memristive Distribution Generator (MDG) circuit that produces tunable analog distributions by exploiting the physical stochasticity of memristors. By encoding the prior distributions of Bayesian problems in the physical output samples of these circuits, we demonstrate that Monte-Carlo sampling can be devised without knowledge of the analytical output distribution of the memristor. Using examples of 1-D Bayesian linear regression and a dynamic 2-D nonlinear localisation problem, we show how MDG circuits can act as a tunable source of randomness, efficiently representing distributions of interest. Our results, obtained using Pt/TiO2/Pt memristors, validate the use of memristor-based MDGs for implementing probabilistic algorithms.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"71 12","pages":"5506-5518"},"PeriodicalIF":5.2,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142713970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Compact Broadband Voltage-Combined Doherty Power Amplifier With Shorted Transmission Line for 5G Millimeter-Wave 用于 5G 毫米波的带短路传输线的紧凑型宽带电压组合多赫蒂功率放大器
IF 5.2 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-10-02 DOI: 10.1109/TCSI.2024.3415010
Jiawen Chen;Haoshen Zhu;Jingye Zhang;Quan Xue
This article presents a CMOS voltage-combined Doherty power amplifier (PA) based on a broadband compact load modulation network (LMN) for fifth-generation (5G) millimeter-wave (mm-wave) mobile communication applications. By analyzing the frequency response of different types of Doherty power combiner, a novel voltage-combined Doherty LMN with shorted TL and corresponding design procedure are proposed to achieve broadband power back-off (PBO) bandwidth and compact footprint. A compact quadrature hybrid coupler without lumped capacitor is also devised to generate wideband quadrature signals. To improve PBO efficiency, an envelope detector is adopted to produce adaptive DC bias for auxiliary path. For the proof of concept, a dual-driver Doherty PA is implemented in 65-nm bulk CMOS technology with a chip size of 0.42 mm2 including all pads. The PA achieves a 3-dB small-signal $S_{21}$ bandwidth from 21.1 to 30.4 GHz and a 1-dB saturated output power ( $P_{text {sat}}$ ) bandwidth from 24 to 30 GHz. The measured $P_{text {sat}}$ , output 1-dB compression point ( $OP_{text {1dB}}$ ), peak power-added efficiency (PAE) and PAE at 6dB-PBO are 20.0 dBm, 19.1 dBm, 24.6% and 20.0% at 27 GHz, respectively. For modulation measurements, the proposed PA under 64-quadrature-amplitude-modulated (64-QAM) signal at a data rate of 0.6/2.4 Gb/s achieves average output power ( $P_{text {avg}}$ ) of 11.5/4.8 dBm and average drain efficiency of 14.1%/3.9% with −25/−24.5 dB of error vector magnitude (EVM) and −29/−25.6 dBc of adjacent channel leakage ratio (ACLR) at 28 GHz, respectively.
本文介绍了一种基于宽带紧凑型负载调制网络(LMN)的 CMOS 电压组合式 Doherty 功率放大器(PA),适用于第五代毫米波(5G)移动通信应用。通过分析不同类型 Doherty 功率合路器的频率响应,提出了一种具有短路 TL 的新型电压组合 Doherty LMN 和相应的设计程序,以实现宽带功率后置(PBO)带宽和紧凑的占地面积。此外,还设计了一种不带叠加电容器的紧凑型正交混合耦合器,以产生宽带正交信号。为提高 PBO 效率,采用了包络检测器为辅助路径产生自适应直流偏置。为了验证概念,采用 65 纳米体 CMOS 技术实现了双驱动器 Doherty 功率放大器,芯片尺寸为 0.42 平方毫米(包括所有焊盘)。该功率放大器实现了 21.1 至 30.4 GHz 的 3 分贝小信号 $S_{21}$ 带宽,以及 24 至 30 GHz 的 1 分贝饱和输出功率($P_{text{sat}}$)带宽。在 27 GHz 时,测得的 $P_{text {sat}$ 、输出 1-dB 压缩点($OP_{text {1dB}}$ )、峰值功率附加效率(PAE)和 6dB-PBO 时的 PAE 分别为 20.0 dBm、19.1 dBm、24.6% 和 20.0%。在调制测量中,在数据速率为 0.6/2.4 Gb/s 的 64-quadrature-amplitude-modulated (64-QAM) 信号下,所提出的功率放大器在 28 GHz 下的平均输出功率($P_{text {avg}}$)为 11.5/4.8 dBm,平均漏极效率为 14.1%/3.9%,误差矢量幅度(EVM)为 -25/-24.5 dB,邻道泄漏比(ACLR)为 -29/-25.6 dBc。
{"title":"A Compact Broadband Voltage-Combined Doherty Power Amplifier With Shorted Transmission Line for 5G Millimeter-Wave","authors":"Jiawen Chen;Haoshen Zhu;Jingye Zhang;Quan Xue","doi":"10.1109/TCSI.2024.3415010","DOIUrl":"https://doi.org/10.1109/TCSI.2024.3415010","url":null,"abstract":"This article presents a CMOS voltage-combined Doherty power amplifier (PA) based on a broadband compact load modulation network (LMN) for fifth-generation (5G) millimeter-wave (mm-wave) mobile communication applications. By analyzing the frequency response of different types of Doherty power combiner, a novel voltage-combined Doherty LMN with shorted TL and corresponding design procedure are proposed to achieve broadband power back-off (PBO) bandwidth and compact footprint. A compact quadrature hybrid coupler without lumped capacitor is also devised to generate wideband quadrature signals. To improve PBO efficiency, an envelope detector is adopted to produce adaptive DC bias for auxiliary path. For the proof of concept, a dual-driver Doherty PA is implemented in 65-nm bulk CMOS technology with a chip size of 0.42 mm2 including all pads. The PA achieves a 3-dB small-signal \u0000<inline-formula> <tex-math>$S_{21}$ </tex-math></inline-formula>\u0000 bandwidth from 21.1 to 30.4 GHz and a 1-dB saturated output power (\u0000<inline-formula> <tex-math>$P_{text {sat}}$ </tex-math></inline-formula>\u0000) bandwidth from 24 to 30 GHz. The measured \u0000<inline-formula> <tex-math>$P_{text {sat}}$ </tex-math></inline-formula>\u0000, output 1-dB compression point (\u0000<inline-formula> <tex-math>$OP_{text {1dB}}$ </tex-math></inline-formula>\u0000), peak power-added efficiency (PAE) and PAE at 6dB-PBO are 20.0 dBm, 19.1 dBm, 24.6% and 20.0% at 27 GHz, respectively. For modulation measurements, the proposed PA under 64-quadrature-amplitude-modulated (64-QAM) signal at a data rate of 0.6/2.4 Gb/s achieves average output power (\u0000<inline-formula> <tex-math>$P_{text {avg}}$ </tex-math></inline-formula>\u0000) of 11.5/4.8 dBm and average drain efficiency of 14.1%/3.9% with −25/−24.5 dB of error vector magnitude (EVM) and −29/−25.6 dBc of adjacent channel leakage ratio (ACLR) at 28 GHz, respectively.","PeriodicalId":13039,"journal":{"name":"IEEE Transactions on Circuits and Systems I: Regular Papers","volume":"71 12","pages":"6190-6202"},"PeriodicalIF":5.2,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142736486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Circuits and Systems I: Regular Papers
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1