首页 > 最新文献

IET Computers and Digital Techniques最新文献

英文 中文
Design topologies with dual-Vth and dual-Tox assignment in 16 nm CMOS technology 在16nm CMOS技术中设计具有双Vth和双Tox分配的拓扑结构
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-30 DOI: 10.1049/iet-cdt.2018.5211
Smita Singhal, Anu Mehra, Upendra Tripathi

This study presents different topologies for the assignment of dual threshold voltage and dual gate oxide thickness in 16 nm complementary metal-oxide-semiconductor technology. The objective is to optimise the circuit in terms of static power dissipation, delay, and power-delay-product (pdp). Topologies namely direct, grouping, and divide-by-2 are simulated for and conventional 1-bit full adder circuits. Results of the proposed topologies are compared with some of the existing techniques of leakage reduction i.e. dual-, dual- and supply switching with ground collapse (SSGC). 1-bit full adder circuit using direct topology reduces static power to 99.98, 96.71, and 95.86% as compared to static power in dual-, dual-, and SSGC techniques, respectively. The pdp of the circuit is significantly improved using proposed topologies. Thus, these topologies can be used for low power and high-performance applications with no area overhead.

本研究提出了在16nm互补金属氧化物半导体技术中分配双阈值电压和双栅极氧化物厚度的不同拓扑结构。目标是在静态功耗、延迟和功率延迟乘积(pdp)方面优化电路。对于传统的1位全加电路,模拟了拓扑结构,即直接、分组和除以2。将所提出的拓扑结构的结果与一些现有的减少泄漏的技术进行了比较,即具有接地崩溃的双开关、双开关和电源开关(SSGC)。与双、双和SSGC技术中的静态功率相比,使用直接拓扑的1位全加电路将静态功率分别降低到99.98、96.71和95.86%。使用所提出的拓扑结构,电路的pdp得到了显著改进。因此,这些拓扑结构可以用于低功耗和高性能的应用,而不会产生区域开销。
{"title":"Design topologies with dual-Vth and dual-Tox assignment in 16 nm CMOS technology","authors":"Smita Singhal,&nbsp;Anu Mehra,&nbsp;Upendra Tripathi","doi":"10.1049/iet-cdt.2018.5211","DOIUrl":"https://doi.org/10.1049/iet-cdt.2018.5211","url":null,"abstract":"<div>\u0000 <p>This study presents different topologies for the assignment of dual threshold voltage and dual gate oxide thickness in 16 nm complementary metal-oxide-semiconductor technology. The objective is to optimise the circuit in terms of static power dissipation, delay, and power-delay-product (pdp). Topologies namely direct, grouping, and divide-by-2 are simulated for and conventional 1-bit full adder circuits. Results of the proposed topologies are compared with some of the existing techniques of leakage reduction i.e. dual-, dual- and supply switching with ground collapse (SSGC). 1-bit full adder circuit using direct topology reduces static power to 99.98, 96.71, and 95.86% as compared to static power in dual-, dual-, and SSGC techniques, respectively. The pdp of the circuit is significantly improved using proposed topologies. Thus, these topologies can be used for low power and high-performance applications with no area overhead.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 4","pages":"176-186"},"PeriodicalIF":1.2,"publicationDate":"2020-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2018.5211","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72169499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-core hardware realisation of the quasi maximum likelihood PPS estimator 准最大似然PPS估计器的多核硬件实现
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-23 DOI: 10.1049/iet-cdt.2019.0114
Nevena R. Brnović, Veselin N. Ivanović, Igor Djurović, Marko Simeunović

Multi-core hardware realisation of the quasi maximum likelihood algorithm as the state-of-the-art estimator of polynomial phase signals (PPSs) is proposed in this study. Developed multiple-clock-cycle realisation is suitable for real-time implementation. To prove this, the proposed design is implemented on a field programmable gate array circuit. The hardware realisation is tested and verified on PPSs corrupted with various amounts of the Gaussian noise. Obtained results are compared with software simulations showing excellent match between the proposed system-based and the software-based outputs.

本研究提出了准最大似然算法作为最先进的多项式相位信号(PPS)估计器的多核硬件实现。开发的多时钟周期实现适用于实时实现。为了证明这一点,所提出的设计是在现场可编程门阵列电路上实现的。硬件实现在被各种高斯噪声破坏的PPS上进行了测试和验证。将获得的结果与软件仿真进行比较,表明所提出的基于系统的输出和基于软件的输出之间具有良好的匹配性。
{"title":"Multi-core hardware realisation of the quasi maximum likelihood PPS estimator","authors":"Nevena R. Brnović,&nbsp;Veselin N. Ivanović,&nbsp;Igor Djurović,&nbsp;Marko Simeunović","doi":"10.1049/iet-cdt.2019.0114","DOIUrl":"https://doi.org/10.1049/iet-cdt.2019.0114","url":null,"abstract":"<div>\u0000 <p>Multi-core hardware realisation of the quasi maximum likelihood algorithm as the state-of-the-art estimator of polynomial phase signals (PPSs) is proposed in this study. Developed multiple-clock-cycle realisation is suitable for real-time implementation. To prove this, the proposed design is implemented on a field programmable gate array circuit. The hardware realisation is tested and verified on PPSs corrupted with various amounts of the Gaussian noise. Obtained results are compared with software simulations showing excellent match between the proposed system-based and the software-based outputs.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 5","pages":"187-192"},"PeriodicalIF":1.2,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/iet-cdt.2019.0114","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72161844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Area and power-efficient variable-length fast Fourier transform for MR-OFDM physical layer of IEEE 802.15.4-g 用于IEEE 802.15.4-g的MR-OFDM物理层的面积和功率高效可变长度快速傅立叶变换
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-23 DOI: 10.1049/iet-cdt.2018.5260
Ganjikunta Ganesh Kumar, Subhendu K. Sahoo

The authors present a novel 16/32/64/128-point single-path delay feedback pipeline fast Fourier transform (FFT) architecture targeting the multi-rate and multi-regional orthogonal frequency division multiplexing (MR-OFDM) physical layer of IEEE 802.15.4-g. The proposed FFT architecture employs a mixed-radix algorithm to significantly reduce the number of complex multipliers. It utilises a configurable complex constant multiplier structure instead of a fixed constant multiplier to efficiently conduct , , and twiddle factor multiplication. A hardware-sharing mechanism has also been formulated to reduce the memory space requirements of the proposed 16/32/64/128-point FFT computation scheme. The proposed design is implemented in Xilinx Virtex-5 and Altera's field-programmable gate array devices. For the computation of 128-point FFT, the proposed mixed-radix FFT architecture significantly reduces the hardware cost in comparison with existing FFT architecture. The proposed FFT architecture is also implemented by adopting the 90 nm complementary metal-oxide-semiconductor technology with a supply voltage of 1 V. Post-synthesis results reveal that the design is efficient in terms of gate count and power consumption, compared to earlier reported designs. The proposed variable-length FFT architecture gate count is 22.3K and consumes 3.832 mW, while the word-length is 12-bits and can be efficiently useful for the IEEE 802.15.4-g standard.

针对IEEE 802.15.4-g的多速率、多区域正交频分复用(MR-OFDM)物理层,提出了一种新颖的16/32/64/128点单路径延迟反馈流水线快速傅立叶变换(FFT)架构。该架构采用混合基数算法,显著减少了复数乘法器的数量。它利用可配置的复杂常数乘法器结构而不是固定常数乘法器来有效地进行和旋转因子乘法。还制定了硬件共享机制,以减少所提出的16/32/64/128点FFT计算方案的存储器空间需求。所提出的设计在Xilinx Virtex-5和Altera的现场可编程门阵列器件中实现。对于128点FFT的计算,与现有的FFT架构相比,所提出的混合基数FFT架构显著降低了硬件成本。所提出的FFT架构还通过采用电源电压为1V的90nm互补金属氧化物半导体技术来实现。后合成结果表明,与早期报道的设计相比,该设计在栅极计数和功耗方面是有效的。所提出的可变长度FFT架构门计数为22.3K,消耗3.832mW,而字长为12位,可以有效地用于IEEE 802.15.4-g标准。
{"title":"Area and power-efficient variable-length fast Fourier transform for MR-OFDM physical layer of IEEE 802.15.4-g","authors":"Ganjikunta Ganesh Kumar,&nbsp;Subhendu K. Sahoo","doi":"10.1049/iet-cdt.2018.5260","DOIUrl":"https://doi.org/10.1049/iet-cdt.2018.5260","url":null,"abstract":"<div>\u0000 <p>The authors present a novel 16/32/64/128-point single-path delay feedback pipeline fast Fourier transform (FFT) architecture targeting the multi-rate and multi-regional orthogonal frequency division multiplexing (MR-OFDM) physical layer of IEEE 802.15.4-g. The proposed FFT architecture employs a mixed-radix algorithm to significantly reduce the number of complex multipliers. It utilises a configurable complex constant multiplier structure instead of a fixed constant multiplier to efficiently conduct , , and twiddle factor multiplication. A hardware-sharing mechanism has also been formulated to reduce the memory space requirements of the proposed 16/32/64/128-point FFT computation scheme. The proposed design is implemented in Xilinx Virtex-5 and Altera's field-programmable gate array devices. For the computation of 128-point FFT, the proposed mixed-radix FFT architecture significantly reduces the hardware cost in comparison with existing FFT architecture. The proposed FFT architecture is also implemented by adopting the 90 nm complementary metal-oxide-semiconductor technology with a supply voltage of 1 V. Post-synthesis results reveal that the design is efficient in terms of gate count and power consumption, compared to earlier reported designs. The proposed variable-length FFT architecture gate count is 22.3K and consumes 3.832 mW, while the word-length is 12-bits and can be efficiently useful for the IEEE 802.15.4-g standard.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 5","pages":"193-200"},"PeriodicalIF":1.2,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2018.5260","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72161843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Ternary DDCVSL: a combined dynamic logic style for standard ternary logic with single power source 三值DDCVSL:一种用于标准三值逻辑的单电源组合动态逻辑风格
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-17 DOI: 10.1049/iet-cdt.2019.0216
Nooshin Azimi, Reza Faghih Mirzaee, Keivan Navi, Amir Masoud Rahmani

Every logic style has certain advantages for a specific application. Therefore, it is essential to introduce and investigate different logic styles. Differential cascode voltage switch logic (DCVSL) with the inherent redundancy is known to be an ideal logic style for error detection applications. This study combines ternary static DCVSL (SDCVSL) with dynamic logic (DL) to realise ternary dynamic DCVSL (DDCVSL) by means of a single power source. At first, it is shown that why the same static-to-dynamic conversion method in binary logic fails to operate correctly in ternary logic. Then, two solutions are given. Static power dissipation and switching activity are particularly dealt with in the second proposed ternary DDCVSL to reduce power consumption. The new designs are simulated and tested by using HSPICE simulator and 32 nm Stanford carbon nanotube field effect transistor model. Simulation results and comparisons with a vast range of conventional and state-of-the-art competitors show prominence and great potential for the new ternary circuit methodology. For example, the authors second proposed ternary DDCVSL AND/NAND has 19.7, 37.4, and 60.5% higher performance than some famous static ternary logic styles such as CMOS-like, SDCVSL, and pseudo N-type, respectively, in terms of energy consumption.

每种逻辑样式对于特定的应用程序都有一定的优势。因此,有必要介绍和研究不同的逻辑风格。已知具有固有冗余的差分共源共栅电压开关逻辑(DCVSL)是用于错误检测应用的理想逻辑类型。本研究将三元静态DCVSL(SDCVSL)与动态逻辑(DL)相结合,通过单一电源实现三元动态DCVSL。首先,说明了二进制逻辑中相同的静态到动态转换方法在三进制逻辑中不能正确操作的原因。然后,给出了两种解决方案。在第二个提出的三元DDCVSL中特别处理了静态功耗和开关活动,以降低功耗。使用HSPICE模拟器和32纳米斯坦福碳纳米管场效应晶体管模型对新设计进行了模拟和测试。仿真结果以及与众多传统和最先进的竞争对手的比较表明,新的三元电路方法具有突出的地位和巨大的潜力。例如,作者第二次提出的三元DDCVSL AND/NAND在能耗方面分别比一些著名的静态三元逻辑样式(如CMOS、SDCVSL和伪N型)高19.7%、37.4%和60.5%。
{"title":"Ternary DDCVSL: a combined dynamic logic style for standard ternary logic with single power source","authors":"Nooshin Azimi,&nbsp;Reza Faghih Mirzaee,&nbsp;Keivan Navi,&nbsp;Amir Masoud Rahmani","doi":"10.1049/iet-cdt.2019.0216","DOIUrl":"https://doi.org/10.1049/iet-cdt.2019.0216","url":null,"abstract":"<div>\u0000 <p>Every logic style has certain advantages for a specific application. Therefore, it is essential to introduce and investigate different logic styles. Differential cascode voltage switch logic (DCVSL) with the inherent redundancy is known to be an ideal logic style for error detection applications. This study combines ternary static DCVSL (SDCVSL) with dynamic logic (DL) to realise ternary dynamic DCVSL (DDCVSL) by means of a single power source. At first, it is shown that why the same static-to-dynamic conversion method in binary logic fails to operate correctly in ternary logic. Then, two solutions are given. Static power dissipation and switching activity are particularly dealt with in the second proposed ternary DDCVSL to reduce power consumption. The new designs are simulated and tested by using HSPICE simulator and 32 nm Stanford carbon nanotube field effect transistor model. Simulation results and comparisons with a vast range of conventional and state-of-the-art competitors show prominence and great potential for the new ternary circuit methodology. For example, the authors second proposed ternary DDCVSL AND/NAND has 19.7, 37.4, and 60.5% higher performance than some famous static ternary logic styles such as CMOS-like, SDCVSL, and pseudo N-type, respectively, in terms of energy consumption.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 4","pages":"166-175"},"PeriodicalIF":1.2,"publicationDate":"2020-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2019.0216","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71968646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Sensitivity analysis of testability parameters for secure IC design 安全集成电路设计中可测试性参数的灵敏度分析
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-17 DOI: 10.1049/iet-cdt.2019.0217
Sreeja Rajendran, Mary Lourde Regeena

Insertion of malicious circuits commonly known as Hardware Trojans into an original integrated circuit (IC) design to alter the functionality has been a major concern in recent years. As a result, over the years multiple techniques have been suggested by researchers to combat these malicious threats. Hard to test nets in any logic circuit are the most vulnerable to insertion of Hardware Trojans. Testability analysis is the process of identification of these hard to test nets in a logic circuit. Testability analysis is achieved through the testability metrics namely controllability and observability. Testability metrics can be used as a yardstick in devising efficient Hardware Trojan detection methods. The crux of this study is a novel method for identification of susceptible nets that are prone to Hardware Trojan insertions in a logic circuit. The study also presents a comprehensive analysis of the impact on testability parameters as a result of Hardware Trojans in the identified susceptible nets. The method utilises the testability parameters of nets to define threshold values for isolating susceptible nets in a design. The study details out the impact of the number of trigger inputs as well as the distribution of trigger nets on the testability metrics of digital circuits.

近年来,将通常被称为硬件木马的恶意电路插入到原始集成电路(IC)设计中以改变功能一直是一个主要问题。因此,多年来,研究人员提出了多种技术来对抗这些恶意威胁。任何逻辑电路中难以测试的网络都最容易受到硬件木马的插入。可测试性分析是识别逻辑电路中这些难以测试的网络的过程。可测试性分析是通过可测试性度量,即可控性和可观测性来实现的。可测试性度量可以用作设计高效硬件特洛伊木马检测方法的标准。这项研究的关键是一种新的方法来识别逻辑电路中容易插入硬件特洛伊木马的易感网络。该研究还对已识别的易感网络中的硬件木马对可测试性参数的影响进行了全面分析。该方法利用网络的可测试性参数来定义用于在设计中隔离易感网络的阈值。该研究详细说明了触发输入的数量以及触发网络的分布对数字电路可测试性指标的影响。
{"title":"Sensitivity analysis of testability parameters for secure IC design","authors":"Sreeja Rajendran,&nbsp;Mary Lourde Regeena","doi":"10.1049/iet-cdt.2019.0217","DOIUrl":"https://doi.org/10.1049/iet-cdt.2019.0217","url":null,"abstract":"<div>\u0000 <p>Insertion of malicious circuits commonly known as Hardware Trojans into an original integrated circuit (IC) design to alter the functionality has been a major concern in recent years. As a result, over the years multiple techniques have been suggested by researchers to combat these malicious threats. Hard to test nets in any logic circuit are the most vulnerable to insertion of Hardware Trojans. Testability analysis is the process of identification of these hard to test nets in a logic circuit. Testability analysis is achieved through the testability metrics namely controllability and observability. Testability metrics can be used as a yardstick in devising efficient Hardware Trojan detection methods. The crux of this study is a novel method for identification of susceptible nets that are prone to Hardware Trojan insertions in a logic circuit. The study also presents a comprehensive analysis of the impact on testability parameters as a result of Hardware Trojans in the identified susceptible nets. The method utilises the testability parameters of nets to define threshold values for isolating susceptible nets in a design. The study details out the impact of the number of trigger inputs as well as the distribution of trigger nets on the testability metrics of digital circuits.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 4","pages":"158-165"},"PeriodicalIF":1.2,"publicationDate":"2020-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2019.0217","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71968645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Amdahl's law in the context of heterogeneous many-core systems – a survey 异质多核心系统背景下的Amdahl定律——综述
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-03 DOI: 10.1049/iet-cdt.2018.5220
Mohammed A. Noaman Al-hayanni, Fei Xia, Ashur Rafiev, Alexander Romanovsky, Rishad Shafik, Alex Yakovlev

For over 50 years, Amdahl's Law has been the hallmark model for reasoning about performance bounds for homogeneous parallel computing resources. As heterogeneous, many-core parallel resources continue to permeate into the modern server and embedded domains, there has been growing interest in promulgating realistic extensions and assumptions in keeping with newer use cases. This study aims to provide a comprehensive review of the purviews and insights provided by the extensive body of work related to Amdahl's law to date, focusing on computation speedup. The authors show that a significant portion of these studies has looked into analysing the scalability of the model considering both workload and system heterogeneity in real-world applications. The focus has been to improve the definition and semantic power of the two key parameters in the original model: the parallel fraction (f) and the computation capability improvement index (n). More recently, researchers have shown normal-form and multi-fraction extensions that can account for wider ranges of heterogeneity, validated on many-core systems running realistic workloads. Speedup models from Amdahl's law onwards have seen a wide range of uses, such as the optimisation of system execution, and these uses are even more important with the advent of the heterogeneous many-core era.

50多年来,Amdahl定律一直是同构并行计算资源性能边界推理的标志性模型。随着异构、许多核心并行资源不断渗透到现代服务器和嵌入式领域,人们对发布符合新用例的现实扩展和假设越来越感兴趣。本研究旨在全面回顾迄今为止与Amdahl定律相关的大量工作所提供的观点和见解,重点是计算加速。作者表明,这些研究的很大一部分着眼于分析模型的可扩展性,同时考虑到现实世界应用程序中的工作负载和系统异构性。重点是改进原始模型中两个关键参数的定义和语义能力:并行分数(f)和计算能力改进指数(n)。最近,研究人员展示了可以解释更广泛异质性的正常形式和多部分扩展,并在许多运行现实工作负载的核心系统上进行了验证。Amdahl定律以后的加速模型有着广泛的用途,例如系统执行的优化,随着异构多核心时代的到来,这些用途变得更加重要。
{"title":"Amdahl's law in the context of heterogeneous many-core systems – a survey","authors":"Mohammed A. Noaman Al-hayanni,&nbsp;Fei Xia,&nbsp;Ashur Rafiev,&nbsp;Alexander Romanovsky,&nbsp;Rishad Shafik,&nbsp;Alex Yakovlev","doi":"10.1049/iet-cdt.2018.5220","DOIUrl":"https://doi.org/10.1049/iet-cdt.2018.5220","url":null,"abstract":"<div>\u0000 <p>For over 50 years, Amdahl's Law has been the hallmark model for reasoning about performance bounds for homogeneous parallel computing resources. As heterogeneous, many-core parallel resources continue to permeate into the modern server and embedded domains, there has been growing interest in promulgating realistic extensions and assumptions in keeping with newer use cases. This study aims to provide a comprehensive review of the purviews and insights provided by the extensive body of work related to Amdahl's law to date, focusing on computation speedup. The authors show that a significant portion of these studies has looked into analysing the scalability of the model considering both workload and system heterogeneity in real-world applications. The focus has been to improve the definition and semantic power of the two key parameters in the original model: the parallel fraction (<i>f</i>) and the computation capability improvement index (<i>n</i>). More recently, researchers have shown normal-form and multi-fraction extensions that can account for wider ranges of heterogeneity, validated on many-core systems running realistic workloads. Speedup models from Amdahl's law onwards have seen a wide range of uses, such as the optimisation of system execution, and these uses are even more important with the advent of the heterogeneous many-core era.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 4","pages":"133-148"},"PeriodicalIF":1.2,"publicationDate":"2020-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/iet-cdt.2018.5220","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71927028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
LUT-based high-speed point multiplier for Goldilocks-Curve448 基于LUT的Goldilocks-Curve448高速点乘法器
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-03 DOI: 10.1049/iet-cdt.2019.0041
Yasir A. Shah, Khalid Javeed, Muhammad I. Shehzad, Shoaib Azmat

Recent studies have shown that existing elliptic curve-based cryptographic standards provide backdoors for manipulation and hence compromise the security. In this regard, two new elliptic curves known as Curve448 and Curve25519 are recently recommended by IETF for transport layer security future generations. Hence, cryptosystems built over these elliptic curves are expected to play a vital role in the near future for secure communications. A high-speed elliptic curve cryptographic processor (ECCP) for the Curve448 is proposed in this study. The area of the ECCP is optimised by performing different modular operations required for the elliptic curve Diffie–Hellman protocol through a unified architecture. The critical path delay of the proposed ECCP is optimised by adopting the redundant-signed-digit technique for arithmetic operations. The segmentation approach is introduced to reduce the required number of clock cycles for the ECCP. The proposed ECCP is developed using look-up-tables (LUTs) only, and hence it can be ported to any field-programmable gate array family or standard ASIC libraries. The authors' ECCP design offers higher speed without any significant area overhead to recent designs reported in the literature.

最近的研究表明,现有的基于椭圆曲线的密码标准为操作提供了后门,从而损害了安全性。在这方面,IETF最近推荐了两种新的椭圆曲线Curve448和Curve25519,用于未来几代的传输层安全。因此,建立在这些椭圆曲线上的密码系统有望在不久的将来在安全通信中发挥至关重要的作用。本文提出了一种用于Curve448的高速椭圆曲线密码处理器(ECCP)。ECCP的区域通过统一架构执行椭圆曲线Diffie–Hellman协议所需的不同模块化操作来优化。通过采用冗余带符号数字技术进行算术运算,优化了所提出的ECCP的关键路径延迟。引入分段方法以减少ECCP所需的时钟周期数。所提出的ECCP仅使用查找表(LUT)开发,因此它可以移植到任何现场可编程门阵列系列或标准ASIC库。与文献中报道的最近的设计相比,作者的ECCP设计提供了更高的速度,而没有任何显著的面积开销。
{"title":"LUT-based high-speed point multiplier for Goldilocks-Curve448","authors":"Yasir A. Shah,&nbsp;Khalid Javeed,&nbsp;Muhammad I. Shehzad,&nbsp;Shoaib Azmat","doi":"10.1049/iet-cdt.2019.0041","DOIUrl":"https://doi.org/10.1049/iet-cdt.2019.0041","url":null,"abstract":"<div>\u0000 <p>Recent studies have shown that existing elliptic curve-based cryptographic standards provide backdoors for manipulation and hence compromise the security. In this regard, two new elliptic curves known as Curve448 and Curve25519 are recently recommended by IETF for transport layer security future generations. Hence, cryptosystems built over these elliptic curves are expected to play a vital role in the near future for secure communications. A high-speed elliptic curve cryptographic processor (ECCP) for the Curve448 is proposed in this study. The area of the ECCP is optimised by performing different modular operations required for the elliptic curve Diffie–Hellman protocol through a unified architecture. The critical path delay of the proposed ECCP is optimised by adopting the redundant-signed-digit technique for arithmetic operations. The segmentation approach is introduced to reduce the required number of clock cycles for the ECCP. The proposed ECCP is developed using look-up-tables (LUTs) only, and hence it can be ported to any field-programmable gate array family or standard ASIC libraries. The authors' ECCP design offers higher speed without any significant area overhead to recent designs reported in the literature.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 4","pages":"149-157"},"PeriodicalIF":1.2,"publicationDate":"2020-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2019.0041","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71927029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
The analogy of matchline sensing techniques for content addressable memory (CAM) 内容可寻址存储器(CAM)的匹配线传感技术的相似性
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-04-01 DOI: 10.1049/iet-cdt.2019.0178
Sandeep Mishra, Telajala Venkata Mahendra, Sheikh Wasmir Hussain, Anup Dandapat

Performance of a memory depends on the storage stability, yield and sensing speed. Differential input and the latching time of sense amplifiers are considered as primary performance factors in static random access memory. In a content addressable memory (CAM), the sensing is carried out through the matchline (ML) and the time for evaluation is the key to decide the search speed. The density of CAM is on a rise to accommodate a higher amount of information which increases the power dissipation associated with it. Issues such as the logical threshold variation and lower noise margin between match and mismatch are critical in the operation of a CAM. A good ML sensing technique can reduce the ML power with enhanced evaluation speed. This work provides an analogy of various ML sensing techniques based on their pre-charging, evaluation and performance improvement strategies. Estimation on the power dissipation and evaluation time are made and in-depth analysis on their power-speed-overhead trade-off are carried on 64-bit CAM macros.

存储器的性能取决于存储稳定性、产量和传感速度。差分输入和读出放大器的锁存时间被认为是静态随机存取存储器的主要性能因素。在内容可寻址存储器(CAM)中,感测是通过匹配线(ML)进行的,评估时间是决定搜索速度的关键。CAM的密度正在增加,以容纳更高数量的信息,这增加了与之相关的功耗。逻辑阈值变化和匹配与失配之间的较低噪声裕度等问题在CAM的操作中至关重要。良好的ML感测技术可以降低ML功率,同时提高评估速度。这项工作基于各种ML传感技术的预充电、评估和性能改进策略,对其进行了类比。对功耗和评估时间进行了估计,并在64位CAM宏上对它们的功率-速度开销权衡进行了深入分析。
{"title":"The analogy of matchline sensing techniques for content addressable memory (CAM)","authors":"Sandeep Mishra,&nbsp;Telajala Venkata Mahendra,&nbsp;Sheikh Wasmir Hussain,&nbsp;Anup Dandapat","doi":"10.1049/iet-cdt.2019.0178","DOIUrl":"https://doi.org/10.1049/iet-cdt.2019.0178","url":null,"abstract":"<div>\u0000 <p>Performance of a memory depends on the storage stability, yield and sensing speed. Differential input and the latching time of sense amplifiers are considered as primary performance factors in static random access memory. In a content addressable memory (CAM), the sensing is carried out through the matchline (ML) and the time for evaluation is the key to decide the search speed. The density of CAM is on a rise to accommodate a higher amount of information which increases the power dissipation associated with it. Issues such as the logical threshold variation and lower noise margin between match and mismatch are critical in the operation of a CAM. A good ML sensing technique can reduce the ML power with enhanced evaluation speed. This work provides an analogy of various ML sensing techniques based on their pre-charging, evaluation and performance improvement strategies. Estimation on the power dissipation and evaluation time are made and in-depth analysis on their power-speed-overhead trade-off are carried on 64-bit CAM macros.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 3","pages":"87-96"},"PeriodicalIF":1.2,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2019.0178","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72124999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Single bit-line 11T SRAM cell for low power and improved stability 用于低功耗和提高稳定性的单比特线11T SRAM单元
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-03-31 DOI: 10.1049/iet-cdt.2019.0234
Rohit Lorenzo, Roy Pailly

This study aims for a new 11T static random access memory (SRAM) cell that uses power gating transistors and transmission gate for low leakage and reliable write operation. The proposed cell has a separate read and write path which successfully improves read and write abilities. Furthermore, it solves the row half select disturbance and utilises a row-based virtual ground signal to eliminate unnecessary bit-line discharge in the un-selected row, thus decreasing energy consumption. The cell also achieves low power due to the stack effect. To show the effectiveness of the cell, its design metrics are compared with other published SRAM cells, namely, conventional 6T, 10T, 9T, and power-gated 9T (PG9T). In standby mode, from 6.71 to 7.37% leakage power reduction is observed for this cell at an operating voltage of 1.2 V and 29.21 to 58.68% & 32.74 to 71.11% improvement for write & read power over other cells. The proposed cell exhibits higher write and reads static noise margins with an improvement of 13.54 and 63.28%, respectively, compared to conventional 6T SRAM cell. The cell provides write delay improvement from 29.77 to 49.40% and read delay improvement from 7 to 12% compared to 9T, 10T, and PG9T, respectively.

本研究旨在开发一种新的11T静态随机存取存储器(SRAM)单元,该单元使用功率门控晶体管和传输门来实现低泄漏和可靠的写入操作。所提出的单元具有独立的读写路径,这成功地提高了读写能力。此外,它解决了行半选择干扰,并利用基于行的虚拟接地信号来消除未选择行中不必要的位线放电,从而降低能耗。由于堆叠效应,该电池还实现了低功率。为了显示该单元的有效性,将其设计指标与其他已发表的SRAM单元进行了比较,即传统的6T、10T、9T和功率门控9T(PG9T)。在待机模式中,在1.2V的操作电压和29.21至58.68%&;写入&;读取其他单元格的功率。与传统的6T SRAM单元相比,所提出的单元表现出更高的写入和读取静态噪声裕度,分别提高了13.54%和63.28%。与9T、10T和PG9T相比,该单元分别提供从29.77%到49.40%的写入延迟改进和从7%到12%的读取延迟改进。
{"title":"Single bit-line 11T SRAM cell for low power and improved stability","authors":"Rohit Lorenzo,&nbsp;Roy Pailly","doi":"10.1049/iet-cdt.2019.0234","DOIUrl":"https://doi.org/10.1049/iet-cdt.2019.0234","url":null,"abstract":"<div>\u0000 <p>This study aims for a new 11T static random access memory (SRAM) cell that uses power gating transistors and transmission gate for low leakage and reliable write operation. The proposed cell has a separate read and write path which successfully improves read and write abilities. Furthermore, it solves the row half select disturbance and utilises a row-based virtual ground signal to eliminate unnecessary bit-line discharge in the un-selected row, thus decreasing energy consumption. The cell also achieves low power due to the stack effect. To show the effectiveness of the cell, its design metrics are compared with other published SRAM cells, namely, conventional 6T, 10T, 9T, and power-gated 9T (PG9T). In standby mode, from 6.71 to 7.37% leakage power reduction is observed for this cell at an operating voltage of 1.2 V and 29.21 to 58.68% &amp; 32.74 to 71.11% improvement for write &amp; read power over other cells. The proposed cell exhibits higher write and reads static noise margins with an improvement of 13.54 and 63.28%, respectively, compared to conventional 6T SRAM cell. The cell provides write delay improvement from 29.77 to 49.40% and read delay improvement from 7 to 12% compared to 9T, 10T, and PG9T, respectively.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 3","pages":"114-121"},"PeriodicalIF":1.2,"publicationDate":"2020-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2019.0234","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71998028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Scalable pseudo-exhaustive methodology for testing and diagnosis in flow-based microfluidic biochips 基于流动的微流控生物芯片测试和诊断的可扩展伪穷举方法
IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2020-03-31 DOI: 10.1049/iet-cdt.2018.5029
Gokulkrishnan Vadakkeveedu, Kamakoti Veezhinathan, Nitin Chandrachoodan, Seetal Potluri

Microfluidics is an upcoming field of science that is going to be used widely in many safety-critical applications including healthcare, medical research and defence. Hence, technologies for fault testing and fault diagnosis of these chips are of extreme importance. In this study, the authors propose a scalable pseudo-exhaustive testing and diagnosis methodology for flow-based microfluidic biochips. The proposed approach employs a divide-and-conquer based technique wherein, large architectures are split into smaller sub-architectures and each of these are tested and diagnosed independently.

微流体是一个即将到来的科学领域,将被广泛用于许多安全关键应用,包括医疗保健、医学研究和国防。因此,这些芯片的故障测试和故障诊断技术具有极其重要的意义。在这项研究中,作者提出了一种可扩展的基于流动的微流控生物芯片的伪穷举测试和诊断方法。所提出的方法采用了一种基于分而治之的技术,其中,将大型体系结构拆分为较小的子体系结构,并对每个子体系结构进行独立的测试和诊断。
{"title":"Scalable pseudo-exhaustive methodology for testing and diagnosis in flow-based microfluidic biochips","authors":"Gokulkrishnan Vadakkeveedu,&nbsp;Kamakoti Veezhinathan,&nbsp;Nitin Chandrachoodan,&nbsp;Seetal Potluri","doi":"10.1049/iet-cdt.2018.5029","DOIUrl":"https://doi.org/10.1049/iet-cdt.2018.5029","url":null,"abstract":"<div>\u0000 <p>Microfluidics is an upcoming field of science that is going to be used widely in many safety-critical applications including healthcare, medical research and defence. Hence, technologies for fault testing and fault diagnosis of these chips are of extreme importance. In this study, the authors propose a scalable pseudo-exhaustive testing and diagnosis methodology for flow-based microfluidic biochips. The proposed approach employs a divide-and-conquer based technique wherein, large architectures are split into smaller sub-architectures and each of these are tested and diagnosed independently.</p>\u0000 </div>","PeriodicalId":50383,"journal":{"name":"IET Computers and Digital Techniques","volume":"14 3","pages":"122-131"},"PeriodicalIF":1.2,"publicationDate":"2020-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/iet-cdt.2018.5029","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72170152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
IET Computers and Digital Techniques
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1