首页 > 最新文献

2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)最新文献

英文 中文
A Cost-Effective CNN Accelerator Design with Configurable PU on FPGA FPGA上可配置PU的高性价比CNN加速器设计
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00015
Chi Fung Brian Fong, Jiandong Mu, Wei Zhang
Convolutional neural networks (CNNs) are rapidly expanding and being applied to a vast range of applications. Despite its popularity, deploying CNNs on a portable system is challenging due to enormous data volume, intensive computation, and frequent memory access. Hence, many approaches have been proposed to reduce the CNN model complexity, such as model pruning and quantization. However, it also brings new challenges. For example, existing designs usually adopted channel dimension tiling which requires regular channel number. After pruning, the channel number may become highly irregular which will incur heavy zero padding and large resource waste. As for quantization, simple aggressive bit reduction usually results in large accuracy drop. In order to address these challenges, in this work, firstly we propose to use row-based tiling in the kernel dimension to adapt to different kernel sizes and channel numbers and significantly reduce the zero padding. Moreover, we developed the configurable processing units (PUs) design which can be dynamically grouped or split to support the tiling flexibility and enable efficient hardware resource sharing. As for quantization, we considered the recently proposed Incremental Network Quantization (INQ) algorithm which uses low bit representation of weights in power of 2 format, and hence is able to represent the weights with minimum computing complexity since expensive multiplication can be transferred into cheap shift operation. We further propose an approximate shifter based processing element (PE) design as the fundamental building block of the PUs to facilitate the convolution computation. At last, a case study of RTL-level implementation of INQ quantized AlexNet is realized on a standalone FPGA, Stratix V. Compared with the state-of-art designs, our accelerator achieves 1.87x higher performance, which demonstrates the efficiency of the proposed design methods.
卷积神经网络(Convolutional neural networks, cnn)正在迅速发展并被广泛应用。尽管cnn很受欢迎,但由于庞大的数据量、密集的计算和频繁的内存访问,在便携式系统上部署cnn是具有挑战性的。因此,人们提出了许多降低CNN模型复杂性的方法,如模型修剪和量化。然而,它也带来了新的挑战。例如,现有的设计通常采用通道尺寸平铺,这需要固定的通道编号。修剪后的通道数可能会变得非常不规则,这将导致大量的零填充和大量的资源浪费。在量化方面,简单的主动降位通常会导致较大的精度下降。为了解决这些问题,在这项工作中,我们首先提出在内核维度上使用基于行的平铺来适应不同的内核大小和通道数,并显著减少零填充。此外,我们还开发了可配置处理单元(pu)设计,可以动态分组或拆分,以支持平铺灵活性和实现有效的硬件资源共享。在量化方面,我们考虑了最近提出的增量网络量化(INQ)算法,该算法使用2次幂格式的低比特表示权重,因此能够以最小的计算复杂度表示权重,因为昂贵的乘法可以转换为便宜的移位操作。我们进一步提出了一种基于近似移位器的处理单元(PE)设计作为处理器的基本构建块,以促进卷积计算。最后,在独立的FPGA Stratix v上实现了INQ量化AlexNet的rtl级实现,与目前的设计相比,我们的加速器的性能提高了1.87倍,证明了所提出的设计方法的有效性。
{"title":"A Cost-Effective CNN Accelerator Design with Configurable PU on FPGA","authors":"Chi Fung Brian Fong, Jiandong Mu, Wei Zhang","doi":"10.1109/ISVLSI.2019.00015","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00015","url":null,"abstract":"Convolutional neural networks (CNNs) are rapidly expanding and being applied to a vast range of applications. Despite its popularity, deploying CNNs on a portable system is challenging due to enormous data volume, intensive computation, and frequent memory access. Hence, many approaches have been proposed to reduce the CNN model complexity, such as model pruning and quantization. However, it also brings new challenges. For example, existing designs usually adopted channel dimension tiling which requires regular channel number. After pruning, the channel number may become highly irregular which will incur heavy zero padding and large resource waste. As for quantization, simple aggressive bit reduction usually results in large accuracy drop. In order to address these challenges, in this work, firstly we propose to use row-based tiling in the kernel dimension to adapt to different kernel sizes and channel numbers and significantly reduce the zero padding. Moreover, we developed the configurable processing units (PUs) design which can be dynamically grouped or split to support the tiling flexibility and enable efficient hardware resource sharing. As for quantization, we considered the recently proposed Incremental Network Quantization (INQ) algorithm which uses low bit representation of weights in power of 2 format, and hence is able to represent the weights with minimum computing complexity since expensive multiplication can be transferred into cheap shift operation. We further propose an approximate shifter based processing element (PE) design as the fundamental building block of the PUs to facilitate the convolution computation. At last, a case study of RTL-level implementation of INQ quantized AlexNet is realized on a standalone FPGA, Stratix V. Compared with the state-of-art designs, our accelerator achieves 1.87x higher performance, which demonstrates the efficiency of the proposed design methods.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"23 1","pages":"31-36"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73041658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Hybrid Memristor-CMOS Obfuscation Against Untrusted Foundries 针对不可信代工厂的混合忆阻器- cmos混淆
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00102
Amin Rezaei, J. Gu, H. Zhou
The high cost of IC design has made chip protection one of the first priorities of the semiconductor industry. In addition, with the growing number of untrusted foundries, the possibility of inside foundry attack is escalating. However, by taking advantage of polymorphic gates, the layouts of the circuits with different functionalities look exactly identical, making it impossible even for an inside foundry attacker to distinguish the defined functionality of an IC by looking at its layout. Moreover, since memristor is compatible with CMOS structure, it is possible to efficiently design hybrid memristor-CMOS circuits. In this paper, we propose a hardware obfuscation method based on polymorphic hybrid memristor-CMOS technology. Overhead of the polymorphic designs and the time complexity of possible attacks are discussed.
集成电路设计的高成本使芯片保护成为半导体工业的首要任务之一。此外,随着越来越多的不受信任的代工厂,内部攻击的可能性正在升级。然而,通过利用多态门,具有不同功能的电路的布局看起来完全相同,即使是内部代工厂攻击者也无法通过查看其布局来区分IC的定义功能。此外,由于忆阻器与CMOS结构兼容,因此可以高效地设计忆阻器-CMOS混合电路。本文提出了一种基于多晶混合忆阻器- cmos技术的硬件混淆方法。讨论了多态设计的开销和可能的攻击的时间复杂度。
{"title":"Hybrid Memristor-CMOS Obfuscation Against Untrusted Foundries","authors":"Amin Rezaei, J. Gu, H. Zhou","doi":"10.1109/ISVLSI.2019.00102","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00102","url":null,"abstract":"The high cost of IC design has made chip protection one of the first priorities of the semiconductor industry. In addition, with the growing number of untrusted foundries, the possibility of inside foundry attack is escalating. However, by taking advantage of polymorphic gates, the layouts of the circuits with different functionalities look exactly identical, making it impossible even for an inside foundry attacker to distinguish the defined functionality of an IC by looking at its layout. Moreover, since memristor is compatible with CMOS structure, it is possible to efficiently design hybrid memristor-CMOS circuits. In this paper, we propose a hardware obfuscation method based on polymorphic hybrid memristor-CMOS technology. Overhead of the polymorphic designs and the time complexity of possible attacks are discussed.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"23 1","pages":"535-540"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75333020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Optimization of Convolutional Neural Networks on Resource Constrained Devices 资源受限设备上卷积神经网络的优化
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00013
Arish S, Sharad Sinha, S. K G
Implementation of convolutional neural networks (CNNs) on resource constrained devices like FPGA (example: Zynq) etc. is important for intelligence in edge computing. This paper presents and discusses different hardware optimization methods that were employed to design a CNN model that is amenable to such devices, in general. Adaptive processing, exploitation of parallelism etc. are employed to show the superior performance of proposed methods over state of the art.
卷积神经网络(cnn)在FPGA(例如:Zynq)等资源受限设备上的实现对于边缘计算中的智能非常重要。本文提出并讨论了不同的硬件优化方法,用于设计一般适用于此类设备的CNN模型。采用自适应处理、利用并行性等来显示所提出的方法优于现有技术的性能。
{"title":"Optimization of Convolutional Neural Networks on Resource Constrained Devices","authors":"Arish S, Sharad Sinha, S. K G","doi":"10.1109/ISVLSI.2019.00013","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00013","url":null,"abstract":"Implementation of convolutional neural networks (CNNs) on resource constrained devices like FPGA (example: Zynq) etc. is important for intelligence in edge computing. This paper presents and discusses different hardware optimization methods that were employed to design a CNN model that is amenable to such devices, in general. Adaptive processing, exploitation of parallelism etc. are employed to show the superior performance of proposed methods over state of the art.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"26 1","pages":"19-24"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78945133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Towards Hardware-Assisted Security for IoT Systems 面向物联网系统的硬件辅助安全
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00118
Yier Jin
As computing devices become more commonplace in every day life, we have seen an increase of possible attacks on commercial devices and critical infrastructure. As a result, both academia and industry have proposed solutions to mitigate or outright eliminate the ever expanding set of viable targets. Initially, this resulted in an influx of software-based defenses against these emerging threats. Unfortunately, it was found that software solutions could be bypassed with more advanced attacks and often resulted in high performance overhead. As such, hardware-assisted security defenses have been developed to provide improved security while keeping performance overhead to manageable levels, especially for IoT devices. In this paper, we will provide a survey of prominent hardware-assisted security defenses. We will enumerate the attacks these defenses aim to protect, as well as their effectiveness. We will also discuss the implications in both performance and system design. A comparison between approaches that target the same set of issues, and possible directions for future research will be presented.
随着计算设备在日常生活中变得越来越普遍,我们已经看到针对商业设备和关键基础设施的可能攻击有所增加。因此,学术界和工业界都提出了解决方案,以减轻或彻底消除不断扩大的可行目标。最初,这导致了针对这些新出现的威胁的基于软件的防御涌入。不幸的是,发现软件解决方案可以被更高级的攻击绕过,并且经常导致高性能开销。因此,硬件辅助安全防御已经开发出来,以提供更高的安全性,同时将性能开销保持在可管理的水平,特别是对于物联网设备。在本文中,我们将提供一个突出的硬件辅助安全防御的调查。我们将列举这些防御旨在保护的攻击,以及它们的有效性。我们还将讨论性能和系统设计方面的影响。将对针对同一组问题的方法进行比较,并提出未来研究的可能方向。
{"title":"Towards Hardware-Assisted Security for IoT Systems","authors":"Yier Jin","doi":"10.1109/ISVLSI.2019.00118","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00118","url":null,"abstract":"As computing devices become more commonplace in every day life, we have seen an increase of possible attacks on commercial devices and critical infrastructure. As a result, both academia and industry have proposed solutions to mitigate or outright eliminate the ever expanding set of viable targets. Initially, this resulted in an influx of software-based defenses against these emerging threats. Unfortunately, it was found that software solutions could be bypassed with more advanced attacks and often resulted in high performance overhead. As such, hardware-assisted security defenses have been developed to provide improved security while keeping performance overhead to manageable levels, especially for IoT devices. In this paper, we will provide a survey of prominent hardware-assisted security defenses. We will enumerate the attacks these defenses aim to protect, as well as their effectiveness. We will also discuss the implications in both performance and system design. A comparison between approaches that target the same set of issues, and possible directions for future research will be presented.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"196 1","pages":"632-637"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76958171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
PVTMC: An All-Digital Sub-Picosecond Timing Measurement Circuit Based on Process Variations PVTMC:基于工艺变化的全数字亚皮秒定时测量电路
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00108
Shuo Li, Xiaolin Xu, W. Burleson
Measuring timing signals of sub-picosecond has become an urgent need in today's high-speed electronics design. However, conventional timing measurement devices are either built with analog circuits that are hard to integrate with digital systems or differential delay-lines whose resolution is significantly impacted by the process variations of a circuit or environmental conditions. In this work, we present a novel all-digital timing measurement circuit: PVTMC, which for the first time, constructively leverages process variations to measure timing signals of sub-picosecond duration. We show that PVTMC achieves high resolution and robust performance against environmental fluctuations. A simple random search-based method is proposed to speed up the measurements and improve its accuracy. We also demonstrate that PVTMC is compatible with most prevalent CMOS technology nodes, as well as FPGA implementations due to the existence of process variations.
测量亚皮秒级的定时信号已成为当今高速电子设计的迫切需要。然而,传统的定时测量设备要么是用难以与数字系统集成的模拟电路构建的,要么是差分延迟线,其分辨率受到电路工艺变化或环境条件的显著影响。在这项工作中,我们提出了一种新颖的全数字定时测量电路:PVTMC,它首次建设性地利用过程变化来测量亚皮秒持续时间的定时信号。我们证明了PVTMC可以实现高分辨率和抗环境波动的鲁棒性能。为了提高测量速度和精度,提出了一种简单的基于随机搜索的方法。我们还证明,由于存在工艺变化,PVTMC与大多数流行的CMOS技术节点以及FPGA实现兼容。
{"title":"PVTMC: An All-Digital Sub-Picosecond Timing Measurement Circuit Based on Process Variations","authors":"Shuo Li, Xiaolin Xu, W. Burleson","doi":"10.1109/ISVLSI.2019.00108","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00108","url":null,"abstract":"Measuring timing signals of sub-picosecond has become an urgent need in today's high-speed electronics design. However, conventional timing measurement devices are either built with analog circuits that are hard to integrate with digital systems or differential delay-lines whose resolution is significantly impacted by the process variations of a circuit or environmental conditions. In this work, we present a novel all-digital timing measurement circuit: PVTMC, which for the first time, constructively leverages process variations to measure timing signals of sub-picosecond duration. We show that PVTMC achieves high resolution and robust performance against environmental fluctuations. A simple random search-based method is proposed to speed up the measurements and improve its accuracy. We also demonstrate that PVTMC is compatible with most prevalent CMOS technology nodes, as well as FPGA implementations due to the existence of process variations.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"10 1","pages":"574-579"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73943800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Approximate Computing Applied to Bacterial Genome Identification using Self-Organizing Maps 近似计算在细菌基因组自组织图谱鉴定中的应用
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00106
D. Stathis, Yu Yang, S. Tewari, A. Hemani, Kolin Paul, M. Grabherr, Rafi Ahmad
In this paper we explore the design space of a self-organizing map (SOM) used for rapid and accurate identification of bacterial genomes. This is an important health care problem because even in Europe, 70% of prescriptions for antibiotics is wrong. The SOM is trained on Next Generation Sequencing (NGS) data and is able to identify the exact strain of bacteria. This is in contrast to conventional methods that require genome assembly to identify the bacterial strain. SOM has been implemented as an synchoros VLSI design and shown to have 3-4 orders better computational efficiency compared to GPUs. To further lower the energy consumption, we exploit the robustness of SOM by successively lowering the resolution to gain further improvements in efficiency and lower the implementation cost without substantially sacrificing the accuracy. We do an in depth analysis of the reduction in resolution vs. loss in accuracy as the basis for designing a system with the lowest cost and acceptable accuracy using NGS data from samples containing multiple bacteria from the labs of one of the co-authors. The objective of this method is to design a bacterial recognition system for battery operated clinical use where the area, power and performance are of critical importance. We demonstrate that with 39% loss in accuracy in 12 bits and 1% in 16 bit representation can yield significant savings in energy and area.
在本文中,我们探索了一种用于快速准确鉴定细菌基因组的自组织图谱(SOM)的设计空间。这是一个重要的卫生保健问题,因为即使在欧洲,70%的抗生素处方是错误的。SOM是在下一代测序(NGS)数据上训练的,能够识别准确的细菌菌株。这与传统方法相反,传统方法需要基因组组装来识别细菌菌株。SOM已经作为同步VLSI设计实现,并且与gpu相比具有3-4个数量级的计算效率。为了进一步降低能量消耗,我们通过不断降低分辨率来利用SOM的鲁棒性,在不牺牲精度的情况下进一步提高效率和降低实现成本。我们对分辨率降低与精度损失进行了深入分析,作为设计成本最低且精度可接受的系统的基础,使用了来自合作作者之一的实验室中含有多种细菌的样品的NGS数据。该方法的目的是设计一种用于电池操作的临床应用的细菌识别系统,其中面积,功率和性能至关重要。我们证明,在12位表示精度损失39%和16位表示精度损失1%的情况下,可以显著节省能源和面积。
{"title":"Approximate Computing Applied to Bacterial Genome Identification using Self-Organizing Maps","authors":"D. Stathis, Yu Yang, S. Tewari, A. Hemani, Kolin Paul, M. Grabherr, Rafi Ahmad","doi":"10.1109/ISVLSI.2019.00106","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00106","url":null,"abstract":"In this paper we explore the design space of a self-organizing map (SOM) used for rapid and accurate identification of bacterial genomes. This is an important health care problem because even in Europe, 70% of prescriptions for antibiotics is wrong. The SOM is trained on Next Generation Sequencing (NGS) data and is able to identify the exact strain of bacteria. This is in contrast to conventional methods that require genome assembly to identify the bacterial strain. SOM has been implemented as an synchoros VLSI design and shown to have 3-4 orders better computational efficiency compared to GPUs. To further lower the energy consumption, we exploit the robustness of SOM by successively lowering the resolution to gain further improvements in efficiency and lower the implementation cost without substantially sacrificing the accuracy. We do an in depth analysis of the reduction in resolution vs. loss in accuracy as the basis for designing a system with the lowest cost and acceptable accuracy using NGS data from samples containing multiple bacteria from the labs of one of the co-authors. The objective of this method is to design a bacterial recognition system for battery operated clinical use where the area, power and performance are of critical importance. We demonstrate that with 39% loss in accuracy in 12 bits and 1% in 16 bit representation can yield significant savings in energy and area.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"23 1","pages":"560-567"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74400303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Design of a CMOS Broadband Transimpedance Amplifier with Floating Active Inductor 带浮动有源电感的CMOS宽带跨阻放大器的设计
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00050
Xiangyu Chen, Yasuhiro Takahashi
This work presents the design and performance of a transimpedance amplifier (TIA) implemented in a 180nm CMOS technology. The introduced TIA uses a floating active inductor (FAI) based on gyrator-C structure, which is why it can increase the bandwidth while occupying a smaller chip area. And we have explained the schematic and characteristics of the gyrator-C structure and FAI. In addition, the proposed TIA also uses the combination of capacitive degeneration, broadband matching network, and the regulated cascode (RGC) input stage, which turns the transimpedance amplifier (TIA) design into a fifth-order low-pass filter with butterworth response, to enhance the bandwidth and the gain. The TIA is implemented in 0.18 m Rohm CMOS technology and a 1.8-V supply. The transimpedance amplifier achieves a transimpedance gain of 41dBΩ and -3 dB frequency of 10 GHz with 0.1pF total input capacitance. Layout size of the proposed TIA is 180 μm × 118 μm.
本文介绍了一种采用180nm CMOS技术实现的跨阻放大器(TIA)的设计和性能。所介绍的TIA采用基于陀螺- c结构的浮动有源电感(FAI),这就是为什么它可以在占用更小的芯片面积的同时增加带宽。并对陀螺- c结构和FAI的原理图和特点进行了说明。此外,所提出的TIA还采用电容性退化、宽带匹配网络和可调节级联码(RGC)输入级相结合的方法,将跨阻放大器(TIA)设计变成具有巴特沃斯响应的五阶低通滤波器,以提高带宽和增益。TIA采用0.18 m Rohm CMOS技术和1.8 v电源实现。该跨阻放大器的跨阻增益为41dBΩ,在10 GHz的-3 dB频率下,总输入电容为0.1pF。TIA的版图尺寸为180 μm × 118 μm。
{"title":"Design of a CMOS Broadband Transimpedance Amplifier with Floating Active Inductor","authors":"Xiangyu Chen, Yasuhiro Takahashi","doi":"10.1109/ISVLSI.2019.00050","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00050","url":null,"abstract":"This work presents the design and performance of a transimpedance amplifier (TIA) implemented in a 180nm CMOS technology. The introduced TIA uses a floating active inductor (FAI) based on gyrator-C structure, which is why it can increase the bandwidth while occupying a smaller chip area. And we have explained the schematic and characteristics of the gyrator-C structure and FAI. In addition, the proposed TIA also uses the combination of capacitive degeneration, broadband matching network, and the regulated cascode (RGC) input stage, which turns the transimpedance amplifier (TIA) design into a fifth-order low-pass filter with butterworth response, to enhance the bandwidth and the gain. The TIA is implemented in 0.18 m Rohm CMOS technology and a 1.8-V supply. The transimpedance amplifier achieves a transimpedance gain of 41dBΩ and -3 dB frequency of 10 GHz with 0.1pF total input capacitance. Layout size of the proposed TIA is 180 μm × 118 μm.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"1 1","pages":"230-234"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86905165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Case Study On Approximate FPGA Design With an Open-Source Image Processing Platform 基于开源图像处理平台的近似FPGA设计案例研究
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00074
Yunxiang Zhang, Xiaokun Yang, Lei Wu, J. Andrian
This paper presents a case study of approximate design with Field-Programmable Gate Array (FPGA), combining an application of a color to grayscale converter and an open image/video processing platform with Verilog hardware description language (HDL). First of all, by integrating two approximations of adders and two approximations of multipliers, together with the exact design, nine different approximations of the design on color to grayscale converter are offered. Second, the image processing platform is presented to demonstrate the proposed work on Nexys-4 FPGA, enabling to capture color images through a low-cost OV7670 camera and display the grayscale results of images on a VGA-interfaced monitor. Experimental results show the difference between different approximations of the design, providing a range of design options corresponding to different quality constrains.
本文介绍了一个使用现场可编程门阵列(FPGA)进行近似设计的案例研究,结合了一个彩色到灰度转换器的应用和一个带有Verilog硬件描述语言(HDL)的开放式图像/视频处理平台。首先,通过对加法器的两个近似和乘法器的两个近似的积分,结合具体的设计,给出了色灰转换器设计的九种不同的近似。其次,给出了图像处理平台,在Nexys-4 FPGA上演示了所提出的工作,使其能够通过低成本的OV7670摄像机捕获彩色图像,并在vga接口监视器上显示图像的灰度结果。实验结果显示了不同近似设计之间的差异,为不同的质量约束提供了一系列设计选项。
{"title":"A Case Study On Approximate FPGA Design With an Open-Source Image Processing Platform","authors":"Yunxiang Zhang, Xiaokun Yang, Lei Wu, J. Andrian","doi":"10.1109/ISVLSI.2019.00074","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00074","url":null,"abstract":"This paper presents a case study of approximate design with Field-Programmable Gate Array (FPGA), combining an application of a color to grayscale converter and an open image/video processing platform with Verilog hardware description language (HDL). First of all, by integrating two approximations of adders and two approximations of multipliers, together with the exact design, nine different approximations of the design on color to grayscale converter are offered. Second, the image processing platform is presented to demonstrate the proposed work on Nexys-4 FPGA, enabling to capture color images through a low-cost OV7670 camera and display the grayscale results of images on a VGA-interfaced monitor. Experimental results show the difference between different approximations of the design, providing a range of design options corresponding to different quality constrains.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"10 1","pages":"372-377"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88591287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An Improved Automatic Hardware Trojan Generation Platform 一种改进的硬件木马自动生成平台
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00062
Shichao Yu, Weiqiang Liu, Máire O’Neill
Over the past 10 years, various Hardware Trojan (HT) detection techniques have been proposed by the research community. However, the development of HT benchmark suites for testing and evaluating HT detection techniques lags behind. The number of HT-infected circuits available in current public HT benchmarks is somewhat limited and the circuits lack diversity in structure. Therefore, this paper proposes a new method to generate HTs using a highly configurable generation platform based on transition probability. The generation platform is highly configurable in terms of the HT trigger condition, trigger type, payload type and in the number and variety of HT-infected circuits that can be generated. In this research the transition probability of netlists is employed to identify rarely activated internal nodes to target for HT insertion rather than functional simulation as utilised in previous research. The authors believe transition probability provides a more realistic reflection of the netlist activity for use in determining the appropriate position for HT insertion. Finally, the generated HT-infected circuits are tested by a machine learning (ML)-based HT detection technique, which is known as Controllability and Observability for HT Detection (COTD). The resulting false positive and false negative rates illustrate the feasibility of the benchmark suite.
在过去的十年中,研究团体提出了各种硬件木马(HT)检测技术。然而,用于测试和评估高温检测技术的高温测试基准套件的开发滞后。目前公共HT基准中可用的HT感染电路数量有限,并且电路结构缺乏多样性。因此,本文提出了一种基于转移概率的高可配置生成平台来生成高温高温的新方法。生成平台在高温触发条件、触发类型、有效载荷类型以及可生成的高温感染电路的数量和种类方面具有高度可配置性。在本研究中,使用网络列表的转移概率来识别很少激活的内部节点,而不是像以前的研究那样使用功能模拟来定位HT插入的目标。作者认为,跃迁概率可以更真实地反映网络表活动,用于确定HT插入的合适位置。最后,生成的HT感染电路通过基于机器学习(ML)的HT检测技术进行测试,该技术被称为HT检测的可控性和可观察性(COTD)。由此产生的假阳性和假阴性率说明了基准套件的可行性。
{"title":"An Improved Automatic Hardware Trojan Generation Platform","authors":"Shichao Yu, Weiqiang Liu, Máire O’Neill","doi":"10.1109/ISVLSI.2019.00062","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00062","url":null,"abstract":"Over the past 10 years, various Hardware Trojan (HT) detection techniques have been proposed by the research community. However, the development of HT benchmark suites for testing and evaluating HT detection techniques lags behind. The number of HT-infected circuits available in current public HT benchmarks is somewhat limited and the circuits lack diversity in structure. Therefore, this paper proposes a new method to generate HTs using a highly configurable generation platform based on transition probability. The generation platform is highly configurable in terms of the HT trigger condition, trigger type, payload type and in the number and variety of HT-infected circuits that can be generated. In this research the transition probability of netlists is employed to identify rarely activated internal nodes to target for HT insertion rather than functional simulation as utilised in previous research. The authors believe transition probability provides a more realistic reflection of the netlist activity for use in determining the appropriate position for HT insertion. Finally, the generated HT-infected circuits are tested by a machine learning (ML)-based HT detection technique, which is known as Controllability and Observability for HT Detection (COTD). The resulting false positive and false negative rates illustrate the feasibility of the benchmark suite.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"2000 1","pages":"302-307"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88288545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A Hyper-Parameter Based Margin Calculation Algorithm for Single Flux Quantum Logic Cells 基于超参数的单通量量子逻辑单元余量计算算法
Pub Date : 2019-07-01 DOI: 10.1109/ISVLSI.2019.00120
S. Shahsavani, Massoud Pedram
This paper presents a novel method for evaluating the robustness of single flux quantum (SFQ) logic cells in a superconducting electronic circuit. The proposed method improves the state-of-the-art by accounting for the global sources of variation, clustering cell parameters into hyper-parameters, and considering the co-dependency of these hyper-parameters when calculating a feasible parameter region in which cell functions correctly, given any combination of the parameter values. The average parametric yield inside the reported feasible parameter region is more than 98%. Additionally, a machine learning based method is presented to estimate the parametric yield both inside and outside the feasible parameter region. The average accuracy of the developed yield model is 96% for five SFQ cells.
提出了一种评价超导电子电路中单通量量子逻辑单元鲁棒性的新方法。该方法考虑了全局变化源,将细胞参数聚类为超参数,并在给定任意参数值组合的情况下,在计算细胞正确运行的可行参数区域时考虑这些超参数的相互依赖性,从而提高了技术水平。在报告的可行参数范围内,平均参数成品率大于98%。此外,提出了一种基于机器学习的方法来估计可行参数区域内外的参数产量。对于5个SFQ细胞,所建立的产率模型的平均准确度为96%。
{"title":"A Hyper-Parameter Based Margin Calculation Algorithm for Single Flux Quantum Logic Cells","authors":"S. Shahsavani, Massoud Pedram","doi":"10.1109/ISVLSI.2019.00120","DOIUrl":"https://doi.org/10.1109/ISVLSI.2019.00120","url":null,"abstract":"This paper presents a novel method for evaluating the robustness of single flux quantum (SFQ) logic cells in a superconducting electronic circuit. The proposed method improves the state-of-the-art by accounting for the global sources of variation, clustering cell parameters into hyper-parameters, and considering the co-dependency of these hyper-parameters when calculating a feasible parameter region in which cell functions correctly, given any combination of the parameter values. The average parametric yield inside the reported feasible parameter region is more than 98%. Additionally, a machine learning based method is presented to estimate the parametric yield both inside and outside the feasible parameter region. The average accuracy of the developed yield model is 96% for five SFQ cells.","PeriodicalId":6703,"journal":{"name":"2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)","volume":"38 1","pages":"645-650"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78037517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1