首页 > 最新文献

2019 IEEE 13th International Conference on ASIC (ASICON)最新文献

英文 中文
Hardware Implementation of Convolutional Neural Network for Face Feature Extraction 卷积神经网络人脸特征提取的硬件实现
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983575
Ru Ding, Xuemei Tian, Guoqiang Bai, G. Su, Xingjun Wu
As an important feed-forward neural network in the field of deep learning, convolutional neural network (CNN) has been widely used in image classification, face recognition, natural language processing and document analysis in recent years. CNN has a large amount of data and many multiply and accumulate (MAC) operations. With the diversity of application files, the channel sizes and kernel sizes of CNN are diverse, while the existing hardware platform mostly adopts the average optimization technology, which causes the waste of computing resources. In this paper, a special configurable convolution computing array is designed, which contains 15 convolution units, each PE contains 6×6 MAC operations, it can be configured to calculate three different kernel sizes of 5×5, 3×3 and 1×1. At the same time, pipeline structure is used to synchronize convolution and pooling operations, which reduces the storage of intermediate results. We design the special hardware structure to optimize DeepID network. Tested on Altera Cyclone V FPGA, the peak performance of each convolution layer at 50 MHz is 27 GOPS, and the average utilization of the MAC is 92%.
卷积神经网络(CNN)作为深度学习领域重要的前馈神经网络,近年来在图像分类、人脸识别、自然语言处理和文档分析等领域得到了广泛的应用。CNN的数据量很大,有很多的乘法和累加运算(MAC)。随着应用程序文件的多样性,CNN的通道大小和内核大小也是多种多样的,而现有的硬件平台大多采用平均优化技术,造成了计算资源的浪费。本文设计了一种特殊的可配置卷积计算阵列,该阵列包含15个卷积单元,每个PE包含6×6 MAC操作,可配置计算5×5、3×3和1×1三种不同内核大小。同时,采用流水线结构同步卷积和池化操作,减少了中间结果的存储。我们设计了特殊的硬件结构来优化DeepID网络。在Altera Cyclone V FPGA上测试,50 MHz时每个卷积层的峰值性能为27 GOPS, MAC的平均利用率为92%。
{"title":"Hardware Implementation of Convolutional Neural Network for Face Feature Extraction","authors":"Ru Ding, Xuemei Tian, Guoqiang Bai, G. Su, Xingjun Wu","doi":"10.1109/ASICON47005.2019.8983575","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983575","url":null,"abstract":"As an important feed-forward neural network in the field of deep learning, convolutional neural network (CNN) has been widely used in image classification, face recognition, natural language processing and document analysis in recent years. CNN has a large amount of data and many multiply and accumulate (MAC) operations. With the diversity of application files, the channel sizes and kernel sizes of CNN are diverse, while the existing hardware platform mostly adopts the average optimization technology, which causes the waste of computing resources. In this paper, a special configurable convolution computing array is designed, which contains 15 convolution units, each PE contains 6×6 MAC operations, it can be configured to calculate three different kernel sizes of 5×5, 3×3 and 1×1. At the same time, pipeline structure is used to synchronize convolution and pooling operations, which reduces the storage of intermediate results. We design the special hardware structure to optimize DeepID network. Tested on Altera Cyclone V FPGA, the peak performance of each convolution layer at 50 MHz is 27 GOPS, and the average utilization of the MAC is 92%.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132676008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Performance optimization for LDO regulator based on the differential evolution 基于差分进化的LDO调节器性能优化
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983642
Jintao Li, Yanhan Zeng, Hailong Wu, R. Li, Jun Zhang, Hongzhou Tan
An application of differential evolution for parameter optimization in the low dropout regulator (LDO) is presented in this paper. The parameters optimization by manual work for the analog integrated circuit, such as LDO, is laborious and time-consuming, and it is uncertain to find the relatively good result. In this paper, the differential evolution is used to optimize the parameters and find the relatively good performance of LDO. In order to improve the convergence speed and optimization effect, a new constraint solution and a fast weight-based non-dominated sorting method are proposed. Simulation results show that the gain-bandwidth product,load regulation and line regulation are improved by 206.5%, 58.1% and 87.6%, respectively, compared with the manual solution.
本文介绍了差分进化算法在低差压稳压器参数优化中的应用。对于模拟集成电路,如LDO,手工进行参数优化既费力又耗时,而且不确定是否能找到相对较好的结果。本文采用差分进化方法对参数进行优化,找到了性能相对较好的LDO。为了提高收敛速度和优化效果,提出了一种新的约束解和一种快速的基于权重的非支配排序方法。仿真结果表明,与手动方案相比,该方案的增益-带宽积、负载调节性和线路调节性分别提高了206.5%、58.1%和87.6%。
{"title":"Performance optimization for LDO regulator based on the differential evolution","authors":"Jintao Li, Yanhan Zeng, Hailong Wu, R. Li, Jun Zhang, Hongzhou Tan","doi":"10.1109/ASICON47005.2019.8983642","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983642","url":null,"abstract":"An application of differential evolution for parameter optimization in the low dropout regulator (LDO) is presented in this paper. The parameters optimization by manual work for the analog integrated circuit, such as LDO, is laborious and time-consuming, and it is uncertain to find the relatively good result. In this paper, the differential evolution is used to optimize the parameters and find the relatively good performance of LDO. In order to improve the convergence speed and optimization effect, a new constraint solution and a fast weight-based non-dominated sorting method are proposed. Simulation results show that the gain-bandwidth product,load regulation and line regulation are improved by 206.5%, 58.1% and 87.6%, respectively, compared with the manual solution.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133436993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Scalable Modeling for the CPW Gap Discontinuity at Frequency up to 150 GHz 频率高达150 GHz的CPW间隙不连续的可扩展建模
Pub Date : 2019-10-01 DOI: 10.1109/asicon47005.2019.8983616
Hao Sun, Jun Fu, Wenpu Cui, Tianling Ren, Linlin Liu, Wei Zhou, Quan Wang, Ao Guo
The coplanar waveguide (CPW) gap discontinuity is an important component of many applications in the monolithic microwave integrated circuits (MMICs). Therefore, accurate modeling of the CPW gap discontinuity is very important and fundamental for the design of MMICs. In this work, a new lumped equiva1ent-circuit model which can accurately describe the characterization of the CPW gap discontinuity from 0 to 150 GHz is proposed. The parameters are extracted from electromagnetic (EM) simulations simply and conveniently without any complicated optimization algorithms. And we make the model scalable in gap length, width and space. Satisfactory fitting accuracy is obtained between the model data and EM simulations, which verifies that our model can be implemented in computer-aided design software to reduce design time and save computer resources.
共面波导(CPW)间隙不连续是单片微波集成电路(mmic)中许多应用的重要组成部分。因此,精确的CPW间隙不连续性建模对于mmic的设计是非常重要和基础的。本文提出了一种新的集总等效电路模型,该模型能准确地描述0 ~ 150 GHz的CPW间隙不连续特性。从电磁仿真中提取参数简单方便,无需复杂的优化算法。我们使模型在间隙长度、宽度和空间上可伸缩。模型数据与仿真结果的拟合精度较好,验证了该模型可以在计算机辅助设计软件中实现,减少了设计时间,节约了计算机资源。
{"title":"Scalable Modeling for the CPW Gap Discontinuity at Frequency up to 150 GHz","authors":"Hao Sun, Jun Fu, Wenpu Cui, Tianling Ren, Linlin Liu, Wei Zhou, Quan Wang, Ao Guo","doi":"10.1109/asicon47005.2019.8983616","DOIUrl":"https://doi.org/10.1109/asicon47005.2019.8983616","url":null,"abstract":"The coplanar waveguide (CPW) gap discontinuity is an important component of many applications in the monolithic microwave integrated circuits (MMICs). Therefore, accurate modeling of the CPW gap discontinuity is very important and fundamental for the design of MMICs. In this work, a new lumped equiva1ent-circuit model which can accurately describe the characterization of the CPW gap discontinuity from 0 to 150 GHz is proposed. The parameters are extracted from electromagnetic (EM) simulations simply and conveniently without any complicated optimization algorithms. And we make the model scalable in gap length, width and space. Satisfactory fitting accuracy is obtained between the model data and EM simulations, which verifies that our model can be implemented in computer-aided design software to reduce design time and save computer resources.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114012906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Low-delay Configurable Register for FPGA FPGA低延迟可配置寄存器
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983553
Zhiyin Lu, Jiafeng Liu, Yunbing Pang, Zhengjie Li, Yufan Zhang, Jinmei Lai, Jian Wang
A low-delay configurable register for FPGA is designed in this paper. This design is based on the basic master-slave D flip-flop, uses transmission gates on key nodes to control the register into four modes: register mode, latch mode, synchronous overwrite mode and asynchronous overwrite mode, then inputs desired signals to complete the functions of registers, latches, global initialization, synchronous reset, asynchronous reset, capture and write-back. The control signals and the D input are separated so that control circuit will not affect the register's timing parameters. A pre-simulation is carried out under the 28nm process and the results show that the configurable register's various functions are correct. The timing parameters are equivalent to the non-configurable master-slave D flip-flops, which proves that the control circuit does not affect the timing parameters. In this paper, configurable register has a 41-ps delay of CK to Q, a 7-ps setup time and a 0-ps hold time.
本文设计了一种用于FPGA的低延迟可配置寄存器。本设计以基本主从D触发器为基础,利用关键节点上的传输门将寄存器控制为寄存器模式、锁存模式、同步覆盖模式和异步覆盖模式四种模式,然后输入所需信号完成寄存器、锁存、全局初始化、同步复位、异步复位、捕获和回写功能。控制信号和D输入是分开的,这样控制电路就不会影响寄存器的时序参数。在28nm制程下进行了预仿真,结果表明可配置寄存器的各项功能是正确的。时序参数等效为不可配置的主从D触发器,证明控制电路不影响时序参数。在本文中,可配置寄存器具有41-ps的CK到Q延迟,7-ps的设置时间和0-ps的保持时间。
{"title":"A Low-delay Configurable Register for FPGA","authors":"Zhiyin Lu, Jiafeng Liu, Yunbing Pang, Zhengjie Li, Yufan Zhang, Jinmei Lai, Jian Wang","doi":"10.1109/ASICON47005.2019.8983553","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983553","url":null,"abstract":"A low-delay configurable register for FPGA is designed in this paper. This design is based on the basic master-slave D flip-flop, uses transmission gates on key nodes to control the register into four modes: register mode, latch mode, synchronous overwrite mode and asynchronous overwrite mode, then inputs desired signals to complete the functions of registers, latches, global initialization, synchronous reset, asynchronous reset, capture and write-back. The control signals and the D input are separated so that control circuit will not affect the register's timing parameters. A pre-simulation is carried out under the 28nm process and the results show that the configurable register's various functions are correct. The timing parameters are equivalent to the non-configurable master-slave D flip-flops, which proves that the control circuit does not affect the timing parameters. In this paper, configurable register has a 41-ps delay of CK to Q, a 7-ps setup time and a 0-ps hold time.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"36 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114022807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An Ultra-Low Power Cycle-by-Cycle Current Limiter Suitable for Switching-Mode Power Supply with 2.2 MHz Frequency 一种适用于2.2 MHz频率开关电源的超低功耗逐周期限流器
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983457
Yue Shi, Jiawen Wang, Jianwen Cao, Ze-kun Zhou
A high performance, low power cycle-by-cycle current limiter is proposed in this paper, which can prevent switching-mode power supply from damage under over current or short circuit conditions. With the clamping effect formed by feedback loop with operational amplifier, the output current can be proportionally sampled and transferred to a voltage relative to ground. In order to further reduce power consumption, the sampling loop with low bias current is only enabled during the on time of high-side transistors. Besides, an assistant clamping circuit is presented to accelerate the settling time of every cycle for high-speed applications. What's more, a source-input comparator with voltage clamper is adopted to judge the status of output current, which can realize reference level and comparison at the same time for further speed improvement with lower power requirement. The proposed current limiter is implemented in a standard 0.18µm CMOS process, whose verification results within a 3A, 2.2MHz Buck converter show that a 39ns response time with 8.6MHz bandwidth is realized with only 2µA on-state and zero off-state current consumption.
本文提出了一种高性能、低功耗的逐周期限流器,可防止开关电源在过流或短路情况下损坏。利用运放反馈回路形成的箝位效应,可以对输出电流按比例采样并转换成相对于地的电压。为了进一步降低功耗,低偏置电流的采样回路只在高侧晶体管导通时启用。此外,还设计了辅助箝位电路,加快了高速应用中每个周期的稳定时间。采用带电压箝位器的源输入比较器判断输出电流的状态,可以同时实现参考电平和比较,从而在更低的功率要求下进一步提高速度。所提出的限流器在标准的0.18µm CMOS工艺中实现,其在3A, 2.2MHz Buck变换器中的验证结果表明,仅在2µa的导通状态和零的关断状态电流消耗下,实现了39ns的响应时间和8.6MHz的带宽。
{"title":"An Ultra-Low Power Cycle-by-Cycle Current Limiter Suitable for Switching-Mode Power Supply with 2.2 MHz Frequency","authors":"Yue Shi, Jiawen Wang, Jianwen Cao, Ze-kun Zhou","doi":"10.1109/ASICON47005.2019.8983457","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983457","url":null,"abstract":"A high performance, low power cycle-by-cycle current limiter is proposed in this paper, which can prevent switching-mode power supply from damage under over current or short circuit conditions. With the clamping effect formed by feedback loop with operational amplifier, the output current can be proportionally sampled and transferred to a voltage relative to ground. In order to further reduce power consumption, the sampling loop with low bias current is only enabled during the on time of high-side transistors. Besides, an assistant clamping circuit is presented to accelerate the settling time of every cycle for high-speed applications. What's more, a source-input comparator with voltage clamper is adopted to judge the status of output current, which can realize reference level and comparison at the same time for further speed improvement with lower power requirement. The proposed current limiter is implemented in a standard 0.18µm CMOS process, whose verification results within a 3A, 2.2MHz Buck converter show that a 39ns response time with 8.6MHz bandwidth is realized with only 2µA on-state and zero off-state current consumption.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115612315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-speed Classification of AER Data Based on a Low-cost Hardware System 基于低成本硬件系统的AER数据高速分类
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983433
J. Huang, Yingcheng Lin, Wei He, Xichuan Zhou, Cong Shi, N. Wu, Gang Luo
This paper proposes a low-cost hardware system for rapid classification of visual objects encoded in the address-event representation (AER) format. A lightweight statistical inference algorithm is implemented in the proposed hardware to process the asynchronous pixel address streams from an AER visual sensor. The algorithm involves only a few simple operations for patch-based binary feature extraction and Random Ferns inference, which are highly parallelized and accelerated by our dedicated hardware architecture. Simulation results suggested that our hardware system was able to process as many as 50M address events per second and achieved 75% classification accuracy for the popular MNIST-DVS dataset. Compared to other AER classification systems, our low-cost FPGA implementation is more plausible for embedded applications because of its similar accuracy and higher system throughput.
本文提出了一种低成本的硬件系统,用于快速分类以地址-事件表示(AER)格式编码的视觉对象。在该硬件中实现了一种轻量级的统计推理算法来处理来自AER视觉传感器的异步像素地址流。该算法仅涉及基于补丁的二进制特征提取和随机蕨类推断等几个简单的操作,并且通过专用的硬件架构实现了高度并行化和加速。仿真结果表明,我们的硬件系统能够每秒处理多达50M个地址事件,并且对于流行的mist - dvs数据集实现了75%的分类准确率。与其他AER分类系统相比,我们的低成本FPGA实现更适合嵌入式应用,因为它具有相似的精度和更高的系统吞吐量。
{"title":"High-speed Classification of AER Data Based on a Low-cost Hardware System","authors":"J. Huang, Yingcheng Lin, Wei He, Xichuan Zhou, Cong Shi, N. Wu, Gang Luo","doi":"10.1109/ASICON47005.2019.8983433","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983433","url":null,"abstract":"This paper proposes a low-cost hardware system for rapid classification of visual objects encoded in the address-event representation (AER) format. A lightweight statistical inference algorithm is implemented in the proposed hardware to process the asynchronous pixel address streams from an AER visual sensor. The algorithm involves only a few simple operations for patch-based binary feature extraction and Random Ferns inference, which are highly parallelized and accelerated by our dedicated hardware architecture. Simulation results suggested that our hardware system was able to process as many as 50M address events per second and achieved 75% classification accuracy for the popular MNIST-DVS dataset. Compared to other AER classification systems, our low-cost FPGA implementation is more plausible for embedded applications because of its similar accuracy and higher system throughput.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114509298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Method for improving energy efficiency of elliptic curve cryptography algorithm on reconfigurable symmetric cipher processor 在可重构对称密码处理器上提高椭圆曲线密码算法能量效率的方法
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983539
Zhao Tuo, Tao Chen, Wei Li, Danyang Yang
The reconfigurable very-long instruction word (VLIW) cipher processor need a large number of instructions and a long execution period when processing the elliptic curve cryptographic algorithm. By adding a modular operation acceleration unit to the cryptographic processor, the energy efficiency of the cipher processor processing the elliptic curve cryptographic algorithm is improved. After synthesis under the 130nm process standard cell library, the experimental results show that the module operation acceleration unit area is 30 Kgate, accounting for 6% of the processor area, and the 256-bit binary domain point multiplication operation time is reduced from 22.8ms to 5.8ms, and the energy efficiency is improved by about 372%.
可重构超长指令字(VLIW)密码处理器在处理椭圆曲线密码算法时,需要大量的指令和较长的执行周期。通过在密码处理器中增加模块化运算加速单元,提高了密码处理器处理椭圆曲线密码算法的能量效率。在130nm制程标准单元库下合成后,实验结果表明,模块运算加速单位面积为30 Kgate,占处理器面积的6%,256位二进制域点乘法运算时间由22.8ms缩短至5.8ms,能效提高约372%。
{"title":"Method for improving energy efficiency of elliptic curve cryptography algorithm on reconfigurable symmetric cipher processor","authors":"Zhao Tuo, Tao Chen, Wei Li, Danyang Yang","doi":"10.1109/ASICON47005.2019.8983539","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983539","url":null,"abstract":"The reconfigurable very-long instruction word (VLIW) cipher processor need a large number of instructions and a long execution period when processing the elliptic curve cryptographic algorithm. By adding a modular operation acceleration unit to the cryptographic processor, the energy efficiency of the cipher processor processing the elliptic curve cryptographic algorithm is improved. After synthesis under the 130nm process standard cell library, the experimental results show that the module operation acceleration unit area is 30 Kgate, accounting for 6% of the processor area, and the 256-bit binary domain point multiplication operation time is reduced from 22.8ms to 5.8ms, and the energy efficiency is improved by about 372%.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114647271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance Investigation of Uniaxially Tensile Stressed Ge n-FinFETs Formed on Biaxially Strained GeOI Substrates And Its Impact On Ge CMOS Inverters 双轴应变GeOI衬底单轴拉伸应力Ge n- finfet的性能研究及其对Ge CMOS逆变器的影响
Pub Date : 2019-10-01 DOI: 10.1109/asicon47005.2019.8983454
R. Cheng, Ming Tian, Changfeng Wang, Zhimei Cai, Jie Zhang, Yan-Yan Zhang, Yi Zhao
In this work, we studied the performance enhancement of uniaxially tensile stressed n-FinFETs realized on the biaxially strained GeOI (sGOI) wafer and its impact on the performance of a Ge CMOS inverter. Uniaxially strained Ge film with nanoscale film width could be patterned on sGOI substrates and used for the fabrication of strained Ge FinFETs. The performance of this novelly proposed Ge FinFET was compared with the unstrained ones with similar dimensions and fabrication processes. The impact of strain on devices with different geometric parameters are also studied. As the strained FinFETs lead to higher on-current, its impact on the circuit speed was simulated. By comparing the output signal of the strained Ge CMOS inverter with the unstrained one, the former shows obvious speed improvement.
在这项工作中,我们研究了在双轴应变GeOI (sGOI)晶圆上实现的单轴拉伸应力n- finfet的性能增强及其对Ge CMOS逆变器性能的影响。具有纳米级薄膜宽度的单轴应变Ge薄膜可以在sGOI衬底上进行图像化,并用于应变Ge finfet的制作。将这种新型的Ge FinFET的性能与具有相似尺寸和制作工艺的非应变FinFET进行了比较。研究了应变对不同几何参数器件的影响。由于应变finfet会导致更高的导通电流,因此模拟了其对电路速度的影响。将应变后的Ge CMOS逆变器输出信号与未应变后的输出信号进行比较,结果表明,应变后的Ge CMOS逆变器输出速度明显提高。
{"title":"Performance Investigation of Uniaxially Tensile Stressed Ge n-FinFETs Formed on Biaxially Strained GeOI Substrates And Its Impact On Ge CMOS Inverters","authors":"R. Cheng, Ming Tian, Changfeng Wang, Zhimei Cai, Jie Zhang, Yan-Yan Zhang, Yi Zhao","doi":"10.1109/asicon47005.2019.8983454","DOIUrl":"https://doi.org/10.1109/asicon47005.2019.8983454","url":null,"abstract":"In this work, we studied the performance enhancement of uniaxially tensile stressed n-FinFETs realized on the biaxially strained GeOI (sGOI) wafer and its impact on the performance of a Ge CMOS inverter. Uniaxially strained Ge film with nanoscale film width could be patterned on sGOI substrates and used for the fabrication of strained Ge FinFETs. The performance of this novelly proposed Ge FinFET was compared with the unstrained ones with similar dimensions and fabrication processes. The impact of strain on devices with different geometric parameters are also studied. As the strained FinFETs lead to higher on-current, its impact on the circuit speed was simulated. By comparing the output signal of the strained Ge CMOS inverter with the unstrained one, the former shows obvious speed improvement.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114404682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Simple Steady Timing Resilient Sample Based on Delay Data Sense Detection 基于延迟数据感知检测的简单稳定定时弹性采样
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983641
Xuemei Fan, Rujin Wang, Qinghui Zeng, Hao Liu, Shengli Lu
The performance and reliability of integrated circuits are susceptible to PVTA variations. Conventional designs reserve certain timing margin and consider the worst-case to avoid these side effects. Timing resilient circuits can reduce the timing safe margin with the cost of excessive energy overhead and an unsteady state under a low voltage. In this study, we exploit a simple steady timing resilient sample by expanding previous works to save considerable extra power overhead. This sample executes timing errors detection based on the delay data sense detection and is implemented both on latches and data strobe flip-flops to recover errors with merely four extra transistors. The effectiveness and efficiency are evaluated by the design of a systolic array CNN accelerator in the 40-nm process. Simulation results demonstrate that the accelerator can achieve a stable performance without any accuracy loss, with the voltage scaled to 0.57V.
集成电路的性能和可靠性易受PVTA变化的影响。传统设计保留一定的时间余量,并考虑最坏情况以避免这些副作用。时序弹性电路可以降低时序安全裕度,但代价是能量开销过大和低电压下的不稳定。在本研究中,我们通过扩展以前的工作来开发一个简单的稳定定时弹性样本,以节省相当多的额外功率开销。该示例基于延迟数据感测执行时序错误检测,并在锁存器和数据频闪锁触发器上实现,仅用四个额外的晶体管即可恢复错误。通过设计40纳米工艺的收缩阵列CNN加速器来评估其有效性和效率。仿真结果表明,当电压降至0.57V时,该加速器能够在不影响精度的情况下实现稳定的性能。
{"title":"A Simple Steady Timing Resilient Sample Based on Delay Data Sense Detection","authors":"Xuemei Fan, Rujin Wang, Qinghui Zeng, Hao Liu, Shengli Lu","doi":"10.1109/ASICON47005.2019.8983641","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983641","url":null,"abstract":"The performance and reliability of integrated circuits are susceptible to PVTA variations. Conventional designs reserve certain timing margin and consider the worst-case to avoid these side effects. Timing resilient circuits can reduce the timing safe margin with the cost of excessive energy overhead and an unsteady state under a low voltage. In this study, we exploit a simple steady timing resilient sample by expanding previous works to save considerable extra power overhead. This sample executes timing errors detection based on the delay data sense detection and is implemented both on latches and data strobe flip-flops to recover errors with merely four extra transistors. The effectiveness and efficiency are evaluated by the design of a systolic array CNN accelerator in the 40-nm process. Simulation results demonstrate that the accelerator can achieve a stable performance without any accuracy loss, with the voltage scaled to 0.57V.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123598015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A radiation resistant library based on DICE and fault-tolerant delay filtering techniques in CMOS 0.18μm technology 基于CMOS 0.18μm技术的DICE和容错延迟滤波技术的抗辐射库
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983582
Yangsheng Wang, Yanyan Gao, Chong Feng, N. Zhang
A radiation resistant standard cell library is completed based on SMIC 0.18 μm technology in this paper. Standard cell library includes combinatorial logic cells and sequential logic cells. SET (Single Event Transient) effect always occurs in combinatorial logic cells, and SEU (Single Event Upset) effect always occurs in sequential logic cells. SEL(Single Event Latchup) and TID(Total Ionizing Dose) effects affect all logic cells. According to radiation effects, logic cells in the standard cell library are designed with harden methods at the circuit level and the layout level. Combination logic cells adopt fault-tolerant delay filtering method in circuit level, and sequential logic cells use DICE structure in circuit design. A guard ring has been added to the layout design for the SEL and TID effects. Then physical information and timing information are extracted for all cells, and the library building process is completed. Finally we use EDA tools to verify the availability of the anti-radiation library.
本文基于SMIC 0.18 μm技术完成了一个耐辐射标准单元库。标准单元库包括组合逻辑单元和顺序逻辑单元。SET (Single Event Transient)效应常发生在组合逻辑单元中,而SEU (Single Event Upset)效应常发生在顺序逻辑单元中。SEL(单事件锁定)和TID(总电离剂量)效应影响所有逻辑细胞。根据辐射效应,对标准单元库中的逻辑单元进行了电路级和布局级的强化设计。组合逻辑单元在电路级采用容错延迟滤波方法,顺序逻辑单元在电路设计上采用DICE结构。一个保护环被添加到SEL和TID效果的布局设计中。然后提取所有单元的物理信息和时间信息,完成库的构建过程。最后利用EDA工具验证了抗辐射库的可用性。
{"title":"A radiation resistant library based on DICE and fault-tolerant delay filtering techniques in CMOS 0.18μm technology","authors":"Yangsheng Wang, Yanyan Gao, Chong Feng, N. Zhang","doi":"10.1109/ASICON47005.2019.8983582","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983582","url":null,"abstract":"A radiation resistant standard cell library is completed based on SMIC 0.18 μm technology in this paper. Standard cell library includes combinatorial logic cells and sequential logic cells. SET (Single Event Transient) effect always occurs in combinatorial logic cells, and SEU (Single Event Upset) effect always occurs in sequential logic cells. SEL(Single Event Latchup) and TID(Total Ionizing Dose) effects affect all logic cells. According to radiation effects, logic cells in the standard cell library are designed with harden methods at the circuit level and the layout level. Combination logic cells adopt fault-tolerant delay filtering method in circuit level, and sequential logic cells use DICE structure in circuit design. A guard ring has been added to the layout design for the SEL and TID effects. Then physical information and timing information are extracted for all cells, and the library building process is completed. Finally we use EDA tools to verify the availability of the anti-radiation library.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123004920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2019 IEEE 13th International Conference on ASIC (ASICON)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1