首页 > 最新文献

2019 IEEE 13th International Conference on ASIC (ASICON)最新文献

英文 中文
Multi-Thread Assembling for Fast FEM Power Delivery DC Integrity Analysis 基于多线程装配的快速有限元输电直流完整性分析
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983609
Ke Yang, Shaoyi Peng, S. Tan, Hai-Bao Chen
Power integrity analysis is of great significance in the field of circuit design, especially the design of modern high speed circuit system. For the high performance printed circuit boards (PCBs) and IC design, power delivery network DC integrity checks play an important role. However, the element assembling process in finite element method (FEM) can take significant portion of total computing time. In this paper, a fast finite element assembling method for power network DC integrity checks of PCBs is proposed. We divided the mesh into a serious of bins and elements in different bins could be assembled in parallel. Further more, a dynamic circle shape approximation method is introduced to further control the number of elements due to vias and circular objectives. As a result, the new solver can easily perform progressive trade off between speed and accuracy. Experimental results of two PCB examples on a 3.6-GHz Intel i7 Dual-core CPU show that the proposed multi-thread assembling method can achieve 2X speedup over existing single-thread assembling methods. A dynamic circle shape approximation method is introduced to further control the number of elements and speed up the solver process. The resulting FEM solver leads to 3X speed over a commercial power integrity solver with no more than 0.7% errors.
电源完整性分析在电路设计领域,特别是现代高速电路系统的设计中具有重要的意义。对于高性能印刷电路板(pcb)和集成电路设计,供电网络直流完整性检测起着重要的作用。然而,有限元法中的单元装配过程会占用相当大的计算时间。本文提出了一种用于电网直流完整性检测的快速有限元装配方法。我们将网格划分为一系列的仓,不同仓中的元素可以并行组装。在此基础上,提出了一种动态圆形状逼近方法,以进一步控制因过孔和圆物镜而产生的元件数量。因此,新的求解器可以很容易地在速度和精度之间进行渐进式权衡。在3.6 ghz Intel i7双核CPU上的两个PCB实例的实验结果表明,所提出的多线程组装方法比现有的单线程组装方法的速度提高了2倍。为了进一步控制单元的数量,加快求解速度,引入了一种动态圆形状逼近方法。由此产生的有限元求解器的速度是商用电源完整性求解器的3倍,误差不超过0.7%。
{"title":"Multi-Thread Assembling for Fast FEM Power Delivery DC Integrity Analysis","authors":"Ke Yang, Shaoyi Peng, S. Tan, Hai-Bao Chen","doi":"10.1109/ASICON47005.2019.8983609","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983609","url":null,"abstract":"Power integrity analysis is of great significance in the field of circuit design, especially the design of modern high speed circuit system. For the high performance printed circuit boards (PCBs) and IC design, power delivery network DC integrity checks play an important role. However, the element assembling process in finite element method (FEM) can take significant portion of total computing time. In this paper, a fast finite element assembling method for power network DC integrity checks of PCBs is proposed. We divided the mesh into a serious of bins and elements in different bins could be assembled in parallel. Further more, a dynamic circle shape approximation method is introduced to further control the number of elements due to vias and circular objectives. As a result, the new solver can easily perform progressive trade off between speed and accuracy. Experimental results of two PCB examples on a 3.6-GHz Intel i7 Dual-core CPU show that the proposed multi-thread assembling method can achieve 2X speedup over existing single-thread assembling methods. A dynamic circle shape approximation method is introduced to further control the number of elements and speed up the solver process. The resulting FEM solver leads to 3X speed over a commercial power integrity solver with no more than 0.7% errors.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130742483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and implementation of Serial ATA pbysical layer on FPGA 串行ATA物理层在FPGA上的设计与实现
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983634
Xie Xie, Qinghua Duan, Jiafeng Liu, Jian Wang, Jinmei Lai
An increasing number of high-performance computing system developed on FPGA devices need access to mass storage devices for storing data, the serial ATA protocol is widely used in the modern computer systems for transferring data between the host and hard disks or solid-state drives. This paper describes the design and implementation of serial ATA physical layer core based on the Xilinx GTX transceiver. With the method of cyclically changing the GTX line rate, the SATA hard disk with different line rate can be automatically identified and linked, realizing backward compatibility. An embedded system has also been developed for validating the functionality of our SATA physical layer core. We test our physical layer core with connecting our core to both SATA3 and SATA2 hard disks. The experimental result has indicated our core can not only provide the whole functionality required by the SATA physical layer, but also utilize very few logic resources on FPGA.
越来越多的基于FPGA的高性能计算系统需要访问海量存储设备来存储数据,串行ATA协议被广泛应用于现代计算机系统中,用于主机与硬盘或固态硬盘之间的数据传输。本文介绍了基于赛灵思GTX收发器的串行ATA物理层核的设计与实现。通过循环改变GTX线率的方法,可以自动识别和链接不同线率的SATA硬盘,实现向后兼容。我们还开发了一个嵌入式系统来验证我们的SATA物理层核心的功能。我们通过将我们的核心连接到SATA3和SATA2硬盘来测试我们的物理层核心。实验结果表明,我们的核心不仅可以提供SATA物理层所需的全部功能,而且可以在FPGA上使用很少的逻辑资源。
{"title":"Design and implementation of Serial ATA pbysical layer on FPGA","authors":"Xie Xie, Qinghua Duan, Jiafeng Liu, Jian Wang, Jinmei Lai","doi":"10.1109/ASICON47005.2019.8983634","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983634","url":null,"abstract":"An increasing number of high-performance computing system developed on FPGA devices need access to mass storage devices for storing data, the serial ATA protocol is widely used in the modern computer systems for transferring data between the host and hard disks or solid-state drives. This paper describes the design and implementation of serial ATA physical layer core based on the Xilinx GTX transceiver. With the method of cyclically changing the GTX line rate, the SATA hard disk with different line rate can be automatically identified and linked, realizing backward compatibility. An embedded system has also been developed for validating the functionality of our SATA physical layer core. We test our physical layer core with connecting our core to both SATA3 and SATA2 hard disks. The experimental result has indicated our core can not only provide the whole functionality required by the SATA physical layer, but also utilize very few logic resources on FPGA.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132383003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deploying and Optimizing Convolutional Neural Networks on Heterogeneous Architecture 在异构架构上部署和优化卷积神经网络
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983456
Junning Jiang, Liang Cai, Feng Dong, Kehua Yu, Ke Chen, Wei Qu, Jianfei Jiang
Deploying convolutional neural networks to hardware platform can accelerate the inference and is critical for the application of artificial intelligence. In this paper, we design an FPGA+CPU heterogeneous platform to accelerate CNNs. Dataflow optimizing, accelerator structure optimization and compute precision optimization are proposed to improve performance of the accelerating platform. Different ResNet and MobileNet networks are successfully deployed on the platform. By applying the proposed dataflow optimization and precision optimization, the performance improvement of inference is 3.25× on ResNet. By applying the accelerator structure optimization and precision optimization, the performance improvement of inference is 3.63× on MobileNet.
将卷积神经网络部署到硬件平台上可以加快推理速度,对人工智能的应用至关重要。在本文中,我们设计了一个FPGA+CPU的异构平台来加速cnn。为了提高加速平台的性能,提出了数据流优化、加速器结构优化和计算精度优化。在平台上成功部署了不同的ResNet和MobileNet网络。通过应用所提出的数据流优化和精度优化,在ResNet上的推理性能提高了3.25倍。通过对加速器结构优化和精度优化,在MobileNet上的推理性能提高了3.63倍。
{"title":"Deploying and Optimizing Convolutional Neural Networks on Heterogeneous Architecture","authors":"Junning Jiang, Liang Cai, Feng Dong, Kehua Yu, Ke Chen, Wei Qu, Jianfei Jiang","doi":"10.1109/ASICON47005.2019.8983456","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983456","url":null,"abstract":"Deploying convolutional neural networks to hardware platform can accelerate the inference and is critical for the application of artificial intelligence. In this paper, we design an FPGA+CPU heterogeneous platform to accelerate CNNs. Dataflow optimizing, accelerator structure optimization and compute precision optimization are proposed to improve performance of the accelerating platform. Different ResNet and MobileNet networks are successfully deployed on the platform. By applying the proposed dataflow optimization and precision optimization, the performance improvement of inference is 3.25× on ResNet. By applying the accelerator structure optimization and precision optimization, the performance improvement of inference is 3.63× on MobileNet.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125348843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A digitalized RRAM-based Spiking Neuron Network system with 3-bit weight and unsupervised online learning scheme 基于数字随机存储器的3位权和无监督在线学习机制的脉冲神经元网络系统
Pub Date : 2019-10-01 DOI: 10.1109/asicon47005.2019.8983603
Danqing Wu, Shilin Yan, Haodi Tang, Yu Wang, Jiayun Feng, Xianwu Hu, Jiaxin Cao, Yufeng Xie
Resistive-switching Random Access Memory (RRAM) has emerged as a promising candidate for the artificial synaptic in neuromorphic computation circuits due to its similar electronic characteristics with the synaptic and features such as high integration density, non-volatile retention and supporting matrix-vector multiplication. In this paper, a digitalized RRAM-based fully-connected Spiking Neuron Network (SNN) system with 3-bit weight and unsupervised online learning scheme is proposed. It consists of 64 pre-neurons and 10 post-neurons, all the neurons are realized by digital circuits for low area overhead, low power consumption and high accuracy. An unsupervised online learning scheme based on binary STDP protocol is applied to train the synaptic weights. Experiments show that the system can be used to recognize the learned ten handwritten digits efficiently.
电阻开关随机存取存储器(RRAM)由于具有与神经形态计算电路中人工突触相似的电子特性以及高积分密度、非易失性保留和支持矩阵向量乘法等特点,已成为神经形态计算电路中人工突触的一个有前途的候选器件。提出了一种基于数字随机存储器的3位权全连接峰值神经元网络(SNN)无监督在线学习方案。该系统由64个前神经元和10个后神经元组成,所有神经元均采用数字电路实现,具有面积占用小、功耗低、精度高等优点。采用一种基于二进制STDP协议的无监督在线学习方案来训练突触权值。实验表明,该系统能够有效地识别学习到的10个手写数字。
{"title":"A digitalized RRAM-based Spiking Neuron Network system with 3-bit weight and unsupervised online learning scheme","authors":"Danqing Wu, Shilin Yan, Haodi Tang, Yu Wang, Jiayun Feng, Xianwu Hu, Jiaxin Cao, Yufeng Xie","doi":"10.1109/asicon47005.2019.8983603","DOIUrl":"https://doi.org/10.1109/asicon47005.2019.8983603","url":null,"abstract":"Resistive-switching Random Access Memory (RRAM) has emerged as a promising candidate for the artificial synaptic in neuromorphic computation circuits due to its similar electronic characteristics with the synaptic and features such as high integration density, non-volatile retention and supporting matrix-vector multiplication. In this paper, a digitalized RRAM-based fully-connected Spiking Neuron Network (SNN) system with 3-bit weight and unsupervised online learning scheme is proposed. It consists of 64 pre-neurons and 10 post-neurons, all the neurons are realized by digital circuits for low area overhead, low power consumption and high accuracy. An unsupervised online learning scheme based on binary STDP protocol is applied to train the synaptic weights. Experiments show that the system can be used to recognize the learned ten handwritten digits efficiently.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"34 7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126662255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Grain-Adaptive Computing Structure for FPGA CNN Acceleration 一种FPGA CNN加速的粒度自适应计算结构
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983480
Xinyuan Qu, Zhihong Huang, Ning Mao, Yu Xu, Gang Cai, Zhen Fang
In recent years, because of its superior performance and outstanding accuracy, convolutional neural networks (CNNs) are widely used in high-tech applications such as image classification and speech recognition. But it is more and more difficult to implement CNN in hardware platform due to the scale of CNN is increasing rapidly. FPGA attracts more attention compared with other processors for its excellent balance of flexibility and efficiency. There are many FPGA-based CNN accelerators proposed by previous work. However, in previous work the computing resource (especially DSP) is not fully utilized, either explicitly or covertly, which affects the CNN accelerator's overall performance seriously. In this work, we propose a new formula that provides a more accurate and comprehensive analysis to evaluate computing resource utilization, which can provide guidance for CNN accelerator design optimization. Then we propose a grain-adaptive computing structure for FPGA-based CNN acceleration, which can change flexibly to suit to and optimally utilize the available DSP resource. Due to the improvement of DSP utilization, we can achieve a more satisfactory result for both overall throughput performance and power efficiency. This architecture is implemented on Xilinx xcku115 based on AlexNet, the frequency is 150MHz and the peak power consumption is 30.05W. The overall performance is 1292.40 GOPS, 43.01 GOP/s/W, resulting in 2.28X and 1.94X, 9.44X and 3.02X improvement compared to previous work [6], [9] correspondingly.
近年来,卷积神经网络(convolutional neural network, cnn)由于其优越的性能和优异的准确率,被广泛应用于图像分类、语音识别等高科技领域。但由于CNN的规模在快速增长,在硬件平台上实现CNN的难度越来越大。与其他处理器相比,FPGA以其在灵活性和效率上的优异平衡而备受关注。前人提出了许多基于fpga的CNN加速器。然而,在以往的工作中,无论是显性的还是隐性的,计算资源(尤其是DSP)都没有得到充分的利用,严重影响了CNN加速器的整体性能。在这项工作中,我们提出了一个新的公式,提供了一个更准确和全面的分析来评估计算资源利用率,可以为CNN加速器的设计优化提供指导。然后,我们提出了一种基于fpga的CNN加速的粒度自适应计算结构,该结构可以灵活变化以适应并优化利用可用的DSP资源。由于DSP利用率的提高,我们可以在整体吞吐量性能和功耗效率方面取得更令人满意的结果。该架构在基于AlexNet的Xilinx xcku115上实现,频率为150MHz,峰值功耗为30.05W。总体性能为1292.40 GOPS, 43.01 GOP/s/W,相对于前期工作[6],[9]分别提升2.28倍,1.94倍,9.44倍,3.02倍。
{"title":"A Grain-Adaptive Computing Structure for FPGA CNN Acceleration","authors":"Xinyuan Qu, Zhihong Huang, Ning Mao, Yu Xu, Gang Cai, Zhen Fang","doi":"10.1109/ASICON47005.2019.8983480","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983480","url":null,"abstract":"In recent years, because of its superior performance and outstanding accuracy, convolutional neural networks (CNNs) are widely used in high-tech applications such as image classification and speech recognition. But it is more and more difficult to implement CNN in hardware platform due to the scale of CNN is increasing rapidly. FPGA attracts more attention compared with other processors for its excellent balance of flexibility and efficiency. There are many FPGA-based CNN accelerators proposed by previous work. However, in previous work the computing resource (especially DSP) is not fully utilized, either explicitly or covertly, which affects the CNN accelerator's overall performance seriously. In this work, we propose a new formula that provides a more accurate and comprehensive analysis to evaluate computing resource utilization, which can provide guidance for CNN accelerator design optimization. Then we propose a grain-adaptive computing structure for FPGA-based CNN acceleration, which can change flexibly to suit to and optimally utilize the available DSP resource. Due to the improvement of DSP utilization, we can achieve a more satisfactory result for both overall throughput performance and power efficiency. This architecture is implemented on Xilinx xcku115 based on AlexNet, the frequency is 150MHz and the peak power consumption is 30.05W. The overall performance is 1292.40 GOPS, 43.01 GOP/s/W, resulting in 2.28X and 1.94X, 9.44X and 3.02X improvement compared to previous work [6], [9] correspondingly.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123281830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
20, 000-fps Visual Motion Magnification on Pixel-parallel Vision Chip 像素并行视觉芯片上20000帧/秒的视觉运动放大
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983493
Junxian He, Xichuan Zhou, Yingcheng Lin, C. Sun, Cong Shi, N. Wu, Gang Luo
This paper proposes a pixel-parallel Eulerian Video Magnification (EVM) algorithm for vision chips. The proposed algorithm is optimized for the stereotyped programmable pixel-parallel array processor architecture favored by high-speed vision chips. We also propose an improved pixel-parallel array processor with alternative image border padding modes to satisfy various algorithm requirements. We implemented an FPGA prototype of an improved 128 × 128 pixel-parallel array processor to run the proposed optimized EVM algorithm with a 120 MHz clock. Experimental results show that our pixel-parallel system can magnify subtle motion clues at a very high speed up to 20, 000 frames per second (fps).
提出了一种用于视觉芯片的像素并行欧拉视频放大(EVM)算法。该算法针对高速视觉芯片青睐的可编程像素并行阵列处理器架构进行了优化。我们还提出了一种改进的像素并行阵列处理器,具有可选的图像边框填充模式,以满足各种算法要求。我们实现了一个改进的128 × 128像素并行阵列处理器的FPGA原型,以运行所提出的优化EVM算法,时钟为120 MHz。实验结果表明,我们的像素并行系统可以以高达每秒2万帧的速度放大细微的运动线索。
{"title":"20, 000-fps Visual Motion Magnification on Pixel-parallel Vision Chip","authors":"Junxian He, Xichuan Zhou, Yingcheng Lin, C. Sun, Cong Shi, N. Wu, Gang Luo","doi":"10.1109/ASICON47005.2019.8983493","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983493","url":null,"abstract":"This paper proposes a pixel-parallel Eulerian Video Magnification (EVM) algorithm for vision chips. The proposed algorithm is optimized for the stereotyped programmable pixel-parallel array processor architecture favored by high-speed vision chips. We also propose an improved pixel-parallel array processor with alternative image border padding modes to satisfy various algorithm requirements. We implemented an FPGA prototype of an improved 128 × 128 pixel-parallel array processor to run the proposed optimized EVM algorithm with a 120 MHz clock. Experimental results show that our pixel-parallel system can magnify subtle motion clues at a very high speed up to 20, 000 frames per second (fps).","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121550242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Solution Processed Metal Oxide in Emerging Electronic Devices 新兴电子器件中的溶液加工金属氧化物
Pub Date : 2019-10-01 DOI: 10.1109/asicon47005.2019.8983521
Chun Zhao, Cezhou Zhao, T. Zhao
Recently, solution processed metal oxide (MO) attracts wide interests due to the advantages including low-cost fabrication, procedure simplicity and vacuum-free technique. Within the paper, the synthesis mechanism of metal oxide deposited through solution process is firstly briefly introduced. Then the recent advances and progress on n-type solution processed MO semiconductors as well as the solution processed MO gate dielectrics have been reviewed for thin-film transistors.
近年来,溶液处理金属氧化物(MO)以其制备成本低、工艺简单、无真空等优点引起了广泛的关注。本文首先简要介绍了溶液法沉积金属氧化物的合成机理。然后综述了近年来在薄膜晶体管中n型溶液处理MO半导体和溶液处理MO栅极电介质的研究进展。
{"title":"Solution Processed Metal Oxide in Emerging Electronic Devices","authors":"Chun Zhao, Cezhou Zhao, T. Zhao","doi":"10.1109/asicon47005.2019.8983521","DOIUrl":"https://doi.org/10.1109/asicon47005.2019.8983521","url":null,"abstract":"Recently, solution processed metal oxide (MO) attracts wide interests due to the advantages including low-cost fabrication, procedure simplicity and vacuum-free technique. Within the paper, the synthesis mechanism of metal oxide deposited through solution process is firstly briefly introduced. Then the recent advances and progress on n-type solution processed MO semiconductors as well as the solution processed MO gate dielectrics have been reviewed for thin-film transistors.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126581641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MMV Subspace Pursuit (M-SP) Algorithm for Joint Sparse Multiple Measurement Vectors Recovery 联合稀疏多测量向量恢复的MMV子空间追踪算法
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983646
Sujuan Liu, Lili Zheng, Lei Liu, Qianjin Lin
In this paper, MMV Subspace Pursuit (M-SP) algorithm is proposed for solving joint sparse multiple measurement vectors (MMV) problem. The pre-selection and backtracking mechanisms are used in M-SP, so M-SP not only has higher recovery performance than some existing algorithms, but also significantly reduces the iteration number for improving the signal recovery efficiency. Simulations results show that M-SP and Simultaneous Compressive Sampling Matching Pursuit (SCoSaMP) have almost identical recovery performance and iteration times, but M-SP significantly reduces the computation complexity in per iteration. For example, when sparsity $K$ is 5, the computational complexity of M-SP is 24.0% of that of SCoSaMP in each iteration.
针对联合稀疏多测量向量(MMV)问题,提出了MMV子空间追踪(M-SP)算法。由于M-SP采用了预选和回溯机制,因此M-SP不仅具有比现有算法更高的恢复性能,而且显著减少了迭代次数,提高了信号恢复效率。仿真结果表明,M-SP与同步压缩采样匹配追踪(SCoSaMP)具有几乎相同的恢复性能和迭代次数,但M-SP显著降低了每次迭代的计算复杂度。例如,当稀疏度$K$为5时,每次迭代M-SP的计算复杂度为SCoSaMP的24.0%。
{"title":"MMV Subspace Pursuit (M-SP) Algorithm for Joint Sparse Multiple Measurement Vectors Recovery","authors":"Sujuan Liu, Lili Zheng, Lei Liu, Qianjin Lin","doi":"10.1109/ASICON47005.2019.8983646","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983646","url":null,"abstract":"In this paper, MMV Subspace Pursuit (M-SP) algorithm is proposed for solving joint sparse multiple measurement vectors (MMV) problem. The pre-selection and backtracking mechanisms are used in M-SP, so M-SP not only has higher recovery performance than some existing algorithms, but also significantly reduces the iteration number for improving the signal recovery efficiency. Simulations results show that M-SP and Simultaneous Compressive Sampling Matching Pursuit (SCoSaMP) have almost identical recovery performance and iteration times, but M-SP significantly reduces the computation complexity in per iteration. For example, when sparsity $K$ is 5, the computational complexity of M-SP is 24.0% of that of SCoSaMP in each iteration.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126101995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Inductorless 5-GHz Differential Dual Regulated Cross-Cascode Transimpedance Amplifier using 40 nm CMOS 采用40nm CMOS的无电感5 ghz差分双稳压跨级联码跨阻放大器
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983663
Bai Song Samuel Lee, Hang-Ji Liu, Xiaopeng Yu, Jer-Ming Chen, K. Yeo
This paper presents a new inductorless 5-GHz differential dual regulated cross-cascode transimpedance amplifier (DDRCCTIA) using UMC 40 nm CMOS technology. It consists of a differential cross-coupled input stage (DDRCC) that has a unique dual PMOS and NMOS regulated cascode loops as well as a frequency doubler with active inductor (FDAI) buffer stage. The design has a transimpedance gain of 62.5 dBΩ and bandwidth of 5.02 GHz. The power consumption is 7.34 mW from a 1.8 V supply, input referred noise current of 4.5 pA√Hz and a very small core area of 0.0018 mm2.
提出了一种采用UMC 40 nm CMOS技术的新型无电感5 ghz差分双稳压跨级联码跨阻放大器(DDRCCTIA)。它由差分交叉耦合输入级(DDRCC)组成,该级具有独特的双PMOS和NMOS调节级联环路,以及带有源电感(FDAI)缓冲级的倍频器。该设计的跨阻增益为62.5 dBΩ,带宽为5.02 GHz。功耗为7.34 mW,电源为1.8 V,输入参考噪声电流为4.5 pA√Hz,核心面积非常小,仅为0.0018 mm2。
{"title":"An Inductorless 5-GHz Differential Dual Regulated Cross-Cascode Transimpedance Amplifier using 40 nm CMOS","authors":"Bai Song Samuel Lee, Hang-Ji Liu, Xiaopeng Yu, Jer-Ming Chen, K. Yeo","doi":"10.1109/ASICON47005.2019.8983663","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983663","url":null,"abstract":"This paper presents a new inductorless 5-GHz differential dual regulated cross-cascode transimpedance amplifier (DDRCCTIA) using UMC 40 nm CMOS technology. It consists of a differential cross-coupled input stage (DDRCC) that has a unique dual PMOS and NMOS regulated cascode loops as well as a frequency doubler with active inductor (FDAI) buffer stage. The design has a transimpedance gain of 62.5 dBΩ and bandwidth of 5.02 GHz. The power consumption is 7.34 mW from a 1.8 V supply, input referred noise current of 4.5 pA√Hz and a very small core area of 0.0018 mm2.","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125308611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Flat Pass-Band Method with Two RC Band-Stop Filters for 4-Stage Passive RC Polyphase Filter in Low-IF Receiver Systems 低中频接收系统中4级无源RC多相滤波器的双RC带阻滤波器平通带方法
Pub Date : 2019-10-01 DOI: 10.1109/ASICON47005.2019.8983611
MinhTri Tran, Nene Kushita, A. Kuwana, Haruo Kobayashi
This paper proposes a flat pass-band for a 4-stage passive RC polyphase filter in a blue-tooth low-IF receiver system; there the bandwidth is 8MHz, the center IF frequency is 4MHz, and the required image rejection ratio is <-30dB. Based on the superposition principle, the transfer function of this filter is derived. As the input signals are the wanted signals, there are two local maximum values which are calculated based on Cauchy-Schwarz inequality theorem at 160kHz (1.24dB) and 40MHz (1.24dB). Therefore, two RC band-stop filters are used to improve the pass-band of these local maximum values (improvement of the ripple gain of pass-band from 2dB into 0.47dB). As a result, a 4-stage passive RC poly-phase filter for a low-IF receiver is designed, where an image rejection ratio is -36dB, and the pass-band gain is flat (the ripple gain is 0.47dB).
提出了一种蓝牙低中频接收系统中4级无源RC多相滤波器的平通带;其中带宽为8MHz,中心中频为4MHz,所需的图像抑制比<-30dB。基于叠加原理,推导了该滤波器的传递函数。由于输入信号为所需信号,因此根据柯西-施瓦茨不等式定理在160kHz (1.24dB)和40MHz (1.24dB)处计算出两个局部最大值。因此,使用两个RC带阻滤波器来改善这些局部最大值的通带(将通带纹波增益从2dB提高到0.47dB)。因此,设计了一个用于低中频接收机的4级无源RC多相滤波器,其中图像抑制比为-36dB,通带增益平坦(纹波增益为0.47dB)。
{"title":"Flat Pass-Band Method with Two RC Band-Stop Filters for 4-Stage Passive RC Polyphase Filter in Low-IF Receiver Systems","authors":"MinhTri Tran, Nene Kushita, A. Kuwana, Haruo Kobayashi","doi":"10.1109/ASICON47005.2019.8983611","DOIUrl":"https://doi.org/10.1109/ASICON47005.2019.8983611","url":null,"abstract":"This paper proposes a flat pass-band for a 4-stage passive RC polyphase filter in a blue-tooth low-IF receiver system; there the bandwidth is 8MHz, the center IF frequency is 4MHz, and the required image rejection ratio is <-30dB. Based on the superposition principle, the transfer function of this filter is derived. As the input signals are the wanted signals, there are two local maximum values which are calculated based on Cauchy-Schwarz inequality theorem at 160kHz (1.24dB) and 40MHz (1.24dB). Therefore, two RC band-stop filters are used to improve the pass-band of these local maximum values (improvement of the ripple gain of pass-band from 2dB into 0.47dB). As a result, a 4-stage passive RC poly-phase filter for a low-IF receiver is designed, where an image rejection ratio is -36dB, and the pass-band gain is flat (the ripple gain is 0.47dB).","PeriodicalId":319342,"journal":{"name":"2019 IEEE 13th International Conference on ASIC (ASICON)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131180213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
2019 IEEE 13th International Conference on ASIC (ASICON)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1