首页 > 最新文献

2020 International SoC Design Conference (ISOCC)最新文献

英文 中文
Recent Trend of Neuromorphic Computing Hardware: Intel's Neuromorphic System Perspective 神经形态计算硬件的最新趋势:英特尔的神经形态系统视角
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9332961
Yoon Seok Yang, Yongtae Kim
Neuromorphic computing has been studied to implement functions inspired by the human brain such as low power, fine-grained parallel processing, and real-time learning beyond the limitations seen by a standard von Neumann processor. In this paper, Intel's Loihi neuromorphic research chip and its hardware systems are introduced and find out how they are applied and used in actual research fields.
神经形态计算已经被研究来实现受人类大脑启发的功能,如低功耗,细粒度并行处理,以及超越标准冯·诺伊曼处理器限制的实时学习。本文介绍了英特尔公司的Loihi神经形态研究芯片及其硬件系统,并了解了它们在实际研究领域中的应用和使用情况。
{"title":"Recent Trend of Neuromorphic Computing Hardware: Intel's Neuromorphic System Perspective","authors":"Yoon Seok Yang, Yongtae Kim","doi":"10.1109/ISOCC50952.2020.9332961","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9332961","url":null,"abstract":"Neuromorphic computing has been studied to implement functions inspired by the human brain such as low power, fine-grained parallel processing, and real-time learning beyond the limitations seen by a standard von Neumann processor. In this paper, Intel's Loihi neuromorphic research chip and its hardware systems are introduced and find out how they are applied and used in actual research fields.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134234971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Early Termination of STDP Learning with Spike Counts in Spiking Neural Networks 尖峰神经网络中带有尖峰计数的STDP学习的早期终止
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9333061
Sunghyun Choi, Jongsun Park
Spiking neural network (SNN) is considered as one of the most promising candidates for designing neuromorphic hardware due to its low power computing capability. Since SNNs are made from imitating features of the human brain, bio-plausible spike-timing-dependent plasticity (STDP) learning rule can be adjusted to perform unsupervised learning of SNN. In this paper, we present a spike count based early termination technique for STDP learning in SNN. To reduce redundant timesteps and calculations, spike counts of output neurons can be used to terminate the training process beforehand, thus latency and energy can be decreased. The proposed scheme reduces 50.7% of timesteps and 51.1% of total weight update during training with 0.35% accuracy drop in MNIST application.
脉冲神经网络(SNN)由于其低功耗的计算能力而被认为是最有前途的神经形态硬件设计候选之一。由于SNN是模仿人类大脑的特征,因此可以调整生物似是而非的spike- time -dependent plasticity (STDP)学习规则来实现SNN的无监督学习。在本文中,我们提出了一种基于尖峰计数的SNN中STDP学习的早期终止技术。为了减少冗余的时间步长和计算,可以使用输出神经元的峰值计数来提前终止训练过程,从而降低延迟和能量。在MNIST应用中,该方案在训练过程中减少了50.7%的时间步长和51.1%的总权重更新,准确率下降了0.35%。
{"title":"Early Termination of STDP Learning with Spike Counts in Spiking Neural Networks","authors":"Sunghyun Choi, Jongsun Park","doi":"10.1109/ISOCC50952.2020.9333061","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9333061","url":null,"abstract":"Spiking neural network (SNN) is considered as one of the most promising candidates for designing neuromorphic hardware due to its low power computing capability. Since SNNs are made from imitating features of the human brain, bio-plausible spike-timing-dependent plasticity (STDP) learning rule can be adjusted to perform unsupervised learning of SNN. In this paper, we present a spike count based early termination technique for STDP learning in SNN. To reduce redundant timesteps and calculations, spike counts of output neurons can be used to terminate the training process beforehand, thus latency and energy can be decreased. The proposed scheme reduces 50.7% of timesteps and 51.1% of total weight update during training with 0.35% accuracy drop in MNIST application.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126596603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optical Receiver Front-end for Active Optical Cable in 180 nm CMOS 180nm CMOS有源光缆光接收机前端
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9333024
Daehyun Koh, Dainel Jeong, Jeongho Hwang, D. Jeong
This paper introduces a 4-channel optical receiver front-end in 180 nm CMOS technology for active optical cable application. The receiver is composed of HDMI 2.0 configuration, occupying area of $2.52 mathrm{mm}^{2}$. Regulated-cascode transimpedance amplifier is used in the chip. Shared inductor and capacitive feedback techniques are applied in limiting amplifiers to enhance bandwidth. The receivers consumes power of 54 mW at the data rate of 6 Gb/s with the energy efficiency of 3 pJ/bit/channel.
介绍了一种应用于有源光缆的180nm CMOS技术的4通道光接收机前端。接收机采用HDMI 2.0配置,占地$2.52 mathm {mm}^{2}$。该芯片采用了调节级联码跨阻放大器。共享电感和电容反馈技术应用于限制放大器,以提高带宽。在6gb /s的数据速率下,接收机功耗为54mw,能量效率为3pj /bit/信道。
{"title":"Optical Receiver Front-end for Active Optical Cable in 180 nm CMOS","authors":"Daehyun Koh, Dainel Jeong, Jeongho Hwang, D. Jeong","doi":"10.1109/ISOCC50952.2020.9333024","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9333024","url":null,"abstract":"This paper introduces a 4-channel optical receiver front-end in 180 nm CMOS technology for active optical cable application. The receiver is composed of HDMI 2.0 configuration, occupying area of $2.52 mathrm{mm}^{2}$. Regulated-cascode transimpedance amplifier is used in the chip. Shared inductor and capacitive feedback techniques are applied in limiting amplifiers to enhance bandwidth. The receivers consumes power of 54 mW at the data rate of 6 Gb/s with the energy efficiency of 3 pJ/bit/channel.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131023497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Memory-like Defect Diagnosis for CMOL FPGAs CMOL fpga类记忆缺陷诊断
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9332927
Jihye Kim, Hayoung Lee, Seokjun Jang, Hogyeong Kim, Sungho Kang
Nanotechnology is considered important as an alternative technology to overcome the limitations of CMOS technology. While nanotechnology has advantages in terms of power, density and performance, it is essential to obtain defect tolerance using reconfiguration due to its high defect rate. To bypass defect elements, accurate defect diagnosis is important for circuit configuration. CMOL FPGAs are circuit structures combining advantages of CMOS and nanotechnology. In this paper, an accurate defect diagnosis method for CMOL FPGAs using an operation similar to the read of CMOL memory are proposed.
纳米技术被认为是克服CMOS技术局限性的重要替代技术。虽然纳米技术在功率,密度和性能方面具有优势,但由于其高缺品率,必须通过重新配置来获得缺陷容限。为了绕过缺陷元件,准确的缺陷诊断对电路配置至关重要。CMOL fpga是结合CMOS和纳米技术优点的电路结构。本文提出了一种基于CMOL存储器读取操作的CMOL fpga缺陷精确诊断方法。
{"title":"Memory-like Defect Diagnosis for CMOL FPGAs","authors":"Jihye Kim, Hayoung Lee, Seokjun Jang, Hogyeong Kim, Sungho Kang","doi":"10.1109/ISOCC50952.2020.9332927","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9332927","url":null,"abstract":"Nanotechnology is considered important as an alternative technology to overcome the limitations of CMOS technology. While nanotechnology has advantages in terms of power, density and performance, it is essential to obtain defect tolerance using reconfiguration due to its high defect rate. To bypass defect elements, accurate defect diagnosis is important for circuit configuration. CMOL FPGAs are circuit structures combining advantages of CMOS and nanotechnology. In this paper, an accurate defect diagnosis method for CMOL FPGAs using an operation similar to the read of CMOL memory are proposed.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132155508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low Power Approximate Multiplier Using Error Tolerant Adder 使用容错加法器的低功耗近似乘法器
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9332952
Jaeik Cho, Youngmin Kim
In many applications such as multimedia processing, the power may need to be lowered. Error tolerant adder is used to study carry prediction and approximate multiplication in this paper. The delay of addition between partial products and the results of speculative counters is reduced by the optimized Three-Dimensional Matrix (TDM) suggested in [1] when the speculative counters make no error. The results of the optimized TDM trees are added by the speculative adder for error tolerant computations. The power dissipation is lowered by tolerating the generated errors in the computations deliberately. By the speculative adder, the parts of LSB are approximated in total calculation. Instead of making LSB inaccurate, the total power dissipation is lowered.
在许多应用程序中,例如多媒体处理,可能需要降低功耗。本文采用容错加法器研究进位预测和近似乘法。在投机计数器不出错的情况下,利用[1]中提出的优化的三维矩阵(TDM)减少了部分乘积与投机计数器结果之间的加法延迟。通过推测加法器对优化后的TDM树进行容错计算。通过有意地容忍计算中产生的误差,降低了功耗。通过推测加法器,在总计算中对LSB部分进行了近似。而不是使LSB不准确,总功耗降低。
{"title":"Low Power Approximate Multiplier Using Error Tolerant Adder","authors":"Jaeik Cho, Youngmin Kim","doi":"10.1109/ISOCC50952.2020.9332952","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9332952","url":null,"abstract":"In many applications such as multimedia processing, the power may need to be lowered. Error tolerant adder is used to study carry prediction and approximate multiplication in this paper. The delay of addition between partial products and the results of speculative counters is reduced by the optimized Three-Dimensional Matrix (TDM) suggested in [1] when the speculative counters make no error. The results of the optimized TDM trees are added by the speculative adder for error tolerant computations. The power dissipation is lowered by tolerating the generated errors in the computations deliberately. By the speculative adder, the parts of LSB are approximated in total calculation. Instead of making LSB inaccurate, the total power dissipation is lowered.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132178553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Low Power Loss IGBT Driver Circuit Using Current Drive 采用电流驱动的低功耗IGBT驱动电路
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9333104
Yudai Abe, A. Iwabuchi, Jun-Ichi Matsuda, A. Kuwana, Takashi Ida, Y. Shibasaki, Haruo Kobayashi
This paper investigates a feasibility of an IGBT driver with current sources to obtain good trade-off between the switching loss and the output voltage overshoot. We consider here an employment of multiple-peak current mirror circuits for the current mode IGBT driver, which can control the current amount in time for pulling the charge from (turn-off case) and pushing one to (turn-on case) the IGBT gate capacitances. We have examined the IGBT turn-off case by pulling the charge from the IGBT gate capacitances, and SPICE simulation results show that our investigated IGBT driver with controlled current sources can achieve lower loss than the one with controlled voltage sources when keeping the overshoot voltage to be equal.
本文研究了带电流源的IGBT驱动器在开关损耗和输出电压超调之间取得良好平衡的可行性。我们在这里考虑采用多峰电流镜像电路作为电流模式IGBT驱动器,它可以及时控制电流量,以便从(关断情况下)拉出电荷并将电荷推到(开通情况下)IGBT门电容。我们通过从IGBT栅极电容中提取电荷来测试IGBT关断情况,SPICE仿真结果表明,在保持过调电压相等的情况下,我们研究的控制电流源的IGBT驱动器比控制电压源的IGBT驱动器具有更低的损耗。
{"title":"Low Power Loss IGBT Driver Circuit Using Current Drive","authors":"Yudai Abe, A. Iwabuchi, Jun-Ichi Matsuda, A. Kuwana, Takashi Ida, Y. Shibasaki, Haruo Kobayashi","doi":"10.1109/ISOCC50952.2020.9333104","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9333104","url":null,"abstract":"This paper investigates a feasibility of an IGBT driver with current sources to obtain good trade-off between the switching loss and the output voltage overshoot. We consider here an employment of multiple-peak current mirror circuits for the current mode IGBT driver, which can control the current amount in time for pulling the charge from (turn-off case) and pushing one to (turn-on case) the IGBT gate capacitances. We have examined the IGBT turn-off case by pulling the charge from the IGBT gate capacitances, and SPICE simulation results show that our investigated IGBT driver with controlled current sources can achieve lower loss than the one with controlled voltage sources when keeping the overshoot voltage to be equal.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132691980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Wideband Distributed Demodulator at 100 GHz 一种100ghz宽带分布式解调器
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9332955
Zubair Mehmood, M. Seo
This paper presents a high speed on-off Keying (OOK) demodulator using distributed approach which is implemented using 28 nm bulk CMOS process. A 100 GHz demodulator post layout full-wave EM simulation results are realized for the data-rate up to 30 Gbps to check the eye diagram at different power levels. The proposed demodulator achieved 50% of eye opening at 30 Gbps for -16 dBm input power. The design consumes 18.5 mW of power and occupies the chip area of 0.135 mm2-,
本文提出了一种采用分布式方法的高速开关键控(OOK)解调器,该解调器采用28nm块体CMOS工艺实现。在数据速率高达30 Gbps的情况下,实现了100 GHz解调器后置全波电磁仿真结果,验证了不同功率水平下的眼图。提出的解调器在-16 dBm输入功率为30 Gbps时实现了50%的睁眼率。该设计功耗为18.5 mW,芯片面积为0.135 mm2-。
{"title":"A Wideband Distributed Demodulator at 100 GHz","authors":"Zubair Mehmood, M. Seo","doi":"10.1109/ISOCC50952.2020.9332955","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9332955","url":null,"abstract":"This paper presents a high speed on-off Keying (OOK) demodulator using distributed approach which is implemented using 28 nm bulk CMOS process. A 100 GHz demodulator post layout full-wave EM simulation results are realized for the data-rate up to 30 Gbps to check the eye diagram at different power levels. The proposed demodulator achieved 50% of eye opening at 30 Gbps for -16 dBm input power. The design consumes 18.5 mW of power and occupies the chip area of 0.135 mm2-,","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133250117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Diagnosis of Scan Chain Faults Based-on Machine-Learning 基于机器学习的扫描链故障诊断
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9333074
Hyeonchan Lim, Tae Hyun Kim, Seunghwan Kim, Sungho Kang
In order to improve yield of nanometer-scale chips, scan-based test and diagnosis are important. However, the scan chain can be subject to defects due to large hardware incurred by itself, which accounts for considerable portion of total chip area. Hence, scan chain test and diagnosis has played a critical role in recent years. In this paper, an efficient scan chain diagnosis method based on two-stage neural networks is proposed for not only stuck-at fault but also transition fault. Experimental results on benchmark circuits show that the proposed method is 10% more accurate than a previous work and CPU time for training the neural networks is also reduced dramatically.
为了提高纳米级芯片的成品率,基于扫描的测试和诊断是非常重要的。然而,由于扫描链本身所产生的庞大硬件占芯片总面积的相当大一部分,因此扫描链可能存在缺陷。因此,扫描链检测和诊断在近年来发挥了至关重要的作用。本文提出了一种基于两阶段神经网络的扫描链诊断方法,不仅适用于卡滞故障,也适用于过渡故障。在基准电路上的实验结果表明,该方法的准确率比以前的方法提高了10%,并且大大减少了神经网络的CPU训练时间。
{"title":"Diagnosis of Scan Chain Faults Based-on Machine-Learning","authors":"Hyeonchan Lim, Tae Hyun Kim, Seunghwan Kim, Sungho Kang","doi":"10.1109/ISOCC50952.2020.9333074","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9333074","url":null,"abstract":"In order to improve yield of nanometer-scale chips, scan-based test and diagnosis are important. However, the scan chain can be subject to defects due to large hardware incurred by itself, which accounts for considerable portion of total chip area. Hence, scan chain test and diagnosis has played a critical role in recent years. In this paper, an efficient scan chain diagnosis method based on two-stage neural networks is proposed for not only stuck-at fault but also transition fault. Experimental results on benchmark circuits show that the proposed method is 10% more accurate than a previous work and CPU time for training the neural networks is also reduced dramatically.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134613438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Binary Content-Addressable Memory System using Nanoelectromechanical Memory Switch 利用纳米机电存储开关的二进制内容可寻址存储系统
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9332913
Hyunju Kim, Youngmin Kim
Content-Addressable Memory (CAM) is a type of memory searches its contents with data and outputs addresses of matching words. Conventional CAM designs used dynamic CMOS architecture for high match speed and high density, but such implementation requires use of system clocks, and thus suffer from timing violations and design limitations such as charge sharing. Thus, we propose static based architecture for CAM, using Nano-Electro Mechanical (NEM) Memory Switch for nonvolatile data storage. We design the proposed CAM architectures on commercial 65 nm process with 1.2 V operating voltage.
内容可寻址存储器(CAM)是一种用数据搜索其内容并输出匹配词的地址的存储器。传统的CAM设计采用动态CMOS架构来实现高匹配速度和高密度,但这种实现需要使用系统时钟,因此存在时间冲突和电荷共享等设计限制。因此,我们提出了基于静态的CAM架构,使用纳米机电(NEM)内存开关进行非易失性数据存储。我们在1.2 V工作电压下,在商用65nm制程上设计了CAM架构。
{"title":"Binary Content-Addressable Memory System using Nanoelectromechanical Memory Switch","authors":"Hyunju Kim, Youngmin Kim","doi":"10.1109/ISOCC50952.2020.9332913","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9332913","url":null,"abstract":"Content-Addressable Memory (CAM) is a type of memory searches its contents with data and outputs addresses of matching words. Conventional CAM designs used dynamic CMOS architecture for high match speed and high density, but such implementation requires use of system clocks, and thus suffer from timing violations and design limitations such as charge sharing. Thus, we propose static based architecture for CAM, using Nano-Electro Mechanical (NEM) Memory Switch for nonvolatile data storage. We design the proposed CAM architectures on commercial 65 nm process with 1.2 V operating voltage.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114593770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixture of Deterministic and Stochastic Quantization Schemes for Lightweight CNN 轻量级CNN的确定性和随机混合量化方案
Pub Date : 2020-10-21 DOI: 10.1109/ISOCC50952.2020.9332958
Sungrae Kim, Hyun Kim
There has been a breakthrough in the field of image classification and object detection, owing to the development of GPU and deep learning. However, because of the huge computation of deep learning, it is hard to use the deep learning algorithms in an embedded platform or a mobile device. Therefore, many compression studies have been conducted, and one of the most popular methods is a parameter quantization. In this paper, we propose an adaptive quantization scheme that reduces the loss of accuracy due to the quantization by properly mixing deterministic and stochastic quantization methods, while retaining the characteristics of the hardware-friendly fixed-point quantization method. By applying the proposed method to the weight parameters of image classification and object detection networks, the proposed method shows better mean average precision (mAP) of up to 0.44% in image classification and 0.91 % in object detection.
由于GPU和深度学习的发展,在图像分类和目标检测领域有了突破。然而,由于深度学习的计算量巨大,很难在嵌入式平台或移动设备中使用深度学习算法。因此,进行了许多压缩研究,其中最流行的方法之一是参数量化。本文提出了一种自适应量化方案,在保留硬件友好的定点量化方法的特点的同时,适当地混合了确定性和随机量化方法,减少了由于量化而导致的精度损失。将该方法应用于图像分类和目标检测网络的权重参数,图像分类的平均精度(mAP)可达0.44%,目标检测的平均精度可达0.91%。
{"title":"Mixture of Deterministic and Stochastic Quantization Schemes for Lightweight CNN","authors":"Sungrae Kim, Hyun Kim","doi":"10.1109/ISOCC50952.2020.9332958","DOIUrl":"https://doi.org/10.1109/ISOCC50952.2020.9332958","url":null,"abstract":"There has been a breakthrough in the field of image classification and object detection, owing to the development of GPU and deep learning. However, because of the huge computation of deep learning, it is hard to use the deep learning algorithms in an embedded platform or a mobile device. Therefore, many compression studies have been conducted, and one of the most popular methods is a parameter quantization. In this paper, we propose an adaptive quantization scheme that reduces the loss of accuracy due to the quantization by properly mixing deterministic and stochastic quantization methods, while retaining the characteristics of the hardware-friendly fixed-point quantization method. By applying the proposed method to the weight parameters of image classification and object detection networks, the proposed method shows better mean average precision (mAP) of up to 0.44% in image classification and 0.91 % in object detection.","PeriodicalId":270577,"journal":{"name":"2020 International SoC Design Conference (ISOCC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117136275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2020 International SoC Design Conference (ISOCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1