首页 > 最新文献

2015 IEEE Hot Chips 27 Symposium (HCS)最新文献

英文 中文
Winner-Take-All and Loser-Take-All Circuits: Architectures, Applications and Analytical Comparison 赢家通吃和输家通吃电路:架构、应用和分析比较
Pub Date : 2023-11-08 DOI: 10.3390/chips2040016
Ehsan Rahiminejad, Hamed Aminzadeh
Different winner-take-all (WTA) and loser-take-all (LTA) circuits are studied, and their operations are analyzed in this review. The exclusive operation of the current conveyor, binary tree, and time-domain WTA/LTA architectures, as the most important architectures reported in the literature, are compared from the perspectives of power consumption, speed, and precision.
本文研究了不同的赢者通吃(WTA)和输者通吃(LTA)电路,并分析了它们的工作原理。从功耗、速度和精度的角度比较了当前输送机的独占运行、二叉树和时域WTA/LTA架构作为文献中报道的最重要的架构。
{"title":"Winner-Take-All and Loser-Take-All Circuits: Architectures, Applications and Analytical Comparison","authors":"Ehsan Rahiminejad, Hamed Aminzadeh","doi":"10.3390/chips2040016","DOIUrl":"https://doi.org/10.3390/chips2040016","url":null,"abstract":"Different winner-take-all (WTA) and loser-take-all (LTA) circuits are studied, and their operations are analyzed in this review. The exclusive operation of the current conveyor, binary tree, and time-domain WTA/LTA architectures, as the most important architectures reported in the literature, are compared from the perspectives of power consumption, speed, and precision.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"344 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135392322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey of Automotive Radar and Lidar Signal Processing and Architectures 汽车雷达和激光雷达信号处理与体系结构综述
Pub Date : 2023-10-08 DOI: 10.3390/chips2040015
Luigi Giuffrida, Guido Masera, Maurizio Martina
In recent years, the development of Advanced Driver-Assistance Systems (ADASs) is driving the need for more reliable and precise on-vehicle sensing. Radar and lidar are crucial in this framework, since they allow sensing of vehicle’s surroundings. In such a scenario, it is necessary to master these sensing systems, and knowing their similarities and differences is important. Due to ADAS’s intrinsic real-time performance requirements, it is almost mandatory to be aware of the processing algorithms required by radar and lidar to understand what can be optimized and what actions can be taken to approach the real-time requirement. This review aims to present state-of-the-art radar and lidar technology, mainly focusing on modulation schemes and imaging systems, highlighting their weaknesses and strengths. Then, an overview of the sensor data processing algorithms is provided, with some considerations on what type of algorithms can be accelerated in hardware, pointing to some implementations from the literature. In conclusion, the basic concepts of sensor fusion are presented, and a comparison between radar and lidar is performed.
近年来,先进驾驶辅助系统(ADASs)的发展推动了对更可靠、更精确的车载传感的需求。雷达和激光雷达在这个框架中至关重要,因为它们可以感知车辆周围的环境。在这种情况下,有必要掌握这些传感系统,了解它们的异同是很重要的。由于ADAS固有的实时性能要求,了解雷达和激光雷达所需的处理算法几乎是强制性的,以便了解可以优化哪些内容,以及可以采取哪些行动来达到实时要求。本综述旨在介绍最先进的雷达和激光雷达技术,主要关注调制方案和成像系统,突出其优缺点。然后,概述了传感器数据处理算法,并考虑了哪些类型的算法可以在硬件中加速,并指出了文献中的一些实现。最后,介绍了传感器融合的基本概念,并对雷达和激光雷达进行了比较。
{"title":"A Survey of Automotive Radar and Lidar Signal Processing and Architectures","authors":"Luigi Giuffrida, Guido Masera, Maurizio Martina","doi":"10.3390/chips2040015","DOIUrl":"https://doi.org/10.3390/chips2040015","url":null,"abstract":"In recent years, the development of Advanced Driver-Assistance Systems (ADASs) is driving the need for more reliable and precise on-vehicle sensing. Radar and lidar are crucial in this framework, since they allow sensing of vehicle’s surroundings. In such a scenario, it is necessary to master these sensing systems, and knowing their similarities and differences is important. Due to ADAS’s intrinsic real-time performance requirements, it is almost mandatory to be aware of the processing algorithms required by radar and lidar to understand what can be optimized and what actions can be taken to approach the real-time requirement. This review aims to present state-of-the-art radar and lidar technology, mainly focusing on modulation schemes and imaging systems, highlighting their weaknesses and strengths. Then, an overview of the sensor data processing algorithms is provided, with some considerations on what type of algorithms can be accelerated in hardware, pointing to some implementations from the literature. In conclusion, the basic concepts of sensor fusion are presented, and a comparison between radar and lidar is performed.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135251448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and Performance Analysis of Hardware Realization of 3GPP Physical Layer for 5G Cell Search 面向5G小区搜索的3GPP物理层硬件实现设计与性能分析
Pub Date : 2023-10-07 DOI: 10.3390/chips2040014
Khalid Lodhi, Jayant Chhillar, Sumit J. Darak, Divisha Sharma
5G Cell Search (CS) is the first step for user equipment (UE) to initiate communication with the 5G node B (gNB) every time it is powered ON. In cellular networks, CS is accomplished via synchronization signals (SS) broadcasted by gNB. 5G 3rd generation partnership project (3GPP) specifications offer a detailed discussion on the SS generation at gNB, but a limited understanding of their blind search and detection is available. Unlike 4G, 5G SS may not be transmitted at the center of carrier frequency, and their frequency location is unknown to UE. In this work, we demonstrate the 5G CS by designing 3GPP compatible hardware realization of the physical layer (PHY) of the gNB transmitter and UE receiver. The proposed SS detection explores a novel down-sampling approach resulting in a 60% reduction in on-chip memory and 50% lower search time. Via detailed performance analysis, we analyze the functional correctness, computational complexity, and latency of the proposed approach for different word lengths, signal-to-noise ratio (SNR), and down-sampling factors. We demonstrate end-to-end 5G CS using GNU Radio-based RFNoC framework on the USRP-FPGA platform and achieve 66% faster SS search compared to software. The 3GPP compatibility and demonstration on hardware strengthen the commercial significance of the proposed work.
5G小区搜索(5G Cell Search, CS)是用户设备(UE)每次上电时启动与5G节点B (gNB)通信的第一步。在蜂窝网络中,CS是通过gNB广播的同步信号(SS)完成的。5G第三代合作伙伴计划(3GPP)规范提供了gNB上SS代的详细讨论,但对其盲搜索和检测的理解有限。与4G不同,5G SS可能不会在载波频率中心传输,其频率位置对于UE来说是未知的。在这项工作中,我们通过设计gNB发射机和UE接收机物理层(PHY)的3GPP兼容硬件实现来演示5G CS。提出的SS检测探索了一种新的下采样方法,从而减少了60%的片上内存和50%的搜索时间。通过详细的性能分析,我们分析了该方法在不同字长、信噪比(SNR)和下采样因素下的功能正确性、计算复杂度和延迟。我们在USRP-FPGA平台上使用基于GNU radio的RFNoC框架演示了端到端5G CS,与软件相比,SS搜索速度提高了66%。3GPP的兼容性和硬件上的演示增强了所提出工作的商业意义。
{"title":"Design and Performance Analysis of Hardware Realization of 3GPP Physical Layer for 5G Cell Search","authors":"Khalid Lodhi, Jayant Chhillar, Sumit J. Darak, Divisha Sharma","doi":"10.3390/chips2040014","DOIUrl":"https://doi.org/10.3390/chips2040014","url":null,"abstract":"5G Cell Search (CS) is the first step for user equipment (UE) to initiate communication with the 5G node B (gNB) every time it is powered ON. In cellular networks, CS is accomplished via synchronization signals (SS) broadcasted by gNB. 5G 3rd generation partnership project (3GPP) specifications offer a detailed discussion on the SS generation at gNB, but a limited understanding of their blind search and detection is available. Unlike 4G, 5G SS may not be transmitted at the center of carrier frequency, and their frequency location is unknown to UE. In this work, we demonstrate the 5G CS by designing 3GPP compatible hardware realization of the physical layer (PHY) of the gNB transmitter and UE receiver. The proposed SS detection explores a novel down-sampling approach resulting in a 60% reduction in on-chip memory and 50% lower search time. Via detailed performance analysis, we analyze the functional correctness, computational complexity, and latency of the proposed approach for different word lengths, signal-to-noise ratio (SNR), and down-sampling factors. We demonstrate end-to-end 5G CS using GNU Radio-based RFNoC framework on the USRP-FPGA platform and achieve 66% faster SS search compared to software. The 3GPP compatibility and demonstration on hardware strengthen the commercial significance of the proposed work.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135301084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Silicon Carbide: Physics, Manufacturing, and Its Role in Large-Scale Vehicle Electrification 碳化硅:物理、制造及其在大规模汽车电气化中的作用
Pub Date : 2023-09-13 DOI: 10.3390/chips2030013
Filippo Di Giovanni
Silicon carbide is changing power electronics; it is enabling massive car electrification owing to its far more efficient operation with respect to mainstream silicon in a large variety of energy conversion systems like the main traction inverter of an electric vehicle (EV). Its superior performance depends upon unique properties such as lower switching and conduction losses, safer high-temperature operation and high-voltage capability. Starting briefly with a description of its physics, more detailed information is then given about some key manufacturing steps such as crystal growth and epitaxy. Afterwards, an overview of its inherent defects and how to mitigate them is presented. Finally, a typical EV’s propulsion inverter is shown, proving the technology’s effectiveness in meeting requirements for mass electrification. Foreword: In recent years, SiC has drawn the attention of a growing number of power electronics designers as the material has good prospects for reducing environmental impacts on a global basis. The goal of this paper, based on the author’s contribution to the introduction of the technology at STMicroelectronics, is to show the potential of silicon carbide in enabling massive car electrification. The company’s SiC MOSFETs, tailored to the automotive industry, are enabling visionary EV makers to pave the way for sustainable e-mobility. The intent of this paper is to describe, for a large crowd of readers, how SiC features can accelerate such a transition by quantifying the benefits they bring in terms of improved efficiency in an EV electric powertrain. The paper also has the ambition to highlight the material’s physics and to give an overview of its production processes, starting from the crystal growth for realizing substrates to the main epitaxy techniques. Some space has been devoted to the analysis of the main crystal defects not present in silicon and whose nature poses new challenges in terms of manufacturing yields and screening. Finally, some insights into the market evolution and on the transition to 200 mm wafers are given.
碳化硅正在改变电力电子;由于在各种能量转换系统(如电动汽车的主牵引逆变器)中,与主流硅相比,它的运行效率要高得多,因此它正在实现大规模的汽车电气化。其优越的性能取决于其独特的特性,如更低的开关和传导损耗,更安全的高温操作和高压能力。从对其物理特性的简要描述开始,然后给出一些关键的制造步骤,如晶体生长和外延的更详细的信息。然后,概述了其固有缺陷以及如何消除这些缺陷。最后,以典型的电动汽车推进逆变器为例,验证了该技术在满足大规模电气化要求方面的有效性。前言:近年来,SiC材料在全球范围内具有良好的减少环境影响的前景,引起了越来越多电力电子设计人员的关注。基于作者对意法半导体技术引进的贡献,本文的目标是展示碳化硅在实现大规模汽车电气化方面的潜力。该公司为汽车行业量身定制的SiC mosfet,使有远见的电动汽车制造商能够为可持续的电动汽车铺平道路。本文的目的是为广大读者描述SiC特性如何通过量化它们在提高电动汽车动力系统效率方面带来的好处来加速这种转变。本文还希望突出材料的物理特性,并概述其生产过程,从实现衬底的晶体生长到主要外延技术。一些空间专门用于分析硅中不存在的主要晶体缺陷,其性质在制造产量和筛选方面提出了新的挑战。最后,对市场演变和向200mm晶圆的过渡给出了一些见解。
{"title":"Silicon Carbide: Physics, Manufacturing, and Its Role in Large-Scale Vehicle Electrification","authors":"Filippo Di Giovanni","doi":"10.3390/chips2030013","DOIUrl":"https://doi.org/10.3390/chips2030013","url":null,"abstract":"Silicon carbide is changing power electronics; it is enabling massive car electrification owing to its far more efficient operation with respect to mainstream silicon in a large variety of energy conversion systems like the main traction inverter of an electric vehicle (EV). Its superior performance depends upon unique properties such as lower switching and conduction losses, safer high-temperature operation and high-voltage capability. Starting briefly with a description of its physics, more detailed information is then given about some key manufacturing steps such as crystal growth and epitaxy. Afterwards, an overview of its inherent defects and how to mitigate them is presented. Finally, a typical EV’s propulsion inverter is shown, proving the technology’s effectiveness in meeting requirements for mass electrification. Foreword: In recent years, SiC has drawn the attention of a growing number of power electronics designers as the material has good prospects for reducing environmental impacts on a global basis. The goal of this paper, based on the author’s contribution to the introduction of the technology at STMicroelectronics, is to show the potential of silicon carbide in enabling massive car electrification. The company’s SiC MOSFETs, tailored to the automotive industry, are enabling visionary EV makers to pave the way for sustainable e-mobility. The intent of this paper is to describe, for a large crowd of readers, how SiC features can accelerate such a transition by quantifying the benefits they bring in terms of improved efficiency in an EV electric powertrain. The paper also has the ambition to highlight the material’s physics and to give an overview of its production processes, starting from the crystal growth for realizing substrates to the main epitaxy techniques. Some space has been devoted to the analysis of the main crystal defects not present in silicon and whose nature poses new challenges in terms of manufacturing yields and screening. Finally, some insights into the market evolution and on the transition to 200 mm wafers are given.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135740455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synergistic Verification of Hardware Peripherals through Virtual Prototype Aided Cross-Level Methodology Leveraging Coverage-Guided Fuzzing and Co-Simulation 利用覆盖引导模糊和联合仿真的虚拟样机辅助跨层方法对硬件外设进行协同验证
Pub Date : 2023-09-08 DOI: 10.3390/chips2030012
Sallar Ahmadi-Pour, Mathis Logemann, V. Herdt, Rolf Drechsler
In this paper, we propose a Virtual Prototype (VP) driven verification methodology for Hardware (HW) peripherals. In particular, we combine two approaches that complement each other and use the VP as a readily available reference model: We use (A) Coverage-Guided Fuzzing (CGF) which enables comprehensive verification at the unit-level of the Register-Transfer Level (RTL) HW peripheral with a Transaction Level Modeling (TLM) reference, and (B) an application-driven co-simulation-based approach that enables verification of the HW peripheral at the system-level. As a case-study, we utilize a RISC-V Platform Level Interrupt Controller (PLIC) as HW peripheral and use an abstract TLM PLIC implementation from the open source RISC-V VP as the reference model. In our experiments we find three behavioral mismatches and discuss the observation of these, as well as non-functional timing behavior mismatches, that were found through the proposed synergistic approach. Furthermore, we provide a discussion and considerations on the RTL/TLM Transactors, as they embody one keystone in cross-level methods. As the different approaches uncover different mismatches in our case-study (e.g., behavioral mismatches and timing mismatches), we conclude a synergy between the methods to aid in verification efforts.
在本文中,我们提出了一种虚拟样机(VP)驱动的硬件(HW)外设验证方法。特别是,我们结合了两种相互补充的方法,并使用VP作为一个可用的参考模型:我们使用(a)覆盖引导模糊(CGF),它可以在注册-传输级(RTL)硬件外设的单元级别进行全面验证,并使用事务级建模(TLM)参考,以及(B)基于应用程序驱动的协同仿真方法,可以在系统级别验证硬件外设。作为案例研究,我们使用RISC-V平台级中断控制器(PLIC)作为硬件外设,并使用开源RISC-V VP中的抽象TLM PLIC实现作为参考模型。在我们的实验中,我们发现了三种行为错配,并讨论了对这些行为的观察,以及通过提出的协同方法发现的非功能性时序行为错配。此外,我们提供了关于RTL/TLM transtors的讨论和考虑,因为它们体现了跨层方法的一个关键。当不同的方法在我们的案例研究中发现不同的不匹配时(例如,行为不匹配和时间不匹配),我们总结了方法之间的协同作用,以帮助验证工作。
{"title":"Synergistic Verification of Hardware Peripherals through Virtual Prototype Aided Cross-Level Methodology Leveraging Coverage-Guided Fuzzing and Co-Simulation","authors":"Sallar Ahmadi-Pour, Mathis Logemann, V. Herdt, Rolf Drechsler","doi":"10.3390/chips2030012","DOIUrl":"https://doi.org/10.3390/chips2030012","url":null,"abstract":"In this paper, we propose a Virtual Prototype (VP) driven verification methodology for Hardware (HW) peripherals. In particular, we combine two approaches that complement each other and use the VP as a readily available reference model: We use (A) Coverage-Guided Fuzzing (CGF) which enables comprehensive verification at the unit-level of the Register-Transfer Level (RTL) HW peripheral with a Transaction Level Modeling (TLM) reference, and (B) an application-driven co-simulation-based approach that enables verification of the HW peripheral at the system-level. As a case-study, we utilize a RISC-V Platform Level Interrupt Controller (PLIC) as HW peripheral and use an abstract TLM PLIC implementation from the open source RISC-V VP as the reference model. In our experiments we find three behavioral mismatches and discuss the observation of these, as well as non-functional timing behavior mismatches, that were found through the proposed synergistic approach. Furthermore, we provide a discussion and considerations on the RTL/TLM Transactors, as they embody one keystone in cross-level methods. As the different approaches uncover different mismatches in our case-study (e.g., behavioral mismatches and timing mismatches), we conclude a synergy between the methods to aid in verification efforts.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"54 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85715060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Standard-Cell-Based Comparators for Ultra-Low Voltage Applications: Analysis and Comparisons 超低电压应用的标准电池比较器:分析和比较
Pub Date : 2023-08-18 DOI: 10.3390/chips2030011
Riccardo Della Sala, F. Centurelli, G. Scotti, G. Palumbo
This work is focused on the performance of three different standard-cell-based comparator topologies, considering ultra-low-voltage (ULV) operation. The main application scenarios in which standard-cell-based comparators can be exploited are considered, and a set of figures of merit (FoM) to allow an in-depth comparison among the different topologies is introduced. Then, a set of simulation testbenches are defined in order to simulate and compare the considered topologies implemented in both a 130 nm technology and a 28 nm FDSOI CMOS process. Propagation delay, power consumption and power–delay product are evaluated for different values of the input common mode voltage, as a function of input differential amplitude, and in different supply voltage and temperature conditions. Monte Carlo simulations to evaluate the input offset voltage under mismatch variations are also provided. Simulation results show that the performances of the different comparator topologies are strongly dependent on the input common mode voltage, and that the best values for all the performance figures of merit are achieved by the comparator based on three-input NAND gates, with the only limitation being its non-rail-to-rail input common mode range (ICMR). The performances of the considered comparator topologies have also been simulated for different values of the supply voltage, ranging from 0.3 V to 1.2 V, showing that, even if standard-cell-based comparators can be operated at higher supply voltages by scaling their performances accordingly, the best values of the FoMs are achieved for VDD = 0.3 V.
这项工作的重点是考虑到超低电压(ULV)操作,三种不同的基于标准电池的比较器拓扑的性能。考虑了可以利用基于标准单元的比较器的主要应用场景,并引入了一组优点图(FoM),以便在不同拓扑之间进行深入比较。然后,定义了一组仿真试验台,以模拟和比较在130纳米技术和28纳米FDSOI CMOS工艺中实现的拓扑结构。在不同的电源电压和温度条件下,计算了不同的输入共模电压值、作为输入差幅的函数的传输延迟、功耗和功率延迟积。还提供了蒙特卡罗模拟来评估失配变化下的输入偏置电压。仿真结果表明,不同比较器拓扑结构的性能对输入共模电压有很强的依赖性,基于三输入NAND门的比较器实现了所有性能指标的最佳值,唯一的限制是其非轨对轨输入共模范围(ICMR)。考虑的比较器拓扑的性能也在0.3 V至1.2 V的不同电源电压值下进行了模拟,结果表明,即使基于标准电池的比较器可以通过相应地缩放其性能在更高的电源电压下工作,但在VDD = 0.3 V时,fom的最佳值达到了。
{"title":"Standard-Cell-Based Comparators for Ultra-Low Voltage Applications: Analysis and Comparisons","authors":"Riccardo Della Sala, F. Centurelli, G. Scotti, G. Palumbo","doi":"10.3390/chips2030011","DOIUrl":"https://doi.org/10.3390/chips2030011","url":null,"abstract":"This work is focused on the performance of three different standard-cell-based comparator topologies, considering ultra-low-voltage (ULV) operation. The main application scenarios in which standard-cell-based comparators can be exploited are considered, and a set of figures of merit (FoM) to allow an in-depth comparison among the different topologies is introduced. Then, a set of simulation testbenches are defined in order to simulate and compare the considered topologies implemented in both a 130 nm technology and a 28 nm FDSOI CMOS process. Propagation delay, power consumption and power–delay product are evaluated for different values of the input common mode voltage, as a function of input differential amplitude, and in different supply voltage and temperature conditions. Monte Carlo simulations to evaluate the input offset voltage under mismatch variations are also provided. Simulation results show that the performances of the different comparator topologies are strongly dependent on the input common mode voltage, and that the best values for all the performance figures of merit are achieved by the comparator based on three-input NAND gates, with the only limitation being its non-rail-to-rail input common mode range (ICMR). The performances of the considered comparator topologies have also been simulated for different values of the supply voltage, ranging from 0.3 V to 1.2 V, showing that, even if standard-cell-based comparators can be operated at higher supply voltages by scaling their performances accordingly, the best values of the FoMs are achieved for VDD = 0.3 V.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"86 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89749116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A-DSCNN: Depthwise Separable Convolutional Neural Network Inference Chip Design Using an Approximate Multiplier 基于近似乘法器的深度可分离卷积神经网络推理芯片设计
Pub Date : 2023-07-19 DOI: 10.3390/chips2030010
Jin-Jia Shang, Nicholas Phipps, I-Chyn Wey, T. Teo
For Convolutional Neural Networks (CNNs), Depthwise Separable CNN (DSCNN) is the preferred architecture for Application Specific Integrated Circuit (ASIC) implementation on edge devices. It benefits from a multi-mode approximate multiplier proposed in this work. The proposed approximate multiplier uses two 4-bit multiplication operations to implement a 12-bit multiplication operation by reusing the same multiplier array. With this approximate multiplier, sequential multiplication operations are pipelined in a modified DSCNN to fully utilize the Processing Element (PE) array in the convolutional layer. Two versions of Approximate-DSCNN (A-DSCNN) accelerators were implemented on TSMC 40 nm CMOS process with a supply voltage of 0.9 V. At a clock frequency of 200 MHz, the designs achieve 4.78 GOPs/mW and 4.89 GOP/mW power efficiency while occupying 1.16 mm2 and 0.398 mm2 area, respectively.
对于卷积神经网络(CNN),深度可分离CNN (DSCNN)是在边缘设备上实现专用集成电路(ASIC)的首选架构。它得益于本工作中提出的多模近似乘法器。所建议的近似乘法器通过重用相同的乘法器数组,使用两个4位乘法操作来实现一个12位乘法操作。利用这个近似乘法器,顺序乘法运算在一个改进的DSCNN中被流水线化,以充分利用卷积层中的处理元素(PE)数组。两个版本的Approximate-DSCNN (a - dscnn)加速器在TSMC 40 nm CMOS工艺上实现,电源电压为0.9 V。在时钟频率为200mhz时,功耗效率分别为4.78 GOPs/mW和4.89 GOP/mW,功耗面积分别为1.16 mm2和0.398 mm2。
{"title":"A-DSCNN: Depthwise Separable Convolutional Neural Network Inference Chip Design Using an Approximate Multiplier","authors":"Jin-Jia Shang, Nicholas Phipps, I-Chyn Wey, T. Teo","doi":"10.3390/chips2030010","DOIUrl":"https://doi.org/10.3390/chips2030010","url":null,"abstract":"For Convolutional Neural Networks (CNNs), Depthwise Separable CNN (DSCNN) is the preferred architecture for Application Specific Integrated Circuit (ASIC) implementation on edge devices. It benefits from a multi-mode approximate multiplier proposed in this work. The proposed approximate multiplier uses two 4-bit multiplication operations to implement a 12-bit multiplication operation by reusing the same multiplier array. With this approximate multiplier, sequential multiplication operations are pipelined in a modified DSCNN to fully utilize the Processing Element (PE) array in the convolutional layer. Two versions of Approximate-DSCNN (A-DSCNN) accelerators were implemented on TSMC 40 nm CMOS process with a supply voltage of 0.9 V. At a clock frequency of 200 MHz, the designs achieve 4.78 GOPs/mW and 4.89 GOP/mW power efficiency while occupying 1.16 mm2 and 0.398 mm2 area, respectively.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88626887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On-Chip Adaptive Implementation of Neuromorphic Spiking Sensory Systems with Self-X Capabilities 具有自x能力的神经形态脉冲传感系统的片上自适应实现
Pub Date : 2023-06-06 DOI: 10.3390/chips2020009
H. Abd, A. König
In contemporary devices, the number and diversity of sensors is increasing, thus, requiring both efficient and robust interfacing to the sensors. Implementing the interfacing systems in advanced integration technologies faces numerous issues due to manufacturing deviations, signal swings, noise, etc. The interface sensor designers escape to the time domain and digital design techniques to handle these challenges. Biology gives examples of efficient machines that have vastly outperformed conventional technology. This work pursues a neuromorphic spiking sensory system design with the same efficient style as biology. Our chip, that comprises the essential elements of the adaptive neuromorphic spiking sensory system, such as the neuron, synapse, adaptive coincidence detection (ACD), and self-adaptive spike-to-rank coding (SA-SRC), was manufactured in XFAB CMOS 0.35 μm technology via EUROPRACTICE. The main emphasis of this paper is to present the measurement outcomes of the SA-SRC on-chip, evaluating the efficacy of its adaptation scheme, and assessing its capability to produce spike orders that correspond to the temporal difference between the two spikes received at its inputs. The SA-SRC plays a crucial role in performing the primary function of the adaptive neuromorphic spiking sensory system. The measurement results of the chip confirm the simulation results of our previous work.
在现代设备中,传感器的数量和多样性正在增加,因此,需要有效和稳健的传感器接口。在先进集成技术中实现接口系统面临着制造偏差、信号波动、噪声等诸多问题。接口传感器设计人员逃避到时域和数字设计技术来处理这些挑战。生物学给出了一些高效机器的例子,这些机器的性能大大超过了传统技术。这项工作追求与生物学相同的高效风格的神经形态刺突感觉系统设计。我们的芯片包含自适应神经形态尖峰感觉系统的基本元素,如神经元、突触、自适应重合检测(ACD)和自适应尖峰到秩编码(SA-SRC),采用XFAB CMOS 0.35 μm技术通过EUROPRACTICE制造。本文的主要重点是介绍片上SA-SRC的测量结果,评估其适应方案的有效性,并评估其产生与输入处接收到的两个尖峰之间的时间差异相对应的尖峰顺序的能力。SA-SRC在自适应神经形态刺突感觉系统的主要功能中起着至关重要的作用。该芯片的测量结果证实了我们前期工作的仿真结果。
{"title":"On-Chip Adaptive Implementation of Neuromorphic Spiking Sensory Systems with Self-X Capabilities","authors":"H. Abd, A. König","doi":"10.3390/chips2020009","DOIUrl":"https://doi.org/10.3390/chips2020009","url":null,"abstract":"In contemporary devices, the number and diversity of sensors is increasing, thus, requiring both efficient and robust interfacing to the sensors. Implementing the interfacing systems in advanced integration technologies faces numerous issues due to manufacturing deviations, signal swings, noise, etc. The interface sensor designers escape to the time domain and digital design techniques to handle these challenges. Biology gives examples of efficient machines that have vastly outperformed conventional technology. This work pursues a neuromorphic spiking sensory system design with the same efficient style as biology. Our chip, that comprises the essential elements of the adaptive neuromorphic spiking sensory system, such as the neuron, synapse, adaptive coincidence detection (ACD), and self-adaptive spike-to-rank coding (SA-SRC), was manufactured in XFAB CMOS 0.35 μm technology via EUROPRACTICE. The main emphasis of this paper is to present the measurement outcomes of the SA-SRC on-chip, evaluating the efficacy of its adaptation scheme, and assessing its capability to produce spike orders that correspond to the temporal difference between the two spikes received at its inputs. The SA-SRC plays a crucial role in performing the primary function of the adaptive neuromorphic spiking sensory system. The measurement results of the chip confirm the simulation results of our previous work.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"48 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84081478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Quantitative Review of Automated Neural Search and On-Device Learning for Tiny Devices 微型设备的自动神经搜索和设备上学习的定量综述
Pub Date : 2023-05-09 DOI: 10.3390/chips2020008
Danilo Pietro Pau, Prem Kumar Ambrose, Fabrizio Maria Aymone
This paper presents a state-of-the-art review of different approaches for Neural Architecture Search targeting resource-constrained devices such as microcontrollers, as well as the implementations of on-device learning techniques for them. Approaches such as MCUNet have been able to drive the design of tiny neural architectures with low memory and computational requirements which can be deployed effectively on microcontrollers. Regarding on-device learning, there are various solutions that have addressed concept drift and have coped with the accuracy drop in real-time data depending on the task targeted, and these rely on a variety of learning methods. For computer vision, MCUNetV3 uses backpropagation and represents a state-of-the-art solution. The Restricted Coulomb Energy Neural Network is a promising method for learning with an extremely low memory footprint and computational complexity, which should be considered for future investigations.
本文介绍了针对资源受限设备(如微控制器)的神经架构搜索的不同方法的最新综述,以及设备上学习技术的实现。像MCUNet这样的方法已经能够驱动具有低内存和计算需求的微型神经架构的设计,这些结构可以有效地部署在微控制器上。关于设备上学习,有各种解决方案可以解决概念漂移问题,并根据目标任务应对实时数据的准确性下降,这些解决方案依赖于各种学习方法。对于计算机视觉,MCUNetV3使用反向传播,代表了最先进的解决方案。限制库仑能量神经网络是一种很有前途的学习方法,具有极低的内存占用和计算复杂度,应该在未来的研究中加以考虑。
{"title":"A Quantitative Review of Automated Neural Search and On-Device Learning for Tiny Devices","authors":"Danilo Pietro Pau, Prem Kumar Ambrose, Fabrizio Maria Aymone","doi":"10.3390/chips2020008","DOIUrl":"https://doi.org/10.3390/chips2020008","url":null,"abstract":"This paper presents a state-of-the-art review of different approaches for Neural Architecture Search targeting resource-constrained devices such as microcontrollers, as well as the implementations of on-device learning techniques for them. Approaches such as MCUNet have been able to drive the design of tiny neural architectures with low memory and computational requirements which can be deployed effectively on microcontrollers. Regarding on-device learning, there are various solutions that have addressed concept drift and have coped with the accuracy drop in real-time data depending on the task targeted, and these rely on a variety of learning methods. For computer vision, MCUNetV3 uses backpropagation and represents a state-of-the-art solution. The Restricted Coulomb Energy Neural Network is a promising method for learning with an extremely low memory footprint and computational complexity, which should be considered for future investigations.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135806779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Cost Indirect Measurements for Power-Efficient In-Field Optimization of Configurable Analog Front-Ends with Self-X Properties: A Hardware Implementation 具有自x属性的可配置模拟前端的低成本间接测量节能现场优化:硬件实现
Pub Date : 2023-05-01 DOI: 10.3390/chips2020007
Q. Zaman, S. Alraho, A. König
This paper presents a practical implementation and measurement results of power-efficient chip performance optimization, utilizing low-cost indirect measurement methods to support self-X properties (self-calibration, self-healing, self-optimization, etc.) for in-field optimization of analog front-end sensory electronics with XFAB 0.35 µm complementary metal oxide semiconductor (CMOS) technology. The reconfigurable, fully differential indirect current-feedback instrumentation amplifier (CFIA) performance is intrinsically optimized by employing a single test sinusoidal signal stimulus and measuring the total harmonic distortion (THD) at the output. To enhance the optimization process, the experience replay particle swarm optimization (ERPSO) algorithm is utilized as an artificial intelligence (AI) agent, implemented at the hardware level, to optimize the performance characteristics of the CFIA. The ERPSO algorithm extends the selection producer capabilities of the classical PSO methodology by incorporating an experience replay buffer to mitigate the likelihood of being trapped in local optima. Furthermore, the CFIA circuit has been integrated with a simple power-monitoring module to assess the power consumption of the optimization solution, to achieve a power-efficient and reliable configuration. The optimized chip performance showed an approximate 34% increase in power efficiency while achieving a targeted THD value of −72 dB, utilizing a 1 Vp-p differential input signal with a frequency of 1 MHz, and consuming approximately 53 mW of power. Preliminary tests conducted on the fabricated chip, using the default configuration pattern extrapolated from post-layout simulations, revealed an unacceptable performance behavior of the CFIA. Nevertheless, the proposed in-field optimization successfully restored the circuit’s performance, resulting in a robust design that meets the performance achieved in the design phase.
本文介绍了一种节能芯片性能优化的实际实现和测量结果,利用低成本的间接测量方法支持自x特性(自校准、自修复、自优化等),以XFAB 0.35µm互补金属氧化物半导体(CMOS)技术对模拟前端传感电子器件进行现场优化。可重构、全差分间接电流反馈仪表放大器(CFIA)的性能通过采用单一测试正弦信号刺激和测量输出端的总谐波失真(THD)而得到内在优化。为了增强优化过程,将经验回放粒子群优化(ERPSO)算法作为人工智能(AI)代理,在硬件层面实现,以优化CFIA的性能特征。ERPSO算法扩展了经典PSO方法的选择生产者能力,通过结合经验回放缓冲区来减少陷入局部最优的可能性。此外,CFIA电路集成了一个简单的功率监测模块,以评估优化解决方案的功耗,以实现节能和可靠的配置。优化后的芯片性能显示,在实现- 72 dB的目标THD值的同时,功率效率提高了约34%,使用频率为1 MHz的1 Vp-p差分输入信号,功耗约为53 mW。在预制芯片上进行的初步测试,使用从布局后模拟推断的默认配置模式,揭示了CFIA的不可接受的性能行为。然而,提出的现场优化成功地恢复了电路的性能,从而实现了满足设计阶段性能的稳健设计。
{"title":"Low-Cost Indirect Measurements for Power-Efficient In-Field Optimization of Configurable Analog Front-Ends with Self-X Properties: A Hardware Implementation","authors":"Q. Zaman, S. Alraho, A. König","doi":"10.3390/chips2020007","DOIUrl":"https://doi.org/10.3390/chips2020007","url":null,"abstract":"This paper presents a practical implementation and measurement results of power-efficient chip performance optimization, utilizing low-cost indirect measurement methods to support self-X properties (self-calibration, self-healing, self-optimization, etc.) for in-field optimization of analog front-end sensory electronics with XFAB 0.35 µm complementary metal oxide semiconductor (CMOS) technology. The reconfigurable, fully differential indirect current-feedback instrumentation amplifier (CFIA) performance is intrinsically optimized by employing a single test sinusoidal signal stimulus and measuring the total harmonic distortion (THD) at the output. To enhance the optimization process, the experience replay particle swarm optimization (ERPSO) algorithm is utilized as an artificial intelligence (AI) agent, implemented at the hardware level, to optimize the performance characteristics of the CFIA. The ERPSO algorithm extends the selection producer capabilities of the classical PSO methodology by incorporating an experience replay buffer to mitigate the likelihood of being trapped in local optima. Furthermore, the CFIA circuit has been integrated with a simple power-monitoring module to assess the power consumption of the optimization solution, to achieve a power-efficient and reliable configuration. The optimized chip performance showed an approximate 34% increase in power efficiency while achieving a targeted THD value of −72 dB, utilizing a 1 Vp-p differential input signal with a frequency of 1 MHz, and consuming approximately 53 mW of power. Preliminary tests conducted on the fabricated chip, using the default configuration pattern extrapolated from post-layout simulations, revealed an unacceptable performance behavior of the CFIA. Nevertheless, the proposed in-field optimization successfully restored the circuit’s performance, resulting in a robust design that meets the performance achieved in the design phase.","PeriodicalId":6666,"journal":{"name":"2015 IEEE Hot Chips 27 Symposium (HCS)","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81592534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2015 IEEE Hot Chips 27 Symposium (HCS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1