首页 > 最新文献

2023 24th International Symposium on Quality Electronic Design (ISQED)最新文献

英文 中文
A Deep Learning Approach for Ventricular Arrhythmias Classification using Microcontroller 基于单片机的室性心律失常分类的深度学习方法
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129313
Ya-sine Agrignan, Shangli Zhou, Jun Bai, Sahidul Islam, S. Nabavi, Mimi Xie, Caiwen Ding
Intra-Cardiac Electrogram (IEGM) is widely used to identify life-threatening ventricular arrhythmias in medical devices to prevent sudden cardiac death, e.g., Implantable Cardioverter Defibrillator (ICD). In this paper, we present and explore the development of a machine learning approach for the detection of life-threatening Heart Arrhythmias through IEGM Data from an ICD Device. This work is facilitated by the design and analysis of 2 Convolutional Neural Network (CNN), 1D and 2D CNNs, that perform inference on a Low Power STM Nucleo-32 MCU. Multiple microcontroller software platforms are utilized to construct and deploy the trained models onto the MCU platform for inference measurements. The experimental analysis consists of minimizing Average Inference time and onboard Memory Occupation while maximizing the accuracy of the models. We profile the memory occupation and inference time for different CNN kernels. We develop a 1D CNN structure with a 26.20 ms Average Inference out of 10 measurements taken by the MCU platform. Model Weights in Flash Memory Occupied 5.99 KiB and Model Activations in SRAM (Static Random Access Memory) measure 5.00 KiB. The 1D CNN achieves a Fβ score of 97.8. The 2D CNN Model achieves 11.00 ms of inference, 3.05 KiB of Flash, and 8.09 KiB of SRAM. The 2D CNN achieves a Fβ score of 95.15. Our code is publicly available at https://github.com/Zhoushanglin100/TinyML-HuskyCSDeepical.
在植入式心律转复除颤器(ICD)等医疗器械中,心内电图(IEGM)被广泛用于识别危及生命的室性心律失常,以防止心源性猝死。在本文中,我们提出并探索了一种机器学习方法的发展,该方法通过来自ICD设备的IEGM数据来检测危及生命的心律失常。这项工作是通过设计和分析2卷积神经网络(CNN), 1D和2D CNN,在低功耗STM Nucleo-32 MCU上进行推理而促进的。利用多个微控制器软件平台构建训练好的模型并将其部署到MCU平台上进行推理测量。实验分析包括最小化平均推理时间和板载内存占用,同时最大限度地提高模型的准确性。我们分析了不同CNN核的内存占用和推理时间。我们开发了一个一维CNN结构,在MCU平台进行的10次测量中,平均推理时间为26.20 ms。Flash中占用的模型权重为5.99 KiB, SRAM(静态随机存取存储器)中的模型激活量为5.00 KiB。1D CNN的Fβ得分为97.8。2D CNN模型的推理时间为11.00 ms, Flash占用3.05 KiB, SRAM占用8.09 KiB。2D CNN的Fβ得分为95.15。我们的代码可以在https://github.com/Zhoushanglin100/TinyML-HuskyCSDeepical上公开获得。
{"title":"A Deep Learning Approach for Ventricular Arrhythmias Classification using Microcontroller","authors":"Ya-sine Agrignan, Shangli Zhou, Jun Bai, Sahidul Islam, S. Nabavi, Mimi Xie, Caiwen Ding","doi":"10.1109/ISQED57927.2023.10129313","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129313","url":null,"abstract":"Intra-Cardiac Electrogram (IEGM) is widely used to identify life-threatening ventricular arrhythmias in medical devices to prevent sudden cardiac death, e.g., Implantable Cardioverter Defibrillator (ICD). In this paper, we present and explore the development of a machine learning approach for the detection of life-threatening Heart Arrhythmias through IEGM Data from an ICD Device. This work is facilitated by the design and analysis of 2 Convolutional Neural Network (CNN), 1D and 2D CNNs, that perform inference on a Low Power STM Nucleo-32 MCU. Multiple microcontroller software platforms are utilized to construct and deploy the trained models onto the MCU platform for inference measurements. The experimental analysis consists of minimizing Average Inference time and onboard Memory Occupation while maximizing the accuracy of the models. We profile the memory occupation and inference time for different CNN kernels. We develop a 1D CNN structure with a 26.20 ms Average Inference out of 10 measurements taken by the MCU platform. Model Weights in Flash Memory Occupied 5.99 KiB and Model Activations in SRAM (Static Random Access Memory) measure 5.00 KiB. The 1D CNN achieves a Fβ score of 97.8. The 2D CNN Model achieves 11.00 ms of inference, 3.05 KiB of Flash, and 8.09 KiB of SRAM. The 2D CNN achieves a Fβ score of 95.15. Our code is publicly available at https://github.com/Zhoushanglin100/TinyML-HuskyCSDeepical.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122877883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reproducing Fear Conditioning of Rats with Unmanned Ground Vehicles and Neuromorphic Systems 无人驾驶地面车辆和神经形态系统对大鼠恐惧条件反射的再现
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129317
Noah Zins, Hongyu An
Deep learning accomplishes remarkable success through training with massively labeled datasets. However, the high demands on the datasets impede the feasibility of deep learning in edge computing scenarios and suffer the data scarcity issue. Rather than relying on labeled data, animals learn by interacting with their surroundings and memorizing the relationship between concurrent events. This learning paradigm is referred to as associative memory. The successful implementation of associative memory potentially achieves self-learning schemes analogous to animals to resolve the challenges of deep learning. The state-of-the-art implementations of associative memory are limited to small-scale and offline paradigms. Thus, in this work, we implement associative memory learning with an Unmanned Ground Vehicle (UGV) and neuromorphic chips (Intel Loihi) for an online learning scenario. Our system reproduces the classic associative memory in rats. In specific, our system successfully reproduces the fear conditioning with no pretraining procedure and labeled datasets. In our experiments, the UGV serves as a substitute for the rats. Our UGV autonomously memorizes the cause-and-effect of the light stimulus and vibration stimulus, then exhibits a movement response. During associative memory learning, the synaptic weights are updated by Hebbian learning. The Intel Loihi chip is integrated with our online learning system for processing visual signals. Its average power usages for computing logic and memory are 30 mW and 29 mW, respectively.
深度学习通过大规模标记数据集的训练取得了显著的成功。然而,对数据集的高要求阻碍了深度学习在边缘计算场景下的可行性,并且存在数据稀缺性问题。动物不是依靠标记数据,而是通过与周围环境的互动和记忆同时发生的事件之间的关系来学习。这种学习模式被称为联想记忆。联想记忆的成功实现可能实现类似于动物的自我学习方案,以解决深度学习的挑战。联想记忆的最新实现仅限于小规模和离线范例。因此,在这项工作中,我们使用无人地面车辆(UGV)和神经形态芯片(英特尔Loihi)实现了联想记忆学习,用于在线学习场景。我们的系统在老鼠身上复制了经典的联想记忆。具体来说,我们的系统在没有预训练程序和标记数据集的情况下成功地再现了恐惧条件反射。在我们的实验中,UGV作为大鼠的替代品。我们的UGV自主记忆光刺激和振动刺激的因果关系,然后表现出运动响应。在联想记忆学习过程中,突触权值通过Hebbian学习进行更新。英特尔Loihi芯片与我们的在线学习系统集成,用于处理视觉信号。其计算逻辑和内存的平均功耗分别为30兆瓦和29兆瓦。
{"title":"Reproducing Fear Conditioning of Rats with Unmanned Ground Vehicles and Neuromorphic Systems","authors":"Noah Zins, Hongyu An","doi":"10.1109/ISQED57927.2023.10129317","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129317","url":null,"abstract":"Deep learning accomplishes remarkable success through training with massively labeled datasets. However, the high demands on the datasets impede the feasibility of deep learning in edge computing scenarios and suffer the data scarcity issue. Rather than relying on labeled data, animals learn by interacting with their surroundings and memorizing the relationship between concurrent events. This learning paradigm is referred to as associative memory. The successful implementation of associative memory potentially achieves self-learning schemes analogous to animals to resolve the challenges of deep learning. The state-of-the-art implementations of associative memory are limited to small-scale and offline paradigms. Thus, in this work, we implement associative memory learning with an Unmanned Ground Vehicle (UGV) and neuromorphic chips (Intel Loihi) for an online learning scenario. Our system reproduces the classic associative memory in rats. In specific, our system successfully reproduces the fear conditioning with no pretraining procedure and labeled datasets. In our experiments, the UGV serves as a substitute for the rats. Our UGV autonomously memorizes the cause-and-effect of the light stimulus and vibration stimulus, then exhibits a movement response. During associative memory learning, the synaptic weights are updated by Hebbian learning. The Intel Loihi chip is integrated with our online learning system for processing visual signals. Its average power usages for computing logic and memory are 30 mW and 29 mW, respectively.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134619060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance Analysis of Cylindrical Through Silicon Via with Interfacial Crack 含界面裂纹的圆柱形硅通孔性能分析
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129400
V. Kumari, Maya Chandrakar, M. Majumder
A continuous scaling down of technology drives the microelectronics industry towards the nanoscale regime, wherein various fabrication-related defects such as electromigration induced open/short faults, interfacial cracks, and thermal stress-induced leakage problems primarily dominate the overall performance of a through silicon via (TSV). Interfacial cracking plays a pivotal role in the long-term service reliability of the chip among them. On account of these facts, this paper provides equivalent RLGC fault modeling and performance analysis of thermo-mechanical delamination in TSVs known as interfacial cracks. Considering the MOS effect, an analytical expression is derived using defective parameters to analyze the feasibility and reliability of the defected TSVs at different crack widths and heights. Using a driver-via-load (DVL) setup, performance in terms of power dissipation, power delay product (PDP), and dynamic crosstalk delay are analyzed using a CMOS driver. Encouragingly, considering interfacial cracked TSV, power and crosstalk delay are improved by 74.4% and 65.5%, respectively, at a minimum crack length approaching the defect-free condition.
技术的不断缩小推动着微电子工业向纳米级方向发展,其中各种与制造相关的缺陷,如电迁移引起的开/短故障、界面裂缝和热应力引起的泄漏问题,主要主导着硅通孔(TSV)的整体性能。其中,界面开裂对芯片的长期使用可靠性起着举足轻重的作用。考虑到这些事实,本文提供了等效RLGC断层建模和tsv中被称为界面裂缝的热-机械分层的性能分析。考虑MOS效应,推导了缺陷参数的解析表达式,分析了缺陷tsv在不同裂纹宽度和高度下的可行性和可靠性。采用驱动器-过负载(DVL)设置,分析了CMOS驱动器在功耗、功率延迟积(PDP)和动态串扰延迟方面的性能。令人鼓舞的是,考虑界面裂纹的TSV,在最小裂纹长度接近无缺陷的情况下,功率和串扰延迟分别提高了74.4%和65.5%。
{"title":"Performance Analysis of Cylindrical Through Silicon Via with Interfacial Crack","authors":"V. Kumari, Maya Chandrakar, M. Majumder","doi":"10.1109/ISQED57927.2023.10129400","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129400","url":null,"abstract":"A continuous scaling down of technology drives the microelectronics industry towards the nanoscale regime, wherein various fabrication-related defects such as electromigration induced open/short faults, interfacial cracks, and thermal stress-induced leakage problems primarily dominate the overall performance of a through silicon via (TSV). Interfacial cracking plays a pivotal role in the long-term service reliability of the chip among them. On account of these facts, this paper provides equivalent RLGC fault modeling and performance analysis of thermo-mechanical delamination in TSVs known as interfacial cracks. Considering the MOS effect, an analytical expression is derived using defective parameters to analyze the feasibility and reliability of the defected TSVs at different crack widths and heights. Using a driver-via-load (DVL) setup, performance in terms of power dissipation, power delay product (PDP), and dynamic crosstalk delay are analyzed using a CMOS driver. Encouragingly, considering interfacial cracked TSV, power and crosstalk delay are improved by 74.4% and 65.5%, respectively, at a minimum crack length approaching the defect-free condition.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134077151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Novel Stochastic LSTM Model Inspired by Quantum Machine Learning 基于量子机器学习的随机LSTM模型
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129344
Joseph Lindsay, Ramtin Zand
Works in quantum machine learning (QML) over the past few years indicate that QML algorithms can function just as well as their classical counterparts, and even outperform them in some cases. Among the corpus of recent work, many current QML models take advantage of variational quantum algorithm (VQA) circuits, given that their scale is typically small enough to be compatible with NISQ devices and the method of automatic differentiation for optimizing circuit parameters is familiar to machine learning (ML). While the results bare interesting promise for an era when quantum machines are more readily accessible, if one can achieve similar results through non-quantum methods then there may be a more near-term advantage available to practitioners. To this end, the nature of this work is to investigate the utilization of stochastic methods inspired by a variational quantum version of the long short-term memory (LSTM) model in attempt to approach the reported successes in performance and rapid convergence. By analyzing the performance of classical, stochastic, and quantum methods, this work aims to elucidate if it is possible to achieve some of QML’s major reported benefits on classical machines by incorporating aspects of its stochasticity.
过去几年在量子机器学习(QML)领域的研究表明,QML算法可以像经典算法一样发挥作用,在某些情况下甚至比经典算法表现得更好。在最近的研究中,许多当前的QML模型都利用了变分量子算法(VQA)电路,因为它们的规模通常足够小,可以与NISQ设备兼容,而且用于优化电路参数的自动微分方法与机器学习(ML)很熟悉。虽然这一结果为量子机器更容易使用的时代带来了有趣的前景,但如果人们可以通过非量子方法获得类似的结果,那么从业者可能会获得更短期的优势。为此,这项工作的本质是研究受长短期记忆(LSTM)模型的变分量子版本启发的随机方法的利用,试图接近报道的性能和快速收敛方面的成功。通过分析经典、随机和量子方法的性能,本工作旨在阐明是否有可能通过结合其随机性方面在经典机器上实现QML的一些主要优点。
{"title":"A Novel Stochastic LSTM Model Inspired by Quantum Machine Learning","authors":"Joseph Lindsay, Ramtin Zand","doi":"10.1109/ISQED57927.2023.10129344","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129344","url":null,"abstract":"Works in quantum machine learning (QML) over the past few years indicate that QML algorithms can function just as well as their classical counterparts, and even outperform them in some cases. Among the corpus of recent work, many current QML models take advantage of variational quantum algorithm (VQA) circuits, given that their scale is typically small enough to be compatible with NISQ devices and the method of automatic differentiation for optimizing circuit parameters is familiar to machine learning (ML). While the results bare interesting promise for an era when quantum machines are more readily accessible, if one can achieve similar results through non-quantum methods then there may be a more near-term advantage available to practitioners. To this end, the nature of this work is to investigate the utilization of stochastic methods inspired by a variational quantum version of the long short-term memory (LSTM) model in attempt to approach the reported successes in performance and rapid convergence. By analyzing the performance of classical, stochastic, and quantum methods, this work aims to elucidate if it is possible to achieve some of QML’s major reported benefits on classical machines by incorporating aspects of its stochasticity.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"207 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114216269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spiking Domain Feature Extraction with Temporal Dynamic Learning 基于时间动态学习的脉冲域特征提取
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129326
Honghao Zheng, Y. Yi
Spiking neural network (SNN) has attracted more and more research attention due to its event-based property. SNNs are more power efficient with such property than a conventional artificial neural network. For transferring the information to spikes, SNNs need an encoding process. With the temporal encoding schemes, SNN can extract the temporal patterns from the original information. A more advanced encoding scheme is a multiplexing temporal encoding which combines several encoding schemes with different timescales to have a larger information density and dynamic range. After that, the spike timing dependence plasticity (STDP) learning algorithm is utilized for training the SNN since the SNN can not be trained with regular training algorithms like backpropagation. In this work, a spiking domain feature extraction neural network with temporal multiplexing encoding is designed on EAGLE and fabricated on the PCB board. The testbench’s power consumption is 400mW. From the test result, a conclusion can be drawn that the network on PCB can transfer the input information to multiplexing temporal encoded spikes and then utilize the spikes to adjust the synaptic weight voltage.
脉冲神经网络(SNN)由于其基于事件的特性而受到越来越多的研究关注。snn具有这种特性,比传统的人工神经网络更节能。为了将信息传输到峰值,snn需要一个编码过程。SNN采用时间编码方案,从原始信息中提取时间模式。一种更高级的编码方案是多路时间编码,它结合了几种不同时间尺度的编码方案,具有更大的信息密度和动态范围。之后,由于SNN无法通过反向传播等常规训练算法进行训练,因此采用尖峰时序依赖可塑性(STDP)学习算法对SNN进行训练。本文在EAGLE上设计了一种具有时序复用编码的尖峰域特征提取神经网络,并在PCB板上制作。试验台的功耗为400mW。从测试结果可以得出结论,PCB上的网络可以将输入信息传输到多路时间编码尖峰,然后利用尖峰来调节突触权重电压。
{"title":"Spiking Domain Feature Extraction with Temporal Dynamic Learning","authors":"Honghao Zheng, Y. Yi","doi":"10.1109/ISQED57927.2023.10129326","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129326","url":null,"abstract":"Spiking neural network (SNN) has attracted more and more research attention due to its event-based property. SNNs are more power efficient with such property than a conventional artificial neural network. For transferring the information to spikes, SNNs need an encoding process. With the temporal encoding schemes, SNN can extract the temporal patterns from the original information. A more advanced encoding scheme is a multiplexing temporal encoding which combines several encoding schemes with different timescales to have a larger information density and dynamic range. After that, the spike timing dependence plasticity (STDP) learning algorithm is utilized for training the SNN since the SNN can not be trained with regular training algorithms like backpropagation. In this work, a spiking domain feature extraction neural network with temporal multiplexing encoding is designed on EAGLE and fabricated on the PCB board. The testbench’s power consumption is 400mW. From the test result, a conclusion can be drawn that the network on PCB can transfer the input information to multiplexing temporal encoded spikes and then utilize the spikes to adjust the synaptic weight voltage.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"364 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124924571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TOTAL: Topology Optimization of Operational Amplifier via Reinforcement Learning 基于强化学习的运算放大器拓扑优化
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129336
Zihao Chen, Songlei Meng, F. Yang, Li Shang, Xuan Zeng
With ever-increasing design complexity and stringent time-to-market pressure, automated topology synthesis tools for operational amplifiers are required to produce designs meeting different specifications. This paper proposes TOTAL, a reinforcement learning-based topology optimization method for operational amplifiers. We decompose the circuit topology design as a Markov decision process to solve the high dimensionality of the design space, with the three-stage cascode paradigm fixed to avoid meaningless structures. Therefore, starting from a basic behavior-level topology, an agent modifies the circuit step by step. Specifically, this agent mainly adopts a graph neural network to understand each design state, including specifications and the design history, and a convolutional neural network to modify the current topology. Every completed circuit is then simulated and evaluated by a customized reward function to guide the agent in finding qualified circuits, among which only the optimal one ever recorded is mapped to the transistor level for further evaluation. Experimental results show that the trained agent can not only generate high-performance circuits, but also be reusable by transferring to other specifications as a pre-trained model and achieving competitive results.
随着设计复杂性和上市时间压力的不断增加,需要用于运算放大器的自动化拓扑合成工具来生产满足不同规格的设计。本文提出了一种基于强化学习的运算放大器拓扑优化方法TOTAL。我们将电路拓扑设计分解为一个马尔可夫决策过程,以解决设计空间的高维性,并固定了三级级联代码范式,以避免无意义的结构。因此,从基本的行为级拓扑开始,智能体逐步修改电路。具体来说,该智能体主要采用图神经网络来理解每个设计状态,包括规格和设计历史,并采用卷积神经网络来修改当前拓扑。然后通过定制的奖励函数对每个完成的电路进行模拟和评估,以指导智能体找到合格的电路,其中只有记录的最优电路才会映射到晶体管级别进行进一步评估。实验结果表明,训练后的智能体不仅可以生成高性能的电路,而且可以作为预训练模型转移到其他规范中,并具有可重用性和竞争性。
{"title":"TOTAL: Topology Optimization of Operational Amplifier via Reinforcement Learning","authors":"Zihao Chen, Songlei Meng, F. Yang, Li Shang, Xuan Zeng","doi":"10.1109/ISQED57927.2023.10129336","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129336","url":null,"abstract":"With ever-increasing design complexity and stringent time-to-market pressure, automated topology synthesis tools for operational amplifiers are required to produce designs meeting different specifications. This paper proposes TOTAL, a reinforcement learning-based topology optimization method for operational amplifiers. We decompose the circuit topology design as a Markov decision process to solve the high dimensionality of the design space, with the three-stage cascode paradigm fixed to avoid meaningless structures. Therefore, starting from a basic behavior-level topology, an agent modifies the circuit step by step. Specifically, this agent mainly adopts a graph neural network to understand each design state, including specifications and the design history, and a convolutional neural network to modify the current topology. Every completed circuit is then simulated and evaluated by a customized reward function to guide the agent in finding qualified circuits, among which only the optimal one ever recorded is mapped to the transistor level for further evaluation. Experimental results show that the trained agent can not only generate high-performance circuits, but also be reusable by transferring to other specifications as a pre-trained model and achieving competitive results.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125071972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Area Efficient Superconducting Unary CNN Accelerator 区域高效超导一元CNN加速器
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129299
Patricia Gonzalez-Guerrero, Kylie Huch, N. Patra, Thom Popovici, George Michelogiannakis
In superconducting circuits, information is carried by ps-wide, µV-tall, Single Flux Quanta (SFQ) pulses. These circuits can operate at frequencies of hundreds of GHz with orders of magnitude lower switching energy than complementary-metal-oxide-semiconductors (CMOS). However, under the stringent area constraints of modern superconductor technologies, fully-fledged, CMOS-inspired superconducting architectures cannot be fabricated at large scales. Unary SFQ (U-SFQ) is an alternative computing paradigm that addresses these area constraints. In U-SFQ, information is mapped to a combination of streams of SFQ pulses and in the temporal domain. In this work, we propose a U-SFQ Convolutional Neural Network (CNN) hardware accelerator capable of comparable peak performance with state-of-the-art superconducting binary (B-SFQ) approaches in 32× less area. CNNs can operate with 5 to 8 bits of resolution with no significant degradation in classification accuracy. The proposed CNN accelerator effortlessly supports this variable resolution and, for less than 7 bits, yields 5×-63× better performance than CMOS and 15×-173× better area efficiency than B-SFQ.
在超导电路中,信息由ps宽、µv高、单通量量子(SFQ)脉冲携带。这些电路可以在数百千兆赫的频率下工作,开关能量比互补金属氧化物半导体(CMOS)低几个数量级。然而,在现代超导体技术严格的面积限制下,完全成熟的、cmos启发的超导体系结构无法大规模制造。一元SFQ (U-SFQ)是解决这些区域限制的另一种计算范式。在U-SFQ中,信息被映射到SFQ脉冲流和时域的组合。在这项工作中,我们提出了一种U-SFQ卷积神经网络(CNN)硬件加速器,能够在32x less区域内与最先进的超导二进制(B-SFQ)方法具有相当的峰值性能。cnn可以在5到8位的分辨率下运行,分类精度没有明显下降。提出的CNN加速器毫不费力地支持这种可变分辨率,并且在小于7位的情况下,产生5×-63×比CMOS更好的性能和15×-173×比B-SFQ更好的面积效率。
{"title":"An Area Efficient Superconducting Unary CNN Accelerator","authors":"Patricia Gonzalez-Guerrero, Kylie Huch, N. Patra, Thom Popovici, George Michelogiannakis","doi":"10.1109/ISQED57927.2023.10129299","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129299","url":null,"abstract":"In superconducting circuits, information is carried by ps-wide, µV-tall, Single Flux Quanta (SFQ) pulses. These circuits can operate at frequencies of hundreds of GHz with orders of magnitude lower switching energy than complementary-metal-oxide-semiconductors (CMOS). However, under the stringent area constraints of modern superconductor technologies, fully-fledged, CMOS-inspired superconducting architectures cannot be fabricated at large scales. Unary SFQ (U-SFQ) is an alternative computing paradigm that addresses these area constraints. In U-SFQ, information is mapped to a combination of streams of SFQ pulses and in the temporal domain. In this work, we propose a U-SFQ Convolutional Neural Network (CNN) hardware accelerator capable of comparable peak performance with state-of-the-art superconducting binary (B-SFQ) approaches in 32× less area. CNNs can operate with 5 to 8 bits of resolution with no significant degradation in classification accuracy. The proposed CNN accelerator effortlessly supports this variable resolution and, for less than 7 bits, yields 5×-63× better performance than CMOS and 15×-173× better area efficiency than B-SFQ.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"193 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121727866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Moving Towards Game-Changing Technology: Fabrication and Application of HfO2 RRAM for In-Memory Computing 迈向改变游戏规则的技术:用于内存计算的HfO2 RRAM的制造和应用
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129352
Kangjun Bai, Daniel Titcombe, Jack Lombardi, C. Thiem, N. Cady
In-memory computing is an emerging computing paradigm that sidesteps challenges inherent to deep learning acceleration in conventional systems. Along with the development of neuromorphic architectures, resistive random-access memory (RRAM) has paved the way for in-memory computing by processing mixed-signal operations in a fully parallel fashion. In this work, we designed and implemented working prototypes of in-memory operators using a custom 65nm CMOS/RRAM technology node fabricated on a 300mm wafer. Specifically, arrays of hafnium-oxide RRAM cells were built in a crossbar structure to support high-throughput matrix multiplications at low energy and area consumption. Building upon these efficient RRAM, applications of pixel detection and flow-based Boolean operations are presented. Our introduced approaches alleviate the intermediate data movement and parallelize the computations, thereby yielding orders of magnitude improvement in energy and area efficiency over the equivalent CMOS design.
随着神经形态架构的发展,电阻式随机存取存储器(RRAM)通过以完全并行的方式处理混合信号操作,为内存计算铺平了道路。在这项工作中,我们使用在300mm晶圆上制造的定制65nm CMOS/RRAM技术节点设计并实现了内存操作器的工作原型。具体来说,氧化铪RRAM电池阵列被构建在一个交叉条形结构中,以支持低能量和面积消耗的高通量矩阵乘法。在这些高效的随机存储器的基础上,提出了像素检测和基于流的布尔运算的应用。我们介绍的方法减轻了中间数据移动和并行计算,从而在能量和面积效率方面比等效CMOS设计有了数量级的提高。
{"title":"Moving Towards Game-Changing Technology: Fabrication and Application of HfO2 RRAM for In-Memory Computing","authors":"Kangjun Bai, Daniel Titcombe, Jack Lombardi, C. Thiem, N. Cady","doi":"10.1109/ISQED57927.2023.10129352","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129352","url":null,"abstract":"In-memory computing is an emerging computing paradigm that sidesteps challenges inherent to deep learning acceleration in conventional systems. Along with the development of neuromorphic architectures, resistive random-access memory (RRAM) has paved the way for in-memory computing by processing mixed-signal operations in a fully parallel fashion. In this work, we designed and implemented working prototypes of in-memory operators using a custom 65nm CMOS/RRAM technology node fabricated on a 300mm wafer. Specifically, arrays of hafnium-oxide RRAM cells were built in a crossbar structure to support high-throughput matrix multiplications at low energy and area consumption. Building upon these efficient RRAM, applications of pixel detection and flow-based Boolean operations are presented. Our introduced approaches alleviate the intermediate data movement and parallelize the computations, thereby yielding orders of magnitude improvement in energy and area efficiency over the equivalent CMOS design.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124302488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of Hardware Accelerators to Compute Parametric Capacitance Tables 计算参数电容表的硬件加速器设计
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129333
S. Koranne
Advances in VLSI process have created a significant computational burden on process calibration, and the generation of capacitance tables of pre-characterized 2D cross-section layout from physical parameters such as dielectrics and layer thickness. In this paper we describe a high-level synthesis approach to design hardware accelerators to alleviate this concern. Design of a hardware accelerator which can produce capacitance tables for multiple layer and corner combinations is presented. An innovative approach (lambda compression) to reduce the volume of output is also described. Simulation shows that our pipelined superscalar approach can generate and solve a capacitance problem in amortized 4us at 500MHz clock, which is three orders of magnitude faster than state-of-art software based solutions. Interestingly, the optimizations suggested by an hardware implementation also give very good results on a CPU implementation, and this is yet another approach to software optimization.
VLSI工艺的进步给工艺校准和根据介质和层厚等物理参数生成预表征二维截面布局的电容表带来了巨大的计算负担。在本文中,我们描述了一个高层次的综合方法来设计硬件加速器,以减轻这种担忧。介绍了一种能够生成多层和拐角组合电容表的硬件加速器的设计。还描述了一种减少输出量的创新方法(lambda压缩)。仿真结果表明,该方法可以在500MHz时钟下产生并解决平摊4us的电容问题,比目前基于软件的解决方案快三个数量级。有趣的是,硬件实现建议的优化在CPU实现上也能得到非常好的结果,这是软件优化的另一种方法。
{"title":"Design of Hardware Accelerators to Compute Parametric Capacitance Tables","authors":"S. Koranne","doi":"10.1109/ISQED57927.2023.10129333","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129333","url":null,"abstract":"Advances in VLSI process have created a significant computational burden on process calibration, and the generation of capacitance tables of pre-characterized 2D cross-section layout from physical parameters such as dielectrics and layer thickness. In this paper we describe a high-level synthesis approach to design hardware accelerators to alleviate this concern. Design of a hardware accelerator which can produce capacitance tables for multiple layer and corner combinations is presented. An innovative approach (lambda compression) to reduce the volume of output is also described. Simulation shows that our pipelined superscalar approach can generate and solve a capacitance problem in amortized 4us at 500MHz clock, which is three orders of magnitude faster than state-of-art software based solutions. Interestingly, the optimizations suggested by an hardware implementation also give very good results on a CPU implementation, and this is yet another approach to software optimization.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130134598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Focusing on the Key Suspicious Trojan Nets with a Collaborative Approach 基于协作方法的关键可疑木马网络研究
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129361
Shih-Jung Pao, Chuan-Pin Huang, Yen-Chi Peng, Ing-Jer Huang
While most gate-level hardware Trojan detection techniques strive to detect as many as possible suspicious nets, this paper suggests another direction: identifying only a few suspicious nets, in order to reduce the subsequent manual investigation effort, since there is no need to trace multiple suspicious nets that lead to the same Trojan module. To accomplish this goal, we adopt a collaborative approach by a combination of structural-based analysis, testability-based analysis, and behavioral-based analysis to minimize the number of suspicious Trojan nets. Extensive experiments are conducted with Trust-HUB benchmark and an industrial processor. The results are very significant: (1) high precision 95.39%, most of identified nets being actual Trojan nets; (2) high true negative rate 99.99%, most normal nets being correctly identified as non-suspicious; (3) 44% less suspicious nets to greatly reduce the subsequent manual investigation effort; while (4) leading to detect 100% of the Trojan modules.
虽然大多数门级硬件木马检测技术都力求检测尽可能多的可疑网络,但本文提出了另一个方向:仅识别少数可疑网络,以减少后续的人工调查工作量,因为不需要跟踪导致同一木马模块的多个可疑网络。为了实现这一目标,我们采用基于结构的分析、基于可测试性的分析和基于行为的分析相结合的协作方法,以最大限度地减少可疑特洛伊网络的数量。在Trust-HUB基准测试和工业处理器上进行了大量实验。结果非常显著:(1)准确率高达95.39%,大部分识别出的网络都是真实的特洛伊网络;(2)真阴性率高,99.99%,大多数正常网被正确识别为无可疑;(3)可疑网少44%,大大减少后续人工排查工作量;(4)导致检测到100%的木马模块。
{"title":"Focusing on the Key Suspicious Trojan Nets with a Collaborative Approach","authors":"Shih-Jung Pao, Chuan-Pin Huang, Yen-Chi Peng, Ing-Jer Huang","doi":"10.1109/ISQED57927.2023.10129361","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129361","url":null,"abstract":"While most gate-level hardware Trojan detection techniques strive to detect as many as possible suspicious nets, this paper suggests another direction: identifying only a few suspicious nets, in order to reduce the subsequent manual investigation effort, since there is no need to trace multiple suspicious nets that lead to the same Trojan module. To accomplish this goal, we adopt a collaborative approach by a combination of structural-based analysis, testability-based analysis, and behavioral-based analysis to minimize the number of suspicious Trojan nets. Extensive experiments are conducted with Trust-HUB benchmark and an industrial processor. The results are very significant: (1) high precision 95.39%, most of identified nets being actual Trojan nets; (2) high true negative rate 99.99%, most normal nets being correctly identified as non-suspicious; (3) 44% less suspicious nets to greatly reduce the subsequent manual investigation effort; while (4) leading to detect 100% of the Trojan modules.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128052756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 24th International Symposium on Quality Electronic Design (ISQED)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1