首页 > 最新文献

2023 24th International Symposium on Quality Electronic Design (ISQED)最新文献

英文 中文
eDRAM-OESP: A novel performance efficient in-embedded-DRAM-compute design for on-edge signal processing application eDRAM-OESP:一种新型的高性能嵌入式dram计算设计,用于边缘信号处理应用
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129307
Mayank Kabra, C. PrashanthH., Kedar Deshpande, M. Rao
In-Memory-Computing (IMC) architectures allow arithmetic and logical functionalities around the memory arrays to effectively use the memory bandwidth and avoid frequent data movement to the processor. As expected, the IMC architecture leads to high throughput performance and significant energy savings primarily due to less workload moving data from memory to the computing core. Embedded DRAM (eDRAM), composed of 1-transistor, 1-capacitor (1T1C) bit cell with logic block enables computing with benefits in terms of power savings and high performance, favorable for embedded computing engines. The work proposes a novel in-eDRAM-compute design employing a 1T1C eDRAM cell with the bit-serial computation that targets 3x throughput efficiency by arranging the operand bits in an interleaved manner. The interleaved eDRAM architecture enables to employ reading corresponding bits of multiple operands from the memory cells at the same time, and also allows to write back post computing in the same activate window, thereby saving on the multiple precharge and activate cycles. Additionally, the interleaved architecture allows pipelining the continuously arriving digitized signal and processes the same. The computing block in the form of a 1-bit adder with a multiplexer unit is optimized for different hardware metrics such as delay, power, and product of power-and-delay (PDP) for adopting the design per the specifications.The eDRAM-based efficient computing design is evaluated for 1-bit adder and further characterized for 8-bit, and 16bit adders, multipliers, and 1-D convolution of varying filter sizes. The proposed design exhibited improvement in computing time by 31% for 16-bit addition and 30.6% for 8-bit addition over the existing state-of-the-art work. The bit-serial in-eDRAM-compute design achieved the best performance of 2.5 ms of computing time and 120 nJ of energy for performing a 1-D convolution operation. The in-eDRAM-compute design is a step towards designing embedded memory with convolutional neural network (CNN) compute capability for customized real-time edge inferencing applications.
内存计算(IMC)架构允许围绕内存阵列的算术和逻辑功能有效地利用内存带宽,并避免频繁地将数据移动到处理器。正如预期的那样,IMC架构带来了高吞吐量性能和显著的能源节约,这主要是由于将数据从内存移动到计算核心的工作负载更少。嵌入式DRAM (eDRAM)由1晶体管1电容(1T1C)位单元和逻辑块组成,具有节能和高性能的优点,有利于嵌入式计算引擎。该工作提出了一种新颖的eDRAM内计算设计,采用1T1C eDRAM单元,其位串行计算通过以交错方式排列操作数位,目标是3倍的吞吐量效率。交错式eDRAM架构能够同时从存储器单元中读取多个操作数的相应位,并且还允许在同一激活窗口中写回后计算,从而节省了多个预充电和激活周期。此外,交错结构允许对连续到达的数字化信号进行流水线处理。以1位加法器和多路复用器单元为形式的计算块针对不同的硬件指标(如延迟、功率和功率与延迟的乘积PDP)进行了优化,以采用符合规范的设计。基于edram的高效计算设计对1位加法器进行了评估,并进一步对8位和16位加法器、乘法器和不同滤波器尺寸的一维卷积进行了表征。与现有的最先进的工作相比,所提出的设计在16位加法和8位加法方面的计算时间分别提高了31%和30.6%。位串行edram计算设计实现了最佳性能,执行一维卷积操作的计算时间为2.5 ms,能量为120 nJ。嵌入式内存计算设计是为定制的实时边缘推理应用设计具有卷积神经网络(CNN)计算能力的嵌入式存储器的一步。
{"title":"eDRAM-OESP: A novel performance efficient in-embedded-DRAM-compute design for on-edge signal processing application","authors":"Mayank Kabra, C. PrashanthH., Kedar Deshpande, M. Rao","doi":"10.1109/ISQED57927.2023.10129307","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129307","url":null,"abstract":"In-Memory-Computing (IMC) architectures allow arithmetic and logical functionalities around the memory arrays to effectively use the memory bandwidth and avoid frequent data movement to the processor. As expected, the IMC architecture leads to high throughput performance and significant energy savings primarily due to less workload moving data from memory to the computing core. Embedded DRAM (eDRAM), composed of 1-transistor, 1-capacitor (1T1C) bit cell with logic block enables computing with benefits in terms of power savings and high performance, favorable for embedded computing engines. The work proposes a novel in-eDRAM-compute design employing a 1T1C eDRAM cell with the bit-serial computation that targets 3x throughput efficiency by arranging the operand bits in an interleaved manner. The interleaved eDRAM architecture enables to employ reading corresponding bits of multiple operands from the memory cells at the same time, and also allows to write back post computing in the same activate window, thereby saving on the multiple precharge and activate cycles. Additionally, the interleaved architecture allows pipelining the continuously arriving digitized signal and processes the same. The computing block in the form of a 1-bit adder with a multiplexer unit is optimized for different hardware metrics such as delay, power, and product of power-and-delay (PDP) for adopting the design per the specifications.The eDRAM-based efficient computing design is evaluated for 1-bit adder and further characterized for 8-bit, and 16bit adders, multipliers, and 1-D convolution of varying filter sizes. The proposed design exhibited improvement in computing time by 31% for 16-bit addition and 30.6% for 8-bit addition over the existing state-of-the-art work. The bit-serial in-eDRAM-compute design achieved the best performance of 2.5 ms of computing time and 120 nJ of energy for performing a 1-D convolution operation. The in-eDRAM-compute design is a step towards designing embedded memory with convolutional neural network (CNN) compute capability for customized real-time edge inferencing applications.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121215472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ACPC: Covert Channel Attack on Last Level Cache using Dynamic Cache Partitioning ACPC:使用动态缓存分区对最后一级缓存的隐蔽通道攻击
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129363
Jaspinder Kaur, Shirshendu Das
The Last Level Cache (LLC) of modern multicore processors is normally shared between different cores and applications. Dynamic cache partitioning is applied to the LLC for fairly distributing the LLC space among the applications. Recently, Covert Channel Attacks (CCA) becomes a major security issue for modern multicore systems. In CCA, two malicious applications: spy and Trojan, run in two different cores. Trojan normally runs in a secure core and knows some secret information. Through CCA, Trojan communicates this information to the spy. A well-known technique to perform such an attack is Prime Probe (P+P). It performs the attack by using the shared behavior of LLC space. Cache partitioning is considered a defense against such CCA. Partitioning makes the applications isolated in the LLC and they cannot evict each other block from the LLC. Hence, the existing P+P based attacks are not possible while dynamic partitioning is applied to LLC. However, in this work, we have proposed a modified CCA (based on P+P) which can establish a covert channel on top of the dynamic cache partitioning technique applied to LLC. Such kinds of attacks must need to be handled carefully in modern processors. A possible defense mechanism for the new attack is also discussed in this paper.
现代多核处理器的最后一级缓存(LLC)通常在不同的核心和应用程序之间共享。为了在应用程序之间公平地分配LLC空间,对LLC应用了动态缓存分区。近年来,隐蔽通道攻击(CCA)成为现代多核系统的主要安全问题。在CCA中,两个恶意应用程序:间谍和特洛伊木马,在两个不同的内核中运行。木马通常运行在一个安全的核心,并知道一些秘密信息。通过CCA,特洛伊木马将此信息传达给间谍。执行这种攻击的一种众所周知的技术是Prime Probe (P+P)。利用LLC空间的共享行为进行攻击。缓存分区被认为是对这种CCA的一种防御。分区使得应用程序隔离在LLC中,它们不能从LLC中驱逐彼此的块。因此,当动态分区应用于LLC时,现有的基于P+P的攻击是不可能的。然而,在这项工作中,我们提出了一种改进的CCA(基于P+P),它可以在应用于LLC的动态缓存分区技术之上建立隐蔽通道。这种攻击必须在现代处理器中谨慎处理。本文还讨论了一种可能的防御机制。
{"title":"ACPC: Covert Channel Attack on Last Level Cache using Dynamic Cache Partitioning","authors":"Jaspinder Kaur, Shirshendu Das","doi":"10.1109/ISQED57927.2023.10129363","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129363","url":null,"abstract":"The Last Level Cache (LLC) of modern multicore processors is normally shared between different cores and applications. Dynamic cache partitioning is applied to the LLC for fairly distributing the LLC space among the applications. Recently, Covert Channel Attacks (CCA) becomes a major security issue for modern multicore systems. In CCA, two malicious applications: spy and Trojan, run in two different cores. Trojan normally runs in a secure core and knows some secret information. Through CCA, Trojan communicates this information to the spy. A well-known technique to perform such an attack is Prime Probe (P+P). It performs the attack by using the shared behavior of LLC space. Cache partitioning is considered a defense against such CCA. Partitioning makes the applications isolated in the LLC and they cannot evict each other block from the LLC. Hence, the existing P+P based attacks are not possible while dynamic partitioning is applied to LLC. However, in this work, we have proposed a modified CCA (based on P+P) which can establish a covert channel on top of the dynamic cache partitioning technique applied to LLC. Such kinds of attacks must need to be handled carefully in modern processors. A possible defense mechanism for the new attack is also discussed in this paper.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"202 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124517840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Security and Reliability Challenges in Machine Learning for EDA: Latest Advances EDA机器学习中的安全性和可靠性挑战:最新进展
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129359
Zhiyao Xie, Tao Zhang, Yifeng Peng
The growing IC complexity has led to a compelling need for design efficiency improvement through new electronic design automation (EDA) methodologies. In recent years, many innovative machine learning (ML)-based solutions have been proposed for EDA applications. While these ML solutions demonstrate great potential in the circuit design flow, however, the hidden security and model reliability problems are rarely discussed until recently. In this paper, we present some latest research advances in the security and reliability challenges in ML for EDA.
日益增长的集成电路复杂性导致迫切需要通过新的电子设计自动化(EDA)方法来提高设计效率。近年来,许多基于机器学习(ML)的创新解决方案被提出用于EDA应用。虽然这些机器学习解决方案在电路设计流程中显示出巨大的潜力,然而,隐藏的安全性和模型可靠性问题直到最近才被讨论。本文介绍了面向EDA的机器学习在安全性和可靠性方面的一些最新研究进展。
{"title":"Security and Reliability Challenges in Machine Learning for EDA: Latest Advances","authors":"Zhiyao Xie, Tao Zhang, Yifeng Peng","doi":"10.1109/ISQED57927.2023.10129359","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129359","url":null,"abstract":"The growing IC complexity has led to a compelling need for design efficiency improvement through new electronic design automation (EDA) methodologies. In recent years, many innovative machine learning (ML)-based solutions have been proposed for EDA applications. While these ML solutions demonstrate great potential in the circuit design flow, however, the hidden security and model reliability problems are rarely discussed until recently. In this paper, we present some latest research advances in the security and reliability challenges in ML for EDA.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"618 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123067898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories 利用多库存储器的DNN加速器的跨层数据流优化
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129330
Man Shi, Steven Colleman, Charlotte VanDeMieroop, Antony Joseph, M. Meijer, W. Dehaene, M. Verhelst
Deep neural networks (DNN) use a wide range of network topologies to achieve high accuracy within diverse applications. This model diversity makes it impossible to identify a single "dataflow" (execution schedule) to perform optimally across all possible layers and network topologies. Several frameworks support the exploration of the best dataflow for a given DNN layer and hardware. However, switching the dataflow from one layer to the next layer within one DNN model can result in hardware inefficiencies stemming from memory data layout mismatch among the layers. Unfortunately, all existing frameworks treat each layer independently and typically model memories as black boxes (one large monolithic wide memory), which ignores the data layout and can not deal with the data layout dependencies of sequential layers. These frameworks are not capable of doing dataflow cross-layer optimization. This work, hence, aims at cross-layer dataflow optimization, taking the data dependency and data layout reshuffling overheads among layers into account. Additionally, we propose to exploit the multibank memories typically present in modern DNN accelerators towards efficiently reshuffling data to support more dataflow at low overhead. These innovations are supported through the Cross-layer Memory-aware Dataflow Scheduler (CMDS). CMDS can model DNN execution energy/latency while considering the different data layout requirements due to the varied optimal dataflow of layers. Compared with the state-of-the-art (SOTA), which performs layer-optimized memory-unaware scheduling, CMDS achieves up to 5.5× energy reduction and 1.35× latency reduction with negligible hardware cost.
深度神经网络(DNN)使用广泛的网络拓扑结构,在不同的应用中实现高精度。这种模型的多样性使得不可能确定一个单一的“数据流”(执行计划),以便在所有可能的层和网络拓扑结构中实现最佳执行。有几个框架支持探索给定深度神经网络层和硬件的最佳数据流。然而,在一个DNN模型中,将数据流从一层切换到下一层可能会导致硬件效率低下,原因是各层之间的内存数据布局不匹配。不幸的是,所有现有的框架都独立对待每一层,并且通常将内存建模为黑盒(一个大的单片宽内存),这忽略了数据布局,无法处理顺序层的数据布局依赖关系。这些框架不能进行数据流跨层优化。因此,本工作的目标是跨层数据流优化,考虑了层间的数据依赖和数据布局重组开销。此外,我们建议利用现代深度神经网络加速器中通常存在的多库存储器来有效地重组数据,以低开销支持更多的数据流。这些创新通过跨层内存感知数据流调度器(CMDS)得到支持。CMDS可以对DNN执行能量/延迟进行建模,同时考虑由于层的最佳数据流不同而导致的不同数据布局要求。与执行层优化内存不感知调度的最先进(SOTA)相比,CMDS实现了高达5.5倍的能耗降低和1.35倍的延迟降低,而硬件成本可以忽略不计。
{"title":"CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories","authors":"Man Shi, Steven Colleman, Charlotte VanDeMieroop, Antony Joseph, M. Meijer, W. Dehaene, M. Verhelst","doi":"10.1109/ISQED57927.2023.10129330","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129330","url":null,"abstract":"Deep neural networks (DNN) use a wide range of network topologies to achieve high accuracy within diverse applications. This model diversity makes it impossible to identify a single \"dataflow\" (execution schedule) to perform optimally across all possible layers and network topologies. Several frameworks support the exploration of the best dataflow for a given DNN layer and hardware. However, switching the dataflow from one layer to the next layer within one DNN model can result in hardware inefficiencies stemming from memory data layout mismatch among the layers. Unfortunately, all existing frameworks treat each layer independently and typically model memories as black boxes (one large monolithic wide memory), which ignores the data layout and can not deal with the data layout dependencies of sequential layers. These frameworks are not capable of doing dataflow cross-layer optimization. This work, hence, aims at cross-layer dataflow optimization, taking the data dependency and data layout reshuffling overheads among layers into account. Additionally, we propose to exploit the multibank memories typically present in modern DNN accelerators towards efficiently reshuffling data to support more dataflow at low overhead. These innovations are supported through the Cross-layer Memory-aware Dataflow Scheduler (CMDS). CMDS can model DNN execution energy/latency while considering the different data layout requirements due to the varied optimal dataflow of layers. Compared with the state-of-the-art (SOTA), which performs layer-optimized memory-unaware scheduling, CMDS achieves up to 5.5× energy reduction and 1.35× latency reduction with negligible hardware cost.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122085485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metal Inter-layer Via Keep-out-zone in M3D IC: A Critical Process-aware Design Consideration M3D集成电路中通过隔离区的金属中间层:一个关键的工艺感知设计考虑
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129285
M. Vemuri, Umamaheswara Rao Tida
Metal inter-layer via (MIV) in Monolithic three-dimensional integrated circuits (M3D-IC) is used to connect inter-layer devices and provide power and clock signals across multiple layers. The size of MIV is comparable to logic gates because of the significant reduction in substrate layers due to sequential integration. Despite MIV’s small size, the impact of MIV on the performance of adjacent devices should be considered to implement IC designs in M3D-IC technology. In this work, we systematically study the changes in performance of transistors when they are placed near MIV to understand the effect of MIV on adjacent devices when MIV passes through the substrate. Simulation results suggest that the keep-out-zone (KOZ) for MIV should be considered to ensure the reliability of M3DIC technology and this KOZ is highly dependent on the M3DIC process. In this paper, we show that the transistor placed near MIV considering the M1 metal pitch as the separation will have up to 68, 668× increase in leakage current, when the channel doping is 1015cm−3, source/drain doping of 1018cm−3 and substrate layer height of 100 nm. We also show that, this increase in leakage current can also be reduced significantly by having KOZ around MIV, which is dependent on the process.
单片三维集成电路(M3D-IC)中的金属层间通孔(MIV)用于连接层间器件,并提供跨多层的电源和时钟信号。MIV的尺寸与逻辑门相当,因为由于顺序集成而显着减少了衬底层。尽管MIV的尺寸很小,但在M3D-IC技术中实施IC设计时,应考虑MIV对相邻器件性能的影响。在这项工作中,我们系统地研究了当晶体管放置在MIV附近时晶体管性能的变化,以了解当MIV通过衬底时MIV对相邻器件的影响。仿真结果表明,为了保证M3DIC技术的可靠性,必须考虑MIV的保出区(KOZ),该保出区高度依赖于M3DIC工艺。在本文中,我们表明,当沟道掺杂为1015cm−3,源极/漏极掺杂为1018cm−3,衬底层高度为100nm时,考虑M1金属间距作为分离的晶体管放置在MIV附近,泄漏电流增加高达68,668倍。我们还表明,泄漏电流的增加也可以通过在MIV周围设置KOZ来显着降低,这取决于工艺。
{"title":"Metal Inter-layer Via Keep-out-zone in M3D IC: A Critical Process-aware Design Consideration","authors":"M. Vemuri, Umamaheswara Rao Tida","doi":"10.1109/ISQED57927.2023.10129285","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129285","url":null,"abstract":"Metal inter-layer via (MIV) in Monolithic three-dimensional integrated circuits (M3D-IC) is used to connect inter-layer devices and provide power and clock signals across multiple layers. The size of MIV is comparable to logic gates because of the significant reduction in substrate layers due to sequential integration. Despite MIV’s small size, the impact of MIV on the performance of adjacent devices should be considered to implement IC designs in M3D-IC technology. In this work, we systematically study the changes in performance of transistors when they are placed near MIV to understand the effect of MIV on adjacent devices when MIV passes through the substrate. Simulation results suggest that the keep-out-zone (KOZ) for MIV should be considered to ensure the reliability of M3DIC technology and this KOZ is highly dependent on the M3DIC process. In this paper, we show that the transistor placed near MIV considering the M1 metal pitch as the separation will have up to 68, 668× increase in leakage current, when the channel doping is 1015cm−3, source/drain doping of 1018cm−3 and substrate layer height of 100 nm. We also show that, this increase in leakage current can also be reduced significantly by having KOZ around MIV, which is dependent on the process.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114069743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Attributed Graph Transformation for Generating Synthetic Benchmarks for Hardware Security 用于生成硬件安全综合基准的属性图变换
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129284
Juneet Kumar Meka, R. Vemuri
EDA methods in general and hardware security applications in particular require careful benchmarking covering nominal and corner cases of various design parameters. Attributed graph grammars have been used for generating interesting and constraint-satisfying structures in various domains of design. This paper shows how attributed graph transformation systems can be effectively adapted to automatically generate synthetic circuit structures that meet arbitrary constraints on various design parameters and how the method is flexible and scalable. We discuss the method in detail and demonstrate its utility for an example hardware security application, the sequential satisfiability attack.
一般的EDA方法,特别是硬件安全应用程序,需要仔细地对各种设计参数的标称和极端情况进行基准测试。在各种设计领域中,属性图语法被用于生成有趣且满足约束的结构。本文展示了如何有效地适应属性图变换系统来自动生成满足各种设计参数任意约束的合成电路结构,以及该方法的灵活性和可扩展性。我们详细讨论了该方法,并演示了其在硬件安全应用程序——序列可满足性攻击中的实用性。
{"title":"Attributed Graph Transformation for Generating Synthetic Benchmarks for Hardware Security","authors":"Juneet Kumar Meka, R. Vemuri","doi":"10.1109/ISQED57927.2023.10129284","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129284","url":null,"abstract":"EDA methods in general and hardware security applications in particular require careful benchmarking covering nominal and corner cases of various design parameters. Attributed graph grammars have been used for generating interesting and constraint-satisfying structures in various domains of design. This paper shows how attributed graph transformation systems can be effectively adapted to automatically generate synthetic circuit structures that meet arbitrary constraints on various design parameters and how the method is flexible and scalable. We discuss the method in detail and demonstrate its utility for an example hardware security application, the sequential satisfiability attack.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132134634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testbench on a Chip: A Yield Test Vehicle for Resistive Memory Devices 片上测试台:电阻式存储器的良率测试工具
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129298
Luke R. Upton, Guénolé Lallement, M. Scott, Joyce E. S. Taylor, R. Radway, D. Rich, M. Nelson, S. Mitra, B. Murmann
Many emerging resistive memory characterization efforts are constrained to small-batch, device-level studies due to a lack of test structure read/write bandwidth. To address this issue, we present a Yield Test Vehicle (YTV) for characterizing resistive RAM (RRAM) at the array level in SkyWater’s 130 nm technology. The YTV provides 16-bit word read/write access with 7 bits (3.3 µS - 425 µS) of linear reference conductance range, and an onboard controller prevents excessive cell writes responsible for yield deterioration. The 100 mm2 YTV die has an aggregate 8.8 Mb capacity and operates at a clock frequency of up to 50 MHz. The readout’s wide input conductance dynamic range and modular peripheral circuit design allow rapid adaptation for characterizing other resistive memory technologies.
由于缺乏测试结构读/写带宽,许多新兴的电阻式存储器特性研究都局限于小批量、器件级的研究。为了解决这个问题,我们提出了一种Yield Test Vehicle (YTV),用于在SkyWater的130纳米技术中表征阵列级电阻性RAM (RRAM)。YTV提供7位(3.3µS - 425µS)线性参考电导范围的16位字读/写访问,板载控制器防止导致产量下降的过度单元写入。100 mm2 YTV芯片的总容量为8.8 Mb,时钟频率高达50 MHz。该读出器的宽输入电导动态范围和模块化外围电路设计允许快速适应表征其他电阻式存储技术。
{"title":"Testbench on a Chip: A Yield Test Vehicle for Resistive Memory Devices","authors":"Luke R. Upton, Guénolé Lallement, M. Scott, Joyce E. S. Taylor, R. Radway, D. Rich, M. Nelson, S. Mitra, B. Murmann","doi":"10.1109/ISQED57927.2023.10129298","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129298","url":null,"abstract":"Many emerging resistive memory characterization efforts are constrained to small-batch, device-level studies due to a lack of test structure read/write bandwidth. To address this issue, we present a Yield Test Vehicle (YTV) for characterizing resistive RAM (RRAM) at the array level in SkyWater’s 130 nm technology. The YTV provides 16-bit word read/write access with 7 bits (3.3 µS - 425 µS) of linear reference conductance range, and an onboard controller prevents excessive cell writes responsible for yield deterioration. The 100 mm2 YTV die has an aggregate 8.8 Mb capacity and operates at a clock frequency of up to 50 MHz. The readout’s wide input conductance dynamic range and modular peripheral circuit design allow rapid adaptation for characterizing other resistive memory technologies.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129722659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of Machine Learning for Quality Risk Factor Analysis of Electronic Assemblies 机器学习在电子组件质量风险因素分析中的应用
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129339
Brendan Reidy, D. Duggan, Bernard Glasauer, Peng Su, Ramtin Zand
The rapid identification of contributing factors to failures in a high-volume electronic product manufacturing environment is critical to reduce disruption to production and mitigate potential quality and reliability risks. As system complexity and component usage continues to increase, it is becoming more and more challenging to manually process the large volume of data that are continuously generated by production processes. In this paper, we utilize various machine learning (ML) techniques to classify components on the printed circuit board assemblies (PCBAs) as defective or non-defective based on an input feature map including features like the date the component is manufactured, the side of board on which the component is placed, the location of the component on the board, etc. We then implement a feature importance algorithm to detect the underlying cause of the component failure. Three ML models including support vector machine, random forest, and neural network are trained and implemented for feature importance analysis using a dataset obtained from over 10 million components on various PCBA boards. Due to the intrinsic characteristics of the dataset, such as a significant imbalance between defective and non-defective cases, pre-processing techniques such as upsampling and downsampling are necessary to increase the performance of the models. The results show that all the developed ML models can achieve more than 99% accuracy. Finally, we show that our proposed feature importance approach is capable of correctly identifying the main cause of defects for given components.
在大批量电子产品制造环境中,快速识别导致故障的因素对于减少生产中断和减轻潜在的质量和可靠性风险至关重要。随着系统复杂性和组件使用量的不断增加,手工处理生产过程中不断产生的大量数据变得越来越具有挑战性。在本文中,我们利用各种机器学习(ML)技术根据输入特征映射将印刷电路板组件(pcb)上的组件分类为有缺陷或无缺陷,包括组件制造日期,组件放置的电路板侧面,组件在电路板上的位置等特征。然后,我们实现了一个特征重要性算法来检测组件故障的潜在原因。使用从各种PCBA板上超过1000万个组件获得的数据集,对包括支持向量机,随机森林和神经网络在内的三个ML模型进行了训练和实现,用于特征重要性分析。由于数据集的固有特征,如缺陷和非缺陷情况之间的显著不平衡,需要预处理技术,如上采样和下采样,以提高模型的性能。结果表明,所开发的机器学习模型的准确率均达到99%以上。最后,我们证明了我们提出的特征重要性方法能够正确地识别给定组件缺陷的主要原因。
{"title":"Application of Machine Learning for Quality Risk Factor Analysis of Electronic Assemblies","authors":"Brendan Reidy, D. Duggan, Bernard Glasauer, Peng Su, Ramtin Zand","doi":"10.1109/ISQED57927.2023.10129339","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129339","url":null,"abstract":"The rapid identification of contributing factors to failures in a high-volume electronic product manufacturing environment is critical to reduce disruption to production and mitigate potential quality and reliability risks. As system complexity and component usage continues to increase, it is becoming more and more challenging to manually process the large volume of data that are continuously generated by production processes. In this paper, we utilize various machine learning (ML) techniques to classify components on the printed circuit board assemblies (PCBAs) as defective or non-defective based on an input feature map including features like the date the component is manufactured, the side of board on which the component is placed, the location of the component on the board, etc. We then implement a feature importance algorithm to detect the underlying cause of the component failure. Three ML models including support vector machine, random forest, and neural network are trained and implemented for feature importance analysis using a dataset obtained from over 10 million components on various PCBA boards. Due to the intrinsic characteristics of the dataset, such as a significant imbalance between defective and non-defective cases, pre-processing techniques such as upsampling and downsampling are necessary to increase the performance of the models. The results show that all the developed ML models can achieve more than 99% accuracy. Finally, we show that our proposed feature importance approach is capable of correctly identifying the main cause of defects for given components.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125089997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design Space Exploration of Modular Multipliers for ASIC FHE accelerators ASIC FHE加速器模块乘法器的设计空间探索
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129292
Deepraj Soni, M. Nabeel, Homer Gamil, O. Mazonka, Brandon Reagen, R. Karri, M. Maniatakos
Fully homomorphic encryption (FHE) promises data protection by computation on encrypted data, but demands resource-intensive computation. The most fundamental resource of FHE is modular multiplier, which needs to be evaluated for efficient implementation. In this work, we develop and evaluate ASIC implementations of the modular multiplier at the block-level and at the system-level. We study the efficiency of the multipliers in terms of performance-for-area and performance-for-power. Since these ASICs are used in FHE, we explore these multipliers within this system-level context with on-chip memory and interconnect limits. We explore ASIC implementations of modular multiplications using a state-of-the-art 22nm technology node with constant operand throughput to ensure a fair comparison. The study yields key insights about the performance-for-area efficiency and power efficiency of bit-serial and bit-parallel designs: Bit-parallel designs are more efficient than their bitserial counterparts. Montgomery multipliers with constrained modulus are the most power-efficient and area-efficient design. Iterative Montgomery multipliers incur minimum peak power for a polynomial multiplication, making them suitable for low-power voltage sources.
完全同态加密(FHE)承诺通过对加密数据的计算来保护数据,但需要大量的资源计算。FHE最基本的资源是模块乘法器,需要对其进行评估才能有效地实现。在这项工作中,我们在块级和系统级开发和评估模块化乘法器的ASIC实现。我们从面积性能和功率性能两方面研究了乘数器的效率。由于这些asic用于FHE,我们在片上存储器和互连限制的系统级上下文中探索这些乘法器。我们探索模块化乘法的ASIC实现,使用最先进的22nm技术节点,具有恒定的操作数吞吐量,以确保公平的比较。该研究对位串行和位并行设计的面积效率和功率效率产生了关键见解:位并行设计比位串行设计更有效。具有约束模量的Montgomery乘法器是最节能和面积有效的设计。迭代蒙哥马利乘法器产生多项式乘法的最小峰值功率,使它们适合于低功率电压源。
{"title":"Design Space Exploration of Modular Multipliers for ASIC FHE accelerators","authors":"Deepraj Soni, M. Nabeel, Homer Gamil, O. Mazonka, Brandon Reagen, R. Karri, M. Maniatakos","doi":"10.1109/ISQED57927.2023.10129292","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129292","url":null,"abstract":"Fully homomorphic encryption (FHE) promises data protection by computation on encrypted data, but demands resource-intensive computation. The most fundamental resource of FHE is modular multiplier, which needs to be evaluated for efficient implementation. In this work, we develop and evaluate ASIC implementations of the modular multiplier at the block-level and at the system-level. We study the efficiency of the multipliers in terms of performance-for-area and performance-for-power. Since these ASICs are used in FHE, we explore these multipliers within this system-level context with on-chip memory and interconnect limits. We explore ASIC implementations of modular multiplications using a state-of-the-art 22nm technology node with constant operand throughput to ensure a fair comparison. The study yields key insights about the performance-for-area efficiency and power efficiency of bit-serial and bit-parallel designs: Bit-parallel designs are more efficient than their bitserial counterparts. Montgomery multipliers with constrained modulus are the most power-efficient and area-efficient design. Iterative Montgomery multipliers incur minimum peak power for a polynomial multiplication, making them suitable for low-power voltage sources.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129190216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Novel Pseudo-Flash Based Digital Low Dropout (LDO) Voltage Regulator 一种基于伪闪光的数字低差(LDO)稳压器
Pub Date : 2023-04-05 DOI: 10.1109/ISQED57927.2023.10129385
Cheng-Yen Lee, S. Khatri, S. Vrudhula
In this paper, we present a pseudo-flash based digital low dropout (Digital LDO) voltage regulator. The novelty of our pseudo-flash based Digital LDO (PFD-LDO) voltage regulator lies in the fact that we use pseudo-flash (or alternately, flash) transistor subarrays for voltage regulation. By changing the threshold voltage (and thereby, the ON resistance) of these transistors, we can use the same design to meet different regulator specifications. The threshold voltage can be programmed either at the factory by the manufacturer or in the field by the user. This gives the manufacturer the ability to offer a family of LDO regulators with a single design, a significant economic advantage. In addition, aging effects and temperature variations are effectively erased since the threshold voltage of the pseudo-flash (or flash) transistors can be tuned to a fine degree in the field. Similarly, process variations can be cancelled after manufacturing in the factory. These advantages are absent in traditional LDO regulators. Our design uses two subarrays. A coarse subarray is used to reduce the recovery time and output voltage overshoot/undershoot, while a fine subarray regulates the output voltage, minimizing the output voltage ripple. Unlike state-of-the-art LDO regulators, our design can realize multiple specifications with the same circuit. For example, we demonstrate that the Vout of the proposed PFD-LDO regulator can range from 0.7V to 1.7V when the supply voltage VIN ranges from 0.8V to 1.8V, using the same circuit design. Over this voltage range, the proposed PFD-LDO regulator achieves Vshoot < 144mV, trec < 0.41µs and Vripple < 7.3mV when the Imax ranges from 15mA to 250mA.
本文提出了一种基于伪flash的数字低差(digital LDO)稳压器。我们基于伪闪存的数字LDO (PFD-LDO)稳压器的新颖之处在于我们使用伪闪存(或交替使用闪存)晶体管子阵列进行电压调节。通过改变这些晶体管的阈值电压(从而改变导通电阻),我们可以使用相同的设计来满足不同的稳压器规格。阈值电压可以由制造商在工厂或在现场由用户编程。这使得制造商能够提供具有单一设计的LDO稳压器系列,具有显着的经济优势。此外,由于伪闪(或闪)晶体管的阈值电压可以在现场微调到一个很好的程度,因此可以有效地消除老化效应和温度变化。同样,在工厂生产后,可以取消工艺变化。这些优点在传统的LDO调节器中是不存在的。我们的设计使用两个子数组。粗子阵列用于减少恢复时间和输出电压过冲/欠冲,而细子阵列用于调节输出电压,使输出电压纹波最小化。与最先进的LDO稳压器不同,我们的设计可以用同一电路实现多种规格。例如,我们证明了当电源电压VIN范围为0.8V至1.8V时,使用相同的电路设计,所提出的PFD-LDO稳压器的Vout范围可以在0.7V至1.7V之间。在此电压范围内,当Imax为15mA至250mA时,所提出的PFD-LDO稳压器实现了Vshoot < 144mV, trec < 0.41µs和Vripple < 7.3mV。
{"title":"A Novel Pseudo-Flash Based Digital Low Dropout (LDO) Voltage Regulator","authors":"Cheng-Yen Lee, S. Khatri, S. Vrudhula","doi":"10.1109/ISQED57927.2023.10129385","DOIUrl":"https://doi.org/10.1109/ISQED57927.2023.10129385","url":null,"abstract":"In this paper, we present a pseudo-flash based digital low dropout (Digital LDO) voltage regulator. The novelty of our pseudo-flash based Digital LDO (PFD-LDO) voltage regulator lies in the fact that we use pseudo-flash (or alternately, flash) transistor subarrays for voltage regulation. By changing the threshold voltage (and thereby, the ON resistance) of these transistors, we can use the same design to meet different regulator specifications. The threshold voltage can be programmed either at the factory by the manufacturer or in the field by the user. This gives the manufacturer the ability to offer a family of LDO regulators with a single design, a significant economic advantage. In addition, aging effects and temperature variations are effectively erased since the threshold voltage of the pseudo-flash (or flash) transistors can be tuned to a fine degree in the field. Similarly, process variations can be cancelled after manufacturing in the factory. These advantages are absent in traditional LDO regulators. Our design uses two subarrays. A coarse subarray is used to reduce the recovery time and output voltage overshoot/undershoot, while a fine subarray regulates the output voltage, minimizing the output voltage ripple. Unlike state-of-the-art LDO regulators, our design can realize multiple specifications with the same circuit. For example, we demonstrate that the Vout of the proposed PFD-LDO regulator can range from 0.7V to 1.7V when the supply voltage VIN ranges from 0.8V to 1.8V, using the same circuit design. Over this voltage range, the proposed PFD-LDO regulator achieves Vshoot < 144mV, trec < 0.41µs and Vripple < 7.3mV when the Imax ranges from 15mA to 250mA.","PeriodicalId":315053,"journal":{"name":"2023 24th International Symposium on Quality Electronic Design (ISQED)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126635363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2023 24th International Symposium on Quality Electronic Design (ISQED)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1