首页 > 最新文献

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems最新文献

英文 中文
veriSiM: Formal Verification of SPICE Netlists for MAGIC-Based Logic-in-Memory 基于magic的内存逻辑的SPICE网络列表的正式验证
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-25 DOI: 10.1109/TCAD.2025.3583199
Chandan Kumar Jha;Simranjeet Singh;Khushboo Qayyum;Ankit Bende;Muhammad Hassan;Vikas Rana;Farhad Merchant;Rolf Drechsler
Advancements in emerging technologies have recently increased the traction of non-von Neumann design styles. One of the most popular design styles in this domain involves using memristors to perform logic operations in memory, known as logic-in-memory (LiM). memristor aided logic (MAGIC) is one of such LiM based design style that is widely used given its benefits in latency and energy. Several prior works have focused on the generation of logic operations, also called micro-operations, for LiM based on the MAGIC design style. Recently, the generation of SPICE netlists for MAGIC design style has been achieved by the MemSPICE tool. While this represents a significant step forward, verifying the correctness of the generated netlists still depends on SPICE-level simulations. These simulations become particularly impractical for medium-to-large designs presenting a bottleneck in the validation process. To address this limitation, in this article, we introduce veriSiM, an automated formal verification methodology for MAGIC-based LiM. More concretely, it ensures the correctness of the generated LiM SPICE netlists against the golden reference Verilog design. Our methodology involves generating clauses from the SPICE netlists and verifying them against clauses generated from the golden reference Verilog design, using the high-performance Z3 solver to perform the equivalence checking. The clause generation process from the SPICE netlists needs to be based on several conditions, which have been identified and discussed in detail. We have used several benchmarks from ISCAS’85, ISCAS’89, and ITC’99 to demonstrate the efficacy of the veriSiM methodology in formally verifying the LiM SPICE netlists.
新兴技术的进步最近增加了非冯·诺伊曼设计风格的吸引力。该领域中最流行的设计风格之一涉及使用忆阻器在内存中执行逻辑操作,称为内存中逻辑(LiM)。忆阻器辅助逻辑(MAGIC)是一种基于LiM的设计风格,由于其在延迟和能量方面的优势而被广泛使用。先前的一些工作集中在基于MAGIC设计风格的LiM的逻辑运算(也称为微运算)的生成上。最近,MemSPICE工具已经实现了MAGIC设计风格的SPICE网络列表的生成。虽然这是向前迈出的重要一步,但验证生成的网络列表的正确性仍然依赖于spice级别的模拟。这些模拟对于在验证过程中出现瓶颈的大中型设计来说尤其不切实际。为了解决这一限制,在本文中,我们介绍了veriSiM,一种用于基于magic的LiM的自动形式化验证方法。更具体地说,它确保了生成的LiM SPICE网络列表与黄金参考Verilog设计的正确性。我们的方法包括从SPICE网络列表中生成子句,并根据黄金参考Verilog设计生成的子句对它们进行验证,使用高性能Z3求解器执行等效性检查。SPICE网络列表中的子句生成过程需要基于几个条件,这些条件已经确定并进行了详细讨论。我们使用了ISCAS ' 85, ISCAS ' 89和ITC ' 99的几个基准来证明veriSiM方法在正式验证LiM SPICE网络列表方面的有效性。
{"title":"veriSiM: Formal Verification of SPICE Netlists for MAGIC-Based Logic-in-Memory","authors":"Chandan Kumar Jha;Simranjeet Singh;Khushboo Qayyum;Ankit Bende;Muhammad Hassan;Vikas Rana;Farhad Merchant;Rolf Drechsler","doi":"10.1109/TCAD.2025.3583199","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3583199","url":null,"abstract":"Advancements in emerging technologies have recently increased the traction of non-von Neumann design styles. One of the most popular design styles in this domain involves using memristors to perform logic operations in memory, known as logic-in-memory (LiM). memristor aided logic (MAGIC) is one of such LiM based design style that is widely used given its benefits in latency and energy. Several prior works have focused on the generation of logic operations, also called micro-operations, for LiM based on the MAGIC design style. Recently, the generation of SPICE netlists for MAGIC design style has been achieved by the MemSPICE tool. While this represents a significant step forward, verifying the correctness of the generated netlists still depends on SPICE-level simulations. These simulations become particularly impractical for medium-to-large designs presenting a bottleneck in the validation process. To address this limitation, in this article, we introduce veriSiM, an automated formal verification methodology for MAGIC-based LiM. More concretely, it ensures the correctness of the generated LiM SPICE netlists against the golden reference Verilog design. Our methodology involves generating clauses from the SPICE netlists and verifying them against clauses generated from the golden reference Verilog design, using the high-performance Z3 solver to perform the equivalence checking. The clause generation process from the SPICE netlists needs to be based on several conditions, which have been identified and discussed in detail. We have used several benchmarks from ISCAS’85, ISCAS’89, and ITC’99 to demonstrate the efficacy of the veriSiM methodology in formally verifying the LiM SPICE netlists.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 2","pages":"845-854"},"PeriodicalIF":2.9,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DANCE++: Differentiable Accelerator/Network Co-Exploration With Hard Constraints and Data-Free Training for Real-World Scenarios DANCE++:具有硬约束和无数据训练的可微分加速器/网络协同探索
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-23 DOI: 10.1109/TCAD.2025.3582196
Kanghyun Choi;Deokki Hong;Hyeyoon Lee;Joonsang Yu;Noseong Park;Youngsok Kim;Jinho Lee
Co-exploration of neural architectures and hardware accelerators has emerged as a promising approach to address computational cost problems, especially in low-profile systems. However, existing co-exploration methods based on reinforcement learning or evolutionary search suffer from substantial search costs. To address this, this work presents DANCE++, a differentiable approach toward the co-exploration of hardware and network architecture design. At the heart of DANCE++ is a differentiable evaluator network that models hardware metrics with a neural network, enabling accelerator design through backpropagation. DANCE++ significantly reduces search time and enhances accuracy and hardware cost metrics compared to traditional approaches. To further address real-world scenarios, this work embodies two important practical topics: 1) hard constraints and 2) data dependency. To meet the constraints, such as frame rates, this work proposes a gradient manipulation algorithm that guides differentiable optimization to find hard-constrained solutions. Also to consider cases where training dataset is inaccessible, this work proposes to use data-free training methods in both co-exploration and training phases. To the best of our knowledge, DANCE++ is the first co-exploration method that targets these real-world challenges, supported by extensive experiments demonstrating its effectiveness.
神经架构和硬件加速器的共同探索已经成为解决计算成本问题的一种有前途的方法,特别是在低调的系统中。然而,现有的基于强化学习或进化搜索的协同探索方法存在巨大的搜索成本。为了解决这个问题,这项工作提出了DANCE++,这是一种共同探索硬件和网络架构设计的可微分方法。DANCE++的核心是一个可微分的评估器网络,它用神经网络建模硬件指标,通过反向传播实现加速器设计。与传统方法相比,danc++显著减少了搜索时间,提高了准确性和硬件成本指标。为了进一步解决现实场景,这项工作体现了两个重要的实践主题:1)硬约束和2)数据依赖性。为了满足约束,例如帧速率,本工作提出了一种梯度操作算法,该算法指导可微优化以找到硬约束解。此外,为了考虑训练数据集不可访问的情况,这项工作建议在共同探索和训练阶段使用无数据的训练方法。据我们所知,DANCE++是第一个针对这些现实挑战的协同探索方法,并得到了大量实验的支持,证明了其有效性。
{"title":"DANCE++: Differentiable Accelerator/Network Co-Exploration With Hard Constraints and Data-Free Training for Real-World Scenarios","authors":"Kanghyun Choi;Deokki Hong;Hyeyoon Lee;Joonsang Yu;Noseong Park;Youngsok Kim;Jinho Lee","doi":"10.1109/TCAD.2025.3582196","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3582196","url":null,"abstract":"Co-exploration of neural architectures and hardware accelerators has emerged as a promising approach to address computational cost problems, especially in low-profile systems. However, existing co-exploration methods based on reinforcement learning or evolutionary search suffer from substantial search costs. To address this, this work presents DANCE++, a differentiable approach toward the co-exploration of hardware and network architecture design. At the heart of DANCE++ is a differentiable evaluator network that models hardware metrics with a neural network, enabling accelerator design through backpropagation. DANCE++ significantly reduces search time and enhances accuracy and hardware cost metrics compared to traditional approaches. To further address real-world scenarios, this work embodies two important practical topics: 1) hard constraints and 2) data dependency. To meet the constraints, such as frame rates, this work proposes a gradient manipulation algorithm that guides differentiable optimization to find hard-constrained solutions. Also to consider cases where training dataset is inaccessible, this work proposes to use data-free training methods in both co-exploration and training phases. To the best of our knowledge, DANCE++ is the first co-exploration method that targets these real-world challenges, supported by extensive experiments demonstrating its effectiveness.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 2","pages":"915-928"},"PeriodicalIF":2.9,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146006923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LUT-Based Mapping of Logic Functions to Partitioned Memristive Crossbar Using MAGIC for High-Throughput Computing 基于lut的逻辑函数映射到基于MAGIC的分区记忆交叉条的高吞吐量计算
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-20 DOI: 10.1109/TCAD.2025.3581866
Pooja Joshi;Anindita Chakraborty;Hafizur Rahaman
Partitioning memristive crossbar is emerging as a new technique for improving computational throughput, and overcoming tradeoffs in memristive processing-in-memory (PIM) architecture. This article presents a complete framework for realizing logic functions inside a partitioned memristive crossbar. This work utilizes a heuristic-based clustering algorithm on and-inverter graphs (AIGs) to minimize the number of lookup tables (LUTs) necessary for mapping Boolean logic functions. In our proposed mapping technique, LUTs are evaluated using single-row memristor aided logic (MAGIC)-based norgates, and mapped inside the partitioned memristive crossbar. The peripheral design for partitioned crossbar utilizes a shared CMOS decoder, and generates Opcodes to control logic computation across all partitions. The experimental results show an average throughput improvement of $12.58{times }$ , $68.47{times }$ , $156.10{times }$ , $107.76{times }$ , and $506.22{times }$ over SIMPLER, ReVAMP-ArC, ReVAMP-DeC, m-AIG-based mapping technique, and CoMIC-3D mapping, respectively. In addition, on average, a 47.4% reduction in computation area is observed.
划分记忆交叉栏是一种提高计算吞吐量和克服记忆内存处理(PIM)体系结构折衷的新技术。本文提出了一个完整的框架来实现在一个分段记忆横杆内的逻辑功能。这项工作利用基于启发式的反相图(AIGs)聚类算法来最小化映射布尔逻辑函数所需的查找表(lut)的数量。在我们提出的映射技术中,lut使用基于单行忆阻辅助逻辑(MAGIC)的规范进行评估,并在划分的忆阻交叉条内进行映射。分块横杆的外设设计采用共享CMOS解码器,并生成操作码来控制所有分块的逻辑计算。实验结果表明,与simple、ReVAMP-ArC、ReVAMP-DeC、m-AIG-based mapping技术和CoMIC-3D mapping技术相比,平均吞吐量分别提高了12.58{times}$、68.47{times}$、156.10{times}$、107.76{times}$和506.22{times}$。此外,计算面积平均减少了47.4%。
{"title":"LUT-Based Mapping of Logic Functions to Partitioned Memristive Crossbar Using MAGIC for High-Throughput Computing","authors":"Pooja Joshi;Anindita Chakraborty;Hafizur Rahaman","doi":"10.1109/TCAD.2025.3581866","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3581866","url":null,"abstract":"Partitioning memristive crossbar is emerging as a new technique for improving computational throughput, and overcoming tradeoffs in memristive processing-in-memory (PIM) architecture. This article presents a complete framework for realizing logic functions inside a partitioned memristive crossbar. This work utilizes a heuristic-based clustering algorithm on and-inverter graphs (AIGs) to minimize the number of lookup tables (LUTs) necessary for mapping Boolean logic functions. In our proposed mapping technique, LUTs are evaluated using single-row memristor aided logic (MAGIC)-based <sc>nor</small>gates, and mapped inside the partitioned memristive crossbar. The peripheral design for partitioned crossbar utilizes a shared CMOS decoder, and generates Opcodes to control logic computation across all partitions. The experimental results show an average throughput improvement of <inline-formula> <tex-math>$12.58{times }$ </tex-math></inline-formula>, <inline-formula> <tex-math>$68.47{times }$ </tex-math></inline-formula>, <inline-formula> <tex-math>$156.10{times }$ </tex-math></inline-formula>, <inline-formula> <tex-math>$107.76{times }$ </tex-math></inline-formula>, and <inline-formula> <tex-math>$506.22{times }$ </tex-math></inline-formula> over SIMPLER, ReVAMP-ArC, ReVAMP-DeC, m-AIG-based mapping technique, and CoMIC-3D mapping, respectively. In addition, on average, a 47.4% reduction in computation area is observed.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 1","pages":"359-372"},"PeriodicalIF":2.9,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145904261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems society information 集成电路与系统计算机辅助设计学报
IF 2.7 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-19 DOI: 10.1109/TCAD.2025.3575827
{"title":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems society information","authors":"","doi":"10.1109/TCAD.2025.3575827","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3575827","url":null,"abstract":"","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"44 7","pages":"C2-C2"},"PeriodicalIF":2.7,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11044996","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144323110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effect of Asymmetric Structures On the Performance of AlGaN/GaN Bilateral IMPATT Device 不对称结构对AlGaN/GaN双侧IMPATT器件性能的影响
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-19 DOI: 10.1109/TCAD.2025.3581475
Yang Dai;Yukun Li;Leiyu Gao;Pengzhan Wang;Weiwei Li;Jing Zuo;Cheng Chen;Wu Zhao
The effect caused by asymmetric structures on the performance of the AlGaN/GaN bilateral impact-ionization-avalanche-transit-time (BIMPATT) device is studied in this article. The asymmetric structures include the asymmetry of the bilateral channel lengths by changing the anode position and the asymmetry of the 2-D electron gas (2-DEG) concentrations by changing the AlGaN thickness. The asymmetry affects the direct current (DC), radio frequency (RF), and noise characteristics of the BIMPATT. Large-signal analysis shows that the asymmetric structures improve the operating frequency but reduce the maximum power and efficiency of the BIMPATT. The asymmetric structures show better RF characteristics than the symmetric structure in the high-frequency band. At last, the asymmetric structures increase the noise performance of the BIMPATT. The reason for these phenomena is the asymmetry of avalanche ionization region and intensity caused by the asymmetry of device structure. This article provides more references for the design and performance optimization of the lateral IMPATT devices.
本文研究了不对称结构对AlGaN/GaN双向冲击-电离-雪崩-传递时间(BIMPATT)器件性能的影响。这种不对称结构包括通过改变阳极位置引起的双向通道长度的不对称和通过改变AlGaN厚度引起的二维电子气浓度的不对称。这种不对称性会影响BIMPATT的直流(DC)、射频(RF)和噪声特性。大信号分析表明,非对称结构提高了BIMPATT的工作频率,但降低了其最大功率和效率。非对称结构在高频波段表现出比对称结构更好的射频特性。最后,非对称结构提高了BIMPATT的噪声性能。产生这些现象的原因是器件结构的不对称导致雪崩电离区域和电离强度的不对称。本文为横向IMPATT器件的设计和性能优化提供了更多的参考。
{"title":"Effect of Asymmetric Structures On the Performance of AlGaN/GaN Bilateral IMPATT Device","authors":"Yang Dai;Yukun Li;Leiyu Gao;Pengzhan Wang;Weiwei Li;Jing Zuo;Cheng Chen;Wu Zhao","doi":"10.1109/TCAD.2025.3581475","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3581475","url":null,"abstract":"The effect caused by asymmetric structures on the performance of the AlGaN/GaN bilateral impact-ionization-avalanche-transit-time (BIMPATT) device is studied in this article. The asymmetric structures include the asymmetry of the bilateral channel lengths by changing the anode position and the asymmetry of the 2-D electron gas (2-DEG) concentrations by changing the AlGaN thickness. The asymmetry affects the direct current (DC), radio frequency (RF), and noise characteristics of the BIMPATT. Large-signal analysis shows that the asymmetric structures improve the operating frequency but reduce the maximum power and efficiency of the BIMPATT. The asymmetric structures show better RF characteristics than the symmetric structure in the high-frequency band. At last, the asymmetric structures increase the noise performance of the BIMPATT. The reason for these phenomena is the asymmetry of avalanche ionization region and intensity caused by the asymmetry of device structure. This article provides more references for the design and performance optimization of the lateral IMPATT devices.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 1","pages":"335-344"},"PeriodicalIF":2.9,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145904301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ASAP: Accelerating Corner-Based Timing Analysis With Bayesian Active Self-Attention Neural Process 用贝叶斯主动自注意神经过程加速拐角时序分析
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-19 DOI: 10.1109/TCAD.2025.3581474
Longze Wang;Wei W. Xing;Zhelong Wang;Christos Sotiriou;Nikolaos Sketopoulos;Ning Xu;Yuanqing Cheng
With the advancement of modern nanoscale technology nodes, static timing analysis (STA) has become an indispensable technique for ensuring circuit reliability and performance across diverse process conditions. However, traditional STA methods scale poorly to the explosion of process corners in the nanoscale fabrication technology. Despite some seminal works in using AI to accelerate such processes, they either lack reliability or stability. To this end, we introduce active self-attention neural process (ASAP), a novel approach addressing this challenge by combining both the latest deep learning methods and the classical Bayesian models to deliver scalable and accurate predictions with a self-calibration strategy to ensure reliability. Technically, the ASAP novelly integrates self-attention to help identify and prioritize crucial features under various input conditions and employs neural process to make confidence-based predictions for the final timing results. Furthermore, ASAP is equipped with Active Learning for self-refinement and self-correction. Experimental evaluations on benchmark circuits demonstrate that our method surpasses state-of-the-art work in STA accuracy by 18% in terms of prediction accuracy.
随着现代纳米技术节点的不断进步,静态时序分析(STA)已成为保证电路在不同工艺条件下的可靠性和性能的一项不可或缺的技术。然而,在纳米级制造技术中,传统的STA方法难以满足工艺角爆炸的要求。尽管在利用人工智能加速这一过程方面做了一些开创性的工作,但它们要么缺乏可靠性,要么缺乏稳定性。为此,我们引入了主动自注意神经过程(ASAP),这是一种解决这一挑战的新方法,它结合了最新的深度学习方法和经典的贝叶斯模型,通过自校准策略提供可扩展和准确的预测,以确保可靠性。从技术上讲,ASAP新颖地集成了自关注,以帮助在各种输入条件下识别和优先考虑关键特征,并采用神经过程对最终时序结果进行基于信心的预测。此外,ASAP还配备了主动学习功能,可以自我完善和自我纠正。在基准电路上的实验评估表明,我们的方法在预测精度方面超过了STA精度的最先进工作18%。
{"title":"ASAP: Accelerating Corner-Based Timing Analysis With Bayesian Active Self-Attention Neural Process","authors":"Longze Wang;Wei W. Xing;Zhelong Wang;Christos Sotiriou;Nikolaos Sketopoulos;Ning Xu;Yuanqing Cheng","doi":"10.1109/TCAD.2025.3581474","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3581474","url":null,"abstract":"With the advancement of modern nanoscale technology nodes, static timing analysis (STA) has become an indispensable technique for ensuring circuit reliability and performance across diverse process conditions. However, traditional STA methods scale poorly to the explosion of process corners in the nanoscale fabrication technology. Despite some seminal works in using AI to accelerate such processes, they either lack reliability or stability. To this end, we introduce active self-attention neural process (ASAP), a novel approach addressing this challenge by combining both the latest deep learning methods and the classical Bayesian models to deliver scalable and accurate predictions with a self-calibration strategy to ensure reliability. Technically, the ASAP novelly integrates self-attention to help identify and prioritize crucial features under various input conditions and employs neural process to make confidence-based predictions for the final timing results. Furthermore, ASAP is equipped with Active Learning for self-refinement and self-correction. Experimental evaluations on benchmark circuits demonstrate that our method surpasses state-of-the-art work in STA accuracy by 18% in terms of prediction accuracy.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 1","pages":"480-493"},"PeriodicalIF":2.9,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145904294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems publication information IEEE集成电路与系统计算机辅助设计汇刊
IF 2.7 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-19 DOI: 10.1109/TCAD.2025.3572228
{"title":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems publication information","authors":"","doi":"10.1109/TCAD.2025.3572228","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3572228","url":null,"abstract":"","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"44 7","pages":"C3-C3"},"PeriodicalIF":2.7,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11044979","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144322968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HiePlace: Efficient Hierarchical PCB Placement HiePlace:高效分层PCB放置
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-17 DOI: 10.1109/TCAD.2025.3580511
Shanyi Li;Zhen Zhuang;Mingyu Liu;Weihua Sheng;Bei Yu;Tsung-Yi Ho
Due to the rapid expansion of printed circuit board (PCB) designs, accompanied by diverse design rules and specific constraints, there has been a substantial increase in manual design engineering efforts. To address this challenge, industries are seeking productivity improvements through automated placement techniques. However, existing placers primarily target VLSI placement and do not align well with PCBs’ unique characteristics. This mismatch arises from both the customization of PCBs and the complexity of the problem, which involves considering various constraints such as priorities, irregularities, and alignment. This article introduces HiePlace, an efficient mathematical-programming (MP)-based placement framework designed explicitly for PCBs. It aims to address the diverse constraints and achieve better performance. To address the issue of time-consuming computation in the direct MP-based algorithm, we present two innovative acceleration techniques: 1) in the initial stage, we introduce a dynamic programming approach to prioritize the placement of core components. This technique effectively reduces the solution space and enhances the overall placement quality and 2) in addition, we propose a relaxation algorithm to minimize the number of boolean variables and further narrow down the solution space. This approach enables more efficient placement results by considering the problem’s specific constraints. Experimental results show that the proposed framework produces $7.7times $ speed up and 66% cost reduction.
由于印刷电路板(PCB)设计的迅速扩展,伴随着不同的设计规则和特定的限制,人工设计工程的工作量大幅增加。为了应对这一挑战,行业正在寻求通过自动化放置技术来提高生产率。然而,现有的贴片机主要针对超大规模集成电路的贴片,并不能很好地与pcb的独特特性相匹配。这种不匹配源于pcb的定制和问题的复杂性,这涉及到考虑各种约束,如优先级、不规则性和对齐。本文介绍了HiePlace,一个有效的基于数学规划(MP)的布局框架,明确为pcb设计。它旨在解决各种约束并实现更好的性能。为了解决直接基于mp的算法中计算时间长的问题,我们提出了两种创新的加速技术:1)在初始阶段,我们引入动态规划方法来优先考虑核心部件的放置。该技术有效地减少了解空间,提高了整体放置质量。2)此外,我们提出了一种松弛算法来最小化布尔变量的数量,进一步缩小了解空间。这种方法通过考虑问题的特定约束,实现更有效的放置结果。实验结果表明,该框架的速度提高了7.7倍,成本降低了66%。
{"title":"HiePlace: Efficient Hierarchical PCB Placement","authors":"Shanyi Li;Zhen Zhuang;Mingyu Liu;Weihua Sheng;Bei Yu;Tsung-Yi Ho","doi":"10.1109/TCAD.2025.3580511","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3580511","url":null,"abstract":"Due to the rapid expansion of printed circuit board (PCB) designs, accompanied by diverse design rules and specific constraints, there has been a substantial increase in manual design engineering efforts. To address this challenge, industries are seeking productivity improvements through automated placement techniques. However, existing placers primarily target VLSI placement and do not align well with PCBs’ unique characteristics. This mismatch arises from both the customization of PCBs and the complexity of the problem, which involves considering various constraints such as priorities, irregularities, and alignment. This article introduces HiePlace, an efficient mathematical-programming (MP)-based placement framework designed explicitly for PCBs. It aims to address the diverse constraints and achieve better performance. To address the issue of time-consuming computation in the direct MP-based algorithm, we present two innovative acceleration techniques: 1) in the initial stage, we introduce a dynamic programming approach to prioritize the placement of core components. This technique effectively reduces the solution space and enhances the overall placement quality and 2) in addition, we propose a relaxation algorithm to minimize the number of boolean variables and further narrow down the solution space. This approach enables more efficient placement results by considering the problem’s specific constraints. Experimental results show that the proposed framework produces <inline-formula> <tex-math>$7.7times $ </tex-math></inline-formula> speed up and 66% cost reduction.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 1","pages":"428-440"},"PeriodicalIF":2.9,"publicationDate":"2025-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11039086","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145904266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ring Oscillator-Based PreBond TSV Testing Method With Classification and Grading of Defects 基于环形振荡器的键前TSV缺陷分类分级检测方法
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-16 DOI: 10.1109/TCAD.2025.3580310
Xianrui Dou;Huaguo Liang;Zhengfeng Huang;Yingchun Lu;Tian Chen;Jun Liu
The immaturity of manufacturing processes often leads to a high incidence of defects in through-silicon vias (TSVs). Prebond TSV testing is essential for optimizing the yield of chip-based integrated circuits. However, current testing methods are limited by their incomprehensive fault coverage and difficulty detecting subtle defects. Furthermore, these methods exhibit significant performance variability due to changes in process angle, power supply voltage, and temperature (PVT). To overcome these limitations, this article introduces an innovative ring oscillator (RO)-based prebond test method specifically designed for TSVs, with a robust system for defect classification and grading. By sampling each node of the RO oscillating ring, the proposed method enhances the resolution of the Time-to-Digital Converter, thereby improving the defect detection capability. Additionally, a weak current source, constructed utilizing the unique properties of MOS transistors, enables the precise detection of open faults, resistive open defects with $R_{text {open}} geq 1.5~{mathrm {K}} {mathrm {Omega }}$ , and leakage defects with $R_{text {leak}} leq 10~{mathrm {G}} {mathrm {Omega }}$ . To further mitigate the impact of PVT variations on test results, this article integrates advanced machine learning techniques for defect classification and grading, providing valuable insights for fault bin classification and fault diagnosis. This innovative approach contributes significantly to the advancement of 3-D IC reliability assessment.
制造工艺的不成熟往往导致硅通孔(tsv)缺陷的高发。预粘接TSV测试对于优化基于芯片的集成电路的成品率至关重要。然而,现有的检测方法存在着故障覆盖不全面、难以检测细微缺陷等问题。此外,由于工艺角度、电源电压和温度(PVT)的变化,这些方法表现出显著的性能变化。为了克服这些限制,本文介绍了一种专门为tsv设计的创新的基于环形振荡器(RO)的预粘合测试方法,该方法具有强大的缺陷分类和分级系统。该方法通过对RO振荡环的每个节点进行采样,提高了时间-数字转换器的分辨率,从而提高了缺陷检测能力。此外,利用MOS晶体管的独特特性构建的弱电源可以精确检测出开路故障、电阻性开路缺陷($R_{text {open}} geq 1.5~{mathrm {K}} {mathrm {Omega }}$)和泄漏缺陷($R_{text {leak}} leq 10~{mathrm {G}} {mathrm {Omega }}$)。为了进一步减轻PVT变化对测试结果的影响,本文集成了用于缺陷分类和分级的先进机器学习技术,为故障分类和故障诊断提供了有价值的见解。这一创新方法为三维集成电路可靠性评估的发展做出了重要贡献。
{"title":"Ring Oscillator-Based PreBond TSV Testing Method With Classification and Grading of Defects","authors":"Xianrui Dou;Huaguo Liang;Zhengfeng Huang;Yingchun Lu;Tian Chen;Jun Liu","doi":"10.1109/TCAD.2025.3580310","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3580310","url":null,"abstract":"The immaturity of manufacturing processes often leads to a high incidence of defects in through-silicon vias (TSVs). Prebond TSV testing is essential for optimizing the yield of chip-based integrated circuits. However, current testing methods are limited by their incomprehensive fault coverage and difficulty detecting subtle defects. Furthermore, these methods exhibit significant performance variability due to changes in process angle, power supply voltage, and temperature (PVT). To overcome these limitations, this article introduces an innovative ring oscillator (RO)-based prebond test method specifically designed for TSVs, with a robust system for defect classification and grading. By sampling each node of the RO oscillating ring, the proposed method enhances the resolution of the Time-to-Digital Converter, thereby improving the defect detection capability. Additionally, a weak current source, constructed utilizing the unique properties of MOS transistors, enables the precise detection of open faults, resistive open defects with <inline-formula> <tex-math>$R_{text {open}} geq 1.5~{mathrm {K}} {mathrm {Omega }}$ </tex-math></inline-formula>, and leakage defects with <inline-formula> <tex-math>$R_{text {leak}} leq 10~{mathrm {G}} {mathrm {Omega }}$ </tex-math></inline-formula>. To further mitigate the impact of PVT variations on test results, this article integrates advanced machine learning techniques for defect classification and grading, providing valuable insights for fault bin classification and fault diagnosis. This innovative approach contributes significantly to the advancement of 3-D IC reliability assessment.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 1","pages":"387-395"},"PeriodicalIF":2.9,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145904284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Efficient Ensemble Framework to Assist Profiled Side-Channel Analysis by Machine Learning 利用机器学习辅助侧通道分析的高效集成框架
IF 2.9 3区 计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Pub Date : 2025-06-16 DOI: 10.1109/TCAD.2025.3580344
Yaoling Ding;Yuwei Zhang;An Wang;Shaofei Sun;Congming Wei;Liehuang Zhu
The application of machine learning techniques in side-channel analysis (SCA) has recently received increased attention. Finding the best hyperparameters to achieve optimal performance for machine learning models in SCA is still a challenging endeavor. In order to solve the problem, we present an efficient ensemble framework designed to support profiled SCA for attacking cryptographic devices with countermeasures. Our proposed framework can partially mitigate the impact of traditional countermeasures employed in cryptographic devices. Additionally, we introduce a novel voting method called elite voting, which leverages candidate keys with higher probabilities to recover the secret key and adjusts the voting weights for better candidate keys. Experimental results illustrate that our proposed framework can effectively recover the right key from cryptographic devices with countermeasures through multiple experiments. It enhances the signal-to-noise ratio of traces and successfully recovers the right key across various datasets. Furthermore, when compared to traditional methods, our elite voting method further enhances the performance of ensemble learning by reducing the number of traces needed to recover the secret key. It exhibits superior performance compared to other ensemble methods, as it can reduce the minimum required number of traces significantly.
机器学习技术在侧信道分析(SCA)中的应用近年来受到越来越多的关注。为SCA中的机器学习模型找到最佳超参数以实现最佳性能仍然是一项具有挑战性的工作。为了解决这个问题,我们提出了一个有效的集成框架,旨在支持配置SCA攻击具有对策的加密设备。我们提出的框架可以部分减轻加密设备中采用的传统对策的影响。此外,我们引入了一种新的投票方法,称为精英投票,它利用具有更高概率的候选密钥来恢复秘密密钥,并调整投票权重以获得更好的候选密钥。实验结果表明,通过多次实验,我们提出的框架可以有效地从具有对抗措施的加密设备中恢复正确的密钥。它提高了迹线的信噪比,并成功地恢复了不同数据集的正确键。此外,与传统方法相比,我们的精英投票方法通过减少恢复密钥所需的跟踪数量,进一步提高了集成学习的性能。与其他集成方法相比,它表现出优越的性能,因为它可以显着减少所需的最小走线数量。
{"title":"An Efficient Ensemble Framework to Assist Profiled Side-Channel Analysis by Machine Learning","authors":"Yaoling Ding;Yuwei Zhang;An Wang;Shaofei Sun;Congming Wei;Liehuang Zhu","doi":"10.1109/TCAD.2025.3580344","DOIUrl":"https://doi.org/10.1109/TCAD.2025.3580344","url":null,"abstract":"The application of machine learning techniques in side-channel analysis (SCA) has recently received increased attention. Finding the best hyperparameters to achieve optimal performance for machine learning models in SCA is still a challenging endeavor. In order to solve the problem, we present an efficient ensemble framework designed to support profiled SCA for attacking cryptographic devices with countermeasures. Our proposed framework can partially mitigate the impact of traditional countermeasures employed in cryptographic devices. Additionally, we introduce a novel voting method called elite voting, which leverages candidate keys with higher probabilities to recover the secret key and adjusts the voting weights for better candidate keys. Experimental results illustrate that our proposed framework can effectively recover the right key from cryptographic devices with countermeasures through multiple experiments. It enhances the signal-to-noise ratio of traces and successfully recovers the right key across various datasets. Furthermore, when compared to traditional methods, our elite voting method further enhances the performance of ensemble learning by reducing the number of traces needed to recover the secret key. It exhibits superior performance compared to other ensemble methods, as it can reduce the minimum required number of traces significantly.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"45 1","pages":"494-505"},"PeriodicalIF":2.9,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145904336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1