首页 > 最新文献

2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)最新文献

英文 中文
Heterogeneous Graph Attention Network Based Statistical Timing Library Characterization with Parasitic RC Reduction 基于异构图注意网络的统计时序库特征描述与寄生 RC 缩减
Pub Date : 2024-01-22 DOI: 10.1109/ASP-DAC58780.2024.10473881
Xu Cheng, Yuyang Ye, Guoqing He, Qianqian Song, Peng Cao
Statistical timing characterization for standard cell library poses significant challenge to accuracy and runtime cost. Prior analytical and machine learning-based methods neglect the profound influence induced by layout-dependent parasitic resistor and capacitor (RC) network in cell netlist as well as the timing correlation between topological structures of cells and process, voltage, and temperature (PVT) corners, resulting in tremendous simulation effort and/or poor accuracy. In this work, an accurate and efficient statistical cell timing library characterization framework is proposed based on heterogeneous graph attention network (HGAT) assisted with parasitic RC reduction approach, where the transistors and parasitic RC in cell are represented as heterogeneous nodes for graph learning and redundant RC nodes are removed to alleviate node imbalance issue and improve prediction accuracy. The proposed framework was validated with TSMC 22nm standard cells under multiple PVT corners to predict the standard deviation of cell delay with the error of 2.67% on average for all validated cells in terms of relative Root Mean Squared Error (rRMSE) with $3 times $ characterization runtime speedup, achieving $2.7 sim 6.9 times $ accuracy improvement compared with prior works. The predicted statistical timing libraries were further validated with ISCAS’89 benchmark circuits for statistical static timing analysis (SSTA), where the critical path delay at $3 sigma$ percentile point is reported with the average mismatch of $1.34 ps$ compared with foundry-provided library, showing $10.7 sim 14.5 times $ better accuracy than the competitive approaches.
标准单元库的统计时序特性分析对准确性和运行成本提出了巨大挑战。先前的分析和基于机器学习的方法忽视了单元网表中与布局相关的寄生电阻和电容(RC)网络以及单元拓扑结构与工艺、电压和温度(PVT)拐角之间的时序相关性所产生的深远影响,从而导致巨大的仿真工作量和/或较差的准确性。在这项工作中,基于异构图注意网络(HGAT)辅助寄生 RC 减少方法,提出了一种准确、高效的统计单元时序库表征框架。在该框架中,单元中的晶体管和寄生 RC 被表示为图学习的异构节点,多余的 RC 节点被移除,以缓解节点不平衡问题并提高预测精度。在多个PVT角下,使用台积电22纳米标准单元验证了所提出的框架,在相对均方根误差(rRMSE)方面,所有验证单元预测单元延迟标准偏差的平均误差为2.67%,特性化运行速度提高了3倍,与之前的工作相比,准确率提高了2.7 sim 6.9 times。预测的统计时序库通过ISCAS'89统计静态时序分析(SSTA)基准电路进行了进一步验证,与代工厂提供的库相比,报告了3个百分位点的关键路径延迟,平均失配为1.34 ps$,与竞争方法相比,准确度提高了10.7个百分位点14.5倍。
{"title":"Heterogeneous Graph Attention Network Based Statistical Timing Library Characterization with Parasitic RC Reduction","authors":"Xu Cheng, Yuyang Ye, Guoqing He, Qianqian Song, Peng Cao","doi":"10.1109/ASP-DAC58780.2024.10473881","DOIUrl":"https://doi.org/10.1109/ASP-DAC58780.2024.10473881","url":null,"abstract":"Statistical timing characterization for standard cell library poses significant challenge to accuracy and runtime cost. Prior analytical and machine learning-based methods neglect the profound influence induced by layout-dependent parasitic resistor and capacitor (RC) network in cell netlist as well as the timing correlation between topological structures of cells and process, voltage, and temperature (PVT) corners, resulting in tremendous simulation effort and/or poor accuracy. In this work, an accurate and efficient statistical cell timing library characterization framework is proposed based on heterogeneous graph attention network (HGAT) assisted with parasitic RC reduction approach, where the transistors and parasitic RC in cell are represented as heterogeneous nodes for graph learning and redundant RC nodes are removed to alleviate node imbalance issue and improve prediction accuracy. The proposed framework was validated with TSMC 22nm standard cells under multiple PVT corners to predict the standard deviation of cell delay with the error of 2.67% on average for all validated cells in terms of relative Root Mean Squared Error (rRMSE) with $3 times $ characterization runtime speedup, achieving $2.7 sim 6.9 times $ accuracy improvement compared with prior works. The predicted statistical timing libraries were further validated with ISCAS’89 benchmark circuits for statistical static timing analysis (SSTA), where the critical path delay at $3 sigma$ percentile point is reported with the average mismatch of $1.34 ps$ compared with foundry-provided library, showing $10.7 sim 14.5 times $ better accuracy than the competitive approaches.","PeriodicalId":518586,"journal":{"name":"2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"178 2","pages":"171-176"},"PeriodicalIF":0.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140531166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Cross-layer Framework for Design Space and Variation Analysis of Non-Volatile Ferroelectric Capacitor-Based Compute-in-Memory Accelerators 基于非易失性铁电电容器的内存计算加速器的跨层设计空间和变异分析框架
Pub Date : 2024-01-22 DOI: 10.1109/ASP-DAC58780.2024.10473887
Yuan-chun Luo, James Read, A. Lu, Shimeng Yu
Using non-volatile “capacitive” crossbar arrays for compute-in-memory (CIM) offers higher energy and area efficiency compared to “resistive” crossbar arrays. However, the impact of device-to-device (D2D) variation and temporal noise on the system-level performance has not been explored yet. In this work, we provide an end-to-end methodology that incorporates experimentally measured D2D variation into the design space exploration from capacitive weight cell design, CIM array with peripheral circuits, to the inference accuracy of SwinV2-T vision transformer and ResNet-50 on the ImageNet dataset. Our framework further assesses the system’s power, performance, and area (PPA) by considering cell design, circuit structure, and model selection. We explore the design space using an early stopping algorithm to produce optimal designs while meeting strict inference accuracy requirements. Overall findings suggest that the capacitive CIM system is robust against D2D variation and noise, outperforming its resistive counterpart by $6.95 times$ and $14.1 times$ for the optimal design in the figure of merit (TOPS/W $times {mathrm {TOPS}}/mathrm{mm}^{2}$) for ResNet-50 and SwinV2-T respectively.
与 "电阻式 "横条阵列相比,使用非易失性 "电容式 "横条阵列进行内存计算(CIM)具有更高的能效和面积效率。然而,设备到设备(D2D)变化和时间噪声对系统级性能的影响尚未得到探讨。在这项工作中,我们提供了一种端到端的方法,将实验测得的 D2D 变化纳入设计空间探索,从电容式权重单元设计、带有外围电路的 CIM 阵列,到 SwinV2-T 视觉转换器和 ResNet-50 在 ImageNet 数据集上的推理准确性。我们的框架通过考虑单元设计、电路结构和模型选择,进一步评估了系统的功耗、性能和面积(PPA)。我们使用早期停止算法探索设计空间,以产生最佳设计,同时满足严格的推理精度要求。总体研究结果表明,电容式 CIM 系统对 D2D 变化和噪声具有很强的鲁棒性,在 ResNet-50 和 SwinV2-T 的优越性图(TOPS/W $times {mathrm {TOPS}}/mathrm{mm}^{2}$ )中,电容式 CIM 系统分别比电阻式 CIM 系统高出 6.95 美元和 14.1 美元。
{"title":"A Cross-layer Framework for Design Space and Variation Analysis of Non-Volatile Ferroelectric Capacitor-Based Compute-in-Memory Accelerators","authors":"Yuan-chun Luo, James Read, A. Lu, Shimeng Yu","doi":"10.1109/ASP-DAC58780.2024.10473887","DOIUrl":"https://doi.org/10.1109/ASP-DAC58780.2024.10473887","url":null,"abstract":"Using non-volatile “capacitive” crossbar arrays for compute-in-memory (CIM) offers higher energy and area efficiency compared to “resistive” crossbar arrays. However, the impact of device-to-device (D2D) variation and temporal noise on the system-level performance has not been explored yet. In this work, we provide an end-to-end methodology that incorporates experimentally measured D2D variation into the design space exploration from capacitive weight cell design, CIM array with peripheral circuits, to the inference accuracy of SwinV2-T vision transformer and ResNet-50 on the ImageNet dataset. Our framework further assesses the system’s power, performance, and area (PPA) by considering cell design, circuit structure, and model selection. We explore the design space using an early stopping algorithm to produce optimal designs while meeting strict inference accuracy requirements. Overall findings suggest that the capacitive CIM system is robust against D2D variation and noise, outperforming its resistive counterpart by $6.95 times$ and $14.1 times$ for the optimal design in the figure of merit (TOPS/W $times {mathrm {TOPS}}/mathrm{mm}^{2}$) for ResNet-50 and SwinV2-T respectively.","PeriodicalId":518586,"journal":{"name":"2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"71 3-4","pages":"159-164"},"PeriodicalIF":0.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140530924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Row Planning and Placement for Hybrid-Row-Height Designs 混合行高设计的行规划和布局
Pub Date : 2024-01-22 DOI: 10.1109/ASP-DAC58780.2024.10473801
Ching-Yao Huang, Wai-Kei Mak
Traditionally, a standard cell library is composed of pre-designed cells all of which have identical height so that the cells can be placed in rows of uniform height on a chip. The desire to integrate more logic gates onto a single chip has led to a continuous reduction of row height with reduced number of routing tracks over the years. It has reached a point that not all cells can be designed with the minimum row height due to internal routability issue. Hybrid-row-height IC design with placement rows of different heights has emerged which offers a better sweet spot for performance and area optimization. [7] proposed the first row planning algorithm for hybrid-row-height design based on k-means clustering to determine the row configuration so that the cells in an initial placement can be moved to rows with matching height with as little cell displacement as possible. The biggest limitation of the k-means clustering method is that it only works for designs without any macros. Here we propose an effective and highly flexible dynamic programming approach to determine an optimized row configuration for designs with or without macros. The experimental results show that for designs without any macros, our approach resulted in 30.7% reduction in total cell displacement and 7.4% reduction in the final routed wirelength on average compared to the k-means clustering approach while satisfying the timing constraints. Additional experimental results show that our approach can comfortably handle designs with macros while satisfying the timing constraints.
传统上,标准单元库由预先设计好的单元组成,所有单元的高度完全相同,因此可以在芯片上按统一高度排成单元行。由于希望在单个芯片上集成更多的逻辑门,多年来,随着布线轨道数量的减少,行高不断降低。由于内部布线问题,并非所有单元都能设计成最小行高。于是出现了混合行高集成电路设计,即采用不同高度的放置行,从而为性能和面积优化提供了更好的切入点。文献[7]首次提出了基于 k-means 聚类的混合行高设计行规划算法,以确定行配置,从而在尽可能减少单元位移的情况下,将初始布局中的单元移动到高度匹配的行中。k-means 聚类方法最大的局限性在于它只适用于没有任何宏的设计。在此,我们提出了一种有效且高度灵活的动态编程方法,用于确定有无宏设计的优化行配置。实验结果表明,与 k-means 聚类方法相比,对于没有任何宏的设计,我们的方法平均减少了 30.7% 的总单元位移,减少了 7.4% 的最终布线长度,同时满足了时序约束。其他实验结果表明,我们的方法可以轻松处理带有宏的设计,同时满足时序约束。
{"title":"Row Planning and Placement for Hybrid-Row-Height Designs","authors":"Ching-Yao Huang, Wai-Kei Mak","doi":"10.1109/ASP-DAC58780.2024.10473801","DOIUrl":"https://doi.org/10.1109/ASP-DAC58780.2024.10473801","url":null,"abstract":"Traditionally, a standard cell library is composed of pre-designed cells all of which have identical height so that the cells can be placed in rows of uniform height on a chip. The desire to integrate more logic gates onto a single chip has led to a continuous reduction of row height with reduced number of routing tracks over the years. It has reached a point that not all cells can be designed with the minimum row height due to internal routability issue. Hybrid-row-height IC design with placement rows of different heights has emerged which offers a better sweet spot for performance and area optimization. [7] proposed the first row planning algorithm for hybrid-row-height design based on k-means clustering to determine the row configuration so that the cells in an initial placement can be moved to rows with matching height with as little cell displacement as possible. The biggest limitation of the k-means clustering method is that it only works for designs without any macros. Here we propose an effective and highly flexible dynamic programming approach to determine an optimized row configuration for designs with or without macros. The experimental results show that for designs without any macros, our approach resulted in 30.7% reduction in total cell displacement and 7.4% reduction in the final routed wirelength on average compared to the k-means clustering approach while satisfying the timing constraints. Additional experimental results show that our approach can comfortably handle designs with macros while satisfying the timing constraints.","PeriodicalId":518586,"journal":{"name":"2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)","volume":"286 6-7","pages":"306-311"},"PeriodicalIF":0.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140530974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1