基于芯片的2.5D/3D先进封装系统PSI优化

Yoon-Yi Hwang, S. Moon, Seungki Nam, Jeong HoonAhn
{"title":"基于芯片的2.5D/3D先进封装系统PSI优化","authors":"Yoon-Yi Hwang, S. Moon, Seungki Nam, Jeong HoonAhn","doi":"10.1109/ectc51906.2022.00010","DOIUrl":null,"url":null,"abstract":"In this work, we propose a novel chiplet platform for 2.5D/3D IC Integration. Given specific design requirements, the Samsung chipletadvanced platform engine (SCAPE) can provide an integrated image of suitable advanced packaging solutionsfrom multi-chip module (MCM) or 2.5D silicon interposer or 3D stacked structures, taking into account the evaluation metrics (performance, power and area: PPA) of system and die-to-die (D2D) interconnect. It can also project an optimal design balance between system performance and cost which is closely related to die size. In a chiplet design perspective, multiple solutions for various specifications may be presented simply, but the architecture-based optimal integrated solution can be allowed only right after performance and cost are thoroughly understood. For that purpose, reference architectures are proposed to be analyzed in terms of power, area and latency at the same bandwidth requirement. As the MCM, 2.5D and 3D structures in sequence shorten the D2D distance, it can mitigate the design overhead for chiplet implementation by reducing the interface IP area and required power consumption. In terms of power and area overhead when compared to a 2D monolithic design, for homogeneously split dies, MCM, 2.5D and 3D design cases show that additional power increase 2.1%, 1.1% and 0.04% respectively and show that additional area increase by 5.6%, 2.4% and 2.4% in a HPC/AI case with 450mm2 diesize. In addition, two best heterogeneous practices are created and analyzed. From the experiments, it clearly shows that 3D face-to-face (F2F) structure is the best option with obvious metrics including system power overhead of 0.11% and system area overhead of 1.9% increase for a bandwidth-centric system with 340W and 700mm2like GPU/NPU from MCM, 2.5D and 3D package candidates. Moreover, in the other latency-centric system with150W and 420mm2 like CPU, it can be seen that 3D F2F case with 25μm μ-bump pitchworks up to 12.5X TBps areal BW density and 80μm C4bump pitch also work up to 8.5X W/mm2 areal power density due to their physical limitation. With respect to power and signal integrity (PSI) of interface elements under various packaging candidates, this work is helpful to understand which chiplet configuration is the best option with obvious metrics and physical limitations of advanced packages, and the need to improve interfaces such as μ-bump or C4bump especially in 3D stacked ICs. We also completed a hierarchical impact diagram of configured systems considering the overhead of interface/TSV itself, die split, test circuitry, and P&R affected by the existence of TSVs. Therefore, in considering the movement toward the era of beyond Moore's Law in the performance-/cost-driven semiconductor industry, this work is expected to serve as a future chiplet reference platform which can provide differentiating solutions for quick adoption of designs.","PeriodicalId":139520,"journal":{"name":"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Chiplet-based System PSI Optimization for 2.5D/3D Advanced Packaging Implementation\",\"authors\":\"Yoon-Yi Hwang, S. Moon, Seungki Nam, Jeong HoonAhn\",\"doi\":\"10.1109/ectc51906.2022.00010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we propose a novel chiplet platform for 2.5D/3D IC Integration. Given specific design requirements, the Samsung chipletadvanced platform engine (SCAPE) can provide an integrated image of suitable advanced packaging solutionsfrom multi-chip module (MCM) or 2.5D silicon interposer or 3D stacked structures, taking into account the evaluation metrics (performance, power and area: PPA) of system and die-to-die (D2D) interconnect. It can also project an optimal design balance between system performance and cost which is closely related to die size. In a chiplet design perspective, multiple solutions for various specifications may be presented simply, but the architecture-based optimal integrated solution can be allowed only right after performance and cost are thoroughly understood. For that purpose, reference architectures are proposed to be analyzed in terms of power, area and latency at the same bandwidth requirement. As the MCM, 2.5D and 3D structures in sequence shorten the D2D distance, it can mitigate the design overhead for chiplet implementation by reducing the interface IP area and required power consumption. In terms of power and area overhead when compared to a 2D monolithic design, for homogeneously split dies, MCM, 2.5D and 3D design cases show that additional power increase 2.1%, 1.1% and 0.04% respectively and show that additional area increase by 5.6%, 2.4% and 2.4% in a HPC/AI case with 450mm2 diesize. In addition, two best heterogeneous practices are created and analyzed. From the experiments, it clearly shows that 3D face-to-face (F2F) structure is the best option with obvious metrics including system power overhead of 0.11% and system area overhead of 1.9% increase for a bandwidth-centric system with 340W and 700mm2like GPU/NPU from MCM, 2.5D and 3D package candidates. Moreover, in the other latency-centric system with150W and 420mm2 like CPU, it can be seen that 3D F2F case with 25μm μ-bump pitchworks up to 12.5X TBps areal BW density and 80μm C4bump pitch also work up to 8.5X W/mm2 areal power density due to their physical limitation. With respect to power and signal integrity (PSI) of interface elements under various packaging candidates, this work is helpful to understand which chiplet configuration is the best option with obvious metrics and physical limitations of advanced packages, and the need to improve interfaces such as μ-bump or C4bump especially in 3D stacked ICs. We also completed a hierarchical impact diagram of configured systems considering the overhead of interface/TSV itself, die split, test circuitry, and P&R affected by the existence of TSVs. Therefore, in considering the movement toward the era of beyond Moore's Law in the performance-/cost-driven semiconductor industry, this work is expected to serve as a future chiplet reference platform which can provide differentiating solutions for quick adoption of designs.\",\"PeriodicalId\":139520,\"journal\":{\"name\":\"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ectc51906.2022.00010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ectc51906.2022.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

在这项工作中,我们提出了一种新的2.5D/3D集成电路芯片平台。考虑到特定的设计要求,三星芯片先进平台引擎(SCAPE)可以从多芯片模块(MCM)或2.5D硅中间层或3D堆叠结构中提供合适的先进封装解决方案的集成图像,同时考虑到系统和模对模(D2D)互连的评估指标(性能,功率和面积:PPA)。它还可以在与模具尺寸密切相关的系统性能和成本之间实现最佳设计平衡。从芯片设计的角度来看,各种规格的多种解决方案可能会简单地呈现出来,但只有在彻底了解性能和成本之后,才能允许基于架构的最佳集成解决方案。为此,提出了参考架构,以在相同带宽要求下从功耗、面积和延迟方面进行分析。由于MCM、2.5D和3D结构依次缩短了D2D距离,因此可以通过减少接口IP面积和所需功耗来减轻芯片实现的设计开销。在功耗和面积开销方面,与2D单片设计相比,对于均匀分裂的芯片,MCM, 2.5D和3D设计案例分别显示额外的功耗增加2.1%,1.1%和0.04%,并且在450mm2尺寸的HPC/AI案例中显示额外的面积增加5.6%,2.4%和2.4%。此外,还创建并分析了两个最佳异构实践。从实验中,它清楚地表明3D面对面(F2F)结构是最佳选择,具有明显的指标,包括系统功耗开销0.11%,系统面积开销增加1.9%,对于带宽为中心的系统,来自MCM, 2.5D和3D封装候选的340W和700mm2类GPU/NPU。此外,在其他具有150w和420mm2 CPU的以延迟为中心的系统中,可以看到25μm μ bump螺距的3D F2F机箱由于其物理限制,其面功率密度可达12.5 TBps, 80μm C4bump螺距的面功率密度可达8.5X W/mm2。对于不同封装方案下接口元件的功率和信号完整性(PSI),本工作有助于了解在先进封装明显的度量和物理限制下,哪种芯片配置是最佳选择,以及改进μ-bump或C4bump等接口的必要性,特别是在3D堆叠ic中。我们还完成了配置系统的分层影响图,考虑了接口/TSV本身的开销、芯片分裂、测试电路和TSV存在所影响的P&R。因此,考虑到性能/成本驱动的半导体行业走向超越摩尔定律的时代,这项工作有望成为未来的芯片参考平台,可以为快速采用设计提供差异化的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Chiplet-based System PSI Optimization for 2.5D/3D Advanced Packaging Implementation
In this work, we propose a novel chiplet platform for 2.5D/3D IC Integration. Given specific design requirements, the Samsung chipletadvanced platform engine (SCAPE) can provide an integrated image of suitable advanced packaging solutionsfrom multi-chip module (MCM) or 2.5D silicon interposer or 3D stacked structures, taking into account the evaluation metrics (performance, power and area: PPA) of system and die-to-die (D2D) interconnect. It can also project an optimal design balance between system performance and cost which is closely related to die size. In a chiplet design perspective, multiple solutions for various specifications may be presented simply, but the architecture-based optimal integrated solution can be allowed only right after performance and cost are thoroughly understood. For that purpose, reference architectures are proposed to be analyzed in terms of power, area and latency at the same bandwidth requirement. As the MCM, 2.5D and 3D structures in sequence shorten the D2D distance, it can mitigate the design overhead for chiplet implementation by reducing the interface IP area and required power consumption. In terms of power and area overhead when compared to a 2D monolithic design, for homogeneously split dies, MCM, 2.5D and 3D design cases show that additional power increase 2.1%, 1.1% and 0.04% respectively and show that additional area increase by 5.6%, 2.4% and 2.4% in a HPC/AI case with 450mm2 diesize. In addition, two best heterogeneous practices are created and analyzed. From the experiments, it clearly shows that 3D face-to-face (F2F) structure is the best option with obvious metrics including system power overhead of 0.11% and system area overhead of 1.9% increase for a bandwidth-centric system with 340W and 700mm2like GPU/NPU from MCM, 2.5D and 3D package candidates. Moreover, in the other latency-centric system with150W and 420mm2 like CPU, it can be seen that 3D F2F case with 25μm μ-bump pitchworks up to 12.5X TBps areal BW density and 80μm C4bump pitch also work up to 8.5X W/mm2 areal power density due to their physical limitation. With respect to power and signal integrity (PSI) of interface elements under various packaging candidates, this work is helpful to understand which chiplet configuration is the best option with obvious metrics and physical limitations of advanced packages, and the need to improve interfaces such as μ-bump or C4bump especially in 3D stacked ICs. We also completed a hierarchical impact diagram of configured systems considering the overhead of interface/TSV itself, die split, test circuitry, and P&R affected by the existence of TSVs. Therefore, in considering the movement toward the era of beyond Moore's Law in the performance-/cost-driven semiconductor industry, this work is expected to serve as a future chiplet reference platform which can provide differentiating solutions for quick adoption of designs.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Transient Thermal Modeling of Die Bond Process in Multiple Die Stacked Flash Memory Package Development and Application of the Moisture-Dependent Viscoelastic Model of Polyimide in Hygro-Thermo-Mechanical Analysis of Fan-Out Interconnect Superb sinterability of the Cu paste consisting of bimodal size distribution Cu nanoparticles for low-temperature and pressureless sintering of large-area die attachment and the sintering mechanism Demonstration of Substrate Embedded Ni-Zn Ferrite Core Solenoid Inductors Using a Photosensitive Glass Substrate A De-Embedding and Embedding Procedure for High-Speed Channel Eye Diagram Oscilloscope Measurement
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1