基于芯片的2.5D/3D先进封装系统PSI优化

2022 IEEE 72nd Electronic Components and Technology Conference (ECTC) Pub Date : 2022-05-01 DOI:10.1109/ectc51906.2022.00010

Yoon-Yi Hwang, S. Moon, Seungki Nam, Jeong HoonAhn

{"title":"基于芯片的2.5D/3D先进封装系统PSI优化","authors":"Yoon-Yi Hwang, S. Moon, Seungki Nam, Jeong HoonAhn","doi":"10.1109/ectc51906.2022.00010","DOIUrl":null,"url":null,"abstract":"In this work, we propose a novel chiplet platform for 2.5D/3D IC Integration. Given specific design requirements, the Samsung chipletadvanced platform engine (SCAPE) can provide an integrated image of suitable advanced packaging solutionsfrom multi-chip module (MCM) or 2.5D silicon interposer or 3D stacked structures, taking into account the evaluation metrics (performance, power and area: PPA) of system and die-to-die (D2D) interconnect. It can also project an optimal design balance between system performance and cost which is closely related to die size. In a chiplet design perspective, multiple solutions for various specifications may be presented simply, but the architecture-based optimal integrated solution can be allowed only right after performance and cost are thoroughly understood. For that purpose, reference architectures are proposed to be analyzed in terms of power, area and latency at the same bandwidth requirement. As the MCM, 2.5D and 3D structures in sequence shorten the D2D distance, it can mitigate the design overhead for chiplet implementation by reducing the interface IP area and required power consumption. In terms of power and area overhead when compared to a 2D monolithic design, for homogeneously split dies, MCM, 2.5D and 3D design cases show that additional power increase 2.1%, 1.1% and 0.04% respectively and show that additional area increase by 5.6%, 2.4% and 2.4% in a HPC/AI case with 450mm2 diesize. In addition, two best heterogeneous practices are created and analyzed. From the experiments, it clearly shows that 3D face-to-face (F2F) structure is the best option with obvious metrics including system power overhead of 0.11% and system area overhead of 1.9% increase for a bandwidth-centric system with 340W and 700mm2like GPU/NPU from MCM, 2.5D and 3D package candidates. Moreover, in the other latency-centric system with150W and 420mm2 like CPU, it can be seen that 3D F2F case with 25μm μ-bump pitchworks up to 12.5X TBps areal BW density and 80μm C4bump pitch also work up to 8.5X W/mm2 areal power density due to their physical limitation. With respect to power and signal integrity (PSI) of interface elements under various packaging candidates, this work is helpful to understand which chiplet configuration is the best option with obvious metrics and physical limitations of advanced packages, and the need to improve interfaces such as μ-bump or C4bump especially in 3D stacked ICs. We also completed a hierarchical impact diagram of configured systems considering the overhead of interface/TSV itself, die split, test circuitry, and P&R affected by the existence of TSVs. Therefore, in considering the movement toward the era of beyond Moore's Law in the performance-/cost-driven semiconductor industry, this work is expected to serve as a future chiplet reference platform which can provide differentiating solutions for quick adoption of designs.","PeriodicalId":139520,"journal":{"name":"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Chiplet-based System PSI Optimization for 2.5D/3D Advanced Packaging Implementation\",\"authors\":\"Yoon-Yi Hwang, S. Moon, Seungki Nam, Jeong HoonAhn\",\"doi\":\"10.1109/ectc51906.2022.00010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we propose a novel chiplet platform for 2.5D/3D IC Integration. Given specific design requirements, the Samsung chipletadvanced platform engine (SCAPE) can provide an integrated image of suitable advanced packaging solutionsfrom multi-chip module (MCM) or 2.5D silicon interposer or 3D stacked structures, taking into account the evaluation metrics (performance, power and area: PPA) of system and die-to-die (D2D) interconnect. It can also project an optimal design balance between system performance and cost which is closely related to die size. In a chiplet design perspective, multiple solutions for various specifications may be presented simply, but the architecture-based optimal integrated solution can be allowed only right after performance and cost are thoroughly understood. For that purpose, reference architectures are proposed to be analyzed in terms of power, area and latency at the same bandwidth requirement. As the MCM, 2.5D and 3D structures in sequence shorten the D2D distance, it can mitigate the design overhead for chiplet implementation by reducing the interface IP area and required power consumption. In terms of power and area overhead when compared to a 2D monolithic design, for homogeneously split dies, MCM, 2.5D and 3D design cases show that additional power increase 2.1%, 1.1% and 0.04% respectively and show that additional area increase by 5.6%, 2.4% and 2.4% in a HPC/AI case with 450mm2 diesize. In addition, two best heterogeneous practices are created and analyzed. From the experiments, it clearly shows that 3D face-to-face (F2F) structure is the best option with obvious metrics including system power overhead of 0.11% and system area overhead of 1.9% increase for a bandwidth-centric system with 340W and 700mm2like GPU/NPU from MCM, 2.5D and 3D package candidates. Moreover, in the other latency-centric system with150W and 420mm2 like CPU, it can be seen that 3D F2F case with 25μm μ-bump pitchworks up to 12.5X TBps areal BW density and 80μm C4bump pitch also work up to 8.5X W/mm2 areal power density due to their physical limitation. With respect to power and signal integrity (PSI) of interface elements under various packaging candidates, this work is helpful to understand which chiplet configuration is the best option with obvious metrics and physical limitations of advanced packages, and the need to improve interfaces such as μ-bump or C4bump especially in 3D stacked ICs. We also completed a hierarchical impact diagram of configured systems considering the overhead of interface/TSV itself, die split, test circuitry, and P&R affected by the existence of TSVs. Therefore, in considering the movement toward the era of beyond Moore's Law in the performance-/cost-driven semiconductor industry, this work is expected to serve as a future chiplet reference platform which can provide differentiating solutions for quick adoption of designs.\",\"PeriodicalId\":139520,\"journal\":{\"name\":\"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ectc51906.2022.00010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ectc51906.2022.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在这项工作中，我们提出了一种新的2.5D/3D集成电路芯片平台。考虑到特定的设计要求，三星芯片先进平台引擎(SCAPE)可以从多芯片模块(MCM)或2.5D硅中间层或3D堆叠结构中提供合适的先进封装解决方案的集成图像，同时考虑到系统和模对模(D2D)互连的评估指标(性能，功率和面积:PPA)。它还可以在与模具尺寸密切相关的系统性能和成本之间实现最佳设计平衡。从芯片设计的角度来看，各种规格的多种解决方案可能会简单地呈现出来，但只有在彻底了解性能和成本之后，才能允许基于架构的最佳集成解决方案。为此，提出了参考架构，以在相同带宽要求下从功耗、面积和延迟方面进行分析。由于MCM、2.5D和3D结构依次缩短了D2D距离，因此可以通过减少接口IP面积和所需功耗来减轻芯片实现的设计开销。在功耗和面积开销方面，与2D单片设计相比，对于均匀分裂的芯片，MCM, 2.5D和3D设计案例分别显示额外的功耗增加2.1%，1.1%和0.04%，并且在450mm2尺寸的HPC/AI案例中显示额外的面积增加5.6%，2.4%和2.4%。此外，还创建并分析了两个最佳异构实践。从实验中，它清楚地表明3D面对面(F2F)结构是最佳选择，具有明显的指标，包括系统功耗开销0.11%，系统面积开销增加1.9%，对于带宽为中心的系统，来自MCM, 2.5D和3D封装候选的340W和700mm2类GPU/NPU。此外，在其他具有150w和420mm2 CPU的以延迟为中心的系统中，可以看到25μm μ bump螺距的3D F2F机箱由于其物理限制，其面功率密度可达12.5 TBps, 80μm C4bump螺距的面功率密度可达8.5X W/mm2。对于不同封装方案下接口元件的功率和信号完整性(PSI)，本工作有助于了解在先进封装明显的度量和物理限制下，哪种芯片配置是最佳选择，以及改进μ-bump或C4bump等接口的必要性，特别是在3D堆叠ic中。我们还完成了配置系统的分层影响图，考虑了接口/TSV本身的开销、芯片分裂、测试电路和TSV存在所影响的P&R。因此，考虑到性能/成本驱动的半导体行业走向超越摩尔定律的时代，这项工作有望成为未来的芯片参考平台，可以为快速采用设计提供差异化的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Chiplet-based System PSI Optimization for 2.5D/3D Advanced Packaging Implementation

In this work, we propose a novel chiplet platform for 2.5D/3D IC Integration. Given specific design requirements, the Samsung chipletadvanced platform engine (SCAPE) can provide an integrated image of suitable advanced packaging solutionsfrom multi-chip module (MCM) or 2.5D silicon interposer or 3D stacked structures, taking into account the evaluation metrics (performance, power and area: PPA) of system and die-to-die (D2D) interconnect. It can also project an optimal design balance between system performance and cost which is closely related to die size. In a chiplet design perspective, multiple solutions for various specifications may be presented simply, but the architecture-based optimal integrated solution can be allowed only right after performance and cost are thoroughly understood. For that purpose, reference architectures are proposed to be analyzed in terms of power, area and latency at the same bandwidth requirement. As the MCM, 2.5D and 3D structures in sequence shorten the D2D distance, it can mitigate the design overhead for chiplet implementation by reducing the interface IP area and required power consumption. In terms of power and area overhead when compared to a 2D monolithic design, for homogeneously split dies, MCM, 2.5D and 3D design cases show that additional power increase 2.1%, 1.1% and 0.04% respectively and show that additional area increase by 5.6%, 2.4% and 2.4% in a HPC/AI case with 450mm2 diesize. In addition, two best heterogeneous practices are created and analyzed. From the experiments, it clearly shows that 3D face-to-face (F2F) structure is the best option with obvious metrics including system power overhead of 0.11% and system area overhead of 1.9% increase for a bandwidth-centric system with 340W and 700mm2like GPU/NPU from MCM, 2.5D and 3D package candidates. Moreover, in the other latency-centric system with150W and 420mm2 like CPU, it can be seen that 3D F2F case with 25μm μ-bump pitchworks up to 12.5X TBps areal BW density and 80μm C4bump pitch also work up to 8.5X W/mm2 areal power density due to their physical limitation. With respect to power and signal integrity (PSI) of interface elements under various packaging candidates, this work is helpful to understand which chiplet configuration is the best option with obvious metrics and physical limitations of advanced packages, and the need to improve interfaces such as μ-bump or C4bump especially in 3D stacked ICs. We also completed a hierarchical impact diagram of configured systems considering the overhead of interface/TSV itself, die split, test circuitry, and P&R affected by the existence of TSVs. Therefore, in considering the movement toward the era of beyond Moore's Law in the performance-/cost-driven semiconductor industry, this work is expected to serve as a future chiplet reference platform which can provide differentiating solutions for quick adoption of designs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 72nd Electronic Components and Technology Conference (ECTC)

自引率

0.00%

发文量