A sampling-based acceleration method for heterogeneous chiplet NoC simulations

IF 6.2 2区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Future Generation Computer Systems-The International Journal of Escience Pub Date : 2024-12-04 DOI:10.1016/j.future.2024.107643
Ruoting Xiong , Wei Ren , Chengzhuo Zhang , Tao Li , Geyong Min
{"title":"A sampling-based acceleration method for heterogeneous chiplet NoC simulations","authors":"Ruoting Xiong ,&nbsp;Wei Ren ,&nbsp;Chengzhuo Zhang ,&nbsp;Tao Li ,&nbsp;Geyong Min","doi":"10.1016/j.future.2024.107643","DOIUrl":null,"url":null,"abstract":"<div><div>To tackle the challenges posed by Moore’s Law, Chiplet technology emerges as a promising solution. Chiplets comprising CPUs and accelerators are connected by Networks-on-Chip (NoC) for large-scale integration and efficient communications. However, the slow simulation speed of NoCs has become a bottleneck, limiting the overall performance of chiplet simulations. Existing solutions only focus on accelerating NoC simulation in homogeneous architecture. In this paper, we introduce a novel TOPSIS-based Heterogeneous Trace Score-sampling method (THTS) for faster NoC simulation in heterogeneous architecture. THTS enables quick and accurate sampling of representative NoC traces. Additionally, we propose a weight exploration model to further enhance sampling accuracy. Compared with the traditional NoC sampling method (NoCLabs), THTS reduces the error of the average packet latency by 22.17% and the total simulation time by 1.6 folds. THTS estimates the NoC performance with an average loss less than 5%, while speeding up the NoC simulation by up to 3 times. In addition, under different weight space sizes, the time required for the weight exploration model to solve the optimal weight vector is within seconds, remarkably speeding up the solution process. Notably, the predicted NoC simulation error under the optimal weight is only 1.42%.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"166 ","pages":"Article 107643"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X24006071","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

To tackle the challenges posed by Moore’s Law, Chiplet technology emerges as a promising solution. Chiplets comprising CPUs and accelerators are connected by Networks-on-Chip (NoC) for large-scale integration and efficient communications. However, the slow simulation speed of NoCs has become a bottleneck, limiting the overall performance of chiplet simulations. Existing solutions only focus on accelerating NoC simulation in homogeneous architecture. In this paper, we introduce a novel TOPSIS-based Heterogeneous Trace Score-sampling method (THTS) for faster NoC simulation in heterogeneous architecture. THTS enables quick and accurate sampling of representative NoC traces. Additionally, we propose a weight exploration model to further enhance sampling accuracy. Compared with the traditional NoC sampling method (NoCLabs), THTS reduces the error of the average packet latency by 22.17% and the total simulation time by 1.6 folds. THTS estimates the NoC performance with an average loss less than 5%, while speeding up the NoC simulation by up to 3 times. In addition, under different weight space sizes, the time required for the weight exploration model to solve the optimal weight vector is within seconds, remarkably speeding up the solution process. Notably, the predicted NoC simulation error under the optimal weight is only 1.42%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于采样的非均匀晶片NoC模拟加速方法
为应对摩尔定律带来的挑战,芯片组技术成为一种前景广阔的解决方案。由 CPU 和加速器组成的芯片通过片上网络(NoC)连接,实现大规模集成和高效通信。然而,NoC 的仿真速度慢已成为瓶颈,限制了芯片组仿真的整体性能。现有的解决方案仅侧重于加速同构架构中的 NoC 仿真。在本文中,我们介绍了一种新颖的基于 TOPSIS 的异构轨迹分数采样方法(THTS),用于加快异构架构中的 NoC 仿真。THTS 能够快速准确地抽取具有代表性的 NoC 迹线。此外,我们还提出了一个权重探索模型,以进一步提高采样精度。与传统的 NoC 采样方法(NoCLabs)相比,THTS 将平均数据包延迟误差减少了 22.17%,总仿真时间减少了 1.6 倍。THTS 估算 NoC 性能的平均损失小于 5%,同时将 NoC 仿真速度提高了 3 倍。此外,在不同权重空间大小的情况下,权重探索模型求解最优权重向量所需的时间均在几秒之内,大大加快了求解过程。值得注意的是,在最优权重下,预测的 NoC 仿真误差仅为 1.42%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
19.90
自引率
2.70%
发文量
376
审稿时长
10.6 months
期刊介绍: Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.
期刊最新文献
Self-sovereign identity framework with user-friendly private key generation and rule table Accelerating complex graph queries by summary-based hybrid partitioning for discovering vulnerabilities of distribution equipment DNA: Dual-radio Dual-constraint Node Activation scheduling for energy-efficient data dissemination in IoT Blending lossy and lossless data compression methods to support health data streaming in smart cities Energy–time modelling of distributed multi-population genetic algorithms with dynamic workload in HPC clusters
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1