利用全局伪同步局部同步时钟降低二维网格noc的功耗和延迟

E. Nilsson, Johnny Öberg
{"title":"利用全局伪同步局部同步时钟降低二维网格noc的功耗和延迟","authors":"E. Nilsson, Johnny Öberg","doi":"10.1145/1016720.1016764","DOIUrl":null,"url":null,"abstract":"One of the main problems when designing large ASICs today is to distribute a low power synchronous clock over the whole chip and a lot of remedies to this problem have been proposed over the years. For networks-on-chip (NoC), where computational resources are organised in a 2-D mesh connected together through switches in an on-chip interconnection network, another possibility exists: globally pseudochronous locally synchronous clock distribution. We present a clocking scheme for NoCs that we call globally pseudochronous locally synchronous, in which we distribute a clock with a constant phase difference between the switches. As a consequence of the phase difference, some paths along the NoC switch network become faster than the others. We call these paths data motorways. By adapting the switching policy in the switches to prefer data to use the motorways, we show that the latency within the network is reduced with up to 40% compared to a synchronous reference case. The phase difference between the resources also makes the circuit more tolerant to clock skew. It also distributes the current peaks more evenly across the clock period, which leads to a reduction in peak power, which in turn further reduces the clock skew and the jitter in the clock network.","PeriodicalId":127038,"journal":{"name":"International Conference on Hardware/Software Codesign and System Synthesis, 2004. CODES + ISSS 2004.","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2004-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"47","resultStr":"{\"title\":\"Reducing power and latency in 2-D mesh NoCs using globally pseudochronous locally synchronous clocking\",\"authors\":\"E. Nilsson, Johnny Öberg\",\"doi\":\"10.1145/1016720.1016764\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the main problems when designing large ASICs today is to distribute a low power synchronous clock over the whole chip and a lot of remedies to this problem have been proposed over the years. For networks-on-chip (NoC), where computational resources are organised in a 2-D mesh connected together through switches in an on-chip interconnection network, another possibility exists: globally pseudochronous locally synchronous clock distribution. We present a clocking scheme for NoCs that we call globally pseudochronous locally synchronous, in which we distribute a clock with a constant phase difference between the switches. As a consequence of the phase difference, some paths along the NoC switch network become faster than the others. We call these paths data motorways. By adapting the switching policy in the switches to prefer data to use the motorways, we show that the latency within the network is reduced with up to 40% compared to a synchronous reference case. The phase difference between the resources also makes the circuit more tolerant to clock skew. It also distributes the current peaks more evenly across the clock period, which leads to a reduction in peak power, which in turn further reduces the clock skew and the jitter in the clock network.\",\"PeriodicalId\":127038,\"journal\":{\"name\":\"International Conference on Hardware/Software Codesign and System Synthesis, 2004. CODES + ISSS 2004.\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"47\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Hardware/Software Codesign and System Synthesis, 2004. CODES + ISSS 2004.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1016720.1016764\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Hardware/Software Codesign and System Synthesis, 2004. CODES + ISSS 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1016720.1016764","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 47

摘要

当今设计大型asic时的主要问题之一是在整个芯片上分配低功耗同步时钟,多年来已经提出了许多解决这个问题的方法。对于片上网络(NoC),计算资源被组织在一个二维网格中,通过片上互连网络中的交换机连接在一起,存在另一种可能性:全局伪同步本地同步时钟分布。我们提出了一种noc的时钟方案,我们称之为全局伪同步局部同步,其中我们在交换机之间分配具有恒定相位差的时钟。由于相位差,沿NoC交换网络的一些路径变得比其他路径更快。我们称这些路径为数据高速公路。通过调整交换机中的交换策略来优先使用高速公路上的数据,我们表明,与同步参考情况相比,网络内的延迟减少了40%。资源之间的相位差也使电路更能容忍时钟倾斜。它还在整个时钟周期内更均匀地分配电流峰值,从而导致峰值功率的降低,从而进一步降低时钟网络中的时钟倾斜和抖动。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Reducing power and latency in 2-D mesh NoCs using globally pseudochronous locally synchronous clocking
One of the main problems when designing large ASICs today is to distribute a low power synchronous clock over the whole chip and a lot of remedies to this problem have been proposed over the years. For networks-on-chip (NoC), where computational resources are organised in a 2-D mesh connected together through switches in an on-chip interconnection network, another possibility exists: globally pseudochronous locally synchronous clock distribution. We present a clocking scheme for NoCs that we call globally pseudochronous locally synchronous, in which we distribute a clock with a constant phase difference between the switches. As a consequence of the phase difference, some paths along the NoC switch network become faster than the others. We call these paths data motorways. By adapting the switching policy in the switches to prefer data to use the motorways, we show that the latency within the network is reduced with up to 40% compared to a synchronous reference case. The phase difference between the resources also makes the circuit more tolerant to clock skew. It also distributes the current peaks more evenly across the clock period, which leads to a reduction in peak power, which in turn further reduces the clock skew and the jitter in the clock network.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Efficient mapping of hierarchical trees on coarse-grain reconfigurable architectures Analytical models for leakage power estimation of memory array structures Memory accesses management during high level synthesis A novel deadlock avoidance algorithm and its hardware implementation Fast exploration of bus-based on-chip communication architectures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1