N-DISE: NDN-based data distribution for large-scale data-intensive science

Yuanhao Wu, Faruk V. Mutlu, Yuezhou Liu, E. Yeh, Ran Liu, C. Iordache, J. Balcas, Harvey Newman, Raimondas Sirvinskas, Michael Lo, Sichen Song, Jason Cong, Lixia Zhang, Sankalpa Timilsina, Susmit Shannigrahi, Chengyu Fan, Davide Pesavento, Junxiao Shi, L. Benmohamed
{"title":"N-DISE: NDN-based data distribution for large-scale data-intensive science","authors":"Yuanhao Wu, Faruk V. Mutlu, Yuezhou Liu, E. Yeh, Ran Liu, C. Iordache, J. Balcas, Harvey Newman, Raimondas Sirvinskas, Michael Lo, Sichen Song, Jason Cong, Lixia Zhang, Sankalpa Timilsina, Susmit Shannigrahi, Chengyu Fan, Davide Pesavento, Junxiao Shi, L. Benmohamed","doi":"10.1145/3517212.3558087","DOIUrl":null,"url":null,"abstract":"To meet unprecedented challenges faced by the world's largest data- and network-intensive science programs, we design and implement a new, highly efficient and field-tested data distribution, caching, access and analysis system for the Large Hadron Collider (LHC) high energy physics (HEP) network and other major science programs. We develop a hierarchical Named Data Networking (NDN) naming scheme for HEP data, implement new consumer and producer applications to interface with the high-performance NDN-DPDK forwarder, and build on recently developed high-throughput NDN caching and forwarding methods. We integrate NDN systems concepts and algorithms with the mainstream data distribution, processing, and management system of the Compact Muon Solenoid (CMS) experiment. We design and prototype stable, high-performance virtual LANs (VLANs) over a continental-scale wide area network testbed. In extensive experiments, our proposed integrated system, named NDN for Data-Intensive Science Experiments (N-DISE), is shown to deliver LHC data over the wide area network (WAN) testbed at throughputs exceeding 31 Gbps between Caltech and StarLight, with dramatically reduced download time.","PeriodicalId":165903,"journal":{"name":"Proceedings of the 9th ACM Conference on Information-Centric Networking","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th ACM Conference on Information-Centric Networking","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3517212.3558087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

To meet unprecedented challenges faced by the world's largest data- and network-intensive science programs, we design and implement a new, highly efficient and field-tested data distribution, caching, access and analysis system for the Large Hadron Collider (LHC) high energy physics (HEP) network and other major science programs. We develop a hierarchical Named Data Networking (NDN) naming scheme for HEP data, implement new consumer and producer applications to interface with the high-performance NDN-DPDK forwarder, and build on recently developed high-throughput NDN caching and forwarding methods. We integrate NDN systems concepts and algorithms with the mainstream data distribution, processing, and management system of the Compact Muon Solenoid (CMS) experiment. We design and prototype stable, high-performance virtual LANs (VLANs) over a continental-scale wide area network testbed. In extensive experiments, our proposed integrated system, named NDN for Data-Intensive Science Experiments (N-DISE), is shown to deliver LHC data over the wide area network (WAN) testbed at throughputs exceeding 31 Gbps between Caltech and StarLight, with dramatically reduced download time.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
N-DISE:基于ndn的大规模数据密集型科学数据分布
为了应对世界上最大的数据和网络密集型科学项目所面临的前所未有的挑战,我们为大型强子对撞机(LHC)高能物理(HEP)网络和其他重大科学项目设计并实施了一种新的、高效的、经过现场测试的数据分发、缓存、访问和分析系统。我们为HEP数据开发了一个分层命名数据网络(NDN)命名方案,实现了新的消费者和生产者应用程序与高性能NDN- dpdk转发器接口,并建立在最近开发的高吞吐量NDN缓存和转发方法的基础上。我们将NDN系统的概念和算法与紧凑介子螺线管(CMS)实验的主流数据分发、处理和管理系统相结合。我们设计和原型稳定,高性能的虚拟局域网(vlan)在大陆规模的广域网测试平台。在广泛的实验中,我们提出的集成系统,名为数据密集型科学实验NDN (N-DISE),被证明可以在加州理工学院和StarLight之间的广域网(WAN)测试平台上以超过31 Gbps的吞吐量传输LHC数据,大大缩短了下载时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
OPSEL SoK: The evolution of distributed dataset synchronization solutions in NDN Building a secure mHealth data sharing infrastructure over NDN On cache-aware dynamic adaptive streaming over information-centric networking RESTful information-centric networking: statement
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1