KheOps: Cost-effective Repeatability, Reproducibility, and Replicability of Edge-to-Cloud Experiments

Daniel Rosendo, K. Keahey, Alexandru Costan, Matthieu Simonin, P. Valduriez, Gabriel Antoniu
{"title":"KheOps: Cost-effective Repeatability, Reproducibility, and Replicability of Edge-to-Cloud Experiments","authors":"Daniel Rosendo, K. Keahey, Alexandru Costan, Matthieu Simonin, P. Valduriez, Gabriel Antoniu","doi":"10.1145/3589806.3600032","DOIUrl":null,"url":null,"abstract":"Distributed infrastructures for computation and analytics are now evolving towards an interconnected ecosystem allowing complex scientific workflows to be executed across hybrid systems spanning from IoT Edge devices to Clouds, and sometimes to supercomputers (the Computing Continuum). Understanding the performance trade-offs of large-scale workflows deployed on such complex Edge-to-Cloud Continuum is challenging. To achieve this, one needs to systematically perform experiments, to enable their reproducibility and allow other researchers to replicate the study and the obtained conclusions on different infrastructures. This breaks down to the tedious process of reconciling the numerous experimental requirements and constraints with low-level infrastructure design choices. To address the limitations of the main state-of-the-art approaches for distributed, collaborative experimentation, such as Google Colab, Kaggle, and Code Ocean, we propose KheOps, a collaborative environment specifically designed to enable cost-effective reproducibility and replicability of Edge-to-Cloud experiments. KheOps is composed of three core elements: (1) an experiment repository; (2) a notebook environment; and (3) a multi-platform experiment methodology. We illustrate KheOps with a real-life Edge-to-Cloud application. The evaluations explore the point of view of the authors of an experiment described in an article (who aim to make their experiments reproducible) and the perspective of their readers (who aim to replicate the experiment). The results show how KheOps helps authors to systematically perform repeatable and reproducible experiments on the Grid5000 + FIT IoT LAB testbeds. Furthermore, KheOps helps readers to cost-effectively replicate authors experiments in different infrastructures such as Chameleon Cloud + CHI@Edge testbeds, and obtain the same conclusions with high accuracies (> 88% for all performance metrics).","PeriodicalId":393751,"journal":{"name":"Proceedings of the 2023 ACM Conference on Reproducibility and Replicability","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 ACM Conference on Reproducibility and Replicability","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3589806.3600032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Distributed infrastructures for computation and analytics are now evolving towards an interconnected ecosystem allowing complex scientific workflows to be executed across hybrid systems spanning from IoT Edge devices to Clouds, and sometimes to supercomputers (the Computing Continuum). Understanding the performance trade-offs of large-scale workflows deployed on such complex Edge-to-Cloud Continuum is challenging. To achieve this, one needs to systematically perform experiments, to enable their reproducibility and allow other researchers to replicate the study and the obtained conclusions on different infrastructures. This breaks down to the tedious process of reconciling the numerous experimental requirements and constraints with low-level infrastructure design choices. To address the limitations of the main state-of-the-art approaches for distributed, collaborative experimentation, such as Google Colab, Kaggle, and Code Ocean, we propose KheOps, a collaborative environment specifically designed to enable cost-effective reproducibility and replicability of Edge-to-Cloud experiments. KheOps is composed of three core elements: (1) an experiment repository; (2) a notebook environment; and (3) a multi-platform experiment methodology. We illustrate KheOps with a real-life Edge-to-Cloud application. The evaluations explore the point of view of the authors of an experiment described in an article (who aim to make their experiments reproducible) and the perspective of their readers (who aim to replicate the experiment). The results show how KheOps helps authors to systematically perform repeatable and reproducible experiments on the Grid5000 + FIT IoT LAB testbeds. Furthermore, KheOps helps readers to cost-effectively replicate authors experiments in different infrastructures such as Chameleon Cloud + CHI@Edge testbeds, and obtain the same conclusions with high accuracies (> 88% for all performance metrics).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
KheOps:具有成本效益的可重复性、再现性和边缘到云实验的可复制性
用于计算和分析的分布式基础设施现在正朝着一个相互关联的生态系统发展,允许复杂的科学工作流程跨混合系统执行,从物联网边缘设备到云,有时甚至到超级计算机(计算连续体)。理解部署在这种复杂的边缘到云连续体上的大规模工作流的性能权衡是具有挑战性的。为了实现这一点,需要系统地进行实验,使其可重复性,并允许其他研究人员在不同的基础设施上复制研究和获得的结论。这将分解为调和大量实验需求和约束与低级基础结构设计选择的乏味过程。为了解决分布式、协作实验(如谷歌Colab、Kaggle和Code Ocean)的主要最先进方法的局限性,我们提出了KheOps,这是一个专门设计的协作环境,用于实现边缘到云实验的成本效益可重复性和可复制性。KheOps由三个核心要素组成:(1)实验库;(2)笔记本环境;(3)多平台实验方法。我们用一个实际的边缘到云应用程序来说明KheOps。评估探讨了文章中描述的实验作者的观点(他们的目标是使实验可重复)和读者的观点(他们的目标是重复实验)。结果显示了KheOps如何帮助作者在Grid5000 + FIT物联网实验室测试台上系统地执行可重复和可再现的实验。此外,KheOps帮助读者经济有效地在不同的基础设施(如变色龙云+ CHI@Edge测试平台)上复制作者的实验,并获得相同的结论,准确性高(所有性能指标的> 88%)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Towards Evidence-Based Software Quality Practices for Reproducibility: Preliminary Results and Research Directions Towards Reproducible Execution of Closed-Source Applications from Internet Archives Automatic Reproduction of Workflows in the Snakemake Workflow Catalog and nf-core Registries KheOps: Cost-effective Repeatability, Reproducibility, and Replicability of Edge-to-Cloud Experiments GTP Benchmarks for Gradual Typing Performance
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1