Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems最新文献

英文中文

Provenance and Reproducibility in the Automation of a Standard Computational Neuroscience Pipeline 标准计算神经科学流水线自动化的来源和再现性

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

Pub Date : 2019-06-17 DOI: 10.1145/3322790.3330592

David B. Stockton, A. Prinz, F. Santamaría

Rapid increase in data volume, compounded by the reproducibility crisis, has led to the need to automate both experimental and computational aspects of neuroscience investigations. Automating neuroscience investigations enables an unprecedented ability to record and inspect how results were achieved. Here we review some of our recent work to integrate provenance and reproducibility measures into a tool called NeuroManager that automates a standard computational neuroscience pipeline, unifying the experiment--data--modeling--analysis cycle and allowing the scientist to focus on model evolution. Through a flexible daily workflow that leverages servers, clusters, and clouds simultaneously, NeuroManager automates manual tasks including database access, job submission, simulation scheduling, and preservation of provenance.

数据量的快速增长，再加上可重复性危机，导致神经科学研究的实验和计算方面都需要自动化。自动化神经科学研究使前所未有的记录和检查结果如何实现的能力。在这里，我们回顾了我们最近的一些工作，将来源和可重复性措施整合到一个名为NeuroManager的工具中，该工具可以自动化标准的计算神经科学管道，统一实验-数据-建模-分析周期，并允许科学家专注于模型进化。通过灵活的日常工作流程，同时利用服务器、集群和云，NeuroManager自动执行手动任务，包括数据库访问、作业提交、模拟调度和来源保存。

引用次数: 0

Scientific Tests and Continuous Integration Strategies to Enhance Reproducibility in the Scientific Software Context 在科学软件环境中提高可重复性的科学测试和持续集成策略

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

Pub Date : 2019-06-17 DOI: 10.1145/3322790.3330595

M. Krafczyk, A. Shi, A. Bhaskar, D. Marinov, V. Stodden

Continuous integration (CI) is a well-established technique in commercial and open-source software projects, although not routinely used in scientific publishing. In the scientific software context, CI can serve two functions to increase reproducibility of scientific results: providing an established platform for testing the reproducibility of these results, and demonstrating to other scientists how the code and data generate the published results. We explore scientific software testing and CI strategies using two articles published in the areas of applied mathematics and computational physics. We discuss lessons learned from reproducing these articles as well as examine and discuss existing tests. We introduce the notion of a "scientific test" as one that produces computational results from a published article. We then consider full result reproduction within a CI environment. If authors find their work too time or resource intensive to easily adapt to a CI context, we recommend the inclusion of results from reduced versions of their work (e.g., run at lower resolution, with shorter time scales, with smaller data sets) alongside their primary results within their article. While these smaller versions may be less interesting scientifically, they can serve to verify that published code and data are working properly. We demonstrate such reduction tests on the two articles studied.

持续集成(CI)在商业和开源软件项目中是一种成熟的技术，尽管在科学出版中并不经常使用。在科学软件上下文中，CI可以提供两个功能来增加科学结果的可重复性:为测试这些结果的可重复性提供一个已建立的平台，并向其他科学家演示代码和数据如何生成已发布的结果。我们使用在应用数学和计算物理领域发表的两篇文章来探索科学的软件测试和CI策略。我们讨论了从复制这些文章中学到的经验教训，并检查和讨论了现有的测试。我们将“科学测试”的概念介绍为从已发表的文章中产生计算结果的测试。然后我们考虑在CI环境中完整的结果复制。如果作者发现他们的工作过于耗时或资源密集，无法轻松适应CI上下文，我们建议在他们的文章中包括他们工作的简化版本的结果(例如，以更低的分辨率，更短的时间尺度，更小的数据集运行)以及他们的主要结果。虽然这些较小的版本在科学上可能不那么有趣，但它们可以用来验证发布的代码和数据是否正常工作。我们对所研究的两篇文章进行了这种约化检验。

{"title":"Scientific Tests and Continuous Integration Strategies to Enhance Reproducibility in the Scientific Software Context","authors":"M. Krafczyk, A. Shi, A. Bhaskar, D. Marinov, V. Stodden","doi":"10.1145/3322790.3330595","DOIUrl":"https://doi.org/10.1145/3322790.3330595","url":null,"abstract":"Continuous integration (CI) is a well-established technique in commercial and open-source software projects, although not routinely used in scientific publishing. In the scientific software context, CI can serve two functions to increase reproducibility of scientific results: providing an established platform for testing the reproducibility of these results, and demonstrating to other scientists how the code and data generate the published results. We explore scientific software testing and CI strategies using two articles published in the areas of applied mathematics and computational physics. We discuss lessons learned from reproducing these articles as well as examine and discuss existing tests. We introduce the notion of a \"scientific test\" as one that produces computational results from a published article. We then consider full result reproduction within a CI environment. If authors find their work too time or resource intensive to easily adapt to a CI context, we recommend the inclusion of results from reduced versions of their work (e.g., run at lower resolution, with shorter time scales, with smaller data sets) alongside their primary results within their article. While these smaller versions may be less interesting scientifically, they can serve to verify that published code and data are working properly. We demonstrate such reduction tests on the two articles studied.","PeriodicalId":192842,"journal":{"name":"Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130379731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 23

Implementing Computational Reproducibility in the Whole Tale Environment 在整个故事环境中实现计算再现性

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

Pub Date : 2019-06-17 DOI: 10.1145/3322790.3330594

K. Chard, N. Gaffney, Matthew B. Jones, K. Kowalik, Bertram Ludäscher, J. Nabrzyski, V. Stodden, I. Taylor, M. Turk, C. Willis

We present and define a structured digital object, called a "Tale," for the dissemination and publication of computational scientific findings in the scholarly record. The Tale emerges from the NSF funded Whole Tale project (wholetale.org) which is developing a computational environment designed to capture the entire computational pipeline associated with a scientific experiment and thereby enable computational reproducibility. A Tale allows researchers to create and package code, data and information about the workflow and computational environment necessary to support, review, and recreate the computational results reported in published research. The Tale then captures the artifacts and information needed to facilitate understanding, transparency, and execution of the Tale for review and reproducibility at the time of publication.

我们提出并定义了一个结构化的数字对象，称为“故事”，用于传播和出版学术记录中的计算科学发现。这个故事来自美国国家科学基金会资助的“整个故事”项目(wholtale.org)，该项目正在开发一个计算环境，旨在捕捉与科学实验相关的整个计算管道，从而实现计算的可重复性。Tale允许研究人员创建和打包代码、数据和有关工作流和计算环境的信息，以支持、审查和重新创建已发表的研究报告中的计算结果。然后，Tale捕获了促进理解、透明和执行Tale所需的工件和信息，以便在发布时进行审查和再现。

引用次数: 12

Initial Thoughts on Cybersecurity And Reproducibility 关于网络安全和可重复性的初步思考

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

Pub Date : 2019-06-17 DOI: 10.1145/3322790.3330593

E. Deelman, V. Stodden, M. Taufer, Von Welch

Cybersecurity, which serves to protect computer systems and data from malicious and accidental abuse and changes, both supports and challenges the reproducibility of computational science. This position paper explores a research agenda by enumerating a set of two types of challenges that emerge at the intersection of cybersecurity and reproducibility: challenges that cybersecurity has in supporting the reproducibility of computational science, and challenges cybersecurity creates for reproducibility of computational science.

网络安全，用于保护计算机系统和数据免受恶意和意外的滥用和变化，既支持又挑战计算科学的可重复性。本立场文件通过列举在网络安全和可再现性交叉点出现的两种挑战来探讨研究议程:网络安全在支持计算科学的可再现性方面面临的挑战，以及网络安全为计算科学的可再现性创造的挑战。

引用次数: 5

Reproducible Computer Network Experiments: A Case Study Using Popper 可重复计算机网络实验:以Popper为例

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

Pub Date : 2019-06-17 DOI: 10.1145/3322790.3330596

Andrea David, Mariette Souppe, I. Jimenez, K. Obraczka, Sam Mansfield, K. Veenstra, C. Maltzahn

Computer network research experiments can be broadly grouped in three categories: simulated, controlled, and real-world experiments. Simulation frameworks, experiment testbeds and measurement tools, respectively, are commonly used as the platforms for carrying out network experiments. In many cases, given the nature of computer network experiments, properly configuring these platforms is a complex and time-consuming task, which makes replicating and validating research results quite challenging. This complexity can be reduced by leveraging tools that enable experiment reproducibility. In this paper, we show how a recently proposed reproducibility tool called Popper facilitates the reproduction of networking experiments. In particular, we detail the steps taken to reproduce results in two published articles that rely on simulations. The outcome of this exercise is a generic workflow for carrying out network simulation experiments. In addition, we briefly present two additional Popper workflows for running experiments on controlled testbeds, as well as studies that gather real-world metrics (all code is publicly available on Github). We close by providing a list of lessons we learned throughout this process.

计算机网络研究实验可以大致分为三类:模拟实验、控制实验和真实实验。仿真框架、实验试验台和测量工具是开展网络实验的常用平台。在许多情况下，考虑到计算机网络实验的性质，正确配置这些平台是一项复杂而耗时的任务，这使得复制和验证研究结果相当具有挑战性。这种复杂性可以通过利用支持实验再现性的工具来降低。在本文中，我们展示了最近提出的一种称为Popper的可再现性工具如何促进网络实验的再现。特别是，我们在两篇依赖模拟的已发表文章中详细介绍了再现结果所采取的步骤。这个练习的结果是一个进行网络模拟实验的通用工作流。此外，我们简要介绍了两个额外的Popper工作流程，用于在受控测试台上运行实验，以及收集真实世界指标的研究(所有代码都在Github上公开可用)。最后，我们提供了一份我们在整个过程中学到的教训清单。

{"title":"Reproducible Computer Network Experiments: A Case Study Using Popper","authors":"Andrea David, Mariette Souppe, I. Jimenez, K. Obraczka, Sam Mansfield, K. Veenstra, C. Maltzahn","doi":"10.1145/3322790.3330596","DOIUrl":"https://doi.org/10.1145/3322790.3330596","url":null,"abstract":"Computer network research experiments can be broadly grouped in three categories: simulated, controlled, and real-world experiments. Simulation frameworks, experiment testbeds and measurement tools, respectively, are commonly used as the platforms for carrying out network experiments. In many cases, given the nature of computer network experiments, properly configuring these platforms is a complex and time-consuming task, which makes replicating and validating research results quite challenging. This complexity can be reduced by leveraging tools that enable experiment reproducibility. In this paper, we show how a recently proposed reproducibility tool called Popper facilitates the reproduction of networking experiments. In particular, we detail the steps taken to reproduce results in two published articles that rely on simulations. The outcome of this exercise is a generic workflow for carrying out network simulation experiments. In addition, we briefly present two additional Popper workflows for running experiments on controlled testbeds, as well as studies that gather real-world metrics (all code is publicly available on Github). We close by providing a list of lessons we learned throughout this process.","PeriodicalId":192842,"journal":{"name":"Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116178321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Applicability Study of the PRIMAD Model to LIGO Gravitational Wave Search Workflows PRIMAD模型在LIGO引力波搜索工作流程中的适用性研究

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

Pub Date : 2019-04-10 DOI: 10.1145/3322790.3330591

Dylan Chapp, Danny Rorabaugh, Duncan A. Brown, E. Deelman, K. Vahi, Von Welch, M. Taufer

The PRIMAD model with its six components (i.e., Platform, Research Objective, Implementation, Methods, Actors, and Data) provides an abstract taxonomy to represent computational experiments and promote reproducibility by design. In this paper, we employ a post-hoc assessment of the model applicability to a set of Laser Interferometer Gravitational-Wave Observatory (LIGO) workflows from literature sources (i.e., published papers). Our work outlines potential advantages and limitations of the model in terms of its levels of abstraction and means of application.

PRIMAD模型由平台、研究目标、实现、方法、参与者和数据六个部分组成，提供了一个抽象的分类法来表示计算实验，并通过设计提高可重复性。在本文中，我们从文献来源(即已发表的论文)对一组激光干涉仪引力波天文台(LIGO)工作流程的模型适用性进行了事后评估。我们的工作概述了该模型在抽象层次和应用方式方面的潜在优势和局限性。

引用次数: 2

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems 第二届计算机系统实用可重复评估国际研讨会论文集

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

Pub Date : 1900-01-01 DOI: 10.1145/3322790

引用次数: 2

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀