首页 > 最新文献

Data Science Journal最新文献

英文 中文
Implementation of a Federated Information System by Means of Reuse of Research Data Archived in Research Data Repositories 利用研究数据存储库中存档研究数据的重用实现联邦信息系统
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-039
Sylvia Melzer, Stefan Thiemann, Simon Schiff, Ralf Möller
At universities, research data is increasingly stored in research data repositories according to a data management plan (DMP) and thus made available for further use. The challenge of reusing hundreds, thousands, or millions of data sets is to obtain an overview of the data in a short period of time and to search through all the data. The high variability of the formats used to store research data requires a new approach to data reusability that focuses on the visualisation and searchability of archived research data, which can also be combined with each other. In this article, we present a practical DMP that describes how information systems can be created on demand by reusing research data archived in research data repositories and how these systems can be merged into a federated information system. As a result, in our projects, information systems have been created in minutes or a couple of hours with few resources. The initial effort to create a federated system remains; however, this allows federated searches to be performed. Extending a federated system to include other information systems can then be accomplished by making a few configurations and manageable adjustments to the source code.
在大学,越来越多的研究数据根据数据管理计划(DMP)存储在研究数据存储库中,从而可供进一步使用。重用数百、数千或数百万个数据集的挑战是在短时间内获得数据的概览并搜索所有数据。用于存储研究数据的格式的高度可变性需要一种新的数据可重用性方法,该方法侧重于存档研究数据的可视化和可搜索性,它们也可以相互结合。在本文中,我们提出了一个实用的DMP,它描述了如何通过重用存档在研究数据存储库中的研究数据来按需创建信息系统,以及如何将这些系统合并到一个联合信息系统中。因此,在我们的项目中,信息系统是在几分钟或几个小时内用很少的资源创建的。创建联邦系统的最初努力仍然存在;但是,这允许执行联邦搜索。扩展联邦系统以包含其他信息系统,然后可以通过对源代码进行一些配置和可管理的调整来完成。
{"title":"Implementation of a Federated Information System by Means of Reuse of Research Data Archived in Research Data Repositories","authors":"Sylvia Melzer, Stefan Thiemann, Simon Schiff, Ralf Möller","doi":"10.5334/dsj-2023-039","DOIUrl":"https://doi.org/10.5334/dsj-2023-039","url":null,"abstract":"At universities, research data is increasingly stored in research data repositories according to a data management plan (DMP) and thus made available for further use. The challenge of reusing hundreds, thousands, or millions of data sets is to obtain an overview of the data in a short period of time and to search through all the data. The high variability of the formats used to store research data requires a new approach to data reusability that focuses on the visualisation and searchability of archived research data, which can also be combined with each other. In this article, we present a practical DMP that describes how information systems can be created on demand by reusing research data archived in research data repositories and how these systems can be merged into a federated information system. As a result, in our projects, information systems have been created in minutes or a couple of hours with few resources. The initial effort to create a federated system remains; however, this allows federated searches to be performed. Extending a federated system to include other information systems can then be accomplished by making a few configurations and manageable adjustments to the source code.","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136303138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ENEA PAES: A Web Platform for Supporting Italian Municipalities in Sustainable Energy Action Plan ENEA PAES:支持意大利市政当局实施可持续能源行动计划的网络平台
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-037
Fabio Cignini, Enrico Cosimi, Vittoria Cozza, Flavio Fontana, Maurizio Matera, Giangiacomo Ponzo, Maria Salvato, Veronica Tomassetti
The Covenant of Mayors promotes the Sustainable Energy Action Plan (SEAP), aiming to mitigate greenhouse gas (GHG) emissions in line with the European Union’s 2030 and 2050 targets. The Covenant signatories could take enormous advantage from a digital platform that allows SEAP drafting also to no technically skilled users, like majority of them are. The Italian National Agency for New Technologies, Energy and Sustainable Economic Development (ENEA) has developed the PAES platform in order to provide digital support to public administrations (PA) adhering to the Covenant of Mayors. The platform exploits open data and it is fed by energetic data aggregated on a municipal level. The platform offers appropriate functionalities for baseline CO2 emissions inventory (BEI) filling out and a best practice (BP) simulation tool. The latter allows to contextualize each BP and to estimate its effects in terms of the main GHG emission. The BP showing the best estimation results can then be converted into concrete adaptation actions. So, this digital system facilitates local Italian municipalities in the strategic planning and monitoring of adaptation actions taken over time.
市长公约促进可持续能源行动计划(SEAP),旨在根据欧盟2030年和2050年的目标减少温室气体(GHG)排放。《公约》签署国可以从一个数字平台中获得巨大的好处,该平台允许SEAP起草工作也适用于没有技术熟练的用户,就像他们中的大多数人一样。意大利国家新技术、能源和可持续经济发展机构(ENEA)开发了PAES平台,以便为遵守《市长公约》的公共行政部门(PA)提供数字支持。该平台利用开放数据,并由市级汇总的活跃数据提供支持。该平台为基线二氧化碳排放清单(BEI)填写和最佳实践(BP)模拟工具提供了适当的功能。后者允许将每个BP置于环境中,并根据主要温室气体排放估计其影响。显示最佳估计结果的BP可以转化为具体的适应行动。因此,这个数字系统有助于意大利地方市政当局进行战略规划和监测长期采取的适应行动。
{"title":"ENEA PAES: A Web Platform for Supporting Italian Municipalities in Sustainable Energy Action Plan","authors":"Fabio Cignini, Enrico Cosimi, Vittoria Cozza, Flavio Fontana, Maurizio Matera, Giangiacomo Ponzo, Maria Salvato, Veronica Tomassetti","doi":"10.5334/dsj-2023-037","DOIUrl":"https://doi.org/10.5334/dsj-2023-037","url":null,"abstract":"The Covenant of Mayors promotes the Sustainable Energy Action Plan (SEAP), aiming to mitigate greenhouse gas (GHG) emissions in line with the European Union’s 2030 and 2050 targets. The Covenant signatories could take enormous advantage from a digital platform that allows SEAP drafting also to no technically skilled users, like majority of them are. The Italian National Agency for New Technologies, Energy and Sustainable Economic Development (ENEA) has developed the PAES platform in order to provide digital support to public administrations (PA) adhering to the Covenant of Mayors. The platform exploits open data and it is fed by energetic data aggregated on a municipal level. The platform offers appropriate functionalities for baseline CO2 emissions inventory (BEI) filling out and a best practice (BP) simulation tool. The latter allows to contextualize each BP and to estimate its effects in terms of the main GHG emission. The BP showing the best estimation results can then be converted into concrete adaptation actions. So, this digital system facilitates local Italian municipalities in the strategic planning and monitoring of adaptation actions taken over time.","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135845686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Programmatic and Scalable Approach to Making Data Management Machine-Actionable 使数据管理机器可操作的程序化和可扩展方法
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-026
M. Praetzellis, M. Buys, Xiaoli Chen, J. Chodacki, N. Davies, Kristian Garza, Catherine Nancarrow, Brian Riley, E. Robinson
{"title":"A Programmatic and Scalable Approach to Making Data Management Machine-Actionable","authors":"M. Praetzellis, M. Buys, Xiaoli Chen, J. Chodacki, N. Davies, Kristian Garza, Catherine Nancarrow, Brian Riley, E. Robinson","doi":"10.5334/dsj-2023-026","DOIUrl":"https://doi.org/10.5334/dsj-2023-026","url":null,"abstract":"","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71068556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Science in a Pandemic 大流行中的数据科学
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-041
Dennis F. X. Mathaisel
Data Science has the potential to provide humanity with critical insight into the massive data being collected during a pandemic. The COVID-19 pandemic presented that opportunity, and Data Science supported an international audience promptly, reliably, effectively, and frequently during that difficult time. The most significant contributions were data visualizations and data dashboards, however, other tools, such as predictive and prescriptive analytics, were equally critical to the effort. The urgency at the start of the pandemic was to quickly communicate information to citizens, governments, and institutions. The change in modality from traditional statistical metrics and tables to data visualizations was extremely significant and helpful to so many. This paper reviews these contributions by demonstrating how the COVID-19 story unfolded through author-generated data visualizations and dashboards, and by providing the community with open-source access to the scripts that generated these visualizations. The open-source access to the (R language) scripts reflects this article’s novelty in the literature. Using publicly available datasets from multiple sources, and employing R toolkits, the author validates the role that Data Science can play in a pandemic, and that can be implemented by anyone with some basic knowledge of scripting languages, like R. The intent is to provide these valuable tools to the community and to demonstrate their effectiveness in the likely event when there is another crisis.
数据科学有可能为人类提供对大流行期间收集的大量数据的关键见解。2019冠状病毒病大流行提供了这一机会,在这一困难时期,数据科学迅速、可靠、有效和频繁地为国际受众提供了支持。最重要的贡献是数据可视化和数据仪表板,然而,其他工具,如预测和规范分析,对这项工作同样至关重要。大流行开始时的当务之急是迅速向公民、政府和机构传达信息。从传统的统计度量和表格到数据可视化的模式变化非常重要,对很多人都很有帮助。本文通过展示作者如何通过生成数据可视化和仪表板展开COVID-19故事,并向社区提供对生成这些可视化脚本的开源访问,回顾了这些贡献。对(R语言)脚本的开源访问反映了本文在文献中的新颖性。作者使用来自多个来源的公开可用数据集,并使用R工具包,验证了数据科学在大流行中可以发挥的作用,并且可以由具有脚本语言(如R)基础知识的任何人实施。其目的是向社区提供这些有价值的工具,并在出现另一场危机时展示它们的有效性。
{"title":"Data Science in a Pandemic","authors":"Dennis F. X. Mathaisel","doi":"10.5334/dsj-2023-041","DOIUrl":"https://doi.org/10.5334/dsj-2023-041","url":null,"abstract":"Data Science has the potential to provide humanity with critical insight into the massive data being collected during a pandemic. The COVID-19 pandemic presented that opportunity, and Data Science supported an international audience promptly, reliably, effectively, and frequently during that difficult time. The most significant contributions were data visualizations and data dashboards, however, other tools, such as predictive and prescriptive analytics, were equally critical to the effort. The urgency at the start of the pandemic was to quickly communicate information to citizens, governments, and institutions. The change in modality from traditional statistical metrics and tables to data visualizations was extremely significant and helpful to so many. This paper reviews these contributions by demonstrating how the COVID-19 story unfolded through author-generated data visualizations and dashboards, and by providing the community with open-source access to the scripts that generated these visualizations. The open-source access to the (R language) scripts reflects this article’s novelty in the literature. Using publicly available datasets from multiple sources, and employing R toolkits, the author validates the role that Data Science can play in a pandemic, and that can be implemented by anyone with some basic knowledge of scripting languages, like R. The intent is to provide these valuable tools to the community and to demonstrate their effectiveness in the likely event when there is another crisis.","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135261934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards a Toolbox for Automated Assessment of Machine-Actionable Data Management Plans 面向机器可操作数据管理计划自动评估的工具箱
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-028
Tomasz Miksa, M. Suchánek, Jan Slifka, Vojtěch Knaisl, F. Ekaputra, Filip Kovacevic, Annisa Maulida Ningtyas, Alaa El-Ebshihy, R. Pergl
{"title":"Towards a Toolbox for Automated Assessment of Machine-Actionable Data Management Plans","authors":"Tomasz Miksa, M. Suchánek, Jan Slifka, Vojtěch Knaisl, F. Ekaputra, Filip Kovacevic, Annisa Maulida Ningtyas, Alaa El-Ebshihy, R. Pergl","doi":"10.5334/dsj-2023-028","DOIUrl":"https://doi.org/10.5334/dsj-2023-028","url":null,"abstract":"","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71068230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Data Management Plan Implementation, Assessments, and Evaluations: Implications and Recommendations 数据管理计划的实施、评估和评价:影响和建议
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-027
B. Bishop, Peter Neish, Ji Hyun Kim, Raphaëlle Bats, Anthony J. Million, Jake Carlson, Heather Moulaison Sandy, Minh T. Pham
{"title":"Data Management Plan Implementation, Assessments, and Evaluations: Implications and Recommendations","authors":"B. Bishop, Peter Neish, Ji Hyun Kim, Raphaëlle Bats, Anthony J. Million, Jake Carlson, Heather Moulaison Sandy, Minh T. Pham","doi":"10.5334/dsj-2023-027","DOIUrl":"https://doi.org/10.5334/dsj-2023-027","url":null,"abstract":"","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71068161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementing Informatics Tools with Data Management Plans for Disease Area Research 实施信息学工具与疾病领域研究的数据管理计划
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-024
V. Navale, Matthew McAuliffe
{"title":"Implementing Informatics Tools with Data Management Plans for Disease Area Research","authors":"V. Navale, Matthew McAuliffe","doi":"10.5334/dsj-2023-024","DOIUrl":"https://doi.org/10.5334/dsj-2023-024","url":null,"abstract":"","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71068504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Engaging with Researchers and Raising Awareness of FAIR and Open Science through the FAIR+ Implementation Survey Tool (FAIRIST) 通过FAIR+实施调查工具(FAIRIST)与研究人员互动,提高对公平和开放科学的认识
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-032
Christine R. Kirkpatrick, Kevin Coakley, Julianne Christopher, Inês Dutra
Seven years after the seminal paper on FAIR was published, that introduced the concept of making research outputs Findable, Accessible, Interoperable, and Reusable, researchers still struggle to understand how to implement the principles. For many researchers, FAIR promises long-term benefits for near-term effort, requires skills not yet acquired, and is one more thing in a long list of unfunded mandates and onerous requirements for scientists. Even for those required to, or who are convinced that they must make time for FAIR research practices, their preference is for just-in-time advice properly sized to the scientific artifacts and process. Because of the generality of most FAIR implementation guidance, it is difficult for a researcher to adjust to the advice according to their situation. Technological advances, especially in the area of artificial intelligence (AI) and machine learning (ML), complicate FAIR adoption, as researchers and data stewards ponder how to make software, workflows, and models FAIR and reproducible. The FAIR+ Implementation Survey Tool (FAIRIST) mitigates the problem by integrating research requirements with research proposals in a systematic way. FAIRIST factors in new scholarly outputs, such as nanopublications and notebooks, and the various research artifacts related to AI research (data, models, workflows, and benchmarks). Researchers step through a self-serve survey process and receive a table ready for use in their data management plan (DMP) and/or work plan. while gaining awareness of the FAIR Principles and Open Science concepts. FAIRIST is a model that uses part of the proposal process as a way to do outreach, raise awareness of FAIR dimensions and considerations, while providing timely assistance for competitive proposals.
在关于FAIR的开创性论文发表七年后,该论文引入了使研究成果可查找、可访问、可互操作和可重用的概念,研究人员仍在努力理解如何实现这些原则。对许多研究人员来说,FAIR承诺短期的努力会带来长期的好处,需要一些尚未掌握的技能,而且是一长串没有资金支持的任务和对科学家的繁重要求中的一项。即使对于那些被要求,或者确信他们必须为公平的研究实践腾出时间的人来说,他们的偏好是及时的建议,适当地与科学工件和过程相适应。由于大多数FAIR实施指南的通用性,研究人员很难根据自己的情况调整建议。技术进步,特别是在人工智能(AI)和机器学习(ML)领域,使公平的采用复杂化,因为研究人员和数据管理员思考如何使软件、工作流和模型公平和可复制。FAIR+实施调查工具(FAIRIST)通过系统地整合研究需求和研究建议,缓解了这一问题。FAIRIST考虑了新的学术产出,如纳米出版物和笔记本,以及与人工智能研究相关的各种研究工件(数据、模型、工作流程和基准)。研究人员通过自助调查过程,并收到一个表格,准备在他们的数据管理计划(DMP)和/或工作计划中使用。同时获得对公平原则和开放科学概念的认识。FAIRIST是一种模式,它利用提案过程的一部分进行外联,提高对公平维度和考虑因素的认识,同时为竞争性提案提供及时的帮助。
{"title":"Engaging with Researchers and Raising Awareness of FAIR and Open Science through the FAIR+ Implementation Survey Tool (FAIRIST)","authors":"Christine R. Kirkpatrick, Kevin Coakley, Julianne Christopher, Inês Dutra","doi":"10.5334/dsj-2023-032","DOIUrl":"https://doi.org/10.5334/dsj-2023-032","url":null,"abstract":"Seven years after the seminal paper on FAIR was published, that introduced the concept of making research outputs Findable, Accessible, Interoperable, and Reusable, researchers still struggle to understand how to implement the principles. For many researchers, FAIR promises long-term benefits for near-term effort, requires skills not yet acquired, and is one more thing in a long list of unfunded mandates and onerous requirements for scientists. Even for those required to, or who are convinced that they must make time for FAIR research practices, their preference is for just-in-time advice properly sized to the scientific artifacts and process. Because of the generality of most FAIR implementation guidance, it is difficult for a researcher to adjust to the advice according to their situation. Technological advances, especially in the area of artificial intelligence (AI) and machine learning (ML), complicate FAIR adoption, as researchers and data stewards ponder how to make software, workflows, and models FAIR and reproducible. The FAIR+ Implementation Survey Tool (FAIRIST) mitigates the problem by integrating research requirements with research proposals in a systematic way. FAIRIST factors in new scholarly outputs, such as nanopublications and notebooks, and the various research artifacts related to AI research (data, models, workflows, and benchmarks). Researchers step through a self-serve survey process and receive a table ready for use in their data management plan (DMP) and/or work plan. while gaining awareness of the FAIR Principles and Open Science concepts. FAIRIST is a model that uses part of the proposal process as a way to do outreach, raise awareness of FAIR dimensions and considerations, while providing timely assistance for competitive proposals.","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134988993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Management for PalMod-II – A FAIR-Based Strategy for Data Handling in Large Climate Modeling Projects PalMod-II的数据管理——大型气候模拟项目中基于fair的数据处理策略
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-034
Swati Gehlot, Karsten Peters-von Gehlen, Andrea Lammert, Hannes Thiemann
PalMod-II was a multi-institutional research project in Germany focusing on enabling and performing global numerical climate simulations with state-of-theart coupled Earth System Models spanning a full glacial cycle from 130 000 years in the past to the present and beyond. The main project goal was the dataset resulting from these simulations and making it available for reuse by the climate science community in-line with the FAIR data principles. In this paper, we present the research data management (RDM) approach developed and employed in PalMod-II to progress towards that project goal. The RDM approach was implemented by RDM professionals specifically funded by PalMod-II, which made it possible to provide RDM services tailored specifically to the project needs. The compilation and maintenance of a project-wide data management plan (DMP) has proven essential for keeping the project on track and serving as a central focal point of any data-related aspects. These include the specification of data responsible scientists, allocation of storage and computaional resources on a high-performance computing system, documentation of simulation output requirements, definition of data standardisation, and publication workflows in-line with the FAIR data principles. Since the RDM approach executed in PalMod-II was first-of-its-kind for all project partners, exhaustive communication at par with the scientists was required to create trust and a collaborative atmosphere within the project. Finally, the RDM approach implemented in PalMod-II facilitated the publication of a flagship dataset for global reuse, and will also be implemented in the follow-up project: PalMod-III.
PalMod-II是德国的一个多机构研究项目,重点是利用最先进的耦合地球系统模型实现和执行全球数值气候模拟,涵盖从过去到现在以及以后的13万年的完整冰期。该项目的主要目标是由这些模拟产生的数据集,并使其符合FAIR数据原则,可供气候科学界重新使用。在本文中,我们介绍了在PalMod-II中开发和使用的研究数据管理(RDM)方法,以实现该项目目标。RDM方法是由PalMod-II专门资助的RDM专业人员实现的,这使得提供专门针对项目需要的RDM服务成为可能。事实证明,编制和维护项目范围的数据管理计划(DMP)对于保持项目的正常运行和作为任何与数据有关的方面的中心焦点至关重要。这些包括数据负责科学家的规范,高性能计算系统上存储和计算资源的分配,模拟输出需求的文档,数据标准化的定义,以及与FAIR数据原则一致的出版工作流程。由于在PalMod-II中执行的RDM方法是所有项目合作伙伴的首创,因此需要与科学家进行详尽的沟通,以在项目中创建信任和协作氛围。最后,在PalMod-II中实施的RDM方法促进了全球重用旗舰数据集的发布,并将在后续项目PalMod-III中实施。
{"title":"Data Management for PalMod-II – A FAIR-Based Strategy for Data Handling in Large Climate Modeling Projects","authors":"Swati Gehlot, Karsten Peters-von Gehlen, Andrea Lammert, Hannes Thiemann","doi":"10.5334/dsj-2023-034","DOIUrl":"https://doi.org/10.5334/dsj-2023-034","url":null,"abstract":"PalMod-II was a multi-institutional research project in Germany focusing on enabling and performing global numerical climate simulations with state-of-theart coupled Earth System Models spanning a full glacial cycle from 130 000 years in the past to the present and beyond. The main project goal was the dataset resulting from these simulations and making it available for reuse by the climate science community in-line with the FAIR data principles. In this paper, we present the research data management (RDM) approach developed and employed in PalMod-II to progress towards that project goal. The RDM approach was implemented by RDM professionals specifically funded by PalMod-II, which made it possible to provide RDM services tailored specifically to the project needs. The compilation and maintenance of a project-wide data management plan (DMP) has proven essential for keeping the project on track and serving as a central focal point of any data-related aspects. These include the specification of data responsible scientists, allocation of storage and computaional resources on a high-performance computing system, documentation of simulation output requirements, definition of data standardisation, and publication workflows in-line with the FAIR data principles. Since the RDM approach executed in PalMod-II was first-of-its-kind for all project partners, exhaustive communication at par with the scientists was required to create trust and a collaborative atmosphere within the project. Finally, the RDM approach implemented in PalMod-II facilitated the publication of a flagship dataset for global reuse, and will also be implemented in the follow-up project: PalMod-III.","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135360692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Management Plans for the Photon and Neutron Communities 光子和中子社区的数据管理计划
Q2 Computer Science Pub Date : 2023-01-01 DOI: 10.5334/dsj-2023-030
Marjolaine Bodin, Fredrik Bolmsten, Petra Aulin, T. Ivănoaica, A. Olivo, J. Malka, K. Wrona, Andy Götz
{"title":"Data Management Plans for the Photon and Neutron Communities","authors":"Marjolaine Bodin, Fredrik Bolmsten, Petra Aulin, T. Ivănoaica, A. Olivo, J. Malka, K. Wrona, Andy Götz","doi":"10.5334/dsj-2023-030","DOIUrl":"https://doi.org/10.5334/dsj-2023-030","url":null,"abstract":"","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71068311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Data Science Journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1