PalMod-II的数据管理——大型气候模拟项目中基于fair的数据处理策略

Q2 Computer Science Data Science Journal Pub Date : 2023-01-01 DOI:10.5334/dsj-2023-034
Swati Gehlot, Karsten Peters-von Gehlen, Andrea Lammert, Hannes Thiemann
{"title":"PalMod-II的数据管理——大型气候模拟项目中基于fair的数据处理策略","authors":"Swati Gehlot, Karsten Peters-von Gehlen, Andrea Lammert, Hannes Thiemann","doi":"10.5334/dsj-2023-034","DOIUrl":null,"url":null,"abstract":"PalMod-II was a multi-institutional research project in Germany focusing on enabling and performing global numerical climate simulations with state-of-theart coupled Earth System Models spanning a full glacial cycle from 130 000 years in the past to the present and beyond. The main project goal was the dataset resulting from these simulations and making it available for reuse by the climate science community in-line with the FAIR data principles. In this paper, we present the research data management (RDM) approach developed and employed in PalMod-II to progress towards that project goal. The RDM approach was implemented by RDM professionals specifically funded by PalMod-II, which made it possible to provide RDM services tailored specifically to the project needs. The compilation and maintenance of a project-wide data management plan (DMP) has proven essential for keeping the project on track and serving as a central focal point of any data-related aspects. These include the specification of data responsible scientists, allocation of storage and computaional resources on a high-performance computing system, documentation of simulation output requirements, definition of data standardisation, and publication workflows in-line with the FAIR data principles. Since the RDM approach executed in PalMod-II was first-of-its-kind for all project partners, exhaustive communication at par with the scientists was required to create trust and a collaborative atmosphere within the project. Finally, the RDM approach implemented in PalMod-II facilitated the publication of a flagship dataset for global reuse, and will also be implemented in the follow-up project: PalMod-III.","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Management for PalMod-II – A FAIR-Based Strategy for Data Handling in Large Climate Modeling Projects\",\"authors\":\"Swati Gehlot, Karsten Peters-von Gehlen, Andrea Lammert, Hannes Thiemann\",\"doi\":\"10.5334/dsj-2023-034\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"PalMod-II was a multi-institutional research project in Germany focusing on enabling and performing global numerical climate simulations with state-of-theart coupled Earth System Models spanning a full glacial cycle from 130 000 years in the past to the present and beyond. The main project goal was the dataset resulting from these simulations and making it available for reuse by the climate science community in-line with the FAIR data principles. In this paper, we present the research data management (RDM) approach developed and employed in PalMod-II to progress towards that project goal. The RDM approach was implemented by RDM professionals specifically funded by PalMod-II, which made it possible to provide RDM services tailored specifically to the project needs. The compilation and maintenance of a project-wide data management plan (DMP) has proven essential for keeping the project on track and serving as a central focal point of any data-related aspects. These include the specification of data responsible scientists, allocation of storage and computaional resources on a high-performance computing system, documentation of simulation output requirements, definition of data standardisation, and publication workflows in-line with the FAIR data principles. Since the RDM approach executed in PalMod-II was first-of-its-kind for all project partners, exhaustive communication at par with the scientists was required to create trust and a collaborative atmosphere within the project. Finally, the RDM approach implemented in PalMod-II facilitated the publication of a flagship dataset for global reuse, and will also be implemented in the follow-up project: PalMod-III.\",\"PeriodicalId\":35375,\"journal\":{\"name\":\"Data Science Journal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data Science Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5334/dsj-2023-034\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5334/dsj-2023-034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

PalMod-II是德国的一个多机构研究项目,重点是利用最先进的耦合地球系统模型实现和执行全球数值气候模拟,涵盖从过去到现在以及以后的13万年的完整冰期。该项目的主要目标是由这些模拟产生的数据集,并使其符合FAIR数据原则,可供气候科学界重新使用。在本文中,我们介绍了在PalMod-II中开发和使用的研究数据管理(RDM)方法,以实现该项目目标。RDM方法是由PalMod-II专门资助的RDM专业人员实现的,这使得提供专门针对项目需要的RDM服务成为可能。事实证明,编制和维护项目范围的数据管理计划(DMP)对于保持项目的正常运行和作为任何与数据有关的方面的中心焦点至关重要。这些包括数据负责科学家的规范,高性能计算系统上存储和计算资源的分配,模拟输出需求的文档,数据标准化的定义,以及与FAIR数据原则一致的出版工作流程。由于在PalMod-II中执行的RDM方法是所有项目合作伙伴的首创,因此需要与科学家进行详尽的沟通,以在项目中创建信任和协作氛围。最后,在PalMod-II中实施的RDM方法促进了全球重用旗舰数据集的发布,并将在后续项目PalMod-III中实施。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Data Management for PalMod-II – A FAIR-Based Strategy for Data Handling in Large Climate Modeling Projects
PalMod-II was a multi-institutional research project in Germany focusing on enabling and performing global numerical climate simulations with state-of-theart coupled Earth System Models spanning a full glacial cycle from 130 000 years in the past to the present and beyond. The main project goal was the dataset resulting from these simulations and making it available for reuse by the climate science community in-line with the FAIR data principles. In this paper, we present the research data management (RDM) approach developed and employed in PalMod-II to progress towards that project goal. The RDM approach was implemented by RDM professionals specifically funded by PalMod-II, which made it possible to provide RDM services tailored specifically to the project needs. The compilation and maintenance of a project-wide data management plan (DMP) has proven essential for keeping the project on track and serving as a central focal point of any data-related aspects. These include the specification of data responsible scientists, allocation of storage and computaional resources on a high-performance computing system, documentation of simulation output requirements, definition of data standardisation, and publication workflows in-line with the FAIR data principles. Since the RDM approach executed in PalMod-II was first-of-its-kind for all project partners, exhaustive communication at par with the scientists was required to create trust and a collaborative atmosphere within the project. Finally, the RDM approach implemented in PalMod-II facilitated the publication of a flagship dataset for global reuse, and will also be implemented in the follow-up project: PalMod-III.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Data Science Journal
Data Science Journal Computer Science-Computer Science (miscellaneous)
CiteScore
5.40
自引率
0.00%
发文量
17
审稿时长
10 weeks
期刊介绍: The Data Science Journal is a peer-reviewed electronic journal publishing papers on the management of data and databases in Science and Technology. Details can be found in the prospectus. The scope of the journal includes descriptions of data systems, their publication on the internet, applications and legal issues. All of the Sciences are covered, including the Physical Sciences, Engineering, the Geosciences and the Biosciences, along with Agriculture and the Medical Science. The journal publishes papers about data and data systems; it does not publish data or data compilations. However it may publish papers about methods of data compilation or analysis.
期刊最新文献
Data on the Margins – Data from LGBTIQ+ Populations in European Social Science Data Archives Insights on Sustainability of Earth Science Data Infrastructure Projects Using OpenBIS as Virtual Research Environment: An ELN-LIMS Open-Source Database Tool as a Framework within the CRC 1411 Design of Particulate Products Umbrella Data Management Plans to Integrate FAIR Data: Lessons From the ISIDORe and BY-COVID Consortia for Pandemic Preparedness The Launch of the <em>Data Science Journal</em>&nbsp;in 2002
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1