利用研究数据存储库中存档研究数据的重用实现联邦信息系统

Q2 Computer Science Data Science Journal Pub Date : 2023-01-01 DOI:10.5334/dsj-2023-039
Sylvia Melzer, Stefan Thiemann, Simon Schiff, Ralf Möller
{"title":"利用研究数据存储库中存档研究数据的重用实现联邦信息系统","authors":"Sylvia Melzer, Stefan Thiemann, Simon Schiff, Ralf Möller","doi":"10.5334/dsj-2023-039","DOIUrl":null,"url":null,"abstract":"At universities, research data is increasingly stored in research data repositories according to a data management plan (DMP) and thus made available for further use. The challenge of reusing hundreds, thousands, or millions of data sets is to obtain an overview of the data in a short period of time and to search through all the data. The high variability of the formats used to store research data requires a new approach to data reusability that focuses on the visualisation and searchability of archived research data, which can also be combined with each other. In this article, we present a practical DMP that describes how information systems can be created on demand by reusing research data archived in research data repositories and how these systems can be merged into a federated information system. As a result, in our projects, information systems have been created in minutes or a couple of hours with few resources. The initial effort to create a federated system remains; however, this allows federated searches to be performed. Extending a federated system to include other information systems can then be accomplished by making a few configurations and manageable adjustments to the source code.","PeriodicalId":35375,"journal":{"name":"Data Science Journal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Implementation of a Federated Information System by Means of Reuse of Research Data Archived in Research Data Repositories\",\"authors\":\"Sylvia Melzer, Stefan Thiemann, Simon Schiff, Ralf Möller\",\"doi\":\"10.5334/dsj-2023-039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"At universities, research data is increasingly stored in research data repositories according to a data management plan (DMP) and thus made available for further use. The challenge of reusing hundreds, thousands, or millions of data sets is to obtain an overview of the data in a short period of time and to search through all the data. The high variability of the formats used to store research data requires a new approach to data reusability that focuses on the visualisation and searchability of archived research data, which can also be combined with each other. In this article, we present a practical DMP that describes how information systems can be created on demand by reusing research data archived in research data repositories and how these systems can be merged into a federated information system. As a result, in our projects, information systems have been created in minutes or a couple of hours with few resources. The initial effort to create a federated system remains; however, this allows federated searches to be performed. Extending a federated system to include other information systems can then be accomplished by making a few configurations and manageable adjustments to the source code.\",\"PeriodicalId\":35375,\"journal\":{\"name\":\"Data Science Journal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data Science Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5334/dsj-2023-039\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5334/dsj-2023-039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

在大学,越来越多的研究数据根据数据管理计划(DMP)存储在研究数据存储库中,从而可供进一步使用。重用数百、数千或数百万个数据集的挑战是在短时间内获得数据的概览并搜索所有数据。用于存储研究数据的格式的高度可变性需要一种新的数据可重用性方法,该方法侧重于存档研究数据的可视化和可搜索性,它们也可以相互结合。在本文中,我们提出了一个实用的DMP,它描述了如何通过重用存档在研究数据存储库中的研究数据来按需创建信息系统,以及如何将这些系统合并到一个联合信息系统中。因此,在我们的项目中,信息系统是在几分钟或几个小时内用很少的资源创建的。创建联邦系统的最初努力仍然存在;但是,这允许执行联邦搜索。扩展联邦系统以包含其他信息系统,然后可以通过对源代码进行一些配置和可管理的调整来完成。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Implementation of a Federated Information System by Means of Reuse of Research Data Archived in Research Data Repositories
At universities, research data is increasingly stored in research data repositories according to a data management plan (DMP) and thus made available for further use. The challenge of reusing hundreds, thousands, or millions of data sets is to obtain an overview of the data in a short period of time and to search through all the data. The high variability of the formats used to store research data requires a new approach to data reusability that focuses on the visualisation and searchability of archived research data, which can also be combined with each other. In this article, we present a practical DMP that describes how information systems can be created on demand by reusing research data archived in research data repositories and how these systems can be merged into a federated information system. As a result, in our projects, information systems have been created in minutes or a couple of hours with few resources. The initial effort to create a federated system remains; however, this allows federated searches to be performed. Extending a federated system to include other information systems can then be accomplished by making a few configurations and manageable adjustments to the source code.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Data Science Journal
Data Science Journal Computer Science-Computer Science (miscellaneous)
CiteScore
5.40
自引率
0.00%
发文量
17
审稿时长
10 weeks
期刊介绍: The Data Science Journal is a peer-reviewed electronic journal publishing papers on the management of data and databases in Science and Technology. Details can be found in the prospectus. The scope of the journal includes descriptions of data systems, their publication on the internet, applications and legal issues. All of the Sciences are covered, including the Physical Sciences, Engineering, the Geosciences and the Biosciences, along with Agriculture and the Medical Science. The journal publishes papers about data and data systems; it does not publish data or data compilations. However it may publish papers about methods of data compilation or analysis.
期刊最新文献
Data on the Margins – Data from LGBTIQ+ Populations in European Social Science Data Archives Insights on Sustainability of Earth Science Data Infrastructure Projects Using OpenBIS as Virtual Research Environment: An ELN-LIMS Open-Source Database Tool as a Framework within the CRC 1411 Design of Particulate Products Umbrella Data Management Plans to Integrate FAIR Data: Lessons From the ISIDORe and BY-COVID Consortia for Pandemic Preparedness The Launch of the <em>Data Science Journal</em>&nbsp;in 2002
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1