Open Science is based on a collaborative network to develop transparent, accessible, and shared knowledge. Open Research Data Platforms (ORDPs) are deployed to fulfill the needs for data sharing of a specific community and/or scientific discipline. The high variety of research areas creates a barrier to data sharing between research entities. To enable this research data to be found by the research entities that need it, it is necessary to establish access to different ORDPs that are unknown to these research entities. The goal of this article is to provide a quantitative analysis showing the current limitations of data sharing between ORDPs in Open Science. We then propose a solution to improve data access and sharing based on theoretical foundations and an experimental approach.
We propose to extend our theoretical interoperability model, which helps us to define the necessary steps to interoperate ORDPs. We present and discuss a quantitative evaluation of ORDPs’ interoperability. Based on this exploratory study, we propose a solution that enables research entities to discover unknown ORDPs, thereby facilitating access to relevant data. This solution is the Open Science Data Network (OSDN), a decentralized, distributed, and federated network of ORDPs that integrates a query propagation process and robustness features. To enable the deployment of OSDN at an Open Science scale, we designed our solution by considering its adoption cost relative to a non-organized interoperability approach. With two ORDPs integrated into the OSDN, the adoption cost is estimated to be reduced by at least 17%. This reduction approaches 100% as the number of integrated ORDPs increases.
To demonstrate the feasibility of the solution, we developed a Proof of Concept (POC) and applied it to two research projects from different domains and involving distinct research communities. For the first research project, we measured a 7% increase in the volume of accessed data and an 80% reduction in the time needed to find this data. In addition, researcher from this experiment was able to formulate new intra- and interdisciplinary research questions thanks to the newly accessed data. In the second research project, we observed an increase in data volume of up to a factor of 3968. More importantly, this process led to the discovery of new essential data that was previously missing.
扫码关注我们
求助内容:
应助结果提醒方式:
