Volodymyr A. Shekhovtsov, Bence Slajcho, Aron Sacherer, Johann Eder
{"title":"CollectionLocator Level 1: Metadata-Based Search for Collections in Federated Biobanks","authors":"Volodymyr A. Shekhovtsov, Bence Slajcho, Aron Sacherer, Johann Eder","doi":"arxiv-2408.16422","DOIUrl":null,"url":null,"abstract":"Biobanks are indispensable resources for medical research collecting\nbiological material and associated data and making them available for research\nprojects and medical studies. For that, the biobank data has to meet certain\ncriteria which can be formulated as adherence to the FAIR (findable,\naccessible, interoperable and reusable) principles. We developed a tool, CollectionLocator, which aims at increasing the FAIR\ncompliance of biobank data by supporting researchers in identifying which\nbiobank and which collection are likely to contain cases (material and data)\nsatisfying the requirements of a defined research project when the detailed\nsample data is not available due to privacy restrictions. The CollectionLocator\nis based on an ontology-based metadata model to address the enormous\nheterogeneities and ensure the privacy of the donors of the biological samples\nand the data. Furthermore, the CollectionLocator represents the data and\nmetadata quality of the collections such that the quality requirements of the\nrequester can be matched with the quality of the available data. The concept of\nCollectionLocator is evaluated with a proof-of-concept implementation.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.16422","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Biobanks are indispensable resources for medical research collecting
biological material and associated data and making them available for research
projects and medical studies. For that, the biobank data has to meet certain
criteria which can be formulated as adherence to the FAIR (findable,
accessible, interoperable and reusable) principles. We developed a tool, CollectionLocator, which aims at increasing the FAIR
compliance of biobank data by supporting researchers in identifying which
biobank and which collection are likely to contain cases (material and data)
satisfying the requirements of a defined research project when the detailed
sample data is not available due to privacy restrictions. The CollectionLocator
is based on an ontology-based metadata model to address the enormous
heterogeneities and ensure the privacy of the donors of the biological samples
and the data. Furthermore, the CollectionLocator represents the data and
metadata quality of the collections such that the quality requirements of the
requester can be matched with the quality of the available data. The concept of
CollectionLocator is evaluated with a proof-of-concept implementation.