{"title":"An examination of metadata practices for research data reuse: Characteristics and predictive probability of metadata elements","authors":"Min Sook Park, Hyoungjoo Park","doi":"10.22452/mjlis.vol24no3.4","DOIUrl":null,"url":null,"abstract":"This study explores metadata practices in the relation to data reuse in biology. Metadata has long been viewed as a major constituent in research data management and reuse. However, the topic of whether metadata is used in a way that encourages data reuse has been understudied. The current study examined metadata elements used to describe datasets and the predictive probability of those metadata elements for data reuse under the assumption that citation frequency reflects the frequency of research data reuse. A total of 34,491 cited records from the biology category of the Clarivate Analytics Data Citation Index were analyzed using descriptive comparison and multiple regression analysis to compare usage patterns of metadata elements between data records cited more than twice and those cited only once. Of the five types of metadata elements identified and examined, metadata elements that provided descriptions about datasets and author-related information dominantly appeared across datasets, whereas DOI and ORCID identifier were scarce. Metadata related to author and funding resources were found to be positive influential factors in predicting data reuse, whereas data descriptions and identifiers appeared to have negative influences. This study contributed to a better understanding of metadata needs for data reuse.","PeriodicalId":45072,"journal":{"name":"Malaysian Journal of Library & Information Science","volume":"24 1","pages":"61-75"},"PeriodicalIF":0.5000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Malaysian Journal of Library & Information Science","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.22452/mjlis.vol24no3.4","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 1
Abstract
This study explores metadata practices in the relation to data reuse in biology. Metadata has long been viewed as a major constituent in research data management and reuse. However, the topic of whether metadata is used in a way that encourages data reuse has been understudied. The current study examined metadata elements used to describe datasets and the predictive probability of those metadata elements for data reuse under the assumption that citation frequency reflects the frequency of research data reuse. A total of 34,491 cited records from the biology category of the Clarivate Analytics Data Citation Index were analyzed using descriptive comparison and multiple regression analysis to compare usage patterns of metadata elements between data records cited more than twice and those cited only once. Of the five types of metadata elements identified and examined, metadata elements that provided descriptions about datasets and author-related information dominantly appeared across datasets, whereas DOI and ORCID identifier were scarce. Metadata related to author and funding resources were found to be positive influential factors in predicting data reuse, whereas data descriptions and identifiers appeared to have negative influences. This study contributed to a better understanding of metadata needs for data reuse.