Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE

IF 0.4 Q4 INFORMATION SCIENCE & LIBRARY SCIENCE Evidence Based Library and Information Practice Pub Date : 2023-09-24 DOI:10.18438/eblip30378
Hilary Jasmin
{"title":"Digital Object Identifiers (DOIs) Prove Highly Effective for Long-Term Data Availability in PLOS ONE","authors":"Hilary Jasmin","doi":"10.18438/eblip30378","DOIUrl":null,"url":null,"abstract":"A Review of: Federer, L. M. (2022). Long-term availability of data associated with articles in PLOS ONE. PLOS ONE 17(8), Article e0272845. https://doi.org/10.1371/journal.pone.0272845 Objective – To retrieve a range of PLOS ONE data availability statements and quantify their ability to point to the study data efficiently and accurately. Research questions focused on availability over time, availability of URLs versus DOIs, the ability to locate resources using the data availability statement and availability based on data sharing method. Design – Observational study. Setting – PLOS ONE archive. Subjects – A corpus of 47,593 data availability statements from research articles in PLOS ONE between March 1, 2014, and May 31, 2016. Methods – Use of custom R scripts to retrieve 47,593 data availability statements; of these, 6,912 (14.5%) contained at least one URL or DOI. Once these links were extracted, R scripts were run to fetch the resources and record HTTP status codes to determine if the resource was discoverable. To address the potential for the DOI or URL to fetch but not actually contain the appropriate data, the researchers selected at random and manually retrieved the data for 350 URLs and 350 DOIs. Main Results – Of the unique URLs, 75% were able to be automatically retrieved by custom R scripts. In the manual sample of 350 URLs, which was used to test for accuracy of the URLs in containing the data, there was a 78% retrieval rate. Of the unique DOIs, 90% were able to be automatically retrieved by custom R scripts. The manual sample of 350 DOIs had a 98% retrieval rate. Conclusion – DOIs, especially those linked with a repository, had the highest rate of success in retrieving the data attached to the article. While URLs were better than no link at all, URLs are susceptible to content drift and need more management for long-term data availability.","PeriodicalId":45227,"journal":{"name":"Evidence Based Library and Information Practice","volume":null,"pages":null},"PeriodicalIF":0.4000,"publicationDate":"2023-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Evidence Based Library and Information Practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18438/eblip30378","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

A Review of: Federer, L. M. (2022). Long-term availability of data associated with articles in PLOS ONE. PLOS ONE 17(8), Article e0272845. https://doi.org/10.1371/journal.pone.0272845 Objective – To retrieve a range of PLOS ONE data availability statements and quantify their ability to point to the study data efficiently and accurately. Research questions focused on availability over time, availability of URLs versus DOIs, the ability to locate resources using the data availability statement and availability based on data sharing method. Design – Observational study. Setting – PLOS ONE archive. Subjects – A corpus of 47,593 data availability statements from research articles in PLOS ONE between March 1, 2014, and May 31, 2016. Methods – Use of custom R scripts to retrieve 47,593 data availability statements; of these, 6,912 (14.5%) contained at least one URL or DOI. Once these links were extracted, R scripts were run to fetch the resources and record HTTP status codes to determine if the resource was discoverable. To address the potential for the DOI or URL to fetch but not actually contain the appropriate data, the researchers selected at random and manually retrieved the data for 350 URLs and 350 DOIs. Main Results – Of the unique URLs, 75% were able to be automatically retrieved by custom R scripts. In the manual sample of 350 URLs, which was used to test for accuracy of the URLs in containing the data, there was a 78% retrieval rate. Of the unique DOIs, 90% were able to be automatically retrieved by custom R scripts. The manual sample of 350 DOIs had a 98% retrieval rate. Conclusion – DOIs, especially those linked with a repository, had the highest rate of success in retrieving the data attached to the article. While URLs were better than no link at all, URLs are susceptible to content drift and need more management for long-term data availability.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
数字对象标识符(DOIs)被证明对PLOS ONE的长期数据可用性非常有效
回顾:费德勒,l.m.(2022)。与PLOS ONE中的文章相关的数据的长期可用性。科学通报,17(8),第0272845篇。https://doi.org/10.1371/journal.pone.0272845目的-检索一系列PLOS ONE数据可用性声明,并量化其有效准确地指向研究数据的能力。研究问题集中在随着时间推移的可用性、url与doi的可用性、使用数据可用性语句定位资源的能力和基于数据共享方法的可用性。设计:观察性研究。设置- PLOS ONE存档。主题:2014年3月1日至2016年5月31日期间,PLOS ONE上的47,593篇研究文章的数据可用性声明的语料。方法-使用自定义R脚本检索47,593个数据可用性语句;其中,6912篇(14.5%)至少包含一个URL或DOI。提取这些链接后,运行R脚本来获取资源并记录HTTP状态码,以确定资源是否可发现。为了解决DOI或URL获取但实际上不包含适当数据的可能性,研究人员随机选择并手动检索350个URL和350个DOI的数据。主要结果——在唯一的url中,75%能够被自定义R脚本自动检索。在350个url的手动样本中,用于测试包含数据的url的准确性,检索率为78%。在唯一的doi中,90%能够通过自定义R脚本自动检索。人工取样350个DOIs,回收率98%。结论- doi,特别是那些与存储库链接的doi,在检索附在文章上的数据时成功率最高。虽然url总比没有链接好,但url容易受到内容漂移的影响,需要对长期数据可用性进行更多的管理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Evidence Based Library and Information Practice
Evidence Based Library and Information Practice INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
0.80
自引率
12.50%
发文量
44
审稿时长
12 weeks
期刊最新文献
Students’ Perspective of the Advantages and Disadvantages of ChatGPT Compared to Reference Librarians Academic Libraries Can Develop AI Chatbots for Virtual Reference Services with Minimal Technical Knowledge and Limited Resources A Study on the Knowledge and Perception of Artificial Intelligence Increasing Student Engagement in a Re-opened Regional Campus Library: Results from a Student Focus Group Gauging Academic Unit Perceptions of Library Services During a Transition in University Budget Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1