International journal of digital curation最新文献

英文中文

Data Curation in Interdisciplinary and Highly Collaborative Research 跨学科和高度协作研究中的数据管理

International journal of digital curation

Pub Date : 2023-10-01 DOI: 10.2218/ijdc.v17i1.835

Inna Kouper

This paper provides a systematic analysis of publications that discuss data curation in interdisciplinary and highly collaborative research (IHCR). Using content analysis methodology, it examined 159 publications and identified patterns in definitions of interdisciplinarity, projectsâ€™ participants and methodologies, and approaches to data curation. The findings suggest that data is a prominent component in interdisciplinarity. In addition to crossing disciplinary and other boundaries, IHCR is defined as curating and integrating heterogeneous data and creating new forms of knowledge from it. Using personal experiences and descriptive approaches, the publications discussed challenges that data curation in IHCR faces, including an increased overhead in coordination and management, lack of consistent metadata practices, and custom infrastructure that makes interoperability across projects, domains, and repositories difficult. The paper concludes with suggestions for future research.

本文对跨学科和高度合作研究(IHCR)中讨论数据管理的出版物进行了系统分析。使用内容分析方法，它检查了159份出版物，并确定了跨学科定义、项目参与者和方法以及数据管理方法的模式。研究结果表明，数据是跨学科的重要组成部分。除了跨越学科和其他界限外，国际人道主义责任还被定义为管理和整合异构数据，并从中创造新形式的知识。这些出版物利用个人经验和描述性方法，讨论了IHCR数据管理面临的挑战，包括协调和管理方面的开销增加，缺乏一致的元数据实践，以及使项目、领域和存储库之间的互操作性变得困难的定制基础设施。最后，对今后的研究提出了建议。

引用次数: 0

If Data is Used in the Forest and No-one is Around to Hear it, Did it Happen? A Citation Count Investigation 如果数据在森林中被使用，而周围没有人听到，这是真的吗?引文统计调查

International journal of digital curation

Pub Date : 2023-08-22 DOI: 10.2218/ijdc.v17i1.830

S. Borda

In this article I describe the process and results of tracking a citation from a data repository through the article publication process and trying to add a citation event to one of our DOIs. I also discuss some other confusing aspects related to citation counts as indicated in various systems, including reference managers, the publisher’s perspective, aggregators, and DOI minters. I discovered numerous problems with citations. Addressing these problems is important as citations can be key to determining both the original use and reuse of a dataset, especially for repositories that do not track usage by requiring people to login or provide an email to download a dataset. The lack of transparency in some data citation systems and processes obscures how and where data is being used.

在本文中，我将描述通过文章发布过程跟踪来自数据存储库的引用的过程和结果，并尝试将引用事件添加到我们的一个doi中。我还讨论了在各种系统中与引用计数相关的其他一些令人困惑的方面，包括参考管理器、出版商的视角、聚合器和DOI分配器。我发现了很多引用的问题。解决这些问题很重要，因为引用可能是确定数据集的原始使用和重用的关键，特别是对于不通过要求人们登录或提供电子邮件来下载数据集来跟踪使用情况的存储库。一些数据引用系统和流程缺乏透明度，使数据的使用方式和地点变得模糊。

引用次数: 0

Analysis of U.S. Federal Funding Agency Data Sharing Policies 美国联邦资助机构数据共享政策分析

International journal of digital curation

Pub Date : 2023-02-08 DOI: 10.2218/ijdc.v17i1.791

Reid I. Boehm, H. Calkins, Patricia B. Condon, J. Petters, Rachel Woodbrook

Federal funding agencies in the United States (U.S.) continue to work towards implementing their plans to increase public access to funded research and comply with the 2013 Office of Science and Technology memo Increasing Access to the Results of Federally Funded Scientific Research. In this article we report on an analysis of research data sharing policy documents from 17 U.S. federal funding agencies as of February 2021. Our analysis is guided by two questions: 1.) What do the findings suggest about the current state of and trends in U.S. federal funding agency data sharing requirements? 2.) In what ways are universities, institutions, associations, and researchers affected by and responding to these policies? Over the past five years, policy updates were common among these agencies and several themes have been thoroughly developed in that time; however, uncertainty remains around how funded researchers are expected to satisfy these policy requirements.

美国联邦资助机构继续努力实施其计划，以增加公众获得资助研究的机会，并遵守2013年科技办公室备忘录《增加获得联邦资助科学研究成果的机会》。在本文中，我们对截至2021年2月的17个美国联邦资助机构的研究数据共享政策文件进行了分析。我们的分析以两个问题为指导：1.）研究结果对美国联邦资助机构数据共享要求的现状和趋势有何启示？2.）大学、机构、协会和研究人员在哪些方面受到这些政策的影响并对其做出回应？在过去五年中，这些机构的政策更新很常见，在这段时间里，一些主题得到了彻底发展；然而，受资助的研究人员如何满足这些政策要求仍然存在不确定性。

引用次数: 0

Long-Term Preservation and Reusability of Open Access Scholar-Led Press Monographs 开放获取学者主导的出版专著的长期保存和可重用性

International journal of digital curation

Pub Date : 2023-02-05 DOI: 10.2218/ijdc.v17i1.826

Miranda Barnes, Ross Higman, G. Cole, Rupert Gatti, J. Fry

This brief report outlines some initial findings and challenges identified by the Community-Led Open Publication Infrastructures for Monographs (COPIM) project when looking to archive and preserve open access books produced by small, scholar-led presses. This paper is based on the research conducted by Work Package 7 in COPIM, which has a focus on the preservation and archiving of open access monographs in all their complexity, along with any accompanying materials.

这份简短的报告概述了社区主导的专著开放出版基础设施(COPIM)项目在寻求存档和保存由小型学者主导的出版社出版的开放获取图书时发现的一些初步发现和挑战。本文基于COPIM工作包7进行的研究，其重点是保存和归档开放获取专著的所有复杂性，以及任何随附材料。

引用次数: 0

Cluster Analysis of Open Research Data: A Case for Replication Metadata 开放研究数据的聚类分析:复制元数据的一个案例

International journal of digital curation

Pub Date : 2023-02-02 DOI: 10.2218/ijdc.v17i1.833

Ana Trisovic

Research data are often released upon journal publication to enable result verification and reproducibility. For that reason, research dissemination infrastructures typically support diverse datasets coming from numerous disciplines, from tabular data and program code to audio-visual files. Metadata, or data about data, is critical to making research outputs adequately documented and FAIR. Aiming to contribute to the discussions on the development of metadata for research outputs, I conducted an exploratory analysis to determine how research datasets cluster based on what researchers organically deposit together. I use the content of over 40,000 datasets from the Harvard Dataverse research data repository as my sample for the cluster analysis. I find that the majority of the clusters are formed by single-type datasets, while in the rest of the sample, no meaningful clusters can be identified. For the result interpretation, I use the metadata standard employed by DataCite, a leading organization for documenting a scholarly record, and map existing resource types to my results. About 65% of the sample can be described with a single-type metadata (such as Dataset, Software orReport), while the rest would require aggregate metadata types. Though DataCite supports an aggregate type such as a Collection, I argue that a significant number of datasets, in particular those containing both data and code files (about 20% of the sample), would be more accurately described as a Replication resource metadata type. Such resource type would be particularly useful in facilitating research reproducibility.

研究数据通常在期刊发表时发布，以便结果验证和可重复性。因此，研究传播基础设施通常支持来自众多学科的各种数据集，从表格数据和程序代码到视听文件。元数据，或关于数据的数据，对于使研究成果充分记录和公平至关重要。为了促进对研究产出元数据发展的讨论，我进行了探索性分析，以确定研究数据集如何基于研究人员有机沉积在一起。我使用来自Harvard Dataverse研究数据存储库的40,000多个数据集的内容作为聚类分析的样本。我发现大多数聚类是由单一类型的数据集形成的，而在其余的样本中，没有发现有意义的聚类。对于结果解释，我使用DataCite使用的元数据标准，DataCite是记录学术记录的领先组织，并将现有资源类型映射到我的结果。大约65%的样本可以用单一类型的元数据(如Dataset、Software或report)来描述，而其余的则需要聚合元数据类型。虽然DataCite支持聚合类型，比如Collection，但我认为有相当数量的数据集，特别是那些同时包含数据和代码文件的数据集(约占样本的20%)，可以更准确地描述为Replication资源元数据类型。这种资源类型对于促进研究的可重复性特别有用。

{"title":"Cluster Analysis of Open Research Data: A Case for Replication Metadata","authors":"Ana Trisovic","doi":"10.2218/ijdc.v17i1.833","DOIUrl":"https://doi.org/10.2218/ijdc.v17i1.833","url":null,"abstract":"Research data are often released upon journal publication to enable result verification and reproducibility. For that reason, research dissemination infrastructures typically support diverse datasets coming from numerous disciplines, from tabular data and program code to audio-visual files. Metadata, or data about data, is critical to making research outputs adequately documented and FAIR. Aiming to contribute to the discussions on the development of metadata for research outputs, I conducted an exploratory analysis to determine how research datasets cluster based on what researchers organically deposit together. I use the content of over 40,000 datasets from the Harvard Dataverse research data repository as my sample for the cluster analysis. I find that the majority of the clusters are formed by single-type datasets, while in the rest of the sample, no meaningful clusters can be identified. For the result interpretation, I use the metadata standard employed by DataCite, a leading organization for documenting a scholarly record, and map existing resource types to my results. About 65% of the sample can be described with a single-type metadata (such as Dataset, Software orReport), while the rest would require aggregate metadata types. Though DataCite supports an aggregate type such as a Collection, I argue that a significant number of datasets, in particular those containing both data and code files (about 20% of the sample), would be more accurately described as a Replication resource metadata type. Such resource type would be particularly useful in facilitating research reproducibility.","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":"187 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135361093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Proposal for a Maturity Continuum Model for Open Research Data 开放研究数据成熟度连续体模型的建议

International journal of digital curation

Pub Date : 2023-01-27 DOI: 10.2218/ijdc.v17i1.821

M. Guirlet, Gaia Bongi, Elise Point, Grégoire Urvoy, René Schneider

As a contribution to the general effort in research to generalize and improve the practices of Open Research Data (ORD), we developed a model conceptualizing the degrees of maturity of a research community in terms of ORD. This model may be used to assess the ORD capacity or maturity level of a specific research community, to strengthen the use of standards with respect to ORD within this community, and to increase its ORD maturity level. We present the background and our motivations for developing such an instrument as well as the reasoning leading to its design. We present its elements in detail and discuss possible applications.

作为对研究中推广和改进开放研究数据（ORD）实践的总体努力的贡献，我们开发了一个模型，从ORD的角度概念化了研究社区的成熟度。该模型可用于评估特定研究社区的ORD能力或成熟度水平，加强该社区内ORD标准的使用，并提高其ORD成熟度。我们介绍了开发这种仪器的背景和动机，以及设计它的原因。我们详细介绍了它的组成部分，并讨论了可能的应用。

引用次数: 0

Putting the R into PlatfoRms 把R放到平台里

International journal of digital curation

Pub Date : 2022-12-12 DOI: 10.2218/ijdc.v17i1.843

K. Levett, Jonathan Smillie, A. Treloar

This paper looks at the question of how and why to bring about greater reusability of Research Platforms (variously called Virtual Laboratories, Virtual Research Environments, or Science Gateways). It begins with some context for the Australian Research Data Commons, where the authors are based. It then examines the infrastructure concerns that are driving the need for platforms to be created and remain sustainable, and the connection from this to reusability. The paper then proceeds to discuss the ways in which FAIR is being extended to a range of research objects and infrastructure elements, before reviewing the work of the FAIR4VREs WG. The core of the paper is an examination, with examples or case studies, of four different paradigms for platform reusability: accessing, adopting, adapting, and abstracting. The paper concludes by examining actions undertaken by the ARDC to increase the likelihood of reusability.

本文着眼于如何以及为什么使研究平台(各种称为虚拟实验室、虚拟研究环境或科学网关)具有更大的可重用性。文章首先介绍了作者所在的澳大利亚研究数据共享中心的一些背景。然后分析了驱动平台创建和保持可持续性需求的基础设施问题，以及由此与可重用性之间的联系。在回顾FAIR4VREs工作组的工作之前，本文接着讨论了将FAIR扩展到一系列研究对象和基础设施要素的方式。本文的核心是通过示例或案例研究，对平台可重用性的四种不同范式进行考察:访问、采用、适应和抽象。最后，本文考察了ARDC为提高可重用性所采取的行动。

引用次数: 0

Data Showcases: the Data Journal in a Multimodal World 数据展示案例：多模式世界中的数据期刊

International journal of digital curation

Pub Date : 2022-12-06 DOI: 10.2218/ijdc.v17i1.789

L. Breure, P. Doorn, H. Voorbij

As an experiment, the Research Data Journal for the Humanities and Social Sciences (RDJ) has temporarily extended the usual format of the online journal with so-called ‘showcases’, separate web pages containing a quick introduction to a dataset, embedded multimedia, interactive components, and facilities to directly preview and explore the dataset described. The aim was to create a coherent hyper document with content communicated via different media (multimodality) and provide space for new forms of scientific publication such as executable papers (e.g. Jupyter notebooks). This paper discusses the objectives, technical implementations, and the need for innovation in data publishing considering the advanced possibilities of today's digital modes of communication. The data showcases experiment proved to be a useful starting point for an exploration of related developments within and outside the humanities and social sciences. It turns out that small-scale experiments are relatively easy to perform thanks to the easy availability of digital technology. However, real innovation in publishing affects organization and infrastructure and requires the joint effort of publishers, editors, data repositories, and authors. It implies a thorough update of the concept of publication and adaptation of the production process. This paper also pays attention to these obstacles to taking new paths.

作为一项实验，《人文与社会科学研究数据杂志》（RDJ）暂时扩展了在线期刊的常见格式，增加了所谓的“展示”，即包含数据集快速介绍的独立网页、嵌入式多媒体、交互式组件，以及直接预览和探索所述数据集的设施。其目的是创建一个连贯的超文档，其中包含通过不同媒体传播的内容（多模式），并为可执行论文（如Jupyter笔记本）等新形式的科学出版物提供空间。考虑到当今数字通信模式的先进可能性，本文讨论了数据发布的目标、技术实现和创新的必要性。这些数据展示了实验被证明是探索人文科学和社会科学内外相关发展的有用起点。事实证明，由于数字技术的普及，小规模实验相对容易进行。然而，出版业真正的创新会影响组织和基础设施，需要出版商、编辑、数据存储库和作者的共同努力。这意味着彻底更新出版的概念，并对制作过程进行改编。本文还注意到了这些障碍，以采取新的道路。

{"title":"Data Showcases: the Data Journal in a Multimodal World","authors":"L. Breure, P. Doorn, H. Voorbij","doi":"10.2218/ijdc.v17i1.789","DOIUrl":"https://doi.org/10.2218/ijdc.v17i1.789","url":null,"abstract":" \u0000 As an experiment, the Research Data Journal for the Humanities and Social Sciences (RDJ) has temporarily extended the usual format of the online journal with so-called ‘showcases’, separate web pages containing a quick introduction to a dataset, embedded multimedia, interactive components, and facilities to directly preview and explore the dataset described. The aim was to create a coherent hyper document with content communicated via different media (multimodality) and provide space for new forms of scientific publication such as executable papers (e.g. Jupyter notebooks). This paper discusses the objectives, technical implementations, and the need for innovation in data publishing considering the advanced possibilities of today's digital modes of communication. The data showcases experiment proved to be a useful starting point for an exploration of related developments within and outside the humanities and social sciences. It turns out that small-scale experiments are relatively easy to perform thanks to the easy availability of digital technology. However, real innovation in publishing affects organization and infrastructure and requires the joint effort of publishers, editors, data repositories, and authors. It implies a thorough update of the concept of publication and adaptation of the production process. This paper also pays attention to these obstacles to taking new paths.","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42515062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Data Curation Strategies to Support Responsible Big Social Research and Big Social Data Reuse 支持负责任的大社会研究和大社会数据重用的数据管理策略

International journal of digital curation

Pub Date : 2022-12-06 DOI: 10.2218/ijdc.v17i1.823

Sara Mannheimer

Big social research repurposes existing data from online sources such as social media, blogs, or online forums, with a goal of advancing knowledge of human behavior and social phenomena. Big social research also presents an array of challenges that can prevent data sharing and reuse. This brief report presents an overview of a larger study that aims to understand the data curation implications of big social research to support use and reuse of big social data. The study, which is based in the United States, identifies six key issues relating to big social research and big social data curation through a review of the literature. It then further investigates perceptions and practices relating to these six key issues through semi-structured interviews with big social researchers and data curators. This report concludes with implications for data curation practice: metadata and documentation, connecting with researchers throughout the research process, data repository services, and advocating for community standards. Supporting responsible practices for using big social data can help scale up social science research, thus enhancing our understanding of human behavior and social phenomena.

大型社会研究重新利用社交媒体、博客或在线论坛等在线来源的现有数据，目的是提高对人类行为和社会现象的了解。大型社会研究也带来了一系列挑战，这些挑战可能会阻碍数据共享和重用。这份简短的报告概述了一项更大的研究，该研究旨在了解大社会研究对数据管理的影响，以支持大社会数据的使用和再利用。这项总部位于美国的研究通过对文献的回顾，确定了与大社会研究和大社会数据管理有关的六个关键问题。然后，它通过对大型社会研究人员和数据策展人的半结构化采访，进一步调查了与这六个关键问题相关的看法和实践。本报告总结了对数据管理实践的影响：元数据和文档，在整个研究过程中与研究人员联系，数据存储库服务，以及倡导社区标准。支持负责任地使用大社会数据可以帮助扩大社会科学研究的规模，从而增强我们对人类行为和社会现象的理解。

{"title":"Data Curation Strategies to Support Responsible Big Social Research and Big Social Data Reuse","authors":"Sara Mannheimer","doi":"10.2218/ijdc.v17i1.823","DOIUrl":"https://doi.org/10.2218/ijdc.v17i1.823","url":null,"abstract":"Big social research repurposes existing data from online sources such as social media, blogs, or online forums, with a goal of advancing knowledge of human behavior and social phenomena. Big social research also presents an array of challenges that can prevent data sharing and reuse. \u0000This brief report presents an overview of a larger study that aims to understand the data curation implications of big social research to support use and reuse of big social data. The study, which is based in the United States, identifies six key issues relating to big social research and big social data curation through a review of the literature. It then further investigates perceptions and practices relating to these six key issues through semi-structured interviews with big social researchers and data curators. \u0000This report concludes with implications for data curation practice: metadata and documentation, connecting with researchers throughout the research process, data repository services, and advocating for community standards. Supporting responsible practices for using big social data can help scale up social science research, thus enhancing our understanding of human behavior and social phenomena.","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45352614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Who Writes Scholarly Code? 谁写学术代码？

International journal of digital curation

Pub Date : 2022-11-01 DOI: 10.2218/ijdc.v17i1.839

Sarah Nguyễn, Vicky Rampin

This paper presents original research about the behaviours, histories, demographics, and motivations of scholars who code, specifically how they interact with version control systems locally and on the Web. By understanding patrons through multiple lenses – daily productivity habits, motivations, and scholarly needs – librarians and archivists can tailor services for software management, curation, and long-term reuse, raising the possibility for long-term reproducibility of a multitude of scholarship.

本文介绍了关于编码学者的行为、历史、人口统计和动机的原始研究，特别是他们如何与本地和网络上的版本控制系统进行交互。通过从多个角度了解读者——日常工作习惯、动机和学术需求——图书馆员和档案管理员可以为软件管理、管理和长期重用量身定制服务，从而提高大量学术成果长期可复制的可能性。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International journal of digital curation

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀