International journal of digital curation最新文献

英文中文

Scaling by Optimising: Modularisation of Data Curation Services in Growing Organisations 通过优化扩展:成长型组织中数据管理服务的模块化

International journal of digital curation

Pub Date : 2021-01-01 DOI: 10.2218/ijdc.v16i1.650

Hagen Peukert

After a century of theorising and applying management practices, we are in the middle of entering a new stage in management science: digital management. The management of digital data submerges in traditional functions of management and, at the same time, continues to recreate viable solutions and conceptualisations in its established fields, e.g. research data management. Yet, one can observe bilateral synergies and mutual enrichment of traditional and data management practices in all fields. The paper at hand addresses a case in point, in which new and old management practices amalgamate to meet a steadily, in part characterised by leaps and bounds, increasing demand of data curation services in academic institutions. The idea of modularisation, as known from software engineering, is applied to data curation workflows so that economies of scale and scope can be used. While scaling refers to both management science and data science, optimising is understood in the traditional managerial sense, that is, with respect to the cost function. By means of a situation analysis describing how data curation services were applied from one department to the entire institution and an analysis of the factors of influence, a method of modularisation is outlined that converges to an optimal state of curation workflows.

经过一个世纪的理论化和管理实践的应用，我们正在进入管理科学的一个新阶段:数字化管理。数字数据的管理淹没在传统的管理职能中，同时继续在其既定领域(例如研究数据管理)重新创建可行的解决方案和概念化。然而，我们可以看到双方在所有领域的传统管理和数据管理实践中发挥了协同作用，相互丰富。手头的论文解决了一个恰当的案例，在这个案例中，新的和旧的管理实践合并，以满足学术机构对数据管理服务的稳步增长的需求，部分特征是跨越式增长。软件工程中的模块化思想被应用到数据管理工作流程中，这样就可以使用规模经济和范围经济。虽然规模化涉及管理科学和数据科学，但优化是在传统的管理意义上理解的，即相对于成本函数。通过描述数据管理服务如何从一个部门应用到整个机构的情况分析和对影响因素的分析，概述了一种模块化方法，该方法汇聚到管理工作流程的最佳状态。

{"title":"Scaling by Optimising: Modularisation of Data Curation Services in Growing Organisations","authors":"Hagen Peukert","doi":"10.2218/ijdc.v16i1.650","DOIUrl":"https://doi.org/10.2218/ijdc.v16i1.650","url":null,"abstract":"After a century of theorising and applying management practices, we are in the middle of entering a new stage in management science: digital management. The management of digital data submerges in traditional functions of management and, at the same time, continues to recreate viable solutions and conceptualisations in its established fields, e.g. research data management. Yet, one can observe bilateral synergies and mutual enrichment of traditional and data management practices in all fields. The paper at hand addresses a case in point, in which new and old management practices amalgamate to meet a steadily, in part characterised by leaps and bounds, increasing demand of data curation services in academic institutions. The idea of modularisation, as known from software engineering, is applied to data curation workflows so that economies of scale and scope can be used. While scaling refers to both management science and data science, optimising is understood in the traditional managerial sense, that is, with respect to the cost function. By means of a situation analysis describing how data curation services were applied from one department to the entire institution and an analysis of the factors of influence, a method of modularisation is outlined that converges to an optimal state of curation workflows.","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":"33 1","pages":"20"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78747152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Assessment, Usability, and Sociocultural Impacts of DataONE 评估、可用性和DataONE的社会文化影响

International journal of digital curation

Pub Date : 2021-01-01 DOI: 10.2218/ijdc.v16i1.678

Robert J. Sandusky, Suzie L. Allard, Lynn Baird, L. Cannon, Kevin Crowston, Amy Forrester, Bruce Grant, Rachael Hu, R. Olendorf, Danielle Pollock, A. Specht, C. Tenopir, Rachel Volentine

DataONE, funded from 2009-2019 by the U.S. National Science Foundation, is an early example of a large-scale project that built both a cyberinfrastructure and culture of data discovery, sharing, and reuse. DataONE used a Working Group model, where a diverse group of participants collaborated on targeted research and development activities to achieve broader project goals. This article summarizes the work carried out by two of DataONE’s working groups: Usability & Assessment (2009-2019) and Sociocultural Issues (2009-2014). The activities of these working groups provide a unique longitudinal look at how scientists, librarians, and other key stakeholders engaged in convergence research to identify and analyze practices around research data management through the development of boundary objects, an iterative assessment program, and reflection. Members of the working groups disseminated their findings widely in papers, presentations, and datasets, reaching international audiences through publications in 25 different journals and presentations to over 5,000 people at interdisciplinary venues. The working groups helped inform the DataONE cyberinfrastructure and influenced the evolving data management landscape. By studying working groups over time, the paper also presents lessons learned about the working group model for global large-scale projects that bring together participants from multiple disciplines and communities in convergence research.

DataONE由美国国家科学基金会于2009-2019年资助，是一个大型项目的早期例子，该项目建立了网络基础设施和数据发现、共享和重用的文化。DataONE采用工作组模式，其中不同的参与者群体在有针对性的研究和开发活动上进行合作，以实现更广泛的项目目标。本文总结了DataONE的两个工作组:可用性与评估(2009-2019)和社会文化问题(2009-2014)所开展的工作。这些工作组的活动提供了一个独特的纵向视角，研究科学家、图书馆员和其他参与融合研究的关键利益相关者如何通过开发边界对象、迭代评估程序和反思来识别和分析围绕研究数据管理的实践。工作组成员通过论文、报告和数据集广泛传播他们的发现，通过在25种不同期刊上发表的出版物和在跨学科场所向5000多人发表的报告，向国际受众传播。工作组为DataONE网络基础设施提供了信息，并影响了不断变化的数据管理格局。通过对工作组的长期研究，本文还介绍了全球大型项目的工作组模式的经验教训，这些项目将来自多个学科和社区的参与者聚集在一起进行收敛研究。

{"title":"Assessment, Usability, and Sociocultural Impacts of DataONE","authors":"Robert J. Sandusky, Suzie L. Allard, Lynn Baird, L. Cannon, Kevin Crowston, Amy Forrester, Bruce Grant, Rachael Hu, R. Olendorf, Danielle Pollock, A. Specht, C. Tenopir, Rachel Volentine","doi":"10.2218/ijdc.v16i1.678","DOIUrl":"https://doi.org/10.2218/ijdc.v16i1.678","url":null,"abstract":"DataONE, funded from 2009-2019 by the U.S. National Science Foundation, is an early example of a large-scale project that built both a cyberinfrastructure and culture of data discovery, sharing, and reuse. DataONE used a Working Group model, where a diverse group of participants collaborated on targeted research and development activities to achieve broader project goals. This article summarizes the work carried out by two of DataONE’s working groups: Usability & Assessment (2009-2019) and Sociocultural Issues (2009-2014). The activities of these working groups provide a unique longitudinal look at how scientists, librarians, and other key stakeholders engaged in convergence research to identify and analyze practices around research data management through the development of boundary objects, an iterative assessment program, and reflection. Members of the working groups disseminated their findings widely in papers, presentations, and datasets, reaching international audiences through publications in 25 different journals and presentations to over 5,000 people at interdisciplinary venues. The working groups helped inform the DataONE cyberinfrastructure and influenced the evolving data management landscape. By studying working groups over time, the paper also presents lessons learned about the working group model for global large-scale projects that bring together participants from multiple disciplines and communities in convergence research.","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":"24 1","pages":"48"},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79944018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Access Some Areas: Reforming Access Categories for Data in a Social Science Data Archive 访问某些领域:改革社会科学数据档案中数据的访问类别

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.708

Laurence Horton, Anja Perry

In this paper we outline the process of revising data access categories for research data sets in GESIS – a large European social science data archive based in Germany. The challenge is to create a minimal set of workable access conditions that cope with a) facilitating as “open as possible, closed as necessary” expectations for data reuse; b) map on to existing legacy access categories and conditions in a data archive. The paper covers the work done in gathering data on data access categories used by data archives in their existing data catalogues, the choices offered to depositors of data in their user agreements, and work done by other data reuse platforms in categorising access to their data. Finally, we talk through the process of refining a minimal set of data access conditions for the GESIS data archive.

在本文中，我们概述了修订GESIS研究数据集的数据访问类别的过程-这是一个位于德国的大型欧洲社会科学数据档案。挑战在于创建一组可行的最小访问条件，以应对a)促进对数据重用的“尽可能开放，必要时关闭”的期望;B)映射到数据存档中现有的遗留访问类别和条件。该文件涵盖了收集数据档案在其现有数据目录中使用的数据访问类别的数据所做的工作，在其用户协议中向数据存款人提供的选择，以及其他数据重用平台在对其数据访问进行分类方面所做的工作。最后，我们讨论了为GESIS数据存档细化一组最小数据访问条件的过程。

引用次数: 0

Mutually Assured Preservation: Fostering Active Preservation Practice through Fire Drills 相互保证保护:通过消防演习培养积极的保护实践

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.724

Bradley J. Daigle

Sound preservation practice is a series of active engagements with the content one hopes to preserve. In many cases, this has not always been the case. Both institutions and services—while not actively encouraging passive preservation—neglect the key components in the stewardship of our historical record. In other words, there is much more to preservation than simply choosing a storage solution and placing one’s content there. The materials need to be verified, checked, and tested against expectations within the service. This is accepted practice for many. However, very few services provide the necessary assurance to test both its own user expectations as well as the depositors’ themselves. Creating a methodology for both depositor and service to be assured that preservation meets expectations is critical. This is happening in very select ways. This paper discusses one such dialogue and its function.

良好的保存实践是与希望保存的内容进行一系列积极的接触。在许多情况下，情况并非总是如此。无论是机构还是服务机构，在不积极鼓励被动保存的同时，都忽视了管理我们历史记录的关键部分。换句话说，保存要比简单地选择一个存储解决方案并将内容放在那里多得多。这些材料需要根据服务的期望进行验证、检查和测试。这是许多人接受的做法。然而，很少有服务能提供必要的保证，既能测试其用户的期望，也能测试存款人自己的期望。为存款人和服务创造一种方法，以确保保存符合预期是至关重要的。这是以非常有选择性的方式发生的。本文讨论了其中的一个对话及其作用。

引用次数: 0

Sustaining Digital Humanities Collections: Challenges and Community-Centred Strategies 维持数字人文馆藏:挑战和以社区为中心的策略

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.725

Katrina Fenlon

Since the advent of digital scholarship in the humanities, decades of extensive, distributed scholarly efforts have produced a digital scholarly record that is increasingly scattered, heterogeneous, and independent of curatorial institutions. Digital scholarship produces collections with unique scholarly and cultural value—collections that serve as hubs for collaboration and communication, engage broad audiences, and support new research. Yet, lacking systematic support for digital scholarship in libraries, digital humanities collections are facing a widespread crisis of sustainability. This paper provides outcomes of a multimodal study of sustainability challenges confronting digital collections in the humanities, characterizing institutional and community-oriented strategies for sustaining collections. Strategies that prioritize community engagement with collections and the maintenance of sociotechnical workflows suggest possibilities for novel approaches to collaborative, community-centred sustainability for digital humanities collections.

自从人文学科的数字学术出现以来，几十年来广泛的、分布式的学术努力已经产生了一个越来越分散、异构和独立于策展机构的数字学术记录。数字学术产生具有独特学术和文化价值的收藏，这些收藏作为协作和交流的中心，吸引广泛的受众，并支持新的研究。然而，由于缺乏对图书馆数字学术的系统支持，数字人文馆藏面临着广泛的可持续性危机。本文提供了人文学科数字馆藏面临的可持续性挑战的多模式研究的结果，描述了机构和社区为导向的可持续馆藏策略。优先考虑社区参与馆藏和维护社会技术工作流程的策略，为数字人文馆藏的协作、以社区为中心的可持续性提供了新方法的可能性。

引用次数: 6

Long-Term Data Preservation Data Lifecycle, Standardisation Process, Implementation and Lessons Learned 数据生命周期、标准化过程、实施和经验教训

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.715

M. Albani, Iolanda Maggio, Ceos Data Stewardship Interest Group

Science and Earth Observation data represent today a unique and valuable asset for humankind that should be preserved without time constraints and kept accessible and exploitable by current and future generations. In Earth Science, knowledge of the past and tracking of the evolution are at the basis of our capability to effectively respond to the global changes that are putting increasing pressure on the environment, and on human society. This can only be achieved if long time series of data are properly preserved and made accessible to support international initiatives. Within ESA Member States and beyond, Earth Science data holders are increasingly coordinating data preservation efforts to ensure that the valuable data are safeguarded against loss and kept accessible and useable for current and future generations. This task becomes increasingly challenging in view of the existing 40 years’ worth of Earth Science data stored in archives around the world and the massive increase of data volumes expected over the next years from e.g., the European Copernicus Sentinel missions. Long Term Data Preservation (LTDP) aims at maintaining information discoverable and accessible in an independent and understandable way, with supporting information, which helps ensuring authenticity, over the long term. A focal aspect of LTDP is data Curation. Data Curation refers to the management of data throughout its life cycle. Data Curation activities enable data discovery and retrieval, maintain its quality, add value, and allow data re-use over time. It includes all the processes that involve data management, such as pre-ingest initiatives, ingest functions, archival storage and preservation, dissemination, and provision of access for a designated community. The paper presents specific aspects, of importance during the entire Earth observation data lifecycle, with respect to evolving data volumes and application scenarios. These particular issues are introduced in the section on 'Big Data' and LTDP. The Data Stewardship Reference lifecycle section describes how the data stewardship activities can be efficiently organised, while the following section addresses the overall preservation workflow and shows the technical steps to be taken during Data Curation. Earth Science Data Curation and preservation should be addressed during all mission stages - from the initial mission planning, throughout the entire mission lifetime, and during the post- mission phase. The Data Stewardship Reference Lifecycle gives a high-level overview of the steps useful for implementing Curation and preservation rules on mission data sets from initial conceptualisation or receipt through the iterative Curation cycle.

今天，科学和地球观测数据是人类独特和宝贵的资产，应不受时间限制地加以保存，并使当代人和子孙后代能够获得和利用。在地球科学中，对过去的了解和对进化的追踪是我们有效应对全球变化的能力的基础，这些变化正在给环境和人类社会带来越来越大的压力。只有妥善保存长时间序列的数据并使其可获得以支持国际倡议，才能实现这一目标。在欧空局成员国内外，地球科学数据持有者正在越来越多地协调数据保存工作，以确保有价值的数据不受损失，并使当代人和子孙后代能够访问和使用这些数据。鉴于现有的40年的地球科学数据储存在世界各地的档案中，并且预计未来几年数据量将大量增加，例如欧洲哥白尼哨兵任务，这项任务变得越来越具有挑战性。长期数据保存(LTDP)旨在以独立和可理解的方式维护信息的可发现性和可访问性，并提供支持信息，这有助于确保长期的真实性。LTDP的一个重点方面是数据管理。数据管理是指在数据的整个生命周期中对数据进行管理。数据管理活动支持数据发现和检索，维护数据质量，增加价值，并允许随着时间的推移重用数据。它包括涉及数据管理的所有过程，例如预摄取计划、摄取功能、档案存储和保存、传播以及为指定社区提供访问。本文介绍了在整个地球观测数据生命周期中，根据不断变化的数据量和应用场景，具有重要意义的具体方面。这些特殊问题将在“大数据”和LTDP章节中介绍。数据管理参考生命周期部分描述了如何有效地组织数据管理活动，而下一节讨论了整个保存工作流程，并展示了在数据管理期间应采取的技术步骤。地球科学数据的管理和保存应该在所有任务阶段——从最初的任务规划，贯穿整个任务生命周期，以及在任务后阶段——得到解决。数据管理参考生命周期对从初始概念化或接收到迭代管理周期的任务数据集实施管理和保存规则的有用步骤进行了高级概述。

{"title":"Long-Term Data Preservation Data Lifecycle, Standardisation Process, Implementation and Lessons Learned","authors":"M. Albani, Iolanda Maggio, Ceos Data Stewardship Interest Group","doi":"10.2218/ijdc.v15i1.715","DOIUrl":"https://doi.org/10.2218/ijdc.v15i1.715","url":null,"abstract":"Science and Earth Observation data represent today a unique and valuable asset for humankind that should be preserved without time constraints and kept accessible and exploitable by current and future generations. In Earth Science, knowledge of the past and tracking of the evolution are at the basis of our capability to effectively respond to the global changes that are putting increasing pressure on the environment, and on human society. This can only be achieved if long time series of data are properly preserved and made accessible to support international initiatives. Within ESA Member States and beyond, Earth Science data holders are increasingly coordinating data preservation efforts to ensure that the valuable data are safeguarded against loss and kept accessible and useable for current and future generations. This task becomes increasingly challenging in view of the existing 40 years’ worth of Earth Science data stored in archives around the world and the massive increase of data volumes expected over the next years from e.g., the European Copernicus Sentinel missions. Long Term Data Preservation (LTDP) aims at maintaining information discoverable and accessible in an independent and understandable way, with supporting information, which helps ensuring authenticity, over the long term. A focal aspect of LTDP is data Curation. Data Curation refers to the management of data throughout its life cycle. Data Curation activities enable data discovery and retrieval, maintain its quality, add value, and allow data re-use over time. It includes all the processes that involve data management, such as pre-ingest initiatives, ingest functions, archival storage and preservation, dissemination, and provision of access for a designated community. \u0000The paper presents specific aspects, of importance during the entire Earth observation data lifecycle, with respect to evolving data volumes and application scenarios. These particular issues are introduced in the section on 'Big Data' and LTDP. The Data Stewardship Reference lifecycle section describes how the data stewardship activities can be efficiently organised, while the following section addresses the overall preservation workflow and shows the technical steps to be taken during Data Curation. Earth Science Data Curation and preservation should be addressed during all mission stages - from the initial mission planning, throughout the entire mission lifetime, and during the post- mission phase. The Data Stewardship Reference Lifecycle gives a high-level overview of the steps useful for implementing Curation and preservation rules on mission data sets from initial conceptualisation or receipt through the iterative Curation cycle.","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":"77 1","pages":"1-10"},"PeriodicalIF":0.0,"publicationDate":"2020-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89236660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Research Data Management (RDM) at the University of Ghana (UG) 加纳大学研究数据管理(RDM)

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.670

B. K. Avuglah

This article explores Research Data Management (RDM) at the University of Ghana (UG). It emphasises on institutional awareness and attitudes, and whether the University Library is officially supporting this emerging strategic interest in research focused Higher Education Institutions (HEIs). Purposive sampling was used to select information-rich respondents from across the University (i.e. Librarians, Research Administrators, ICT Managers and Senior Researchers) who were interviewed on a range of issues about RDM. Institutional documents were also reviewed to corroborate the primary data and get a deeper understanding of the research problem. The study shows that while RDM is recognised at the institutional level as good research practice and integrity issue, the concept is tenuously understood in the local community. Unsurprisingly, however, there was a general appreciation and awareness of the need for RDM and the implications for such critical concerns as security, integrity, continuity and institutional reputation. The library is yet to take a strategic approach to RDM issues and there is clearly a dearth in RDM expertise within the library system. The study recommends that the library must be proactive in advocating and promoting RDM issues at UG, but first, the Librarians must take advantage of numerous existing opportunities to build their capacity.

本文探讨了加纳大学(UG)的研究数据管理(RDM)。它强调机构的意识和态度，以及大学图书馆是否正式支持以研究为重点的高等教育机构(HEIs)的这种新兴战略兴趣。有目的抽样用于从整个大学(即图书馆员，研究管理员，ICT经理和高级研究人员)中选择信息丰富的受访者，他们就RDM的一系列问题接受了采访。同时，我们也回顾了相关机构的文献，以验证原始数据，并对研究问题有更深入的了解。该研究表明，虽然RDM在机构层面被认为是一个良好的研究实践和诚信问题，但在当地社区，人们对这个概念的理解却很模糊。然而，毫不奇怪的是，大家普遍赞赏和认识到需要就地管理及其对安全、完整性、连续性和机构声誉等关键问题的影响。图书馆还没有采取战略方法来解决RDM问题，而且图书馆系统中显然缺乏RDM专业知识。该研究建议，图书馆必须积极主动地在UG倡导和促进RDM问题，但首先，图书馆员必须利用大量现有的机会来建立自己的能力。

{"title":"Research Data Management (RDM) at the University of Ghana (UG)","authors":"B. K. Avuglah","doi":"10.2218/ijdc.v15i1.670","DOIUrl":"https://doi.org/10.2218/ijdc.v15i1.670","url":null,"abstract":"This article explores Research Data Management (RDM) at the University of Ghana (UG). It emphasises on institutional awareness and attitudes, and whether the University Library is officially supporting this emerging strategic interest in research focused Higher Education Institutions (HEIs). Purposive sampling was used to select information-rich respondents from across the University (i.e. Librarians, Research Administrators, ICT Managers and Senior Researchers) who were interviewed on a range of issues about RDM. Institutional documents were also reviewed to corroborate the primary data and get a deeper understanding of the research problem. The study shows that while RDM is recognised at the institutional level as good research practice and integrity issue, the concept is tenuously understood in the local community. Unsurprisingly, however, there was a general appreciation and awareness of the need for RDM and the implications for such critical concerns as security, integrity, continuity and institutional reputation. The library is yet to take a strategic approach to RDM issues and there is clearly a dearth in RDM expertise within the library system. The study recommends that the library must be proactive in advocating and promoting RDM issues at UG, but first, the Librarians must take advantage of numerous existing opportunities to build their capacity. \u0000 ","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":"9 1","pages":"1-25"},"PeriodicalIF":0.0,"publicationDate":"2020-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83661640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Updating the DCC Curation Lifecycle Model 更新DCC策展生命周期模型

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.721

Sayeed Choudhury, Caihong Huang, C. Palmer

The DCC Curation Lifecycle Model has played a vital role in the field of data curation for over a decade. During that time, the scale and complexity of data have changed dramatically, along with the contexts of data production and use. This paper reports on a study examining factors impacting data curation practices and presents recommendations for updating the DCC Curation Lifecycle Model. The study was grounded in a review of other lifecycle models and informed by a site visit to the Digital Curation Centre and consultation with expert practitioners and researchers. Framed by contemporary conditions impacting the conduct of research and provision of data services, the analysis and proposed recommendations account for the prominence of machine-actionable data, the importance of machine learning for data processing and analytics, growth of integrated research workflows, and escalating concerns with fairness, accountability, and transparency of data and algorithms.

十多年来，DCC管理生命周期模型在数据管理领域发挥了至关重要的作用。在此期间，数据的规模和复杂性发生了巨大变化，数据生产和使用的背景也发生了变化。本文报告了一项研究，研究了影响数据管理实践的因素，并提出了更新DCC管理生命周期模型的建议。该研究基于对其他生命周期模型的回顾，并通过对数字管理中心的实地访问以及与专家从业者和研究人员的咨询获得了信息。在影响研究行为和数据服务提供的当代条件的框架下，分析和提出的建议说明了机器可操作数据的突出性，机器学习对数据处理和分析的重要性，集成研究工作流程的增长，以及对数据和算法的公平性，问责制和透明度的日益关注。

引用次数: 3

Data Curator in the Middle: Curating Data for a Diverse Community of Stakeholders 中间的数据管理员:为不同的利益相关者社区管理数据

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.706

R. Geraghty

The Prevention and Early Intervention Research Initiative is an archiving project to preserve the data and reports that were generated by twelve years of philanthropic and state investment into prevention and early intervention approaches in the children and youth sector in Ireland and Northern Ireland. The investment resulted in an extensive collection of evaluation data and reports, which collectively provide an evidence base for continued investment into PEI programmes that are shown to be effective. In 2016, the Prevention and Early Intervention Research Initiative (PEI-RI) was established to preserve the outputs from these evaluations in the national data archives, as a publicly available evidence base. The political and social significance of this collection is manifest in the range of stakeholder groups that the project is engaging with, including the community and not-for-profit organisations that operated the PEI programmes, the research teams from academic institutions that evaluated these programmes, and representatives from government departments that co-funded many of these programmes with Atlantic. This paper tells the story of the PEI-RI archiving project, describing the steps we’ve taken since 2016 to preserve and promote the PEI data. During the course of the project we realised that it would not be enough to provide access to the data alone, as "[g]enerating and collating the evidence is of no use if it never reaches the commissioners and professionals who need it" (What Works Network, 2014, pp. 6). In the second phase of our project we are creating a range of resources for practitioner and decision maker audiences which provide a pathway to the data using the archival infrastructure. The project provides a case study of curating a digital collection that is intended for multiple stakeholders with different expectations of the archived material. The PEI-RI data curator is located in the middle of a triad of data creators, data consumers and data archives, and is tasked with balancing the interests, expectations and limitations of each.

预防和早期干预研究倡议是一个存档项目，目的是保存爱尔兰和北爱尔兰儿童和青年部门预防和早期干预方法十二年来慈善和国家投资所产生的数据和报告。这项投资导致广泛收集了评价数据和报告，这些数据和报告共同为继续投资于证明有效的PEI规划提供了证据基础。2016年，预防和早期干预研究倡议(PEI-RI)成立，目的是将这些评估的产出保存在国家数据档案中，作为可公开获得的证据基础。该收藏的政治和社会意义体现在项目参与的利益相关者群体的范围内，包括运营PEI项目的社区和非营利组织，评估这些项目的学术机构的研究团队，以及与大西洋公司共同资助这些项目的政府部门的代表。本文讲述了PEI- ri存档项目的故事，描述了我们自2016年以来为保存和推广PEI数据所采取的步骤。项目过程中我们意识到它还不足以提供对数据的访问,仅作为“[g] enerating和整理证据也没有用,如果它永远无法达到委员和专业人士需要它”(网络工作,2014年,页6)。在我们项目的第二阶段为医生创造了一系列资源和决策者的观众提供一个途径使用档案的数据基础设施。该项目提供了一个案例研究，旨在为对存档材料有不同期望的多个利益相关者策划数字收藏。PEI-RI数据管理员位于数据创建者、数据消费者和数据存档三者的中间，其任务是平衡各方的利益、期望和限制。

{"title":"Data Curator in the Middle: Curating Data for a Diverse Community of Stakeholders","authors":"R. Geraghty","doi":"10.2218/ijdc.v15i1.706","DOIUrl":"https://doi.org/10.2218/ijdc.v15i1.706","url":null,"abstract":"\u0000The Prevention and Early Intervention Research Initiative is an archiving project to preserve the data and reports that were generated by twelve years of philanthropic and state investment into prevention and early intervention approaches in the children and youth sector in Ireland and Northern Ireland. The investment resulted in an extensive collection of evaluation data and reports, which collectively provide an evidence base for continued investment into PEI programmes that are shown to be effective. In 2016, the Prevention and Early Intervention Research Initiative (PEI-RI) was established to preserve the outputs from these evaluations in the national data archives, as a publicly available evidence base. The political and social significance of this collection is manifest in the range of stakeholder groups that the project is engaging with, including the community and not-for-profit organisations that operated the PEI programmes, the research teams from academic institutions that evaluated these programmes, and representatives from government departments that co-funded many of these programmes with Atlantic. \u0000This paper tells the story of the PEI-RI archiving project, describing the steps we’ve taken since 2016 to preserve and promote the PEI data. During the course of the project we realised that it would not be enough to provide access to the data alone, as \"[g]enerating and collating the evidence is of no use if it never reaches the commissioners and professionals who need it\" (What Works Network, 2014, pp. 6). In the second phase of our project we are creating a range of resources for practitioner and decision maker audiences which provide a pathway to the data using the archival infrastructure. \u0000The project provides a case study of curating a digital collection that is intended for multiple stakeholders with different expectations of the archived material. The PEI-RI data curator is located in the middle of a triad of data creators, data consumers and data archives, and is tasked with balancing the interests, expectations and limitations of each. \u0000","PeriodicalId":87279,"journal":{"name":"International journal of digital curation","volume":"10 1","pages":"1-12"},"PeriodicalIF":0.0,"publicationDate":"2020-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88401456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Building the Picture Behind a Dataset 构建数据集背后的图片

International journal of digital curation

Pub Date : 2020-12-31 DOI: 10.2218/ijdc.v15i1.702

Frances Madden, J. Ashton, J. Cope

As part of the European Commission funded FREYA project The British Library wanted to explore the possibility of developing provenance information in datasets derived from the British Library’s collections, the data.bl.uk collection. Provenance information is defined in this context as ‘information relating to the origin, source and curation of the datasets’. Provenance information is also identified within the FAIR principles as an important aspect of being able to reuse and understand research datasets. According to the FAIR principles, the aim is to understand how to cite and acknowledge the dataset as well as understanding how the dataset was created and has been processed. There is also reference to the importance of this metadata being machine readable. By enhancing the metadata of these datasets with additional persistent identifiers and metadata a fuller picture of the datasets and their content could be understood. This also adds to the veracity and understanding the dataset by end users of data.bl.uk.

作为欧盟委员会资助的FREYA项目的一部分，大英图书馆希望探索从大英图书馆的馆藏data.bl.uk收集的数据集中开发出处信息的可能性。在这种情况下，来源信息被定义为“与数据集的起源、来源和管理有关的信息”。在FAIR原则中，来源信息也被确定为能够重用和理解研究数据集的一个重要方面。根据FAIR原则，目标是了解如何引用和确认数据集，以及了解数据集是如何创建和处理的。这里还提到了元数据机器可读的重要性。通过使用额外的持久标识符和元数据增强这些数据集的元数据，可以更全面地了解数据集及其内容。这也增加了data.bl.uk的最终用户对数据集的准确性和理解。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International journal of digital curation

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀