首页 > 最新文献

IASSIST quarterly最新文献

英文 中文
Mathematics, risk, and messy survey data 数学、风险和混乱的调查数据
Pub Date : 2020-12-18 DOI: 10.29173/iq979
Kristi Thompson, C. Sullivan
Research funder mandates, such as those from the U.S. National Science Foundation (2011), the Canadian Tri-Agency (draft, 2018), and the UK Economic and Social Research Council (2018) now often include requirements for data curation, including where possible data sharing in an approved archive. Data curators need to be prepared for the potential that researchers who have not previously shared data will need assistance with cleaning and depositing datasets so that they can meet these requirements and maintain funding. Data de-identification or anonymization is a major ethical concern in cases where survey data is to be shared, and one which data professionals may find themselves ill-equipped to deal with. This article is intended to provide an accessible and practical introduction to the theory and concepts behind data anonymization and risk assessment, will describe a couple of case studies that demonstrate how these methods were carried out on actual datasets requiring anonymization, and discuss some of the difficulties encountered. Much of the literature dealing with statistical risk assessment of anonymized data is abstract and aimed at computer scientists and mathematicians, while material aimed at practitioners often does not consider more recent developments in the theory of data anonymization. We hope that this article will help bridge this gap.
研究资助者的授权,如美国国家科学基金会(2011年)、加拿大三机构(草案,2018年)和英国经济和社会研究委员会(2018年)的授权,现在通常包括数据管理的要求,包括在可能的情况下在经批准的档案中共享数据。数据管理者需要做好准备,以应对之前没有共享数据的研究人员可能需要帮助清理和存放数据集,从而满足这些要求并保持资金。在共享调查数据的情况下,数据去识别或匿名化是一个主要的道德问题,而数据专业人员可能会发现自己没有能力处理这些问题。本文旨在对数据匿名化和风险评估背后的理论和概念进行简单实用的介绍,并将描述几个案例研究,展示这些方法是如何在需要匿名化的实际数据集上进行的,并讨论遇到的一些困难。许多关于匿名数据统计风险评估的文献都是抽象的,针对的是计算机科学家和数学家,而针对从业者的材料往往没有考虑数据匿名化理论的最新发展。我们希望这篇文章将有助于弥合这一差距。
{"title":"Mathematics, risk, and messy survey data","authors":"Kristi Thompson, C. Sullivan","doi":"10.29173/iq979","DOIUrl":"https://doi.org/10.29173/iq979","url":null,"abstract":"Research funder mandates, such as those from the U.S. National Science Foundation (2011), the Canadian Tri-Agency (draft, 2018), and the UK Economic and Social Research Council (2018) now often include requirements for data curation, including where possible data sharing in an approved archive. Data curators need to be prepared for the potential that researchers who have not previously shared data will need assistance with cleaning and depositing datasets so that they can meet these requirements and maintain funding. Data de-identification or anonymization is a major ethical concern in cases where survey data is to be shared, and one which data professionals may find themselves ill-equipped to deal with. This article is intended to provide an accessible and practical introduction to the theory and concepts behind data anonymization and risk assessment, will describe a couple of case studies that demonstrate how these methods were carried out on actual datasets requiring anonymization, and discuss some of the difficulties encountered. Much of the literature dealing with statistical risk assessment of anonymized data is abstract and aimed at computer scientists and mathematicians, while material aimed at practitioners often does not consider more recent developments in the theory of data anonymization. We hope that this article will help bridge this gap.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47316922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sustainability through the liaison with data archive users 通过与数据档案用户的联系实现可持续性
Pub Date : 2020-12-18 DOI: 10.29173/iq976
Michaela Kudrnáčová, Ilona Trtíková
As a social science data archive, we focus on collecting research data and archiving it. However, there are more responsibilities that come with data archiving: cooperation on international social surveys (ISSP, ESS), supporting secondary data analysis and much more. Significant part of our work is to communicate with students and researchers, to educate them about data management and data analysis. Although the relationship we have is functional and seems sufficient, we tend to ask ourselves: who are the data archive users and what do they expect from us?We decided to employ user-centered design methods and tools to define a typical user of our services and to find out what their motivations for using our data archive are and what specific functions they use and (do not) appreciate, so we would have a better image of their needs. Moreover, we wondered about the role of open science and its impact on the users’ needs and future requirements arising from the open science environment. Obtained information is a point of departure for redesigning archival services to satisfy new demands our users have regarding more data resources, new techniques of scientific work and better interconnection between different platforms.
作为一个社会科学数据档案馆,我们专注于收集研究数据并将其归档。然而,数据归档带来了更多的责任:国际社会调查(ISSP、ESS)合作、支持二次数据分析等。我们工作的重要部分是与学生和研究人员交流,教育他们有关数据管理和数据分析的知识。尽管我们之间的关系是功能性的,而且似乎足够,但我们倾向于问自己:谁是数据归档用户,他们对我们有什么期望?我们决定采用以用户为中心的设计方法和工具来定义我们服务的典型用户,并了解他们使用我们数据档案的动机,以及他们使用和(不)欣赏的特定功能,这样我们就能更好地了解他们的需求。此外,我们想知道开放科学的作用及其对用户需求和未来需求的影响,这些需求和需求是由开放科学环境引起的。获得的信息是重新设计档案服务的出发点,以满足用户对更多数据资源、科学工作的新技术以及不同平台之间更好的互连的新需求。
{"title":"Sustainability through the liaison with data archive users","authors":"Michaela Kudrnáčová, Ilona Trtíková","doi":"10.29173/iq976","DOIUrl":"https://doi.org/10.29173/iq976","url":null,"abstract":"As a social science data archive, we focus on collecting research data and archiving it. However, there are more responsibilities that come with data archiving: cooperation on international social surveys (ISSP, ESS), supporting secondary data analysis and much more. Significant part of our work is to communicate with students and researchers, to educate them about data management and data analysis. Although the relationship we have is functional and seems sufficient, we tend to ask ourselves: who are the data archive users and what do they expect from us?\u0000We decided to employ user-centered design methods and tools to define a typical user of our services and to find out what their motivations for using our data archive are and what specific functions they use and (do not) appreciate, so we would have a better image of their needs. Moreover, we wondered about the role of open science and its impact on the users’ needs and future requirements arising from the open science environment. Obtained information is a point of departure for redesigning archival services to satisfy new demands our users have regarding more data resources, new techniques of scientific work and better interconnection between different platforms.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47028009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Capturing their “first” dataset: A graduate course to walk PhD students through the curation of their dissertation data 获取他们的“第一个”数据集:一门研究生课程,带领博士生完成论文数据的管理
Pub Date : 2020-09-23 DOI: 10.29173/iq971
Megan Sapp Nelson, N. Kong
The data set accompanying theses is a valuable intellectual property asset, both from the viewpoint of the PhD student, who can procure employment and build publications and research grants from the work for years to come, and the university, which owns the data and has invested in the work. However, the data set has generally not been captured as a finished product in a similar manner to the published thesis. A course has been developed which walks PhD students through the process of identifying an archival data set, selecting a repository or long term storage location, creating metadata and documentation for the data package, and the deposit process. A pre- and post assessment has been designed to ascertain the level of data literacy the students gain through curating their own dataset. PIs for the projects have input into the repositories and metadata standards selected.  The university thesis office was consulted as the course was developed, so that accurate procedures and practices are reflected throughout the course. This first of a kind class is open to students of any discipline at a Research-1 university. The resulting mixture of data types creates a unique course every time it is offered.
论文附带的数据集是一项宝贵的知识产权资产,无论是从博士生的角度来看,他们可以在未来几年找到工作,并从工作中获得出版物和研究资助,还是从拥有数据并投资于工作的大学的角度来看。然而,数据集通常没有以与发表论文类似的方式作为成品捕获。已经开发了一门课程,指导博士生完成识别档案数据集、选择存储库或长期存储位置、为数据包创建元数据和文档以及存放过程的过程。设计了一个前后评估,以确定学生通过整理自己的数据集获得的数据素养水平。项目的pi已经输入到所选择的存储库和元数据标准中。在课程开发过程中,咨询了大学论文办公室,以便在整个课程中反映准确的程序和实践。这是研究型大学所有学科的学生都可以参加的第一门课程。所产生的混合数据类型每次提供时都会创建一个独特的课程。
{"title":"Capturing their “first” dataset: A graduate course to walk PhD students through the curation of their dissertation data","authors":"Megan Sapp Nelson, N. Kong","doi":"10.29173/iq971","DOIUrl":"https://doi.org/10.29173/iq971","url":null,"abstract":"The data set accompanying theses is a valuable intellectual property asset, both from the viewpoint of the PhD student, who can procure employment and build publications and research grants from the work for years to come, and the university, which owns the data and has invested in the work. However, the data set has generally not been captured as a finished product in a similar manner to the published thesis. A course has been developed which walks PhD students through the process of identifying an archival data set, selecting a repository or long term storage location, creating metadata and documentation for the data package, and the deposit process. A pre- and post assessment has been designed to ascertain the level of data literacy the students gain through curating their own dataset. PIs for the projects have input into the repositories and metadata standards selected.  The university thesis office was consulted as the course was developed, so that accurate procedures and practices are reflected throughout the course. This first of a kind class is open to students of any discipline at a Research-1 university. The resulting mixture of data types creates a unique course every time it is offered.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45324013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The matter of meta in research data management: Introducing the CESSDA Metadata Office Project 研究数据管理中的元数据问题——CESSDA元数据办公室项目简介
Pub Date : 2020-09-23 DOI: 10.29173/iq970
André Förster, Kerrin Borschewski, S. Bolton, Taina Jääskeläinen
Accompanying the growing importance of research data management, the provision and maintenance of metadata – understood as data about (research) data – have obtained a key role in contextualizing, understanding, and preserving research data. Acknowledging the importance of metadata in the social sciences, the Consortium of European Social Science Data Archives started the Metadata Office project in 2019. This project report presents the various activities of the Metadata Office (MDO). Metadata models, schema, controlled vocabularies and thesauri are covered, including the MDO’s collaboration with the DDI Alliance on multilingual translations of DDI vocabularies for CESSDA Service Providers. The report also summarizes the communication, training and advice provided by MDO, including DDI use across CESSDA, illustrates the impact of the project for the social science and research data management community, and offers an outline regarding future plans of the project.
随着研究数据管理的重要性日益增加,元数据的提供和维护——被理解为关于(研究)数据的数据——在背景化、理解和保存研究数据方面发挥了关键作用。认识到元数据在社会科学中的重要性,欧洲社会科学数据档案联盟于2019年启动了元数据办公室项目。本项目报告介绍元数据办公室(MDO)的各项活动。其中涵盖了元数据模型、模式、受控词汇表和辞典,包括MDO与DDI联盟在为CESSDA服务提供商翻译DDI词汇表的多语言合作。该报告还总结了MDO提供的沟通、培训和建议,包括DDI在整个CESSDA中的使用,说明了该项目对社会科学和研究数据管理界的影响,并提供了关于该项目未来计划的概述。
{"title":"The matter of meta in research data management: Introducing the CESSDA Metadata Office Project","authors":"André Förster, Kerrin Borschewski, S. Bolton, Taina Jääskeläinen","doi":"10.29173/iq970","DOIUrl":"https://doi.org/10.29173/iq970","url":null,"abstract":"Accompanying the growing importance of research data management, the provision and maintenance of metadata – understood as data about (research) data – have obtained a key role in contextualizing, understanding, and preserving research data. Acknowledging the importance of metadata in the social sciences, the Consortium of European Social Science Data Archives started the Metadata Office project in 2019. This project report presents the various activities of the Metadata Office (MDO). Metadata models, schema, controlled vocabularies and thesauri are covered, including the MDO’s collaboration with the DDI Alliance on multilingual translations of DDI vocabularies for CESSDA Service Providers. The report also summarizes the communication, training and advice provided by MDO, including DDI use across CESSDA, illustrates the impact of the project for the social science and research data management community, and offers an outline regarding future plans of the project.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42000753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Provenance metadata for statistical data: An introduction to Structured Data Transformation Language (SDTL) 统计数据的源元数据:结构化数据转换语言(SDTL)简介
Pub Date : 2020-07-06 DOI: 10.29173/iq983
George Alter, Darrell Donakowski, J. Gager, P. Heus, Carson Hunter, Sanda Ionescu, J. Iverson, H. Jagadish, C. Lagoze, Jared Lyle, Alexander Mueller, Sigbjørn Revheim, M. Richardson, Ørnulf Risnes, Karunakara Seelam, Dan J. Smith, T. Smith, Jie Song, Y. Vaidya, Ole Voldsater
Structured Data Transformation Language (SDTL) provides structured, machine actionable representations of data transformation commands found in statistical analysis software.   The Continuous Capture of Metadata for Statistical Data Project (C2Metadata) created SDTL as part of an automated system that captures provenance metadata from data transformation scripts and adds variable derivations to standard metadata files.  SDTL also has potential for auditing scripts and for translating scripts between languages.  SDTL is expressed in a set of JSON schemas, which are machine actionable and easily serialized to other formats.  Statistical software languages have a number of special features that have been carried into SDTL.  We explain how SDTL handles differences among statistical languages and complex operations, such as merging files and reshaping data tables from “wide” to “long”. 
结构化数据转换语言(SDTL)为统计分析软件中的数据转换命令提供结构化的、机器可操作的表示。统计数据元数据持续捕获项目(C2Metadata)创建了SDTL,作为自动化系统的一部分,该系统从数据转换脚本中捕获出处元数据,并将变量派生添加到标准元数据文件中。SDTL还具有审计脚本和在语言之间翻译脚本的潜力。SDTL用一组JSON模式表示,这些模式可在机器上操作,并且很容易序列化为其他格式。统计软件语言具有许多特殊功能,这些功能已被纳入SDTL中。我们解释了SDTL如何处理统计语言和复杂操作之间的差异,例如合并文件和将数据表从“宽”改为“长”。
{"title":"Provenance metadata for statistical data: An introduction to Structured Data Transformation Language (SDTL)","authors":"George Alter, Darrell Donakowski, J. Gager, P. Heus, Carson Hunter, Sanda Ionescu, J. Iverson, H. Jagadish, C. Lagoze, Jared Lyle, Alexander Mueller, Sigbjørn Revheim, M. Richardson, Ørnulf Risnes, Karunakara Seelam, Dan J. Smith, T. Smith, Jie Song, Y. Vaidya, Ole Voldsater","doi":"10.29173/iq983","DOIUrl":"https://doi.org/10.29173/iq983","url":null,"abstract":"Structured Data Transformation Language (SDTL) provides structured, machine actionable representations of data transformation commands found in statistical analysis software.   The Continuous Capture of Metadata for Statistical Data Project (C2Metadata) created SDTL as part of an automated system that captures provenance metadata from data transformation scripts and adds variable derivations to standard metadata files.  SDTL also has potential for auditing scripts and for translating scripts between languages.  SDTL is expressed in a set of JSON schemas, which are machine actionable and easily serialized to other formats.  Statistical software languages have a number of special features that have been carried into SDTL.  We explain how SDTL handles differences among statistical languages and complex operations, such as merging files and reshaping data tables from “wide” to “long”. ","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47288015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Advocating for reproducibility 提倡可重复性
Pub Date : 2020-07-02 DOI: 10.29173/iq982
H. Dekker, Amy Riegelman
As guest editors, we are excited to publish this special double issue of IASSIST Quarterly. The topics of reproducibility, replicability, and transparency have been addressed in past issues of IASSIST Quarterly and at the IASSIST conference, but this double issue is entirely focused on these issues. In recent years, efforts “to improve the credibility of science by advancing transparency, reproducibility, rigor, and ethics in research” have gained momentum in the social sciences (Center for Effective Global Action, 2020). While few question the spirit of the reproducibility and research transparency movement, it faces significant challenges because it goes against the grain of established practice. We believe the data services community is in a unique position to help advance this movement given our data and technical expertise, training and consulting work, international scope, and established role in data management and preservation, and more. As evidence of the movement, several initiatives exist to support research reproducibility infrastructure and data preservation efforts: Center for Open Science (COS) / Open Science Framework (OSF)[i] Berkeley Initiative for Transparency in the Social Sciences (BITSS)[ii] CUrating for REproducibility (CURE)[iii] Project Tier[iv] Data Curation Network[v] UK Reproducibility Network[vi] While many new initiatives have launched in recent years, prior to the now commonly used phrase “reproducibility crisis” and Ioannidis publishing the essay, “Why Most Published Research Findings are False,” we know that the data services community was supporting reproducibility in a variety of ways (e.g., data management, data preservation, metadata standards) in wellestablished consortiums such as Inter-university Consortium for Political and Social Research (ICPSR) (Ioannidis, 2005). The articles in this issue comprise several very important aspects of reproducible research: Identification of barriers to reproducibility and solutions to such barriers Evidence synthesis as related to transparent reporting and reproducibility Reflection on how information professionals, researchers, and librarians perceive the reproducibility crisis and how they can partner to help solve it. The issue begins with “Reproducibility literature analysis” which looks at existing resources and literature to identify barriers to reproducibility and potential solutions. The authors have compiled a comprehensive list of resources with annotations that include definitions of key concepts pertinent to the reproducibility crisis. The next article addresses data reuse from the perspective of a large research university. The authors examine instances of both successful and failed data reuse instances and identify best practices for librarians interested in conducting research involving the common forms of data collected in an academic library. Systematic reviews are a research approach that involves the quantitative and/or qualitati
作为客座编辑,我们很高兴能出版《国际会计准则与统计系统季刊》的双月刊特刊。可复制性、可复制性和透明度的主题在过去的《国际会计准则和统计系统季刊》和国际会计准则与统计系统会议上都有讨论,但这一双重问题完全集中在这些问题上。近年来,“通过提高研究的透明度、再现性、严谨性和伦理性来提高科学的可信度”的努力在社会科学中获得了势头(有效全球行动中心,2020)。虽然很少有人质疑再现性和研究透明度运动的精神,但它面临着重大挑战,因为它违背了既定的惯例。我们相信,鉴于我们的数据和技术专业知识、培训和咨询工作、国际范围以及在数据管理和保存方面的既定作用等,数据服务界在帮助推动这一运动方面处于独特的地位。作为运动的证据,有几个倡议支持研究再现性基础设施和数据保存工作:开放科学中心(COS)/开放科学框架(OSF)[i]伯克利社会科学透明度倡议(BITSS)[ii]可再生产性CUrating在现在常用的短语“再现性危机”和Ioannidis发表这篇文章之前,“为什么大多数已发表的研究结果都是错误的”,我们知道数据服务社区在建立良好的联盟中以各种方式支持再现性(例如,数据管理、数据保存、元数据标准),如大学间政治和社会研究联盟(ICPSR)(Ioannidis,2005)。本期文章包括可重复性研究的几个非常重要的方面:识别可重复性的障碍和这些障碍的解决方案与透明报告和可重复性相关的证据综合思考信息专业人员、研究人员,图书馆员了解再现性危机,以及他们如何合作帮助解决危机。问题始于“再现性文献分析”,该分析着眼于现有资源和文献,以确定再现性的障碍和潜在的解决方案。作者编制了一份全面的资源清单,并附有注释,其中包括与再现性危机相关的关键概念的定义。下一篇文章从一所大型研究型大学的角度讨论数据重用。作者检查了成功和失败的数据重用实例,并为有兴趣进行涉及学术图书馆收集的常见数据形式的研究的图书馆员确定了最佳实践。系统综述是一种研究方法,涉及对通过综合文献综述收集的数据进行定量和/或定性综合。“支持读者对心理学系统综述信心的方法报告”着眼于心理学系统综述中报告的电子文献搜索的可再现性。再现或复制计算结果的一个根本挑战是研究人员需要提供用于产生这些结果的代码。但是,共享代码并让它为另一个用户正确运行可能会带来重大的技术挑战。在“Reprozip,Reproserver的可复制性、保存和研究访问”中,作者描述了他们正在开发的开源软件,以应对这些挑战。取一篇发表的文章并试图复制结果,这是一种有时在学术课程中用来强调过程中固有困难的练习。本期的最后一篇文章《ReprohackNL 2019:图书馆如何通过社区参与促进研究再现性》描述了这项工作的一种基于图书馆的创新变体。Harrison Dekker,罗德岛大学数据馆员Amy Riegelman,明尼苏达大学有效全球行动参考中心社会科学馆员(2020),关于伯克利社会科学透明度倡议。网址:https://www.bitss.org/about(2020年6月23日查阅)。Ioannidis,J.P.(2005)“为什么大多数发表的研究结果都是假的”,《公共科学图书馆·医学》,2(8),第e124页。doi:https://doi.org/10.1371/journal.pmed.0020124[i]https://osf.io[ii]https://www.bitss.org/[ii]http://cure.web.unc.edu[iv]https://www.projecttier.org/[v]https://datacurationnetwork.org/[vi]https://ukrn.org
{"title":"Advocating for reproducibility","authors":"H. Dekker, Amy Riegelman","doi":"10.29173/iq982","DOIUrl":"https://doi.org/10.29173/iq982","url":null,"abstract":"As guest editors, we are excited to publish this special double issue of IASSIST Quarterly. The topics of reproducibility, replicability, and transparency have been addressed in past issues of IASSIST Quarterly and at the IASSIST conference, but this double issue is entirely focused on these issues. \u0000In recent years, efforts “to improve the credibility of science by advancing transparency, reproducibility, rigor, and ethics in research” have gained momentum in the social sciences (Center for Effective Global Action, 2020). While few question the spirit of the reproducibility and research transparency movement, it faces significant challenges because it goes against the grain of established practice. \u0000We believe the data services community is in a unique position to help advance this movement given our data and technical expertise, training and consulting work, international scope, and established role in data management and preservation, and more. As evidence of the movement, several initiatives exist to support research reproducibility infrastructure and data preservation efforts: \u0000 \u0000Center for Open Science (COS) / Open Science Framework (OSF)[i] \u0000Berkeley Initiative for Transparency in the Social Sciences (BITSS)[ii] \u0000CUrating for REproducibility (CURE)[iii] \u0000Project Tier[iv] \u0000Data Curation Network[v] \u0000UK Reproducibility Network[vi] \u0000 \u0000While many new initiatives have launched in recent years, prior to the now commonly used phrase “reproducibility crisis” and Ioannidis publishing the essay, “Why Most Published Research Findings are False,” we know that the data services community was supporting reproducibility in a variety of ways (e.g., data management, data preservation, metadata standards) in wellestablished consortiums such as Inter-university Consortium for Political and Social Research (ICPSR) (Ioannidis, 2005). \u0000The articles in this issue comprise several very important aspects of reproducible research: \u0000 \u0000Identification of barriers to reproducibility and solutions to such barriers \u0000Evidence synthesis as related to transparent reporting and reproducibility \u0000Reflection on how information professionals, researchers, and librarians perceive the reproducibility crisis and how they can partner to help solve it. \u0000 \u0000The issue begins with “Reproducibility literature analysis” which looks at existing resources and literature to identify barriers to reproducibility and potential solutions. The authors have compiled a comprehensive list of resources with annotations that include definitions of key concepts pertinent to the reproducibility crisis. \u0000The next article addresses data reuse from the perspective of a large research university. The authors examine instances of both successful and failed data reuse instances and identify best practices for librarians interested in conducting research involving the common forms of data collected in an academic library. \u0000Systematic reviews are a research approach that involves the quantitative and/or qualitati","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43698988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reproducibility literature analysis - a federal information professional perspective 再现性文献分析——联邦信息专业的视角
Pub Date : 2020-06-29 DOI: 10.29173/iq967
E. Antognoli, R. Avila, J. Sears, L. Christiansen, J. Tieman, Jacquelyn Hart
This article examines a cross-section of literature and other resources to reveal common reproducibility issues faced by stakeholders regardless of subject area or focus. We identify a variety of issues named as reproducibility barriers, the solutions to such barriers, and reflect on how researchers and information professionals can act to address the ‘reproducibility crisis.’ The finished products of this work include an annotated list of 122 published resources and a primer that identifies and defines key concepts from the resources that contribute to the crisis.
本文考察了文献和其他资源的横截面,以揭示利益相关者面临的常见再现性问题,无论主题领域或重点如何。我们确定了被称为再现性障碍的各种问题,这些障碍的解决方案,并反思了研究人员和信息专业人员如何采取行动来解决“再现性危机”这项工作的成品包括一份由122个已发表资源组成的注释列表,以及一本初级读本,用于识别和定义导致危机的资源中的关键概念。
{"title":"Reproducibility literature analysis - a federal information professional perspective","authors":"E. Antognoli, R. Avila, J. Sears, L. Christiansen, J. Tieman, Jacquelyn Hart","doi":"10.29173/iq967","DOIUrl":"https://doi.org/10.29173/iq967","url":null,"abstract":"This article examines a cross-section of literature and other resources to reveal common reproducibility issues faced by stakeholders regardless of subject area or focus. We identify a variety of issues named as reproducibility barriers, the solutions to such barriers, and reflect on how researchers and information professionals can act to address the ‘reproducibility crisis.’ The finished products of this work include an annotated list of 122 published resources and a primer that identifies and defines key concepts from the resources that contribute to the crisis.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46110142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Learning from data reuse: successful and failed experiences in a large public research university library 从数据复用中学习:大型公立研究型大学图书馆的成功与失败经验
Pub Date : 2020-06-29 DOI: 10.29173/iq966
J. Scoulas
This paper illustrates a large research university library experience in reusing the data for research collected both within and outside of the library to demonstrate data reuse practice. The purpose of the paper is to 1) demonstrate when and how data are reused in a large public research university library, 2) share tips on what to consider when reusing data, and 3) share challenges and lessons learned from data reuse experiences. This paper presents five proposed opportunities for data reuse conducted by three researchers at the institution’s library which resulted in three successful instances of data reuses and two failed data reuses. Learning from successful and failed experiences is critical to understand what works and what does not work in order to identify best practices for data reuse. This paper will be helpful for librarians who intend to reuse data for publication.
本文展示了大型研究型大学图书馆在重用图书馆内外收集的研究数据方面的经验,以展示数据重用实践。本文的目的是:1)演示大型公立研究型大学图书馆何时以及如何重用数据,2)分享重用数据时应考虑的事项,3)分享从数据重用经验中汲取的挑战和教训。本文介绍了由该机构图书馆的三名研究人员提出的五个数据重用机会,这些机会导致了三次成功的数据重用和两次失败的数据重用。从成功和失败的经验中学习对于理解什么有效,什么无效至关重要,以便确定数据重用的最佳实践。这篇论文将有助于图书馆员谁打算重用数据出版。
{"title":"Learning from data reuse: successful and failed experiences in a large public research university library","authors":"J. Scoulas","doi":"10.29173/iq966","DOIUrl":"https://doi.org/10.29173/iq966","url":null,"abstract":"This paper illustrates a large research university library experience in reusing the data for research collected both within and outside of the library to demonstrate data reuse practice. The purpose of the paper is to 1) demonstrate when and how data are reused in a large public research university library, 2) share tips on what to consider when reusing data, and 3) share challenges and lessons learned from data reuse experiences. This paper presents five proposed opportunities for data reuse conducted by three researchers at the institution’s library which resulted in three successful instances of data reuses and two failed data reuses. Learning from successful and failed experiences is critical to understand what works and what does not work in order to identify best practices for data reuse. This paper will be helpful for librarians who intend to reuse data for publication.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48328845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methods reporting that supports reader confidence for systematic reviews in psychology: assessing the reproducibility of electronic searches and first-level screening decisions. 支持读者对心理学系统综述信心的方法报告:评估电子搜索的可重复性和一级筛选决策。
Pub Date : 2020-06-29 DOI: 10.29173/iq968
P. Fehrmann, M. Mamolen
Recent discussions and research in psychology show a significant emphasis on reproducibility. Concerns for reproducibility pertain to methods as well as results. We evaluated the reporting of the electronic search methods used for systematic reviews (SR) published in psychology. Such reports are key for determining the reproducibility of electronic searches. The use of SR has been increasing in psychology, and we report on the status of reporting of electronic searches in recent SR in psychology. We used 12 checklist items to evaluate reporting for basic electronic strategies. Kappa results for those items developed from evidence-based recommendations ranged from fair to almost perfect. Additionally, using a set of those items to represent a “PRISMA” type of recommended reporting showed that only one of the 25 randomly selected psychology SR from 2009-2012 reported recommended information for all items in the set, and none of the 25 psychology SR from 2014-2016 did so. Using a second less stringent set of items found that only 36% of the psychology SR reported basic information that supports confidence in the reproducibility of electronic searches. Similar results were found for a set of psychology SR published in 2017. An area for improvements in SR in psychology involves fuller and clearer reporting of the steps used for electronic searches in SR. Such improvements will provide a strong basis for confidence in the reproducibility of searches. That confidence, in turn, can strengthen reader confidence more generally in the results and conclusions reached in SR in psychology.
最近在心理学上的讨论和研究显示了对再现性的重视。对再现性的关注涉及到方法和结果。我们评估了发表在心理学杂志上的用于系统评价(SR)的电子检索方法的报道。这些报告是确定电子搜索重现性的关键。在心理学中,电子检索的使用越来越多,我们报告了最近心理学中电子检索的报告状况。我们使用了12个清单项目来评估报告的基本电子策略。Kappa从基于证据的建议中得出的这些项目的结果从一般到几乎完美不等。此外,使用一组这些项目来代表“PRISMA”类型的推荐报告表明,2009-2012年随机选择的25个心理学SR中只有一个报告了集合中所有项目的推荐信息,2014-2016年的25个心理学SR都没有这样做。使用第二组不那么严格的项目发现,只有36%的心理学SR报告了支持电子搜索可重复性的基本信息。2017年发表的一组心理学SR也发现了类似的结果。心理学中社会检索的一个改进领域是更全面、更清晰地报告社会检索中电子检索所使用的步骤。这种改进将为对检索的可重复性的信心提供坚实的基础。这种信心,反过来,可以增强读者对心理学中SR的结果和结论的信心。
{"title":"Methods reporting that supports reader confidence for systematic reviews in psychology: assessing the reproducibility of electronic searches and first-level screening decisions.","authors":"P. Fehrmann, M. Mamolen","doi":"10.29173/iq968","DOIUrl":"https://doi.org/10.29173/iq968","url":null,"abstract":"Recent discussions and research in psychology show a significant emphasis on reproducibility. Concerns for reproducibility pertain to methods as well as results. We evaluated the reporting of the electronic search methods used for systematic reviews (SR) published in psychology. Such reports are key for determining the reproducibility of electronic searches. The use of SR has been increasing in psychology, and we report on the status of reporting of electronic searches in recent SR in psychology. \u0000We used 12 checklist items to evaluate reporting for basic electronic strategies. Kappa results for those items developed from evidence-based recommendations ranged from fair to almost perfect. Additionally, using a set of those items to represent a “PRISMA” type of recommended reporting showed that only one of the 25 randomly selected psychology SR from 2009-2012 reported recommended information for all items in the set, and none of the 25 psychology SR from 2014-2016 did so. Using a second less stringent set of items found that only 36% of the psychology SR reported basic information that supports confidence in the reproducibility of electronic searches. Similar results were found for a set of psychology SR published in 2017. \u0000An area for improvements in SR in psychology involves fuller and clearer reporting of the steps used for electronic searches in SR. Such improvements will provide a strong basis for confidence in the reproducibility of searches. That confidence, in turn, can strengthen reader confidence more generally in the results and conclusions reached in SR in psychology.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47970993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ReprohackNL 2019: how libraries can promote research reproducibility through community engagement ReprohackNL 2019:图书馆如何通过社区参与促进研究再现性
Pub Date : 2020-04-28 DOI: 10.31235/osf.io/6f4zv
K. Hettne, Ricarda K. K. Proppert, L. Nab, L. P. R. Saunero, Daniela Gawehns
University Libraries play a crucial role in moving towards Open Science, contributing to more transparent, reproducible and reusable research. The Center for Digital Scholarship (CDS) at Leiden University (LU) library is a scholarly lab that promotes open science literacy among Leiden’s scholars by two complementary strategies: existing top-down structures are used to provide training and services, while bottom-up initiatives from the research community are actively supported by offering the CDS’s expertise and facilities. An example of how bottom-up initiatives can blossom with the help of library structures such as the CDS is ReproHack. ReproHack – a reproducibility hackathon – is a grass-root initiative by young scholars with the goal of improving research reproducibility in three ways. First, hackathon attendees learn about reproducibility tools and challenges by reproducing published results and providing feedback to authors on their attempt. Second, authors can nominate their work and receive feedback on their reproducibility efforts. Third, the collaborative atmosphere helps building a community interested in making their own research reproducible. A first ReproHack in the Netherlands took place on November 30th, 2019, co-organised by the CDS at the LU Library with 44 participants from the fields of psychology, engineering, biomedicine, and computer science. For 19 papers, 24 feedback forms were returned and five papers were reported as successfully reproduced. Besides the researchers’ learning experience, the event led to recommendations on how to enhance research reproducibility. The ReproHack format therefore provides an opportunity for libraries to improve scientific reproducibility through community engagement.
大学图书馆在迈向开放科学方面发挥着至关重要的作用,为更透明、可复制和可重复使用的研究做出了贡献。莱顿大学图书馆的数字奖学金中心(CDS)是一个学术实验室,通过两种互补的策略促进莱顿学者的开放科学素养:现有的自上而下的结构用于提供培训和服务,而研究界自下而上的举措则通过提供CDS的专业知识和设施得到积极支持。ReproHack是一个自下而上的举措如何在CDS等图书馆结构的帮助下蓬勃发展的例子。ReproHack是一场再现性黑客马拉松,是年轻学者发起的一项草根倡议,目的是通过三种方式提高研究的再现性。首先,黑客马拉松参与者通过复制已发表的结果并向作者提供他们尝试的反馈来了解再现性工具和挑战。其次,作者可以提名他们的作品,并收到关于他们再现性努力的反馈。第三,合作氛围有助于建立一个有兴趣让自己的研究重现的社区。荷兰的第一次ReproHack于2019年11月30日在伦敦大学图书馆由CDS联合组织,来自心理学、工程、生物医学和计算机科学领域的44名参与者参加了会议。对于19篇论文,退回了24份反馈表,据报告成功转载了5篇论文。除了研究人员的学习经验外,该活动还提出了如何提高研究再现性的建议。因此,ReproHack格式为图书馆通过社区参与提高科学再现性提供了机会。
{"title":"ReprohackNL 2019: how libraries can promote research reproducibility through community engagement","authors":"K. Hettne, Ricarda K. K. Proppert, L. Nab, L. P. R. Saunero, Daniela Gawehns","doi":"10.31235/osf.io/6f4zv","DOIUrl":"https://doi.org/10.31235/osf.io/6f4zv","url":null,"abstract":"University Libraries play a crucial role in moving towards Open Science, contributing to more transparent, reproducible and reusable research. The Center for Digital Scholarship (CDS) at Leiden University (LU) library is a scholarly lab that promotes open science literacy among Leiden’s scholars by two complementary strategies: existing top-down structures are used to provide training and services, while bottom-up initiatives from the research community are actively supported by offering the CDS’s expertise and facilities. An example of how bottom-up initiatives can blossom with the help of library structures such as the CDS is ReproHack. ReproHack – a reproducibility hackathon – is a grass-root initiative by young scholars with the goal of improving research reproducibility in three ways. First, hackathon attendees learn about reproducibility tools and challenges by reproducing published results and providing feedback to authors on their attempt. Second, authors can nominate their work and receive feedback on their reproducibility efforts. Third, the collaborative atmosphere helps building a community interested in making their own research reproducible. \u0000A first ReproHack in the Netherlands took place on November 30th, 2019, co-organised by the CDS at the LU Library with 44 participants from the fields of psychology, engineering, biomedicine, and computer science. For 19 papers, 24 feedback forms were returned and five papers were reported as successfully reproduced. Besides the researchers’ learning experience, the event led to recommendations on how to enhance research reproducibility. The ReproHack format therefore provides an opportunity for libraries to improve scientific reproducibility through community engagement.","PeriodicalId":84870,"journal":{"name":"IASSIST quarterly","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48489831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
IASSIST quarterly
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1