Mandy Wigdorowitz, Barbara McGillivray, Marton Ribary
{"title":"数据论文如何为人文社会科学的开放研究做出独特贡献","authors":"Mandy Wigdorowitz, Barbara McGillivray, Marton Ribary","doi":"10.2218/eor.2023.8116","DOIUrl":null,"url":null,"abstract":"The open research movement and initiatives like the FAIR principles have been critical in establishing the importance of data in research, particularly within the sciences. Alongside the sciences, attention to openly available data in Humanities and Social Sciences (HSS) research has gradually grown. This growth is largely attributed to the increased availability of digital collections, the development of new data-intensive methods, an increasingly solid infrastructure, increased pressure from funders, the requirement of data management plans for preservation purposes, and the involvement of research libraries in data curation. In this context, attention to how data is produced, how it is openly and transparently shared, and how it can be reused has generated great interest, accompanied by an inevitable need for reputable data sharing outlets. One such outlet is the data paper – a peer-reviewed publication that focuses on describing a curated dataset. Data papers can be shared in traditional research journals as one subtype of article publication, or, more recently, in data journals which are dedicated to the publication of data papers. This presentation focuses on the work done by the open access Journal of Open Humanities (JOHD) in promoting the practice of publishing data papers with their accompanying open access datasets. JOHD was established with Ubiquity Press in 2015 to promote awareness, use, and reuse of humanities data. JOHD data papers promote the comprehensive description of how a dataset was assembled, where it may be accessed, and any crucial context including the research questions that framed the data gathering, including limitations to the original methods or scope of sources included. JOHD data papers suggest potential future reuses of data, which recent analytics seem to suggest has helped increase the visibility of datasets, and therefore their research impact (Marongiu et al., forthcoming; McGillivray et al., 2022). In addition, an overview of the three key elements (the “golden triangle”) that assess the impact of open research efforts as represented by different research outputs (datasets, data papers and research papers) will be presented, along with proposed initiatives for linking these. In doing so, we aim to (a) find a programmatic way to identify these links by extracting information from available metadata of datasets and verifying their accuracy, and (b) create a “ground truth” in a manual and/or machine-assisted way which would enable the training of more sophisticated NLP-based methods as a next step. We hope to illustrate the importance of including data papers into the research conversation given that they present a unique contribution to addressing global challenges within the open research arena.","PeriodicalId":244254,"journal":{"name":"Edinburgh Open Research","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"How Data Papers Present a Unique Contribution To Open Research In The Humanities And Social Sciences\",\"authors\":\"Mandy Wigdorowitz, Barbara McGillivray, Marton Ribary\",\"doi\":\"10.2218/eor.2023.8116\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The open research movement and initiatives like the FAIR principles have been critical in establishing the importance of data in research, particularly within the sciences. Alongside the sciences, attention to openly available data in Humanities and Social Sciences (HSS) research has gradually grown. This growth is largely attributed to the increased availability of digital collections, the development of new data-intensive methods, an increasingly solid infrastructure, increased pressure from funders, the requirement of data management plans for preservation purposes, and the involvement of research libraries in data curation. In this context, attention to how data is produced, how it is openly and transparently shared, and how it can be reused has generated great interest, accompanied by an inevitable need for reputable data sharing outlets. One such outlet is the data paper – a peer-reviewed publication that focuses on describing a curated dataset. Data papers can be shared in traditional research journals as one subtype of article publication, or, more recently, in data journals which are dedicated to the publication of data papers. This presentation focuses on the work done by the open access Journal of Open Humanities (JOHD) in promoting the practice of publishing data papers with their accompanying open access datasets. JOHD was established with Ubiquity Press in 2015 to promote awareness, use, and reuse of humanities data. JOHD data papers promote the comprehensive description of how a dataset was assembled, where it may be accessed, and any crucial context including the research questions that framed the data gathering, including limitations to the original methods or scope of sources included. JOHD data papers suggest potential future reuses of data, which recent analytics seem to suggest has helped increase the visibility of datasets, and therefore their research impact (Marongiu et al., forthcoming; McGillivray et al., 2022). In addition, an overview of the three key elements (the “golden triangle”) that assess the impact of open research efforts as represented by different research outputs (datasets, data papers and research papers) will be presented, along with proposed initiatives for linking these. In doing so, we aim to (a) find a programmatic way to identify these links by extracting information from available metadata of datasets and verifying their accuracy, and (b) create a “ground truth” in a manual and/or machine-assisted way which would enable the training of more sophisticated NLP-based methods as a next step. We hope to illustrate the importance of including data papers into the research conversation given that they present a unique contribution to addressing global challenges within the open research arena.\",\"PeriodicalId\":244254,\"journal\":{\"name\":\"Edinburgh Open Research\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Edinburgh Open Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2218/eor.2023.8116\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Edinburgh Open Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2218/eor.2023.8116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
开放研究运动和FAIR原则等倡议对于确立数据在研究中的重要性至关重要,尤其是在科学领域。除了科学之外,对人文社会科学(HSS)研究中公开可用数据的关注也在逐渐增长。这一增长在很大程度上归因于数字馆藏的可用性增加、新的数据密集型方法的发展、基础设施的日益稳固、来自资助者的压力增加、以保存为目的的数据管理计划的要求以及研究型图书馆参与数据管理。在这种背景下,对数据如何产生、如何公开透明地共享以及如何重用的关注引起了极大的兴趣,同时也不可避免地需要有信誉的数据共享渠道。其中一个渠道是数据论文——一种同行评审的出版物,专注于描述一个精心策划的数据集。数据论文可以作为文章发表的一个子类型在传统的研究期刊上共享,或者最近在专门发表数据论文的数据期刊上共享。本次演讲的重点是开放人文期刊(JOHD)在促进数据论文及其附带的开放获取数据集的出版实践方面所做的工作。JOHD与Ubiquity Press于2015年成立,旨在促进人文数据的认识、使用和再利用。JOHD数据论文促进对数据集如何组装、可访问的位置以及包括构成数据收集框架的研究问题在内的任何关键背景的全面描述,包括对原始方法或包括的来源范围的限制。JOHD的数据论文提出了数据的潜在未来重用,最近的分析似乎表明,这有助于提高数据集的可见性,从而提高它们的研究影响(Marongiu等人,即将出版;McGillivray et al., 2022)。此外,还将概述三个关键要素(“金三角”),以评估不同研究成果(数据集、数据论文和研究论文)所代表的开放研究工作的影响,并提出将这些因素联系起来的建议。在此过程中,我们的目标是(a)通过从数据集的可用元数据中提取信息并验证其准确性,找到一种可编程的方法来识别这些链接,以及(b)以手动和/或机器辅助的方式创建“基本事实”,这将使更复杂的基于nlp的方法的培训成为下一步。我们希望说明将数据论文纳入研究对话的重要性,因为它们为解决开放研究领域的全球挑战做出了独特的贡献。
How Data Papers Present a Unique Contribution To Open Research In The Humanities And Social Sciences
The open research movement and initiatives like the FAIR principles have been critical in establishing the importance of data in research, particularly within the sciences. Alongside the sciences, attention to openly available data in Humanities and Social Sciences (HSS) research has gradually grown. This growth is largely attributed to the increased availability of digital collections, the development of new data-intensive methods, an increasingly solid infrastructure, increased pressure from funders, the requirement of data management plans for preservation purposes, and the involvement of research libraries in data curation. In this context, attention to how data is produced, how it is openly and transparently shared, and how it can be reused has generated great interest, accompanied by an inevitable need for reputable data sharing outlets. One such outlet is the data paper – a peer-reviewed publication that focuses on describing a curated dataset. Data papers can be shared in traditional research journals as one subtype of article publication, or, more recently, in data journals which are dedicated to the publication of data papers. This presentation focuses on the work done by the open access Journal of Open Humanities (JOHD) in promoting the practice of publishing data papers with their accompanying open access datasets. JOHD was established with Ubiquity Press in 2015 to promote awareness, use, and reuse of humanities data. JOHD data papers promote the comprehensive description of how a dataset was assembled, where it may be accessed, and any crucial context including the research questions that framed the data gathering, including limitations to the original methods or scope of sources included. JOHD data papers suggest potential future reuses of data, which recent analytics seem to suggest has helped increase the visibility of datasets, and therefore their research impact (Marongiu et al., forthcoming; McGillivray et al., 2022). In addition, an overview of the three key elements (the “golden triangle”) that assess the impact of open research efforts as represented by different research outputs (datasets, data papers and research papers) will be presented, along with proposed initiatives for linking these. In doing so, we aim to (a) find a programmatic way to identify these links by extracting information from available metadata of datasets and verifying their accuracy, and (b) create a “ground truth” in a manual and/or machine-assisted way which would enable the training of more sophisticated NLP-based methods as a next step. We hope to illustrate the importance of including data papers into the research conversation given that they present a unique contribution to addressing global challenges within the open research arena.