首页 > 最新文献

arXiv - CS - Digital Libraries最新文献

英文 中文
Literature Filtering for Systematic Reviews with Transformers 用转换器过滤系统性综述文献
Pub Date : 2024-05-30 DOI: arxiv-2405.20354
John Hawkins, David Tivey
Identifying critical research within the growing body of academic work is anessential element of quality research. Systematic review processes, used inevidence-based medicine, formalise this as a procedure that must be followed ina research program. However, it comes with an increasing burden in terms of thetime required to identify the important articles of research for a given topic.In this work, we develop a method for building a general-purpose filteringsystem that matches a research question, posed as a natural languagedescription of the required content, against a candidate set of articlesobtained via the application of broad search terms. Our results demonstratethat transformer models, pre-trained on biomedical literature then fine tunedfor the specific task, offer a promising solution to this problem. The modelcan remove large volumes of irrelevant articles for most research questions.
在不断增长的学术成果中识别关键性研究是高质量研究的一个重要因素。以证据为基础的医学所采用的系统综述流程将此正式确定为研究计划必须遵循的程序。在这项工作中,我们开发了一种构建通用过滤系统的方法,该系统可以将研究问题(以自然语言描述的形式提出)与通过应用广泛的搜索条件获得的候选文章集进行匹配。我们的研究结果表明,在生物医学文献上预先训练、然后针对特定任务进行微调的转换器模型为这一问题提供了一个很有前景的解决方案。该模型可以为大多数研究问题去除大量不相关的文章。
{"title":"Literature Filtering for Systematic Reviews with Transformers","authors":"John Hawkins, David Tivey","doi":"arxiv-2405.20354","DOIUrl":"https://doi.org/arxiv-2405.20354","url":null,"abstract":"Identifying critical research within the growing body of academic work is an\u0000essential element of quality research. Systematic review processes, used in\u0000evidence-based medicine, formalise this as a procedure that must be followed in\u0000a research program. However, it comes with an increasing burden in terms of the\u0000time required to identify the important articles of research for a given topic.\u0000In this work, we develop a method for building a general-purpose filtering\u0000system that matches a research question, posed as a natural language\u0000description of the required content, against a candidate set of articles\u0000obtained via the application of broad search terms. Our results demonstrate\u0000that transformer models, pre-trained on biomedical literature then fine tuned\u0000for the specific task, offer a promising solution to this problem. The model\u0000can remove large volumes of irrelevant articles for most research questions.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141252532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of the papermilling behavior 检测造纸行为
Pub Date : 2024-05-30 DOI: arxiv-2405.19872
Igor Podlubny
Based on the analysis of the data obtainable from the Web of Sciencepublication and citation database, typical signs of possible papermillingbehavior are described, quantified, and illustrated by examples. A MATLABfunction is provided for the analysis of the outputs from the Web of Science. Anew quantitative indicator -- integrity index, or I-index -- is proposed forusing it along with standard bibliographic and scientometric indicators.
基于对从 Web of Science 出版物和引用数据库中获得的数据的分析,对可能的造纸行为的典型迹象进行了描述、量化,并通过实例进行了说明。提供了一个 MATLAB 函数,用于分析 Web of Science 的输出结果。提出了一个新的量化指标--完整性指数(或称 I 指数),以便与标准书目和科学计量指标一起使用。
{"title":"Detection of the papermilling behavior","authors":"Igor Podlubny","doi":"arxiv-2405.19872","DOIUrl":"https://doi.org/arxiv-2405.19872","url":null,"abstract":"Based on the analysis of the data obtainable from the Web of Science\u0000publication and citation database, typical signs of possible papermilling\u0000behavior are described, quantified, and illustrated by examples. A MATLAB\u0000function is provided for the analysis of the outputs from the Web of Science. A\u0000new quantitative indicator -- integrity index, or I-index -- is proposed for\u0000using it along with standard bibliographic and scientometric indicators.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141190218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the modification and revocation of open source licences 关于开放源代码许可证的修改和撤销
Pub Date : 2024-05-29 DOI: arxiv-2407.13064
Paul Gagnon, Misha Benjamin, Justine Gauthier, Catherine Regis, Jenny Lee, Alexei Nordell-Markovits
Historically, open source commitments have been deemed irrevocable oncematerials are released under open source licenses. In this paper, the authorsargue for the creation of a subset of rights that allows open sourcecontributors to force users to (i) update to the most recent version of amodel, (ii) accept new use case restrictions, or even (iii) cease using thesoftware entirely. While this would be a departure from the traditional opensource approach, the legal, reputational and moral risks related toopen-sourcing AI models could justify contributors having more control overdownstream uses. Recent legislative changes have also opened the door toliability of open source contributors in certain cases. The authors believethat contributors would welcome the ability to ensure that downstream users areimplementing updates that address issues like bias, guardrail workarounds oradversarial attacks on their contributions. Finally, this paper addresses howthis license category would interplay with RAIL licenses, and how it should beoperationalized and adopted by key stakeholders such as OSS platforms andscanning tools.
从历史上看,一旦材料根据开源许可发布,开源承诺就被视为不可撤销。在本文中,作者主张创建一个权利子集,允许开源贡献者强迫用户(i)更新到最新版本的模型,(ii)接受新的用例限制,甚至(iii)完全停止使用软件。虽然这有别于传统的开源方法,但与开源人工智能模型相关的法律、声誉和道德风险可以证明贡献者有理由对下游使用进行更多控制。最近的立法变革也为开源贡献者在某些情况下的可操作性打开了大门。作者认为,贡献者会欢迎有能力确保下游用户正在实施更新,以解决偏见、护栏变通或对其贡献的对抗性攻击等问题。最后,本文讨论了这一许可证类别如何与 RAIL 许可证相互作用,以及它应该如何被开放源码软件平台和扫描工具等主要利益相关者操作和采用。
{"title":"On the modification and revocation of open source licences","authors":"Paul Gagnon, Misha Benjamin, Justine Gauthier, Catherine Regis, Jenny Lee, Alexei Nordell-Markovits","doi":"arxiv-2407.13064","DOIUrl":"https://doi.org/arxiv-2407.13064","url":null,"abstract":"Historically, open source commitments have been deemed irrevocable once\u0000materials are released under open source licenses. In this paper, the authors\u0000argue for the creation of a subset of rights that allows open source\u0000contributors to force users to (i) update to the most recent version of a\u0000model, (ii) accept new use case restrictions, or even (iii) cease using the\u0000software entirely. While this would be a departure from the traditional open\u0000source approach, the legal, reputational and moral risks related to\u0000open-sourcing AI models could justify contributors having more control over\u0000downstream uses. Recent legislative changes have also opened the door to\u0000liability of open source contributors in certain cases. The authors believe\u0000that contributors would welcome the ability to ensure that downstream users are\u0000implementing updates that address issues like bias, guardrail workarounds or\u0000adversarial attacks on their contributions. Finally, this paper addresses how\u0000this license category would interplay with RAIL licenses, and how it should be\u0000operationalized and adopted by key stakeholders such as OSS platforms and\u0000scanning tools.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"47 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models 利用知识图谱和大型语言模型生成并由人类专家评估有趣的研究想法
Pub Date : 2024-05-27 DOI: arxiv-2405.17044
Xuemei Gu, Mario Krenn
Advanced artificial intelligence (AI) systems with access to millions ofresearch papers could inspire new research ideas that may not be conceived byhumans alone. However, how interesting are these AI-generated ideas, and howcan we improve their quality? Here, we introduce SciMuse, a system that uses anevolving knowledge graph built from more than 58 million scientific papers togenerate personalized research ideas via an interface to GPT-4. We conducted alarge-scale human evaluation with over 100 research group leaders from the MaxPlanck Society, who ranked more than 4,000 personalized research ideas based ontheir level of interest. This evaluation allows us to understand therelationships between scientific interest and the core properties of theknowledge graph. We find that data-efficient machine learning can predictresearch interest with high precision, allowing us to optimize theinterest-level of generated research ideas. This work represents a step towardsan artificial scientific muse that could catalyze unforeseen collaborations andsuggest interesting avenues for scientists.
先进的人工智能(AI)系统可以访问数百万篇研究论文,可以激发人类无法单独构思的新研究想法。然而,这些由人工智能产生的想法究竟有多有趣,我们又该如何提高它们的质量呢?在这里,我们介绍了SciMuse,这是一个利用从5800多万篇科学论文中构建的不断发展的知识图谱,通过GPT-4接口生成个性化研究想法的系统。我们与马普学会的 100 多位研究小组负责人进行了大规模的人工评估,他们根据自己的兴趣程度对 4000 多个个性化研究构想进行了排序。通过这项评估,我们了解了科学兴趣与知识图谱核心属性之间的关系。我们发现,数据高效的机器学习可以高精度地预测研究兴趣,从而优化生成的研究想法的兴趣等级。这项工作标志着我们向人工科学缪斯迈出了一步,它可以催化不可预见的合作,并为科学家们提出有趣的研究方向。
{"title":"Generation and human-expert evaluation of interesting research ideas using knowledge graphs and large language models","authors":"Xuemei Gu, Mario Krenn","doi":"arxiv-2405.17044","DOIUrl":"https://doi.org/arxiv-2405.17044","url":null,"abstract":"Advanced artificial intelligence (AI) systems with access to millions of\u0000research papers could inspire new research ideas that may not be conceived by\u0000humans alone. However, how interesting are these AI-generated ideas, and how\u0000can we improve their quality? Here, we introduce SciMuse, a system that uses an\u0000evolving knowledge graph built from more than 58 million scientific papers to\u0000generate personalized research ideas via an interface to GPT-4. We conducted a\u0000large-scale human evaluation with over 100 research group leaders from the Max\u0000Planck Society, who ranked more than 4,000 personalized research ideas based on\u0000their level of interest. This evaluation allows us to understand the\u0000relationships between scientific interest and the core properties of the\u0000knowledge graph. We find that data-efficient machine learning can predict\u0000research interest with high precision, allowing us to optimize the\u0000interest-level of generated research ideas. This work represents a step towards\u0000an artificial scientific muse that could catalyze unforeseen collaborations and\u0000suggest interesting avenues for scientists.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141165440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Oil & Water? Diffusion of AI Within and Across Scientific Fields 油和水?人工智能在科学领域内和科学领域间的扩散
Pub Date : 2024-05-24 DOI: arxiv-2405.15828
Eamon Duede, William Dolan, André Bauer, Ian Foster, Karim Lakhani
This study empirically investigates claims of the increasing ubiquity ofartificial intelligence (AI) within roughly 80 million research publicationsacross 20 diverse scientific fields, by examining the change in scholarlyengagement with AI from 1985 through 2022. We observe exponential growth, withAI-engaged publications increasing approximately thirteenfold (13x) across allfields, suggesting a dramatic shift from niche to mainstream. Moreover, weprovide the first empirical examination of the distribution of AI-engagedpublications across publication venues within individual fields, with resultsthat reveal a broadening of AI engagement within disciplines. While thisbroadening engagement suggests a move toward greater disciplinary integrationin every field, increased ubiquity is associated with a semantic tensionbetween AI-engaged research and more traditional disciplinary research. Throughan analysis of tens of millions of document embeddings, we observe a complexinterplay between AI-engaged and non-AI-engaged research within and acrossfields, suggesting that increasing ubiquity is something of an oil-and-waterphenomenon -- AI-engaged work is spreading out over fields, but not mixing wellwith non-AI-engaged work.
人工智能(AI)在20个不同科学领域的大约8000万篇研究论文中越来越无处不在,本研究通过实证研究,考察了从1985年到2022年学术界对人工智能的关注程度的变化。我们观察到了指数级的增长,在所有领域中,人工智能参与的出版物增加了约十三倍(13x),这表明从小众到主流的巨大转变。此外,我们还首次对人工智能参与的出版物在各个领域内的出版场所的分布情况进行了实证研究,结果显示人工智能在学科内的参与范围不断扩大。虽然这种参与范围的扩大表明每个领域都在向更大的学科整合迈进,但这种无处不在的趋势也与人工智能参与研究和更传统的学科研究之间的语义紧张有关。通过对数以千万计的文档嵌入进行分析,我们观察到人工智能参与的研究与非人工智能参与的研究在领域内和跨领域之间的复杂互动,这表明日益普遍化是一种油水分离的现象--人工智能参与的工作正在向各个领域扩散,但并没有与非人工智能参与的工作很好地混合在一起。
{"title":"Oil & Water? Diffusion of AI Within and Across Scientific Fields","authors":"Eamon Duede, William Dolan, André Bauer, Ian Foster, Karim Lakhani","doi":"arxiv-2405.15828","DOIUrl":"https://doi.org/arxiv-2405.15828","url":null,"abstract":"This study empirically investigates claims of the increasing ubiquity of\u0000artificial intelligence (AI) within roughly 80 million research publications\u0000across 20 diverse scientific fields, by examining the change in scholarly\u0000engagement with AI from 1985 through 2022. We observe exponential growth, with\u0000AI-engaged publications increasing approximately thirteenfold (13x) across all\u0000fields, suggesting a dramatic shift from niche to mainstream. Moreover, we\u0000provide the first empirical examination of the distribution of AI-engaged\u0000publications across publication venues within individual fields, with results\u0000that reveal a broadening of AI engagement within disciplines. While this\u0000broadening engagement suggests a move toward greater disciplinary integration\u0000in every field, increased ubiquity is associated with a semantic tension\u0000between AI-engaged research and more traditional disciplinary research. Through\u0000an analysis of tens of millions of document embeddings, we observe a complex\u0000interplay between AI-engaged and non-AI-engaged research within and across\u0000fields, suggesting that increasing ubiquity is something of an oil-and-water\u0000phenomenon -- AI-engaged work is spreading out over fields, but not mixing well\u0000with non-AI-engaged work.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"56 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141165434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rethinking the production and publication of machine-reusable expressions of research findings 重新思考制作和出版可通过机器重复使用的研究成果表达方式
Pub Date : 2024-05-21 DOI: arxiv-2405.13129
Markus Stocker, Lauren Snyder, Matthew Anfuso, Oliver Ludwig, Freya Thießen, Kheir Eddine Farfar, Muhammad Haris, Allard Oelen, Mohamad Yaser Jaradeh
Literature is the primary expression of scientific knowledge and an importantsource of research data. However, scientific knowledge expressed in narrativetext documents is not inherently machine reusable. To facilitate knowledgereuse, e.g. for synthesis research, scientific knowledge must be extracted fromarticles and organized into databases post-publication. The high time costs andinaccuracies associated with completing these activities manually has driventhe development of techniques that automate knowledge extraction. Tackling theproblem with a different mindset, we propose a pre-publication approach, knownas reborn, that ensures scientific knowledge is born reusable, i.e. produced ina machine-reusable format during knowledge production. We implement theapproach using the Open Research Knowledge Graph infrastructure for FAIRscientific knowledge organization. We test the approach with three use cases,and discuss the role of publishers and editors in scaling the approach. Ourresults suggest that the proposed approach is superior compared to classicalmanual and semi-automated post-publication extraction techniques in terms ofknowledge richness and accuracy as well as technological simplicity.
文献是科学知识的主要表达方式,也是研究数据的重要来源。然而,以叙述性文本文档表达的科学知识本身并不能被机器重复使用。为了促进知识的重复使用,例如用于综合研究,必须从文章中提取科学知识,并在出版后将其整理到数据库中。人工完成这些工作的时间成本高、误差大,因此推动了知识提取自动化技术的发展。为了以不同的思维方式解决这个问题,我们提出了一种名为 "重生 "的出版前方法,以确保科学知识生来就可重复使用,即在知识生产过程中生成机器可重复使用的格式。我们利用开放式研究知识图谱基础架构实现了这一方法,用于 FAIR 科学知识组织。我们用三个用例对该方法进行了测试,并讨论了出版商和编辑在扩展该方法中的作用。我们的研究结果表明,与传统的人工和半自动出版后提取技术相比,我们提出的方法在知识丰富度、准确性和技术简易性方面都更胜一筹。
{"title":"Rethinking the production and publication of machine-reusable expressions of research findings","authors":"Markus Stocker, Lauren Snyder, Matthew Anfuso, Oliver Ludwig, Freya Thießen, Kheir Eddine Farfar, Muhammad Haris, Allard Oelen, Mohamad Yaser Jaradeh","doi":"arxiv-2405.13129","DOIUrl":"https://doi.org/arxiv-2405.13129","url":null,"abstract":"Literature is the primary expression of scientific knowledge and an important\u0000source of research data. However, scientific knowledge expressed in narrative\u0000text documents is not inherently machine reusable. To facilitate knowledge\u0000reuse, e.g. for synthesis research, scientific knowledge must be extracted from\u0000articles and organized into databases post-publication. The high time costs and\u0000inaccuracies associated with completing these activities manually has driven\u0000the development of techniques that automate knowledge extraction. Tackling the\u0000problem with a different mindset, we propose a pre-publication approach, known\u0000as reborn, that ensures scientific knowledge is born reusable, i.e. produced in\u0000a machine-reusable format during knowledge production. We implement the\u0000approach using the Open Research Knowledge Graph infrastructure for FAIR\u0000scientific knowledge organization. We test the approach with three use cases,\u0000and discuss the role of publishers and editors in scaling the approach. Our\u0000results suggest that the proposed approach is superior compared to classical\u0000manual and semi-automated post-publication extraction techniques in terms of\u0000knowledge richness and accuracy as well as technological simplicity.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141149818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer Learning Approach for Railway Technical Map (RTM) Component Identification 铁路技术图 (RTM) 组件识别的迁移学习方法
Pub Date : 2024-05-21 DOI: arxiv-2405.13229
Obadage Rochana Rumalshan, Pramuka Weerasinghe, Mohamed Shaheer, Prabhath Gunathilake, Erunika Dayaratna
The extreme popularity over the years for railway transportation urges thenecessity to maintain efficient railway management systems around the globe.Even though, at present, there exist a large collection of Computer AidedDesigned Railway Technical Maps (RTMs) but available only in the portabledocument format (PDF). Using Deep Learning and Optical Character Recognitiontechniques, this research work proposes a generic system to digitize therelevant map component data from a given input image and create a formattedtext file per image. Out of YOLOv3, SSD and Faster-RCNN object detection modelsused, Faster-RCNN yields the highest mean Average Precision (mAP) and thehighest F1 score values 0.68 and 0.76 respectively. Further it is proven fromthe results obtained that, one can improve the results with OCR when the textcontaining image is being sent through a sophisticated pre-processing pipelineto remove distortions.
尽管目前存在大量计算机辅助设计的铁路技术地图(RTM),但这些地图只有PDF格式。利用深度学习和光学字符识别技术,这项研究工作提出了一种通用系统,可从给定的输入图像中数字化相关的地图组件数据,并为每幅图像创建一个格式化文本文件。在使用的 YOLOv3、SSD 和 Faster-RCNN 物体检测模型中,Faster-RCNN 的平均精度(mAP)最高,F1 分数最高,分别为 0.68 和 0.76。此外,所获得的结果还证明,如果将包含文本的图像通过复杂的预处理管道发送,以消除失真,则可以提高 OCR 的效果。
{"title":"Transfer Learning Approach for Railway Technical Map (RTM) Component Identification","authors":"Obadage Rochana Rumalshan, Pramuka Weerasinghe, Mohamed Shaheer, Prabhath Gunathilake, Erunika Dayaratna","doi":"arxiv-2405.13229","DOIUrl":"https://doi.org/arxiv-2405.13229","url":null,"abstract":"The extreme popularity over the years for railway transportation urges the\u0000necessity to maintain efficient railway management systems around the globe.\u0000Even though, at present, there exist a large collection of Computer Aided\u0000Designed Railway Technical Maps (RTMs) but available only in the portable\u0000document format (PDF). Using Deep Learning and Optical Character Recognition\u0000techniques, this research work proposes a generic system to digitize the\u0000relevant map component data from a given input image and create a formatted\u0000text file per image. Out of YOLOv3, SSD and Faster-RCNN object detection models\u0000used, Faster-RCNN yields the highest mean Average Precision (mAP) and the\u0000highest F1 score values 0.68 and 0.76 respectively. Further it is proven from\u0000the results obtained that, one can improve the results with OCR when the text\u0000containing image is being sent through a sophisticated pre-processing pipeline\u0000to remove distortions.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141149685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scientific discourse on YouTube: Motivations for citing research in comments YouTube 上的科学讨论:在评论中引用研究成果的动机
Pub Date : 2024-05-21 DOI: arxiv-2405.12798
Sören Striewski, Olga Zagovora, Isabella Peters
YouTube is a valuable source of user-generated content on a wide range oftopics, and it encourages user participation through the use of a commentsystem. Video content is increasingly addressing scientific topics, and thereis evidence that both academics and consumers use video descriptions and videocomments to refer to academic research and scientific publications. Becausecommenting is a discursive behavior, this study will provide insights on whyindividuals post links to research publications in comments. For this, aqualitative content analysis and iterative coding approach were applied.Furthermore, the reasons for mentioning academic publications in comments werecontrasted with the reasons for citing in scholarly works and with reasons forcommenting on YouTube. We discovered that the primary motives for sharingresearch links were (1) providing more insights into the topic and (2)challenging information offered by other commentators.
YouTube 是用户生成内容的重要来源,内容涉及广泛的主题,并通过使用评论系统鼓励用户参与。视频内容越来越多地涉及科学话题,有证据表明,学者和消费者都使用视频描述和视频评论来参考学术研究和科学出版物。由于评论是一种话语行为,本研究将深入探讨为什么个人会在评论中发布研究出版物的链接。此外,我们还将在评论中提及学术出版物的原因与在学术著作中引用的原因以及在 YouTube 上发表评论的原因进行了对比。我们发现,分享研究链接的主要动机是:(1) 为话题提供更多见解;(2) 质疑其他评论者提供的信息。
{"title":"Scientific discourse on YouTube: Motivations for citing research in comments","authors":"Sören Striewski, Olga Zagovora, Isabella Peters","doi":"arxiv-2405.12798","DOIUrl":"https://doi.org/arxiv-2405.12798","url":null,"abstract":"YouTube is a valuable source of user-generated content on a wide range of\u0000topics, and it encourages user participation through the use of a comment\u0000system. Video content is increasingly addressing scientific topics, and there\u0000is evidence that both academics and consumers use video descriptions and video\u0000comments to refer to academic research and scientific publications. Because\u0000commenting is a discursive behavior, this study will provide insights on why\u0000individuals post links to research publications in comments. For this, a\u0000qualitative content analysis and iterative coding approach were applied.\u0000Furthermore, the reasons for mentioning academic publications in comments were\u0000contrasted with the reasons for citing in scholarly works and with reasons for\u0000commenting on YouTube. We discovered that the primary motives for sharing\u0000research links were (1) providing more insights into the topic and (2)\u0000challenging information offered by other commentators.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"42 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141149686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Amplifying Academic Research through YouTube: Engagement Metrics as Predictors of Citation Impact 通过 YouTube 放大学术研究:作为引文影响力预测指标的参与度指标
Pub Date : 2024-05-21 DOI: arxiv-2405.12734
Olga Zagovora, Talisa Schwal, Katrin Weller
This study explores the interplay between YouTube engagement metrics and theacademic impact of cited publications within video descriptions, amid decliningtrust in traditional journalism and increased reliance on social media forinformation. By analyzing data from Altmetric.com and YouTube's API, itassesses how YouTube video features relate to citation impact. Initial resultssuggest that videos citing scientific publications and garnering highengagement-likes, comments, and references to other publications-may functionas a filtering mechanism or even as a predictor of impactful research.
在人们对传统新闻业的信任度下降、越来越依赖社交媒体获取信息的背景下,本研究探讨了 YouTube 参与度指标与视频描述中引用出版物的学术影响力之间的相互作用。通过分析 Altmetric.com 和 YouTube API 的数据,本研究评估了 YouTube 视频功能与引用影响力之间的关系。初步结果表明,引用科学出版物并获得较高的赞数、评论和对其他出版物的引用的视频可能是一种过滤机制,甚至是有影响力研究的预测指标。
{"title":"Amplifying Academic Research through YouTube: Engagement Metrics as Predictors of Citation Impact","authors":"Olga Zagovora, Talisa Schwal, Katrin Weller","doi":"arxiv-2405.12734","DOIUrl":"https://doi.org/arxiv-2405.12734","url":null,"abstract":"This study explores the interplay between YouTube engagement metrics and the\u0000academic impact of cited publications within video descriptions, amid declining\u0000trust in traditional journalism and increased reliance on social media for\u0000information. By analyzing data from Altmetric.com and YouTube's API, it\u0000assesses how YouTube video features relate to citation impact. Initial results\u0000suggest that videos citing scientific publications and garnering high\u0000engagement-likes, comments, and references to other publications-may function\u0000as a filtering mechanism or even as a predictor of impactful research.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"47 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141149783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The participation of public in knowledge production: a citizen science projects overview 公众参与知识生产:公民科学项目概览
Pub Date : 2024-05-17 DOI: arxiv-2405.10829
Nuria Bautista-Puig, Enrique Orduna-Malea, Philippe Mongeon
Citizen Science (CS) is related to public engagement in scientific research.The tasks in which the citizens can be involved are diverse and can range fromdata collection and tagging images to participation in the planning andresearch design. However, little is known about the involvement degree of thecitizens to CS projects, and the contribution of those projects to theadvancement of knowledge (e.g. scientific outcomes). This study aims to gain abetter understanding by analysing the SciStarter database. A total of 2,346 CSprojects were identified, mainly from Ecology and Environmental Sciences. Ofthese projects, 91% show low participation of the citizens (Level 1 "citizensas sensors" and 2 "citizens as interpreters", from Haklay's scale). In terms ofscientific output, 918 papers indexed in the Web of Science (WoS) wereidentified. The most prolific projects were found to have lower levels ofcitizen involvement, specifically at Levels 1 and 2.
公民科学(CS)与公众参与科学研究有关。公民可以参与的任务多种多样,从数据收集和标记图像到参与规划和研究设计。然而,人们对公民参与 CS 项目的程度以及这些项目对知识进步(如科学成果)的贡献知之甚少。本研究旨在通过分析科学启动者(SciStarter)数据库获得更深入的了解。共确定了 2346 个 CS 项目,主要来自生态学和环境科学。在这些项目中,91%的项目显示公民参与度较低(根据 Haklay 的量表,1 级 "公民作为传感器 "和 2 级 "公民作为解释者")。在科学成果方面,共鉴定了 918 篇被科学网(WoS)收录的论文。发现最多产的项目的公民参与程度较低,特别是在第 1 级和第 2 级。
{"title":"The participation of public in knowledge production: a citizen science projects overview","authors":"Nuria Bautista-Puig, Enrique Orduna-Malea, Philippe Mongeon","doi":"arxiv-2405.10829","DOIUrl":"https://doi.org/arxiv-2405.10829","url":null,"abstract":"Citizen Science (CS) is related to public engagement in scientific research.\u0000The tasks in which the citizens can be involved are diverse and can range from\u0000data collection and tagging images to participation in the planning and\u0000research design. However, little is known about the involvement degree of the\u0000citizens to CS projects, and the contribution of those projects to the\u0000advancement of knowledge (e.g. scientific outcomes). This study aims to gain a\u0000better understanding by analysing the SciStarter database. A total of 2,346 CS\u0000projects were identified, mainly from Ecology and Environmental Sciences. Of\u0000these projects, 91% show low participation of the citizens (Level 1 \"citizens\u0000as sensors\" and 2 \"citizens as interpreters\", from Haklay's scale). In terms of\u0000scientific output, 918 papers indexed in the Web of Science (WoS) were\u0000identified. The most prolific projects were found to have lower levels of\u0000citizen involvement, specifically at Levels 1 and 2.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"98 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141149684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Digital Libraries
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1