评论丰富的索引项提高了NLP系统检索到的评论医学文章排名的相关性和新颖性

IF 3.1 3区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Online Information Review Pub Date : 2022-12-29 DOI:10.1108/oir-05-2022-0283
Kianoosh Rashidi, H. Sotudeh, A. Nikseresht
{"title":"评论丰富的索引项提高了NLP系统检索到的评论医学文章排名的相关性和新颖性","authors":"Kianoosh Rashidi, H. Sotudeh, A. Nikseresht","doi":"10.1108/oir-05-2022-0283","DOIUrl":null,"url":null,"abstract":"PurposeThis study aimed to investigate how the enrichment of medical documents' index terms by their comments improves the relevance and novelty of the top-ranked results retrieved by an NLP system.Design/methodology/approachA semi-experimental pre-test and post-test research was designed to compare NLP-based indexes before and after being expanded by the comment terms. The experiments were conducted on a test collection of 13,957 documents commented by F1000-Prime reviewers. They were indexed at title, abstract, body and full-text levels. In total, 100 seed documents were randomly selected and served as queries. The textual similarity of the documents and queries was calculated using Lucene-more-like-this function and evaluated by the semantic similarity of their MeSH. The results novelty was measured using maximal marginal relevance and evaluated by their MeSH novelties. Normalized discounted cumulative gain was used to compare the basic and expanded indexes' precisions at 10, 20 and 50 top ranks.FindingsThe relevance and novelty of the results ranked at the top precision points was improved after expanding the indexes by the comment terms. The finding implies that meta-texts are effective in representing their mother documents, by adding dynamic elements to their rather static contents. It also provides further evidence about the merits of the application of social intelligence and collective wisdom reflected in the actions and reactions of users in tackling the challenges faced by NLP-based systems.Originality/valueThis is the first study to confirm that social comments on scientific papers improve the performance of information systems in terms of relevance and novelty.Peer reviewThe peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-05-2022-0283.","PeriodicalId":54683,"journal":{"name":"Online Information Review","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2022-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comment-enriched index terms improve the relevance and novelty of the ranking of the commented medical articles retrieved by an NLP system\",\"authors\":\"Kianoosh Rashidi, H. Sotudeh, A. Nikseresht\",\"doi\":\"10.1108/oir-05-2022-0283\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"PurposeThis study aimed to investigate how the enrichment of medical documents' index terms by their comments improves the relevance and novelty of the top-ranked results retrieved by an NLP system.Design/methodology/approachA semi-experimental pre-test and post-test research was designed to compare NLP-based indexes before and after being expanded by the comment terms. The experiments were conducted on a test collection of 13,957 documents commented by F1000-Prime reviewers. They were indexed at title, abstract, body and full-text levels. In total, 100 seed documents were randomly selected and served as queries. The textual similarity of the documents and queries was calculated using Lucene-more-like-this function and evaluated by the semantic similarity of their MeSH. The results novelty was measured using maximal marginal relevance and evaluated by their MeSH novelties. Normalized discounted cumulative gain was used to compare the basic and expanded indexes' precisions at 10, 20 and 50 top ranks.FindingsThe relevance and novelty of the results ranked at the top precision points was improved after expanding the indexes by the comment terms. The finding implies that meta-texts are effective in representing their mother documents, by adding dynamic elements to their rather static contents. It also provides further evidence about the merits of the application of social intelligence and collective wisdom reflected in the actions and reactions of users in tackling the challenges faced by NLP-based systems.Originality/valueThis is the first study to confirm that social comments on scientific papers improve the performance of information systems in terms of relevance and novelty.Peer reviewThe peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-05-2022-0283.\",\"PeriodicalId\":54683,\"journal\":{\"name\":\"Online Information Review\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2022-12-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Online Information Review\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.1108/oir-05-2022-0283\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Information Review","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/oir-05-2022-0283","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

目的本研究旨在探讨通过医学文献的评论来丰富医学文献的索引词,如何提高NLP系统检索到的排名靠前的结果的相关性和新颖性。设计/方法/方法设计半实验前测和后测研究,比较基于nlp的索引被评论词扩展前后的差异。实验是在F1000-Prime审稿人评论的13957篇论文的测试集上进行的。它们按标题、摘要、正文和全文进行索引。总共随机选择100个种子文档作为查询。使用Lucene-more-like-this函数计算文档和查询的文本相似度,并通过其MeSH的语义相似度进行评估。结果的新颖性用最大边际相关性来衡量,并通过其MeSH新颖性来评估。使用归一化贴现累积增益来比较基本指数和扩展指数在10、20和50位的精度。结果:在通过评论项扩展索引后,排名在精度点前的结果的相关性和新颖性得到了提高。这一发现表明,通过向其静态内容添加动态元素,元文本可以有效地表示其母文档。它还提供了进一步的证据,证明在解决基于nlp的系统所面临的挑战时,反映在用户的行动和反应中的社会智能和集体智慧的应用的优点。原创性/价值这是第一个证实对科学论文的社会评论在相关性和新颖性方面提高信息系统性能的研究。同行评议本文的同行评议历史可在:https://publons.com/publon/10.1108/OIR-05-2022-0283。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Comment-enriched index terms improve the relevance and novelty of the ranking of the commented medical articles retrieved by an NLP system
PurposeThis study aimed to investigate how the enrichment of medical documents' index terms by their comments improves the relevance and novelty of the top-ranked results retrieved by an NLP system.Design/methodology/approachA semi-experimental pre-test and post-test research was designed to compare NLP-based indexes before and after being expanded by the comment terms. The experiments were conducted on a test collection of 13,957 documents commented by F1000-Prime reviewers. They were indexed at title, abstract, body and full-text levels. In total, 100 seed documents were randomly selected and served as queries. The textual similarity of the documents and queries was calculated using Lucene-more-like-this function and evaluated by the semantic similarity of their MeSH. The results novelty was measured using maximal marginal relevance and evaluated by their MeSH novelties. Normalized discounted cumulative gain was used to compare the basic and expanded indexes' precisions at 10, 20 and 50 top ranks.FindingsThe relevance and novelty of the results ranked at the top precision points was improved after expanding the indexes by the comment terms. The finding implies that meta-texts are effective in representing their mother documents, by adding dynamic elements to their rather static contents. It also provides further evidence about the merits of the application of social intelligence and collective wisdom reflected in the actions and reactions of users in tackling the challenges faced by NLP-based systems.Originality/valueThis is the first study to confirm that social comments on scientific papers improve the performance of information systems in terms of relevance and novelty.Peer reviewThe peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-05-2022-0283.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Online Information Review
Online Information Review 工程技术-计算机:信息系统
CiteScore
6.90
自引率
16.10%
发文量
67
审稿时长
6 months
期刊介绍: The journal provides a multi-disciplinary forum for scholars from a range of fields, including information studies/iSchools, data studies, internet studies, media and communication studies and information systems. Publishes research on the social, political and ethical aspects of emergent digital information practices and platforms, and welcomes submissions that draw upon critical and socio-technical perspectives in order to address these developments. Welcomes empirical, conceptual and methodological contributions on any topics relevant to the broad field of digital information and communication, however we are particularly interested in receiving submissions that address emerging issues around the below topics. Coverage includes (but is not limited to): •Online communities, social networking and social media, including online political communication; crowdsourcing; positive computing and wellbeing. •The social drivers and implications of emerging data practices, including open data; big data; data journeys and flows; and research data management. •Digital transformations including organisations’ use of information technologies (e.g. Internet of Things and digitisation of user experience) to improve economic and social welfare, health and wellbeing, and protect the environment. •Developments in digital scholarship and the production and use of scholarly content. •Online and digital research methods, including their ethical aspects.
期刊最新文献
The digitalization tendency of young adults: differences by living environment, gender and education Artificial intelligence (AI) for supply chain collaboration: implications on information sharing and trust Navigating the inception stage in online peer production communities: a comparative study on community building activities, user roles and interaction dynamics Finding “fake” in the news: the relationship between social media use, political knowledge, epistemic political efficacy and fake news literacy How do users select the content they share on social media: flow theory perspective
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1