首页 > 最新文献

Research Synthesis Methods最新文献

英文 中文
Bayesian Federated Inference for regression models based on non-shared medical center data. 基于非共享医疗中心数据的贝叶斯联邦推理回归模型。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-10 DOI: 10.1017/rsm.2025.6
Marianne A Jonker, Hassan Pazira, Anthony C C Coolen

To estimate accurately the parameters of a regression model, the sample size must be large enough relative to the number of possible predictors for the model. In practice, sufficient data is often lacking, which can lead to overfitting of the model and, as a consequence, unreliable predictions of the outcome of new patients. Pooling data from different data sets collected in different (medical) centers would alleviate this problem, but is often not feasible due to privacy regulation or logistic problems. An alternative route would be to analyze the local data in the centers separately and combine the statistical inference results with the Bayesian Federated Inference (BFI) methodology. The aim of this approach is to compute from the inference results in separate centers what would have been found if the statistical analysis was performed on the combined data. We explain the methodology under homogeneity and heterogeneity across the populations in the separate centers, and give real life examples for better understanding. Excellent performance of the proposed methodology is shown. An R-package to do all the calculations has been developed and is illustrated in this article. The mathematical details are given in the Appendix.

为了准确地估计回归模型的参数,样本量必须相对于模型可能的预测因子的数量足够大。在实践中,往往缺乏足够的数据,这可能导致模型的过度拟合,从而导致对新患者预后的预测不可靠。汇集来自不同(医疗)中心收集的不同数据集的数据可以缓解这一问题,但由于隐私法规或后勤问题,通常不可行。另一种方法是分别分析中心的本地数据,并将统计推断结果与贝叶斯联邦推断(BFI)方法结合起来。这种方法的目的是从不同中心的推断结果中计算如果对组合数据进行统计分析会发现什么。我们解释了在不同中心人群的同质性和异质性下的方法,并给出了现实生活中的例子,以便更好地理解。结果表明,该方法具有良好的性能。已经开发了一个r包来完成所有的计算,本文将对此进行说明。数学细节在附录中给出。
{"title":"Bayesian Federated Inference for regression models based on non-shared medical center data.","authors":"Marianne A Jonker, Hassan Pazira, Anthony C C Coolen","doi":"10.1017/rsm.2025.6","DOIUrl":"10.1017/rsm.2025.6","url":null,"abstract":"<p><p>To estimate accurately the parameters of a regression model, the sample size must be large enough relative to the number of possible predictors for the model. In practice, sufficient data is often lacking, which can lead to overfitting of the model and, as a consequence, unreliable predictions of the outcome of new patients. Pooling data from different data sets collected in different (medical) centers would alleviate this problem, but is often not feasible due to privacy regulation or logistic problems. An alternative route would be to analyze the local data in the centers separately and combine the statistical inference results with the Bayesian Federated Inference (BFI) methodology. The aim of this approach is to compute from the inference results in separate centers what would have been found if the statistical analysis was performed on the combined data. We explain the methodology under homogeneity and heterogeneity across the populations in the separate centers, and give real life examples for better understanding. Excellent performance of the proposed methodology is shown. An R-package to do all the calculations has been developed and is illustrated in this article. The mathematical details are given in the Appendix.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"383-423"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527543/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive systematic review dataset is a rich resource for training and evaluation of AI systems for title and abstract screening. 一个全面的系统评论数据集是训练和评估人工智能系统进行标题和摘要筛选的丰富资源。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-07 DOI: 10.1017/rsm.2025.1
Gary C K Chan, Estrid He, Janni Leung, Karin Verspoor

When conducting a systematic review, screening the vast body of literature to identify the small set of relevant studies is a labour-intensive and error-prone process. Although there is an increasing number of fully automated tools for screening, their performance is suboptimal and varies substantially across review topic areas. Many of these tools are only trained on small datasets, and most are not tested on a wide range of review topic areas. This study presents two systematic review datasets compiled from more than 8600 systematic reviews and more than 540000 abstracts covering 51 research topic areas in health and medical research. These datasets are the largest of their kinds to date. We demonstrate their utility in training and evaluating language models for title and abstract screening. Our dataset includes detailed metadata of each review, including title, background, objectives and selection criteria. We demonstrated that a small language model trained on this dataset with additional metadata has excellent performance with an average recall above 95% and specificity over 70% across a wide range of review topic areas. Future research can build on our dataset to further improve the performance of fully automated tools for systematic review title and abstract screening.

在进行系统综述时,筛选大量文献以确定一小部分相关研究是一个劳动密集型且容易出错的过程。尽管有越来越多的全自动筛选工具,但它们的性能不是最优的,并且在审查主题领域之间变化很大。这些工具中的许多只在小数据集上进行了训练,并且大多数都没有在广泛的审查主题领域进行测试。本研究提供了两个系统综述数据集,其中包括8600多篇系统综述和54万多篇摘要,涵盖卫生和医学研究的51个研究主题领域。这些数据集是迄今为止同类数据集中最大的。我们展示了它们在训练和评估标题和摘要筛选语言模型中的效用。我们的数据集包括每篇综述的详细元数据,包括标题、背景、目标和选择标准。我们证明了在此数据集上训练的具有额外元数据的小型语言模型具有出色的性能,在广泛的审查主题领域中,平均召回率超过95%,特异性超过70%。未来的研究可以建立在我们的数据集上,进一步提高系统综述标题和摘要筛选的全自动工具的性能。
{"title":"A comprehensive systematic review dataset is a rich resource for training and evaluation of AI systems for title and abstract screening.","authors":"Gary C K Chan, Estrid He, Janni Leung, Karin Verspoor","doi":"10.1017/rsm.2025.1","DOIUrl":"10.1017/rsm.2025.1","url":null,"abstract":"<p><p>When conducting a systematic review, screening the vast body of literature to identify the small set of relevant studies is a labour-intensive and error-prone process. Although there is an increasing number of fully automated tools for screening, their performance is suboptimal and varies substantially across review topic areas. Many of these tools are only trained on small datasets, and most are not tested on a wide range of review topic areas. This study presents two systematic review datasets compiled from more than 8600 systematic reviews and more than 540000 abstracts covering 51 research topic areas in health and medical research. These datasets are the largest of their kinds to date. We demonstrate their utility in training and evaluating language models for title and abstract screening. Our dataset includes detailed metadata of each review, including title, background, objectives and selection criteria. We demonstrated that a small language model trained on this dataset with additional metadata has excellent performance with an average recall above 95% and specificity over 70% across a wide range of review topic areas. Future research can build on our dataset to further improve the performance of fully automated tools for systematic review title and abstract screening.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"308-322"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527522/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Federated Inference for regression models based on non-shared medical center data - ERRATUM. 基于非共享医疗中心数据的贝叶斯联邦推理回归模型-勘误。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 DOI: 10.1017/rsm.2025.23
Marianne A Jonker, Hassan Pazira, Anthony C C Coolen
{"title":"Bayesian Federated Inference for regression models based on non-shared medical center data - ERRATUM.","authors":"Marianne A Jonker, Hassan Pazira, Anthony C C Coolen","doi":"10.1017/rsm.2025.23","DOIUrl":"10.1017/rsm.2025.23","url":null,"abstract":"","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"424"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527526/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network meta-analysis made simple: A composite likelihood approach. 网络荟萃分析简单:复合似然方法。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-17 DOI: 10.1017/rsm.2024.12
Yu-Lun Liu, Bingyu Zhang, Haitao Chu, Yong Chen

Network meta-analysis (NMA), also known as mixed treatment comparison meta-analysis or multiple treatments meta-analysis, extends conventional pairwise meta-analysis by simultaneously synthesizing multiple interventions in a single integrated analysis. Despite the growing popularity of NMA within comparative effectiveness research, it comes with potential challenges. For example, within-study correlations among treatment comparisons are rarely reported in the published literature. Yet, these correlations are pivotal for valid statistical inference. As demonstrated in earlier studies, ignoring these correlations can inflate mean squared errors of the resulting point estimates and lead to inaccurate standard error estimates. This article introduces a composite likelihood-based approach that ensures accurate statistical inference without requiring knowledge of the within-study correlations. The proposed method is computationally robust and efficient, with substantially reduced computational time compared to the state-of-the-science methods implemented in R packages. The proposed method was evaluated through extensive simulations and applied to two important applications including an NMA comparing interventions for primary open-angle glaucoma, and another comparing treatments for chronic prostatitis and chronic pelvic pain syndrome.

网络荟萃分析(NMA),也被称为混合治疗比较荟萃分析或多种治疗荟萃分析,通过在单一综合分析中同时综合多种干预措施,扩展了传统的两两荟萃分析。尽管NMA在比较有效性研究中越来越受欢迎,但它也带来了潜在的挑战。例如,在已发表的文献中很少报道治疗比较之间的研究内相关性。然而,这些相关性对于有效的统计推断至关重要。正如在早期的研究中所证明的那样,忽略这些相关性会使所得点估计的均方误差增大,并导致不准确的标准误差估计。本文介绍了一种基于复合似然的方法,该方法确保了准确的统计推断,而不需要了解研究内部的相关性。所提出的方法具有计算鲁棒性和效率,与在R包中实现的最新方法相比,大大减少了计算时间。该方法通过广泛的模拟进行评估,并应用于两个重要的应用,包括NMA比较原发性开角型青光眼的干预措施,以及另一个比较慢性前列腺炎和慢性盆腔疼痛综合征的治疗方法。
{"title":"Network meta-analysis made simple: A composite likelihood approach.","authors":"Yu-Lun Liu, Bingyu Zhang, Haitao Chu, Yong Chen","doi":"10.1017/rsm.2024.12","DOIUrl":"10.1017/rsm.2024.12","url":null,"abstract":"<p><p>Network meta-analysis (NMA), also known as mixed treatment comparison meta-analysis or multiple treatments meta-analysis, extends conventional pairwise meta-analysis by simultaneously synthesizing multiple interventions in a single integrated analysis. Despite the growing popularity of NMA within comparative effectiveness research, it comes with potential challenges. For example, within-study correlations among treatment comparisons are rarely reported in the published literature. Yet, these correlations are pivotal for valid statistical inference. As demonstrated in earlier studies, ignoring these correlations can inflate mean squared errors of the resulting point estimates and lead to inaccurate standard error estimates. This article introduces a composite likelihood-based approach that ensures accurate statistical inference without requiring knowledge of the within-study correlations. The proposed method is computationally robust and efficient, with substantially reduced computational time compared to the state-of-the-science methods implemented in R packages. The proposed method was evaluated through extensive simulations and applied to two important applications including an NMA comparing interventions for primary open-angle glaucoma, and another comparing treatments for chronic prostatitis and chronic pelvic pain syndrome.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"272-290"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527528/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CausalMetaR: An R package for performing causally interpretable meta-analyses - ERRATUM. 一个R软件包,用于执行因果关系可解释的元分析-勘误。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 DOI: 10.1017/rsm.2025.22
Guanbo Wang, Sean McGrath, Yi Lian
{"title":"CausalMetaR: An R package for performing causally interpretable meta-analyses - ERRATUM.","authors":"Guanbo Wang, Sean McGrath, Yi Lian","doi":"10.1017/rsm.2025.22","DOIUrl":"10.1017/rsm.2025.22","url":null,"abstract":"","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"441"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527490/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ZIBGLMM: Zero-inflated bivariate generalized linear mixed model for meta-analysis with double-zero-event studies. 双零事件研究的元分析的零膨胀双变量广义线性混合模型。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-21 DOI: 10.1017/rsm.2024.4
Lu Li, Lifeng Lin, Joseph C Cappelleri, Haitao Chu, Yong Chen

Double-zero-event studies (DZS) pose a challenge for accurately estimating the overall treatment effect in meta-analysis (MA). Current approaches, such as continuity correction or omission of DZS, are commonly employed, yet these ad hoc methods can yield biased conclusions. Although the standard bivariate generalized linear mixed model (BGLMM) can accommodate DZS, it fails to address the potential systemic differences between DZS and other studies. In this article, we propose a zero-inflated bivariate generalized linear mixed model (ZIBGLMM) to tackle this issue. This two-component finite mixture model includes zero inflation for a subpopulation with negligible or extremely low risk. We develop both frequentist and Bayesian versions of ZIBGLMM and examine its performance in estimating risk ratios against the BGLMM and conventional two-stage MA that excludes DZS. Through extensive simulation studies and real-world MA case studies, we demonstrate that ZIBGLMM outperforms the BGLMM and conventional two-stage MA that excludes DZS in estimating the true effect size with substantially less bias and comparable coverage probability.

双零事件研究(DZS)在meta分析(MA)中对准确估计整体治疗效果提出了挑战。目前的方法,如连续性校正或遗漏DZS,通常被采用,但这些临时方法可能产生有偏见的结论。虽然标准的二元广义线性混合模型(BGLMM)可以容纳DZS,但它无法解决DZS与其他研究之间潜在的系统性差异。在本文中,我们提出了一个零膨胀的二元广义线性混合模型(ZIBGLMM)来解决这个问题。这种双组分有限混合模型对可忽略不计或极低风险的亚群体包括零膨胀。我们开发了频率和贝叶斯版本的ZIBGLMM,并检查了它在估计风险比方面的性能,而不是BGLMM和排除DZS的传统两阶段MA。通过广泛的模拟研究和现实世界的MA案例研究,我们证明ZIBGLMM在估计真实效应大小方面优于BGLMM和排除DZS的传统两阶段MA,偏差大大减少,覆盖概率相当。
{"title":"ZIBGLMM: Zero-inflated bivariate generalized linear mixed model for meta-analysis with double-zero-event studies.","authors":"Lu Li, Lifeng Lin, Joseph C Cappelleri, Haitao Chu, Yong Chen","doi":"10.1017/rsm.2024.4","DOIUrl":"10.1017/rsm.2024.4","url":null,"abstract":"<p><p>Double-zero-event studies (DZS) pose a challenge for accurately estimating the overall treatment effect in meta-analysis (MA). Current approaches, such as continuity correction or omission of DZS, are commonly employed, yet these ad hoc methods can yield biased conclusions. Although the standard bivariate generalized linear mixed model (BGLMM) can accommodate DZS, it fails to address the potential systemic differences between DZS and other studies. In this article, we propose a zero-inflated bivariate generalized linear mixed model (ZIBGLMM) to tackle this issue. This two-component finite mixture model includes zero inflation for a subpopulation with negligible or extremely low risk. We develop both frequentist and Bayesian versions of ZIBGLMM and examine its performance in estimating risk ratios against the BGLMM and conventional two-stage MA that excludes DZS. Through extensive simulation studies and real-world MA case studies, we demonstrate that ZIBGLMM outperforms the BGLMM and conventional two-stage MA that excludes DZS in estimating the true effect size with substantially less bias and comparable coverage probability.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"251-271"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527523/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Translating systematic searches in the APA PsycInfo database from Ovid to EBSCOhost: A tutorial based on a filter translation. 将APA PsycInfo数据库中的系统搜索从Ovid翻译到EBSCOhost:一个基于过滤器翻译的教程。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-07 DOI: 10.1017/rsm.2024.18
Zahra Premji, Hilary Kraus

Search filters are single-concept systematic search strategies created by experts. Filters are a valuable resource for systematic searchers. Typically, filters are designed for a single database in a single interface. If researchers do not have access to that specific interface, the existing filter will be unusable without translation. Filter translation is a complex process that requires an understanding of information retrieval concepts, as well as the unique indexing and search functionality of databases and interfaces. The authors undertook a project to translate an APA PsycInfo search filter for Randomized Controlled Trials/Clinical Controlled Trials (RCT/CCT), developed by Canada's Drug Agency, from the Wolters Kluwer Health Ovid interface to the EBSCO Information Services EBSCOhost interface. We present here a guide for translation, from the first principles of systematic searching to fine details of the relevant database and interfaces, based on our experience and illustrated by a worked example. We discuss each element of a systematic search in a stepwise process, addressing both the underlying information retrieval concepts and the technical strategies for effective translation between the two interfaces. We end with a discussion on translation challenges, with some guidance on how to mitigate potential impacts on sensitivity. While we have endeavored to explain the workings of this process accessibly for researchers who are not experts in systematic searching, anyone undertaking a search translation project should work with a trained information specialist if they lack information retrieval expertise or are unfamiliar with the inner workings of the database, the original interface, and the destination interface.

搜索过滤器是由专家创建的单概念系统搜索策略。过滤器是系统搜索器的宝贵资源。通常,过滤器是为单个接口中的单个数据库设计的。如果研究人员无法访问特定的接口,那么没有翻译,现有的过滤器将无法使用。过滤器翻译是一个复杂的过程,需要理解信息检索概念,以及数据库和接口的独特索引和搜索功能。作者承担了一个项目,将由加拿大药品管理局开发的APA PsycInfo随机对照试验/临床对照试验(RCT/CCT)搜索过滤器从Wolters Kluwer Health Ovid界面翻译到EBSCO信息服务EBSCOhost界面。在此,我们根据自己的经验,并通过一个工作实例,提供了一个翻译指南,从系统搜索的基本原则到相关数据库和接口的细节。我们在一个逐步的过程中讨论了系统搜索的每个元素,解决了潜在的信息检索概念和在两个接口之间有效转换的技术策略。最后,我们讨论了翻译面临的挑战,并就如何减轻对敏感性的潜在影响提供了一些指导。虽然我们努力为非系统搜索专家的研究人员解释这一过程的工作原理,但任何从事搜索翻译项目的人,如果缺乏信息检索专业知识或不熟悉数据库、原始接口和目标接口的内部工作原理,都应该与受过训练的信息专家合作。
{"title":"Translating systematic searches in the APA PsycInfo database from Ovid to EBSCOhost: A tutorial based on a filter translation.","authors":"Zahra Premji, Hilary Kraus","doi":"10.1017/rsm.2024.18","DOIUrl":"10.1017/rsm.2024.18","url":null,"abstract":"<p><p>Search filters are single-concept systematic search strategies created by experts. Filters are a valuable resource for systematic searchers. Typically, filters are designed for a single database in a single interface. If researchers do not have access to that specific interface, the existing filter will be unusable without translation. Filter translation is a complex process that requires an understanding of information retrieval concepts, as well as the unique indexing and search functionality of databases and interfaces. The authors undertook a project to translate an APA PsycInfo search filter for Randomized Controlled Trials/Clinical Controlled Trials (RCT/CCT), developed by Canada's Drug Agency, from the Wolters Kluwer Health Ovid interface to the EBSCO Information Services EBSCOhost interface. We present here a guide for translation, from the first principles of systematic searching to fine details of the relevant database and interfaces, based on our experience and illustrated by a worked example. We discuss each element of a systematic search in a stepwise process, addressing both the underlying information retrieval concepts and the technical strategies for effective translation between the two interfaces. We end with a discussion on translation challenges, with some guidance on how to mitigate potential impacts on sensitivity. While we have endeavored to explain the workings of this process accessibly for researchers who are not experts in systematic searching, anyone undertaking a search translation project should work with a trained information specialist if they lack information retrieval expertise or are unfamiliar with the inner workings of the database, the original interface, and the destination interface.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"228-250"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527542/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning for identifying randomised controlled trials when conducting systematic reviews: Development and evaluation of its impact on practice. 在进行系统评价时用于识别随机对照试验的机器学习:其对实践影响的发展和评估。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-21 DOI: 10.1017/rsm.2025.3
Xuan Qin, Minghong Yao, Xiaochao Luo, Jiali Liu, Yu Ma, Yanmei Liu, Hao Li, Ke Deng, Kang Zou, Ling Li, Xin Sun

Machine learning (ML) models have been developed to identify randomised controlled trials (RCTs) to accelerate systematic reviews (SRs). However, their use has been limited due to concerns about their performance and practical benefits. We developed a high-recall ensemble learning model using Cochrane RCT data to enhance the identification of RCTs for rapid title and abstract screening in SRs and evaluated the model externally with our annotated RCT datasets. Additionally, we assessed the practical impact in terms of labour time savings and recall improvement under two scenarios: ML-assisted double screening (where ML and one reviewer screened all citations in parallel) and ML-assisted stepwise screening (where ML flagged all potential RCTs, and at least two reviewers subsequently filtered the flagged citations). Our model achieved twice the precision compared to the existing SVM model while maintaining a recall of 0.99 in both internal and external tests. In a practical evaluation with ML-assisted double screening, our model led to significant labour time savings (average 45.4%) and improved recall (average 0.998 compared to 0.919 for a single reviewer). In ML-assisted stepwise screening, the model performed similarly to standard manual screening but with average labour time savings of 74.4%. In conclusion, compared with existing methods, the proposed model can reduce workload while maintaining comparable recall when identifying RCTs during the title and abstract screening stages, thereby accelerating SRs. We propose practical recommendations to effectively apply ML-assisted manual screening when conducting SRs, depending on reviewer availability (ML-assisted double screening) or time constraints (ML-assisted stepwise screening).

机器学习(ML)模型已被开发用于识别随机对照试验(rct),以加速系统评价(SRs)。然而,由于对其性能和实际效益的担忧,它们的使用受到限制。我们利用Cochrane RCT数据开发了一个高查全率集成学习模型,以增强RCT在SRs中的快速标题和摘要筛选的识别,并使用我们的注释RCT数据集对模型进行外部评估。此外,我们评估了在两种情况下节省劳动时间和提高召回率方面的实际影响:机器学习辅助双重筛选(机器学习和一名审稿人并行筛选所有引用)和机器学习辅助逐步筛选(机器学习标记所有潜在的随机对照试验,至少两名审稿人随后过滤标记的引用)。与现有的SVM模型相比,我们的模型实现了两倍的精度,同时在内部和外部测试中保持了0.99的召回率。在使用机器学习辅助双重筛选的实际评估中,我们的模型显著节省了劳动时间(平均45.4%),提高了召回率(平均0.998,而单个评论者的召回率为0.919)。在机器学习辅助逐步筛选中,该模型的表现与标准人工筛选相似,但平均节省了74.4%的劳动时间。综上所述,与现有方法相比,本文提出的模型可以减少工作量,同时在标题和摘要筛选阶段识别rct时保持可比较的召回率,从而加快SRs。根据审稿人的可用性(机器学习辅助的双重筛选)或时间限制(机器学习辅助的逐步筛选),我们提出了在进行SRs时有效应用机器学习辅助的手动筛选的实用建议。
{"title":"Machine learning for identifying randomised controlled trials when conducting systematic reviews: Development and evaluation of its impact on practice.","authors":"Xuan Qin, Minghong Yao, Xiaochao Luo, Jiali Liu, Yu Ma, Yanmei Liu, Hao Li, Ke Deng, Kang Zou, Ling Li, Xin Sun","doi":"10.1017/rsm.2025.3","DOIUrl":"10.1017/rsm.2025.3","url":null,"abstract":"<p><p>Machine learning (ML) models have been developed to identify randomised controlled trials (RCTs) to accelerate systematic reviews (SRs). However, their use has been limited due to concerns about their performance and practical benefits. We developed a high-recall ensemble learning model using Cochrane RCT data to enhance the identification of RCTs for rapid title and abstract screening in SRs and evaluated the model externally with our annotated RCT datasets. Additionally, we assessed the practical impact in terms of labour time savings and recall improvement under two scenarios: ML-assisted double screening (where ML and one reviewer screened all citations in parallel) and ML-assisted stepwise screening (where ML flagged all potential RCTs, and at least two reviewers subsequently filtered the flagged citations). Our model achieved twice the precision compared to the existing SVM model while maintaining a recall of 0.99 in both internal and external tests. In a practical evaluation with ML-assisted double screening, our model led to significant labour time savings (average 45.4%) and improved recall (average 0.998 compared to 0.919 for a single reviewer). In ML-assisted stepwise screening, the model performed similarly to standard manual screening but with average labour time savings of 74.4%. In conclusion, compared with existing methods, the proposed model can reduce workload while maintaining comparable recall when identifying RCTs during the title and abstract screening stages, thereby accelerating SRs. We propose practical recommendations to effectively apply ML-assisted manual screening when conducting SRs, depending on reviewer availability (ML-assisted double screening) or time constraints (ML-assisted stepwise screening).</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"350-363"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527483/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effect modification and non-collapsibility together may lead to conflicting treatment decisions: A review of marginal and conditional estimands and recommendations for decision-making. 效果修改和不可折叠性一起可能导致相互冲突的治疗决策:对边际和条件估计和决策建议的回顾。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-10 DOI: 10.1017/rsm.2025.2
David M Phillippo, Antonio Remiro-Azócar, Anna Heath, Gianluca Baio, Sofia Dias, A E Ades, Nicky J Welton

Effect modification occurs when a covariate alters the relative effectiveness of treatment compared to control. It is widely understood that, when effect modification is present, treatment recommendations may vary by population and by subgroups within the population. Population-adjustment methods are increasingly used to adjust for differences in effect modifiers between study populations and to produce population-adjusted estimates in a relevant target population for decision-making. It is also widely understood that marginal and conditional estimands for non-collapsible effect measures, such as odds ratios or hazard ratios, do not in general coincide even without effect modification. However, the consequences of both non-collapsibility and effect modification together are little-discussed in the literature.In this article, we set out the definitions of conditional and marginal estimands, illustrate their properties when effect modification is present, and discuss the implications for decision-making. In particular, we show that effect modification can result in conflicting treatment rankings between conditional and marginal estimates. This is because conditional and marginal estimands correspond to different decision questions that are no longer aligned when effect modification is present. For time-to-event outcomes, the presence of covariates implies that marginal hazard ratios are time-varying, and effect modification can cause marginal hazard curves to cross. We conclude with practical recommendations for decision-making in the presence of effect modification, based on pragmatic comparisons of both conditional and marginal estimates in the decision target population. Currently, multilevel network meta-regression is the only population-adjustment method capable of producing both conditional and marginal estimates, in any decision target population.

当协变量改变了治疗相对于对照的相对有效性时,效果改变就发生了。人们普遍认为,当效果发生改变时,治疗建议可能因人群和人群中的亚组而异。人口调整方法越来越多地用于调整研究人群之间效应修饰因子的差异,并为决策提供有关目标人群的人口调整估计值。人们还普遍认识到,即使没有效应修正,非可折叠效应度量的边际和条件估计,如优势比或风险比,通常也不会一致。然而,文献中很少讨论非溃散性和效应修饰的后果。在本文中,我们列出了条件估计和边际估计的定义,说明了它们在存在效应修正时的性质,并讨论了决策的含义。我们特别指出,效应修正可能导致条件估计和边际估计之间的治疗排名冲突。这是因为条件估计和边际估计对应于不同的决策问题,当存在效果修改时,这些问题不再对齐。对于时间-事件结果,协变量的存在意味着边际风险比是时变的,效应修正会导致边际风险曲线交叉。基于对决策目标人群的条件估计和边际估计的务实比较,我们总结了在存在效应修正的情况下对决策的实际建议。目前,多层次网络元回归是唯一能够在任何决策目标人群中产生条件估计和边际估计的人口调整方法。
{"title":"Effect modification and non-collapsibility together may lead to conflicting treatment decisions: A review of marginal and conditional estimands and recommendations for decision-making.","authors":"David M Phillippo, Antonio Remiro-Azócar, Anna Heath, Gianluca Baio, Sofia Dias, A E Ades, Nicky J Welton","doi":"10.1017/rsm.2025.2","DOIUrl":"10.1017/rsm.2025.2","url":null,"abstract":"<p><p>Effect modification occurs when a covariate alters the relative effectiveness of treatment compared to control. It is widely understood that, when effect modification is present, treatment recommendations may vary by population and by subgroups within the population. Population-adjustment methods are increasingly used to adjust for differences in effect modifiers between study populations and to produce population-adjusted estimates in a relevant target population for decision-making. It is also widely understood that marginal and conditional estimands for non-collapsible effect measures, such as odds ratios or hazard ratios, do not in general coincide even without effect modification. However, the consequences of both non-collapsibility and effect modification together are little-discussed in the literature.In this article, we set out the definitions of conditional and marginal estimands, illustrate their properties when effect modification is present, and discuss the implications for decision-making. In particular, we show that effect modification can result in conflicting treatment rankings between conditional and marginal estimates. This is because conditional and marginal estimands correspond to different decision questions that are no longer aligned when effect modification is present. For time-to-event outcomes, the presence of covariates implies that marginal hazard ratios are time-varying, and effect modification can cause marginal hazard curves to cross. We conclude with practical recommendations for decision-making in the presence of effect modification, based on pragmatic comparisons of both conditional and marginal estimates in the decision target population. Currently, multilevel network meta-regression is the only population-adjustment method capable of producing both conditional and marginal estimates, in any decision target population.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"323-349"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527544/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meta-analytic rain cloud plots: Improving evidence communication through data visualization design principles. 元分析雨云图:通过数据可视化设计原则改善证据交流。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-10 DOI: 10.1017/rsm.2025.4
Kaitlyn G Fitzgerald, David Khella, Avery Charles, Elizabeth Tipton

Results of meta-analyses are of interest not only to researchers but often to policy-makers and other decision-makers (e.g., in education and medicine), and visualizations play an important role in communicating data and statistical evidence to the broader public. Therefore, the potential audience of meta-analytic visualizations is broad. However, the most common meta-analytic visualization - the forest plot - uses non-optimal design principles that do not align with data visualization best practices and relies on statistical knowledge and conventions not likely to be familiar to a broad audience. Previously, the Meta-Analytic Rain Cloud (MARC) plot has been shown to be an effective alternative to a forest plot when communicating the results of a small meta-analysis to education practitioners. However, the original MARC plot design was not well-suited for meta-analyses with large numbers of effect sizes as is common across the social sciences. This paper presents an extension of the MARC plot, intended for effective communication of moderate to large meta-analyses (k = 10, 20, 50, 100 studies). We discuss the design principles of the MARC plot, grounded in the data visualization and cognitive science literature. We then present the methods and results of a randomized survey experiment to evaluate the revised MARC plot in comparison to the original MARC plot, the forest plot, and a bar plot. We find that the revised MARC plot is more effective for communicating moderate to large meta-analyses to non-research audiences, offering a 0.30, 0.34, and 1.07 standard deviation improvement in chart users' scores compared to the original MARC plot, forest plot, and bar plot, respectively.

荟萃分析的结果不仅对研究人员感兴趣,而且通常对政策制定者和其他决策者(例如,在教育和医学领域)感兴趣,可视化在向更广泛的公众传达数据和统计证据方面发挥着重要作用。因此,元分析可视化的潜在受众是广泛的。然而,最常见的元分析可视化—森林图—使用非最佳设计原则,这些原则与数据可视化最佳实践不一致,并且依赖于不太可能为广大受众所熟悉的统计知识和惯例。以前,在向教育从业者传达小型元分析结果时,元分析雨云(MARC)图已被证明是森林图的有效替代方案。然而,最初的MARC情节设计并不适合具有大量效应量的元分析,这在社会科学中很常见。本文提出了MARC图的扩展,旨在有效沟通中等到大型荟萃分析(k = 10,20,50,100项研究)。我们在数据可视化和认知科学文献的基础上讨论了MARC图的设计原则。然后,我们提出了一项随机调查实验的方法和结果,将修改后的MARC图与原始MARC图、森林图和柱状图进行比较。我们发现,修订后的MARC图更有效地向非研究受众传达中大型元分析,与原始MARC图、森林图和柱状图相比,图表用户的得分分别提高了0.30、0.34和1.07个标准差。
{"title":"Meta-analytic rain cloud plots: Improving evidence communication through data visualization design principles.","authors":"Kaitlyn G Fitzgerald, David Khella, Avery Charles, Elizabeth Tipton","doi":"10.1017/rsm.2025.4","DOIUrl":"10.1017/rsm.2025.4","url":null,"abstract":"<p><p>Results of meta-analyses are of interest not only to researchers but often to policy-makers and other decision-makers (e.g., in education and medicine), and visualizations play an important role in communicating data and statistical evidence to the broader public. Therefore, the potential audience of meta-analytic visualizations is broad. However, the most common meta-analytic visualization - the forest plot - uses non-optimal design principles that do not align with data visualization best practices and relies on statistical knowledge and conventions not likely to be familiar to a broad audience. Previously, the Meta-Analytic Rain Cloud (MARC) plot has been shown to be an effective alternative to a forest plot when communicating the results of a small meta-analysis to education practitioners. However, the original MARC plot design was not well-suited for meta-analyses with large numbers of effect sizes as is common across the social sciences. This paper presents an extension of the MARC plot, intended for effective communication of moderate to large meta-analyses (<i>k</i> = 10, 20, 50, 100 studies). We discuss the design principles of the MARC plot, grounded in the data visualization and cognitive science literature. We then present the methods and results of a randomized survey experiment to evaluate the revised MARC plot in comparison to the original MARC plot, the forest plot, and a bar plot. We find that the revised MARC plot is more effective for communicating moderate to large meta-analyses to non-research audiences, offering a 0.30, 0.34, and 1.07 standard deviation improvement in chart users' scores compared to the original MARC plot, forest plot, and bar plot, respectively.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"364-382"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527513/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Research Synthesis Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1