首页 > 最新文献

Research Synthesis Methods最新文献

英文 中文
Quantitative bias analysis for unmeasured confounding in unanchored population-adjusted indirect comparisons. 非锚定人口调整间接比较中未测量混杂的定量偏倚分析。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.13
Shijie Ren, Sa Ren, Nicky J Welton, Mark Strong

Unanchored population-adjusted indirect comparisons (PAICs) such as matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC) attracted a significant attention in the health technology assessment field in recent years. These methods allow for indirect comparisons by balancing different patient characteristics in single-arm studies in the case where individual patient-level data are only available for one study. However, the validity of findings from unanchored MAIC/STC analyses is frequently questioned by decision makers, due to the assumption that all potential prognostic factors and effect modifiers are accounted for. Addressing this critical concern, we introduce a sensitivity analysis algorithm for unanchored PAICs by extending quantitative bias analysis techniques traditionally used in epidemiology. Our proposed sensitivity analysis involves simulating important covariates that were not reported by the comparator study when conducting unanchored STC and enables the formal evaluating of the impact of unmeasured confounding in a quantitative manner without additional assumptions. We demonstrate the practical application of this method through a real-world case study of metastatic colorectal cancer, highlighting its utility in enhancing the robustness and credibility of unanchored PAIC results. Our findings emphasise the necessity of formal quantitative sensitivity analysis in interpreting unanchored PAIC results, as it quantifies the robustness of conclusions regarding potential unmeasured confounders and supports more robust, reliable, and informative decision-making in healthcare.

近年来,非锚定人口调整间接比较(PAICs)如匹配调整间接比较(MAIC)和模拟治疗比较(STC)在卫生技术评估领域引起了广泛关注。这些方法允许通过平衡单臂研究中的不同患者特征来进行间接比较,而单个患者水平的数据仅可用于一项研究。然而,非锚定的MAIC/STC分析结果的有效性经常受到决策者的质疑,因为假设所有潜在的预后因素和影响修饰因素都被考虑在内。为了解决这一关键问题,我们通过扩展流行病学中传统使用的定量偏差分析技术,引入了一种针对非锚定PAICs的敏感性分析算法。我们提出的敏感性分析包括在进行非锚定STC时模拟比较研究未报告的重要协变量,并且能够在没有额外假设的情况下以定量方式正式评估未测量混杂的影响。我们通过转移性结直肠癌的真实案例研究展示了该方法的实际应用,强调了其在提高无锚定pac结果的稳健性和可信度方面的实用性。我们的研究结果强调了正式定量敏感性分析在解释非锚定pac结果时的必要性,因为它量化了关于潜在未测量混杂因素的结论的稳健性,并支持医疗保健中更稳健、可靠和信息丰富的决策。
{"title":"Quantitative bias analysis for unmeasured confounding in unanchored population-adjusted indirect comparisons.","authors":"Shijie Ren, Sa Ren, Nicky J Welton, Mark Strong","doi":"10.1017/rsm.2025.13","DOIUrl":"10.1017/rsm.2025.13","url":null,"abstract":"<p><p>Unanchored population-adjusted indirect comparisons (PAICs) such as matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC) attracted a significant attention in the health technology assessment field in recent years. These methods allow for indirect comparisons by balancing different patient characteristics in single-arm studies in the case where individual patient-level data are only available for one study. However, the validity of findings from unanchored MAIC/STC analyses is frequently questioned by decision makers, due to the assumption that all potential prognostic factors and effect modifiers are accounted for. Addressing this critical concern, we introduce a sensitivity analysis algorithm for unanchored PAICs by extending quantitative bias analysis techniques traditionally used in epidemiology. Our proposed sensitivity analysis involves simulating important covariates that were not reported by the comparator study when conducting unanchored STC and enables the formal evaluating of the impact of unmeasured confounding in a quantitative manner without additional assumptions. We demonstrate the practical application of this method through a real-world case study of metastatic colorectal cancer, highlighting its utility in enhancing the robustness and credibility of unanchored PAIC results. Our findings emphasise the necessity of formal quantitative sensitivity analysis in interpreting unanchored PAIC results, as it quantifies the robustness of conclusions regarding potential unmeasured confounders and supports more robust, reliable, and informative decision-making in healthcare.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"509-527"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527536/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthesis of depression outcomes reported on different scales: A comparison of methods for modelling mean differences. 不同量表报告的抑郁结果的综合:建模平均差异方法的比较。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.7
Beatrice C Downing, Nicky J Welton, Hugo Pedder, Ifigeneia Mavranezouli, Odette Megnin-Viggars, A E Ades

Several methods have been proposed for the synthesis of continuous outcomes reported on different scales, including the Standardised Mean Difference (SMD) and the Ratio of Means (RoM). SMDs can be formed by dividing the study mean treatment effect either by a study-specific (Study-SMD) or a scale-specific (Scale-SMD) standard deviation (SD). We compared the performance of RoM to the different standardisation methods with and without meta-regression (MR) on baseline severity, in a Bayesian network meta-analysis (NMA) of 14 treatments for depression, reported on five different scales. There was substantial between-study variation in the SDs reported on the same scale. Based on the Deviance Information Criterion, RoM was preferred as having better model fit than the SMD models. Model fit for SMD models was not improved with meta-regression. Percentage shrinkage was used as a scale-independent measure with higher % shrinkage indicating lower heterogeneity. Heterogeneity was lowest for RoM (20.5% shrinkage), then Scale-SMD (18.2% shrinkage), and highest for Study-SMD (16.7% shrinkage). Model choice impacted which treatment was estimated to be most effective. However, all models picked out the same three highest-ranked treatments using the GRADE criteria. Alongside other indicators, higher shrinkage of RoM models suggests that treatments for depression act multiplicatively rather than additively. Further research is needed to determine whether these findings extend to Patient- and Clinician-Reported Outcomes used in other application areas. Where treatment effects are additive, we recommend using Scale-SMD for standardisation to avoid the additional heterogeneity introduced by Study-SMD.

已经提出了几种方法来综合不同尺度上报告的连续结果,包括标准化平均差(SMD)和均值比(RoM)。通过将研究平均治疗效果除以特定研究(study- smd)或特定量表(Scale-SMD)标准偏差(SD),可以形成标准差。在贝叶斯网络荟萃分析(NMA)中,我们在5个不同的量表上报告了14种抑郁症治疗方法,比较了RoM在基线严重程度上使用和不使用meta回归(MR)的不同标准化方法的表现。在同一量表上报告的SDs在研究间存在大量差异。基于偏差信息准则,RoM模型比SMD模型具有更好的模型拟合。meta回归并没有改善SMD模型的拟合。收缩率被用作一个尺度无关的措施,较高的收缩率表明较低的异质性。RoM的异质性最低(收缩20.5%),然后是Scale-SMD(收缩18.2%),Study-SMD的异质性最高(收缩16.7%)。模型选择影响了哪种治疗被认为是最有效的。然而,使用GRADE标准,所有模型都选择了相同的三个排名最高的治疗方法。与其他指标一样,RoM模型的高收缩率表明,抑郁症的治疗是乘数性的,而不是加法性的。需要进一步的研究来确定这些发现是否延伸到其他应用领域的患者和临床报告的结果。如果治疗效果是累加性的,我们建议使用Scale-SMD进行标准化,以避免Study-SMD引入的额外异质性。
{"title":"Synthesis of depression outcomes reported on different scales: A comparison of methods for modelling mean differences.","authors":"Beatrice C Downing, Nicky J Welton, Hugo Pedder, Ifigeneia Mavranezouli, Odette Megnin-Viggars, A E Ades","doi":"10.1017/rsm.2025.7","DOIUrl":"10.1017/rsm.2025.7","url":null,"abstract":"<p><p>Several methods have been proposed for the synthesis of continuous outcomes reported on different scales, including the Standardised Mean Difference (SMD) and the Ratio of Means (RoM). SMDs can be formed by dividing the study mean treatment effect either by a study-specific (Study-SMD) or a scale-specific (Scale-SMD) standard deviation (SD). We compared the performance of RoM to the different standardisation methods with and without meta-regression (MR) on baseline severity, in a Bayesian network meta-analysis (NMA) of 14 treatments for depression, reported on five different scales. There was substantial between-study variation in the SDs reported on the same scale. Based on the Deviance Information Criterion, RoM was preferred as having better model fit than the SMD models. Model fit for SMD models was not improved with meta-regression. Percentage shrinkage was used as a scale-independent measure with higher % shrinkage indicating lower heterogeneity. Heterogeneity was lowest for RoM (20.5% shrinkage), then Scale-SMD (18.2% shrinkage), and highest for Study-SMD (16.7% shrinkage). Model choice impacted which treatment was estimated to be most effective. However, all models picked out the same three highest-ranked treatments using the GRADE criteria. Alongside other indicators, higher shrinkage of RoM models suggests that treatments for depression act multiplicatively rather than additively. Further research is needed to determine whether these findings extend to Patient- and Clinician-Reported Outcomes used in other application areas. Where treatment effects are additive, we recommend using Scale-SMD for standardisation to avoid the additional heterogeneity introduced by Study-SMD.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"460-478"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527519/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel robust meta-analysis model using the t distribution for outlier accommodation and detection. 使用t分布进行离群调节和检测的新颖稳健元分析模型。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.8
Yue Wang, Jianhua Zhao, Fen Jiang, Lei Shi, Jianxin Pan

Random effects meta-analysis model is an important tool for integrating results from multiple independent studies. However, the standard model is based on the assumption of normal distributions for both random effects and within-study errors, making it susceptible to outlying studies. Although robust modeling using the t distribution is an appealing idea, the existing work, that explores the use of the t distribution only for random effects, involves complicated numerical integration and numerical optimization. In this article, a novel robust meta-analysis model using the t distribution is proposed (tMeta). The novelty is that the marginal distribution of the effect size in tMeta follows the t distribution, enabling that tMeta can simultaneously accommodate and detect outlying studies in a simple and adaptive manner. A simple and fast EM-type algorithm is developed for maximum likelihood estimation. Due to the mathematical tractability of the t distribution, tMeta frees from numerical integration and allows for efficient optimization. Experiments on real data demonstrate that tMeta is compared favorably with related competitors in situations involving mild outliers. Moreover, in the presence of gross outliers, while related competitors may fail, tMeta continues to perform consistently and robustly.

随机效应荟萃分析模型是整合多个独立研究结果的重要工具。然而,对于随机效应和研究内误差,标准模型是基于正态分布的假设,这使得它容易受到外围研究的影响。虽然使用t分布的稳健建模是一个吸引人的想法,但现有的工作,探索使用t分布的随机效应,涉及复杂的数值积分和数值优化。在这篇文章中,我们提出了一种新的使用t分布的稳健元分析模型(tMeta)。新颖之处在于tMeta中效应大小的边际分布遵循t分布,使得tMeta能够以一种简单和自适应的方式同时容纳和检测外围研究。提出了一种简单快速的em型最大似然估计算法。由于t分布的数学可追溯性,tMeta从数值积分中解脱出来,并允许有效的优化。实际数据实验表明,在轻度异常值的情况下,tMeta优于相关竞争对手。此外,在总异常值存在的情况下,虽然相关竞争对手可能会失败,但tMeta将继续保持稳定和强劲的表现。
{"title":"A novel robust meta-analysis model using the <i>t</i> distribution for outlier accommodation and detection.","authors":"Yue Wang, Jianhua Zhao, Fen Jiang, Lei Shi, Jianxin Pan","doi":"10.1017/rsm.2025.8","DOIUrl":"10.1017/rsm.2025.8","url":null,"abstract":"<p><p>Random effects meta-analysis model is an important tool for integrating results from multiple independent studies. However, the standard model is based on the assumption of normal distributions for both random effects and within-study errors, making it susceptible to outlying studies. Although robust modeling using the <i>t</i> distribution is an appealing idea, the existing work, that explores the use of the <i>t</i> distribution only for random effects, involves complicated numerical integration and numerical optimization. In this article, a novel robust meta-analysis model using the <i>t</i> distribution is proposed (<i>t</i>Meta). The novelty is that the marginal distribution of the effect size in <i>t</i>Meta follows the <i>t</i> distribution, enabling that <i>t</i>Meta can simultaneously accommodate and detect outlying studies in a simple and adaptive manner. A simple and fast EM-type algorithm is developed for maximum likelihood estimation. Due to the mathematical tractability of the <i>t</i> distribution, <i>t</i>Meta frees from numerical integration and allows for efficient optimization. Experiments on real data demonstrate that <i>t</i>Meta is compared favorably with related competitors in situations involving mild outliers. Moreover, in the presence of gross outliers, while related competitors may fail, <i>t</i>Meta continues to perform consistently and robustly.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"442-459"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527545/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CausalMetaR: An R package for performing causally interpretable meta-analyses. CausalMetaR:一个R包,用于执行因果关系可解释的元分析。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-12 DOI: 10.1017/rsm.2025.5
Guanbo Wang, Sean McGrath, Yi Lian

Researchers would often like to leverage data from a collection of sources (e.g., meta-analyses of randomized trials, multi-center trials, pooled analyses of observational cohorts) to estimate causal effects in a target population of interest. However, because different data sources typically represent different underlying populations, traditional meta-analytic methods may not produce causally interpretable estimates that apply to any reasonable target population. In this article, we present the CausalMetaR R package, which implements robust and efficient methods to estimate causal effects in a given internal or external target population using multi-source data. The package includes estimators of average and subgroup treatment effects for the entire target population. To produce efficient and robust estimates of causal effects, the package implements doubly robust and non-parametric efficient estimators and supports using flexible data-adaptive (e.g., machine learning techniques) methods and cross-fitting techniques to estimate the nuisance models (e.g., the treatment model, the outcome model). We briefly review the methods, describe the key features of the package, and demonstrate how to use the package through an example. The package aims to facilitate causal analyses in the context of meta-analysis.

研究人员通常喜欢利用来自各种来源的数据(例如,随机试验的荟萃分析,多中心试验,观察队列的汇总分析)来估计目标人群的因果效应。然而,由于不同的数据源通常代表不同的潜在人群,传统的元分析方法可能无法产生适用于任何合理目标人群的因果关系可解释的估计。在本文中,我们介绍了CausalMetaR R包,它实现了鲁棒和有效的方法,可以使用多源数据估计给定内部或外部目标群体中的因果效应。该方案包括对整个目标人群的平均治疗效果和亚组治疗效果的估计。为了产生有效和稳健的因果效应估计,该软件包实现了双稳健和非参数有效估计器,并支持使用灵活的数据自适应(例如,机器学习技术)方法和交叉拟合技术来估计有害模型(例如,治疗模型,结果模型)。我们简要回顾了这些方法,描述了包的主要特性,并通过一个示例演示了如何使用包。该包旨在促进元分析背景下的因果分析。
{"title":"CausalMetaR: An R package for performing causally interpretable meta-analyses.","authors":"Guanbo Wang, Sean McGrath, Yi Lian","doi":"10.1017/rsm.2025.5","DOIUrl":"10.1017/rsm.2025.5","url":null,"abstract":"<p><p>Researchers would often like to leverage data from a collection of sources (e.g., meta-analyses of randomized trials, multi-center trials, pooled analyses of observational cohorts) to estimate causal effects in a target population of interest. However, because different data sources typically represent different underlying populations, traditional meta-analytic methods may not produce causally interpretable estimates that apply to any reasonable target population. In this article, we present the CausalMetaR R package, which implements robust and efficient methods to estimate causal effects in a given internal or external target population using multi-source data. The package includes estimators of average and subgroup treatment effects for the entire target population. To produce efficient and robust estimates of causal effects, the package implements doubly robust and non-parametric efficient estimators and supports using flexible data-adaptive (e.g., machine learning techniques) methods and cross-fitting techniques to estimate the nuisance models (e.g., the treatment model, the outcome model). We briefly review the methods, describe the key features of the package, and demonstrate how to use the package through an example. The package aims to facilitate causal analyses in the context of meta-analysis.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"425-440"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527535/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Federated Inference for regression models based on non-shared medical center data. 基于非共享医疗中心数据的贝叶斯联邦推理回归模型。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-10 DOI: 10.1017/rsm.2025.6
Marianne A Jonker, Hassan Pazira, Anthony C C Coolen

To estimate accurately the parameters of a regression model, the sample size must be large enough relative to the number of possible predictors for the model. In practice, sufficient data is often lacking, which can lead to overfitting of the model and, as a consequence, unreliable predictions of the outcome of new patients. Pooling data from different data sets collected in different (medical) centers would alleviate this problem, but is often not feasible due to privacy regulation or logistic problems. An alternative route would be to analyze the local data in the centers separately and combine the statistical inference results with the Bayesian Federated Inference (BFI) methodology. The aim of this approach is to compute from the inference results in separate centers what would have been found if the statistical analysis was performed on the combined data. We explain the methodology under homogeneity and heterogeneity across the populations in the separate centers, and give real life examples for better understanding. Excellent performance of the proposed methodology is shown. An R-package to do all the calculations has been developed and is illustrated in this article. The mathematical details are given in the Appendix.

为了准确地估计回归模型的参数,样本量必须相对于模型可能的预测因子的数量足够大。在实践中,往往缺乏足够的数据,这可能导致模型的过度拟合,从而导致对新患者预后的预测不可靠。汇集来自不同(医疗)中心收集的不同数据集的数据可以缓解这一问题,但由于隐私法规或后勤问题,通常不可行。另一种方法是分别分析中心的本地数据,并将统计推断结果与贝叶斯联邦推断(BFI)方法结合起来。这种方法的目的是从不同中心的推断结果中计算如果对组合数据进行统计分析会发现什么。我们解释了在不同中心人群的同质性和异质性下的方法,并给出了现实生活中的例子,以便更好地理解。结果表明,该方法具有良好的性能。已经开发了一个r包来完成所有的计算,本文将对此进行说明。数学细节在附录中给出。
{"title":"Bayesian Federated Inference for regression models based on non-shared medical center data.","authors":"Marianne A Jonker, Hassan Pazira, Anthony C C Coolen","doi":"10.1017/rsm.2025.6","DOIUrl":"10.1017/rsm.2025.6","url":null,"abstract":"<p><p>To estimate accurately the parameters of a regression model, the sample size must be large enough relative to the number of possible predictors for the model. In practice, sufficient data is often lacking, which can lead to overfitting of the model and, as a consequence, unreliable predictions of the outcome of new patients. Pooling data from different data sets collected in different (medical) centers would alleviate this problem, but is often not feasible due to privacy regulation or logistic problems. An alternative route would be to analyze the local data in the centers separately and combine the statistical inference results with the Bayesian Federated Inference (BFI) methodology. The aim of this approach is to compute from the inference results in separate centers what would have been found if the statistical analysis was performed on the combined data. We explain the methodology under homogeneity and heterogeneity across the populations in the separate centers, and give real life examples for better understanding. Excellent performance of the proposed methodology is shown. An R-package to do all the calculations has been developed and is illustrated in this article. The mathematical details are given in the Appendix.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"383-423"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527543/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive systematic review dataset is a rich resource for training and evaluation of AI systems for title and abstract screening. 一个全面的系统评论数据集是训练和评估人工智能系统进行标题和摘要筛选的丰富资源。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-07 DOI: 10.1017/rsm.2025.1
Gary C K Chan, Estrid He, Janni Leung, Karin Verspoor

When conducting a systematic review, screening the vast body of literature to identify the small set of relevant studies is a labour-intensive and error-prone process. Although there is an increasing number of fully automated tools for screening, their performance is suboptimal and varies substantially across review topic areas. Many of these tools are only trained on small datasets, and most are not tested on a wide range of review topic areas. This study presents two systematic review datasets compiled from more than 8600 systematic reviews and more than 540000 abstracts covering 51 research topic areas in health and medical research. These datasets are the largest of their kinds to date. We demonstrate their utility in training and evaluating language models for title and abstract screening. Our dataset includes detailed metadata of each review, including title, background, objectives and selection criteria. We demonstrated that a small language model trained on this dataset with additional metadata has excellent performance with an average recall above 95% and specificity over 70% across a wide range of review topic areas. Future research can build on our dataset to further improve the performance of fully automated tools for systematic review title and abstract screening.

在进行系统综述时,筛选大量文献以确定一小部分相关研究是一个劳动密集型且容易出错的过程。尽管有越来越多的全自动筛选工具,但它们的性能不是最优的,并且在审查主题领域之间变化很大。这些工具中的许多只在小数据集上进行了训练,并且大多数都没有在广泛的审查主题领域进行测试。本研究提供了两个系统综述数据集,其中包括8600多篇系统综述和54万多篇摘要,涵盖卫生和医学研究的51个研究主题领域。这些数据集是迄今为止同类数据集中最大的。我们展示了它们在训练和评估标题和摘要筛选语言模型中的效用。我们的数据集包括每篇综述的详细元数据,包括标题、背景、目标和选择标准。我们证明了在此数据集上训练的具有额外元数据的小型语言模型具有出色的性能,在广泛的审查主题领域中,平均召回率超过95%,特异性超过70%。未来的研究可以建立在我们的数据集上,进一步提高系统综述标题和摘要筛选的全自动工具的性能。
{"title":"A comprehensive systematic review dataset is a rich resource for training and evaluation of AI systems for title and abstract screening.","authors":"Gary C K Chan, Estrid He, Janni Leung, Karin Verspoor","doi":"10.1017/rsm.2025.1","DOIUrl":"10.1017/rsm.2025.1","url":null,"abstract":"<p><p>When conducting a systematic review, screening the vast body of literature to identify the small set of relevant studies is a labour-intensive and error-prone process. Although there is an increasing number of fully automated tools for screening, their performance is suboptimal and varies substantially across review topic areas. Many of these tools are only trained on small datasets, and most are not tested on a wide range of review topic areas. This study presents two systematic review datasets compiled from more than 8600 systematic reviews and more than 540000 abstracts covering 51 research topic areas in health and medical research. These datasets are the largest of their kinds to date. We demonstrate their utility in training and evaluating language models for title and abstract screening. Our dataset includes detailed metadata of each review, including title, background, objectives and selection criteria. We demonstrated that a small language model trained on this dataset with additional metadata has excellent performance with an average recall above 95% and specificity over 70% across a wide range of review topic areas. Future research can build on our dataset to further improve the performance of fully automated tools for systematic review title and abstract screening.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"308-322"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527522/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network meta-analysis made simple: A composite likelihood approach. 网络荟萃分析简单:复合似然方法。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-17 DOI: 10.1017/rsm.2024.12
Yu-Lun Liu, Bingyu Zhang, Haitao Chu, Yong Chen

Network meta-analysis (NMA), also known as mixed treatment comparison meta-analysis or multiple treatments meta-analysis, extends conventional pairwise meta-analysis by simultaneously synthesizing multiple interventions in a single integrated analysis. Despite the growing popularity of NMA within comparative effectiveness research, it comes with potential challenges. For example, within-study correlations among treatment comparisons are rarely reported in the published literature. Yet, these correlations are pivotal for valid statistical inference. As demonstrated in earlier studies, ignoring these correlations can inflate mean squared errors of the resulting point estimates and lead to inaccurate standard error estimates. This article introduces a composite likelihood-based approach that ensures accurate statistical inference without requiring knowledge of the within-study correlations. The proposed method is computationally robust and efficient, with substantially reduced computational time compared to the state-of-the-science methods implemented in R packages. The proposed method was evaluated through extensive simulations and applied to two important applications including an NMA comparing interventions for primary open-angle glaucoma, and another comparing treatments for chronic prostatitis and chronic pelvic pain syndrome.

网络荟萃分析(NMA),也被称为混合治疗比较荟萃分析或多种治疗荟萃分析,通过在单一综合分析中同时综合多种干预措施,扩展了传统的两两荟萃分析。尽管NMA在比较有效性研究中越来越受欢迎,但它也带来了潜在的挑战。例如,在已发表的文献中很少报道治疗比较之间的研究内相关性。然而,这些相关性对于有效的统计推断至关重要。正如在早期的研究中所证明的那样,忽略这些相关性会使所得点估计的均方误差增大,并导致不准确的标准误差估计。本文介绍了一种基于复合似然的方法,该方法确保了准确的统计推断,而不需要了解研究内部的相关性。所提出的方法具有计算鲁棒性和效率,与在R包中实现的最新方法相比,大大减少了计算时间。该方法通过广泛的模拟进行评估,并应用于两个重要的应用,包括NMA比较原发性开角型青光眼的干预措施,以及另一个比较慢性前列腺炎和慢性盆腔疼痛综合征的治疗方法。
{"title":"Network meta-analysis made simple: A composite likelihood approach.","authors":"Yu-Lun Liu, Bingyu Zhang, Haitao Chu, Yong Chen","doi":"10.1017/rsm.2024.12","DOIUrl":"10.1017/rsm.2024.12","url":null,"abstract":"<p><p>Network meta-analysis (NMA), also known as mixed treatment comparison meta-analysis or multiple treatments meta-analysis, extends conventional pairwise meta-analysis by simultaneously synthesizing multiple interventions in a single integrated analysis. Despite the growing popularity of NMA within comparative effectiveness research, it comes with potential challenges. For example, within-study correlations among treatment comparisons are rarely reported in the published literature. Yet, these correlations are pivotal for valid statistical inference. As demonstrated in earlier studies, ignoring these correlations can inflate mean squared errors of the resulting point estimates and lead to inaccurate standard error estimates. This article introduces a composite likelihood-based approach that ensures accurate statistical inference without requiring knowledge of the within-study correlations. The proposed method is computationally robust and efficient, with substantially reduced computational time compared to the state-of-the-science methods implemented in R packages. The proposed method was evaluated through extensive simulations and applied to two important applications including an NMA comparing interventions for primary open-angle glaucoma, and another comparing treatments for chronic prostatitis and chronic pelvic pain syndrome.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"272-290"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527528/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Federated Inference for regression models based on non-shared medical center data - ERRATUM. 基于非共享医疗中心数据的贝叶斯联邦推理回归模型-勘误。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 DOI: 10.1017/rsm.2025.23
Marianne A Jonker, Hassan Pazira, Anthony C C Coolen
{"title":"Bayesian Federated Inference for regression models based on non-shared medical center data - ERRATUM.","authors":"Marianne A Jonker, Hassan Pazira, Anthony C C Coolen","doi":"10.1017/rsm.2025.23","DOIUrl":"10.1017/rsm.2025.23","url":null,"abstract":"","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"424"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527526/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CausalMetaR: An R package for performing causally interpretable meta-analyses - ERRATUM. 一个R软件包,用于执行因果关系可解释的元分析-勘误。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 DOI: 10.1017/rsm.2025.22
Guanbo Wang, Sean McGrath, Yi Lian
{"title":"CausalMetaR: An R package for performing causally interpretable meta-analyses - ERRATUM.","authors":"Guanbo Wang, Sean McGrath, Yi Lian","doi":"10.1017/rsm.2025.22","DOIUrl":"10.1017/rsm.2025.22","url":null,"abstract":"","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"441"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527490/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ZIBGLMM: Zero-inflated bivariate generalized linear mixed model for meta-analysis with double-zero-event studies. 双零事件研究的元分析的零膨胀双变量广义线性混合模型。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-21 DOI: 10.1017/rsm.2024.4
Lu Li, Lifeng Lin, Joseph C Cappelleri, Haitao Chu, Yong Chen

Double-zero-event studies (DZS) pose a challenge for accurately estimating the overall treatment effect in meta-analysis (MA). Current approaches, such as continuity correction or omission of DZS, are commonly employed, yet these ad hoc methods can yield biased conclusions. Although the standard bivariate generalized linear mixed model (BGLMM) can accommodate DZS, it fails to address the potential systemic differences between DZS and other studies. In this article, we propose a zero-inflated bivariate generalized linear mixed model (ZIBGLMM) to tackle this issue. This two-component finite mixture model includes zero inflation for a subpopulation with negligible or extremely low risk. We develop both frequentist and Bayesian versions of ZIBGLMM and examine its performance in estimating risk ratios against the BGLMM and conventional two-stage MA that excludes DZS. Through extensive simulation studies and real-world MA case studies, we demonstrate that ZIBGLMM outperforms the BGLMM and conventional two-stage MA that excludes DZS in estimating the true effect size with substantially less bias and comparable coverage probability.

双零事件研究(DZS)在meta分析(MA)中对准确估计整体治疗效果提出了挑战。目前的方法,如连续性校正或遗漏DZS,通常被采用,但这些临时方法可能产生有偏见的结论。虽然标准的二元广义线性混合模型(BGLMM)可以容纳DZS,但它无法解决DZS与其他研究之间潜在的系统性差异。在本文中,我们提出了一个零膨胀的二元广义线性混合模型(ZIBGLMM)来解决这个问题。这种双组分有限混合模型对可忽略不计或极低风险的亚群体包括零膨胀。我们开发了频率和贝叶斯版本的ZIBGLMM,并检查了它在估计风险比方面的性能,而不是BGLMM和排除DZS的传统两阶段MA。通过广泛的模拟研究和现实世界的MA案例研究,我们证明ZIBGLMM在估计真实效应大小方面优于BGLMM和排除DZS的传统两阶段MA,偏差大大减少,覆盖概率相当。
{"title":"ZIBGLMM: Zero-inflated bivariate generalized linear mixed model for meta-analysis with double-zero-event studies.","authors":"Lu Li, Lifeng Lin, Joseph C Cappelleri, Haitao Chu, Yong Chen","doi":"10.1017/rsm.2024.4","DOIUrl":"10.1017/rsm.2024.4","url":null,"abstract":"<p><p>Double-zero-event studies (DZS) pose a challenge for accurately estimating the overall treatment effect in meta-analysis (MA). Current approaches, such as continuity correction or omission of DZS, are commonly employed, yet these ad hoc methods can yield biased conclusions. Although the standard bivariate generalized linear mixed model (BGLMM) can accommodate DZS, it fails to address the potential systemic differences between DZS and other studies. In this article, we propose a zero-inflated bivariate generalized linear mixed model (ZIBGLMM) to tackle this issue. This two-component finite mixture model includes zero inflation for a subpopulation with negligible or extremely low risk. We develop both frequentist and Bayesian versions of ZIBGLMM and examine its performance in estimating risk ratios against the BGLMM and conventional two-stage MA that excludes DZS. Through extensive simulation studies and real-world MA case studies, we demonstrate that ZIBGLMM outperforms the BGLMM and conventional two-stage MA that excludes DZS in estimating the true effect size with substantially less bias and comparable coverage probability.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"251-271"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527523/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Research Synthesis Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1