首页 > 最新文献

Research Synthesis Methods最新文献

英文 中文
A critical assessment of matching-adjusted indirect comparisons in relation to target populations. 对与目标人群相关的匹配调整间接比较的关键评估。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.10
Ziren Jiang, Jialing Liu, Demissie Alemayehu, Joseph C Cappelleri, Devin Abrahami, Yong Chen, Haitao Chu

Matching-adjusted indirect comparison (MAIC) has been increasingly applied in health technology assessments (HTA). By reweighting subjects from a trial with individual participant data (IPD) to match the summary statistics of covariates in another trial with aggregate data (AgD), MAIC enables a comparison of the interventions for the AgD trial population. However, when there are imbalances in effect modifiers with different magnitudes of modification across treatments, contradictory conclusions may arise if MAIC is performed with the IPD and AgD swapped between trials. This can lead to the "MAIC paradox," where different entities reach opposing conclusions about which treatment is more effective, despite analyzing the same data. In this paper, we use synthetic data to illustrate this paradox and emphasize the importance of clearly defining the target population in HTA submissions. Additionally, we recommend making de-identified IPD available to HTA agencies, enabling further indirect comparisons that better reflect the overall population represented by both IPD and AgD trials, as well as other relevant target populations for policy decisions. This would help ensure more accurate and consistent assessments of comparative effectiveness.

匹配调整间接比较(MAIC)在卫生技术评价(HTA)中的应用越来越广泛。MAIC通过对具有个体参与者数据(IPD)的试验中的受试者重新加权,使其与具有总体数据(AgD)的另一项试验的协变量汇总统计数据相匹配,从而能够对AgD试验人群的干预措施进行比较。然而,当不同治疗的效果调节剂不平衡时,如果在试验之间交换IPD和AgD进行MAIC,可能会产生矛盾的结论。这可能导致“MAIC悖论”,即尽管分析了相同的数据,但不同的实体对哪种治疗更有效得出了相反的结论。在本文中,我们使用综合数据来说明这一悖论,并强调在HTA提交中明确定义目标人群的重要性。此外,我们建议HTA机构可以使用去识别的IPD,以便进一步进行间接比较,更好地反映IPD和AgD试验所代表的总体人群,以及政策决策的其他相关目标人群。这将有助于确保对相对效力进行更准确和一致的评估。
{"title":"A critical assessment of matching-adjusted indirect comparisons in relation to target populations.","authors":"Ziren Jiang, Jialing Liu, Demissie Alemayehu, Joseph C Cappelleri, Devin Abrahami, Yong Chen, Haitao Chu","doi":"10.1017/rsm.2025.10","DOIUrl":"10.1017/rsm.2025.10","url":null,"abstract":"<p><p>Matching-adjusted indirect comparison (MAIC) has been increasingly applied in health technology assessments (HTA). By reweighting subjects from a trial with individual participant data (IPD) to match the summary statistics of covariates in another trial with aggregate data (AgD), MAIC enables a comparison of the interventions for the AgD trial population. However, when there are imbalances in effect modifiers with different magnitudes of modification across treatments, contradictory conclusions may arise if MAIC is performed with the IPD and AgD swapped between trials. This can lead to the \"MAIC paradox,\" where different entities reach opposing conclusions about which treatment is more effective, despite analyzing the same data. In this paper, we use synthetic data to illustrate this paradox and emphasize the importance of clearly defining the target population in HTA submissions. Additionally, we recommend making de-identified IPD available to HTA agencies, enabling further indirect comparisons that better reflect the overall population represented by both IPD and AgD trials, as well as other relevant target populations for policy decisions. This would help ensure more accurate and consistent assessments of comparative effectiveness.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"569-574"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527493/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensitivity analysis for reporting bias on the time-dependent summary receiver operating characteristics curve in meta-analysis of prognosis studies with time-to-event outcomes. 在具有时间-事件结局预后研究的meta分析中,报告偏倚对时间相关的总接受者工作特征曲线的敏感性分析。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.14
Yi Zhou, Ao Huang, Satoshi Hattori

In prognosis studies with time-to-event outcomes, the survivals of groups with high/low biomarker expression are often estimated by the Kaplan-Meier method, and the difference between groups is measured by the hazard ratios (HRs). Since the high/low expressions are usually determined by study-specific cutoff values, synthesizing only HRs for summarizing the prognostic capacity of a biomarker brings heterogeneity in the meta-analysis. The time-dependent summary receiver operating characteristics (SROC) curve was proposed as a cutoff-free summary of the prognostic capacity, extended from the SROC curve in meta-analysis of diagnostic studies. However, estimates of the time-dependent SROC curve may be threatened by reporting bias in that studies with significant outcomes, such as HRs, are more likely to be published and selected in meta-analyses. Under this conjecture, this paper proposes a sensitivity analysis method for quantifying and adjusting reporting bias on the time-dependent SROC curve. We model the publication process determined by the significance of the HRs and introduce a sensitivity analysis method based on the conditional likelihood constrained by some expected proportions of published studies. Simulation studies showed that the proposed method could reduce reporting bias given the correctly-specified marginal selection probability. The proposed method is illustrated on the real-world meta-analysis of Ki67 for breast cancer.

在具有时间到事件结果的预后研究中,生物标志物高/低表达组的存活率通常通过Kaplan-Meier方法估计,组间差异通过风险比(hr)来衡量。由于高/低表达通常由研究特定的截止值决定,因此仅合成hr来总结生物标志物的预后能力会在meta分析中带来异质性。时间相关的总接受者工作特征(SROC)曲线被提出作为预后能力的无截止总结,从诊断研究的meta分析中的SROC曲线扩展而来。然而,对时间相关SROC曲线的估计可能会受到报告偏倚的威胁,因为具有显著结果(如hr)的研究更有可能被发表并被荟萃分析选中。在此假设下,本文提出了一种量化和调整随时间变化的SROC曲线报告偏差的敏感性分析方法。我们建立了由hr的显著性决定的发表过程模型,并引入了一种基于条件似然的敏感性分析方法,该方法受发表研究的一些预期比例的约束。仿真研究表明,在给定正确的边际选择概率的情况下,该方法可以减少报告偏差。提出的方法是在现实世界的荟萃分析Ki67乳腺癌说明。
{"title":"Sensitivity analysis for reporting bias on the time-dependent summary receiver operating characteristics curve in meta-analysis of prognosis studies with time-to-event outcomes.","authors":"Yi Zhou, Ao Huang, Satoshi Hattori","doi":"10.1017/rsm.2025.14","DOIUrl":"10.1017/rsm.2025.14","url":null,"abstract":"<p><p>In prognosis studies with time-to-event outcomes, the survivals of groups with high/low biomarker expression are often estimated by the Kaplan-Meier method, and the difference between groups is measured by the hazard ratios (HRs). Since the high/low expressions are usually determined by study-specific cutoff values, synthesizing only HRs for summarizing the prognostic capacity of a biomarker brings heterogeneity in the meta-analysis. The time-dependent summary receiver operating characteristics (SROC) curve was proposed as a cutoff-free summary of the prognostic capacity, extended from the SROC curve in meta-analysis of diagnostic studies. However, estimates of the time-dependent SROC curve may be threatened by reporting bias in that studies with significant outcomes, such as HRs, are more likely to be published and selected in meta-analyses. Under this conjecture, this paper proposes a sensitivity analysis method for quantifying and adjusting reporting bias on the time-dependent SROC curve. We model the publication process determined by the significance of the HRs and introduce a sensitivity analysis method based on the conditional likelihood constrained by some expected proportions of published studies. Simulation studies showed that the proposed method could reduce reporting bias given the correctly-specified marginal selection probability. The proposed method is illustrated on the real-world meta-analysis of Ki67 for breast cancer.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"528-549"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527534/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
metaConvert: an automatic suite for estimation of 11 different effect size measures and flexible conversion across them. metaConvert:一个自动套件,用于估计11种不同的效应大小措施和灵活的转换。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.11
Corentin J Gosling, Samuele Cortese, Marco Solmi, Belen Haza, Eduard Vieta, Richard Delorme, Paolo Fusar-Poli, Joaquim Radua

A fundamental pillar of science is the estimation of the effect size of associations. However, this task is sometimes difficult and error-prone. To facilitate this process, the R package metaConvert automatically calculates and flexibly converts multiple effect size measures. It applies more than 120 formulas to convert any relevant input data into Cohen's d, Hedges' g, mean difference, odds ratio, risk ratio, incidence rate ratio, correlation coefficient, Fisher's r-to-z transformed correlation coefficient, variability ratio, coefficient of variation ratio, or number needed to treat. Researchers unfamiliar with R can use this software through a browser-based graphical interface (https://metaconvert.org/). We hope this suite will help researchers in the life sciences and other disciplines estimate and convert effect sizes more easily and accurately.

科学的一个基本支柱是对关联效应大小的估计。然而,这项任务有时很困难,而且容易出错。为了方便这一过程,R包metaConvert自动计算并灵活转换多个效应大小度量。它使用120多个公式将任何相关输入数据转换为Cohen’s d、Hedges’g、均值差、优势比、风险比、发病率比、相关系数、Fisher’s r- z转换相关系数、变异性比、变异系数比或需要处理的数。不熟悉R的研究人员可以通过基于浏览器的图形界面(https://metaconvert.org/)使用该软件。我们希望这个套件能够帮助生命科学和其他学科的研究人员更容易、更准确地估计和转换效应大小。
{"title":"metaConvert: an automatic suite for estimation of 11 different effect size measures and flexible conversion across them.","authors":"Corentin J Gosling, Samuele Cortese, Marco Solmi, Belen Haza, Eduard Vieta, Richard Delorme, Paolo Fusar-Poli, Joaquim Radua","doi":"10.1017/rsm.2025.11","DOIUrl":"10.1017/rsm.2025.11","url":null,"abstract":"<p><p>A fundamental pillar of science is the estimation of the effect size of associations. However, this task is sometimes difficult and error-prone. To facilitate this process, the R package metaConvert automatically calculates and flexibly converts multiple effect size measures. It applies more than 120 formulas to convert any relevant input data into Cohen's <i>d</i>, Hedges' <i>g</i>, mean difference, odds ratio, risk ratio, incidence rate ratio, correlation coefficient, Fisher's r-to-z transformed correlation coefficient, variability ratio, coefficient of variation ratio, or number needed to treat. Researchers unfamiliar with R can use this software through a browser-based graphical interface (https://metaconvert.org/). We hope this suite will help researchers in the life sciences and other disciplines estimate and convert effect sizes more easily and accurately.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"575-586"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527507/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Treatment recommendations based on network meta-analysis: Rules for risk-averse decision-makers. 基于网络荟萃分析的治疗建议:风险规避决策者的规则。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.17
A E Ades, Annabel L Davies, David M Phillippo, Hugo Pedder, Howard Thom, Beatrice Downing, Deborah M Caldwell, Nicky J Welton

The treatment recommendation based on a network meta-analysis (NMA) is usually the single treatment with the highest expected value (EV) on an evaluative function. We explore approaches that recommend multiple treatments and that penalise uncertainty, making them suitable for risk-averse decision-makers. We introduce loss-adjusted EV (LaEV) and compare it to GRADE and three probability-based rankings. We define properties of a valid ranking under uncertainty and other desirable properties of ranking systems. A two-stage process is proposed: the first identifies treatments superior to the reference treatment; the second identifies those that are also within a minimal clinically important difference (MCID) of the best treatment. Decision rules and ranking systems are compared on stylised examples and 10 NMAs used in NICE (National Institute of Health and Care Excellence) guidelines. Only LaEV reliably delivers valid rankings under uncertainty and has all the desirable properties. In 10 NMAs comparing between 5 and 41 treatments, an EV decision maker would recommend 4-14 treatments, and LaEV 0-3 (median 2) fewer. GRADE rules give rise to anomalies, and, like the probability-based rankings, the number of treatments recommended depends on arbitrary probability cutoffs. Among treatments that are superior to the reference, GRADE privileges the more uncertain ones, and in 3/10 cases, GRADE failed to recommend the treatment with the highest EV and LaEV. A two-stage approach based on MCID ensures that EV- and LaEV-based rules recommend a clinically appropriate number of treatments. For a risk-averse decision maker, LaEV is conservative, simple to implement, and has an independent theoretical foundation.

基于网络元分析(NMA)的治疗建议通常是评价函数期望值(EV)最高的单一治疗。我们探索了推荐多种治疗方法并惩罚不确定性的方法,使其适合厌恶风险的决策者。我们引入损失调整EV (LaEV),并将其与GRADE和三种基于概率的排名进行比较。我们定义了不确定情况下有效排序的性质和排序系统的其他理想性质。提出了两阶段流程:首先确定优于参考处理的处理;第二种识别那些也在最小临床重要差异(MCID)内的最佳治疗。决策规则和排名系统在风格化的例子和NICE(国家健康和护理卓越研究所)指南中使用的10个nma进行了比较。只有LaEV在不确定的情况下可靠地提供有效的排名,并具有所有理想的属性。在10个nma中,比较5 - 41种治疗方法,EV决策者会推荐4-14种治疗方法,LaEV 0-3(中位数2)更少。GRADE规则会产生异常,并且,像基于概率的排名一样,推荐的治疗数量取决于任意的概率截止值。在优于参考的治疗方案中,GRADE优先于更不确定的治疗方案,在3/10的病例中,GRADE未能推荐EV和LaEV最高的治疗方案。基于MCID的两阶段方法确保基于EV和laev的规则推荐临床适当的治疗数量。对于风险厌恶型决策者而言,LaEV具有保守性、实现简单、独立的理论基础。
{"title":"Treatment recommendations based on network meta-analysis: Rules for risk-averse decision-makers.","authors":"A E Ades, Annabel L Davies, David M Phillippo, Hugo Pedder, Howard Thom, Beatrice Downing, Deborah M Caldwell, Nicky J Welton","doi":"10.1017/rsm.2025.17","DOIUrl":"10.1017/rsm.2025.17","url":null,"abstract":"<p><p>The treatment recommendation based on a network meta-analysis (NMA) is usually the single treatment with the highest expected value (EV) on an evaluative function. We explore approaches that recommend multiple treatments and that penalise uncertainty, making them suitable for risk-averse decision-makers. We introduce loss-adjusted EV (LaEV) and compare it to GRADE and three probability-based rankings. We define properties of a valid ranking under uncertainty and other desirable properties of ranking systems. A two-stage process is proposed: the first identifies treatments superior to the reference treatment; the second identifies those that are also within a minimal clinically important difference (MCID) of the best treatment. Decision rules and ranking systems are compared on stylised examples and 10 NMAs used in NICE (National Institute of Health and Care Excellence) guidelines. Only LaEV reliably delivers valid rankings under uncertainty and has all the desirable properties. In 10 NMAs comparing between 5 and 41 treatments, an EV decision maker would recommend 4-14 treatments, and LaEV 0-3 (median 2) fewer. GRADE rules give rise to anomalies, and, like the probability-based rankings, the number of treatments recommended depends on arbitrary probability cutoffs. Among treatments that are superior to the reference, GRADE privileges the more uncertain ones, and in 3/10 cases, GRADE failed to recommend the treatment with the highest EV and LaEV. A two-stage approach based on MCID ensures that EV- and LaEV-based rules recommend a clinically appropriate number of treatments. For a risk-averse decision maker, LaEV is conservative, simple to implement, and has an independent theoretical foundation.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"550-568"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527546/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalizable and scalable multistage biomedical concept normalization leveraging large language models. 利用大型语言模型的可泛化和可扩展的多阶段生物医学概念规范化。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.9
Nicholas J Dobbins

Background: Biomedical entity normalization is critical to biomedical research because the richness of free-text clinical data, such as progress notes, can often be fully leveraged only after translating words and phrases into structured and coded representations suitable for analysis. Large Language Models (LLMs), in turn, have shown great potential and high performance in a variety of natural language processing (NLP) tasks, but their application for normalization remains understudied.

Methods: We applied both proprietary and open-source LLMs in combination with several rule-based normalization systems commonly used in biomedical research. We used a two-step LLM integration approach, (1) using an LLM to generate alternative phrasings of a source utterance, and (2) to prune candidate UMLS concepts, using a variety of prompting methods. We measure results by $F_{beta }$, where we favor recall over precision, and F1.

Results: We evaluated a total of 5,523 concept terms and text contexts from a publicly available dataset of human-annotated biomedical abstracts. Incorporating GPT-3.5-turbo increased overall $F_{beta }$ and F1 in normalization systems +16.5 and +16.2 (OpenAI embeddings), +9.5 and +7.3 (MetaMapLite), +13.9 and +10.9 (QuickUMLS), and +10.5 and +10.3 (BM25), while the open-source Vicuna model achieved +20.2 and +21.7 (OpenAI embeddings), +10.8 and +12.2 (MetaMapLite), +14.7 and +15 (QuickUMLS), and +15.6 and +18.7 (BM25).

Conclusions: Existing general-purpose LLMs, both propriety and open-source, can be leveraged to greatly improve normalization performance using existing tools, with no fine-tuning.

背景:生物医学实体规范化对生物医学研究至关重要,因为自由文本临床数据(如进度记录)的丰富性通常只有在将单词和短语翻译成适合分析的结构化和编码表示后才能充分利用。反过来,大型语言模型(llm)在各种自然语言处理(NLP)任务中显示出巨大的潜力和高性能,但它们在规范化方面的应用仍有待研究。方法:我们结合生物医学研究中常用的几种基于规则的规范化系统,应用专有和开源法学硕士。我们使用了两步LLM集成方法,(1)使用LLM生成源话语的替代短语,(2)使用各种提示方法修剪候选的UMLS概念。我们用F_{beta}$和F1来衡量结果,其中我们倾向于召回率而不是精度。结果:我们从人类注释的生物医学摘要的公开数据集中评估了总共5523个概念术语和文本上下文。采用gpt -3.5 turbo的归一化系统在+16.5和+16.2 (OpenAI嵌入)、+9.5和+7.3 (MetaMapLite)、+13.9和+10.9 (QuickUMLS)、+10.5和+10.3 (BM25)中增加了总体$F_{beta}$和F1,而开源Vicuna模型在+20.2和+21.7 (OpenAI嵌入)、+10.8和+12.2 (MetaMapLite)、+14.7和+15 (QuickUMLS)、+15.6和+18.7 (BM25)中实现了+20.2和+21.7。结论:现有的通用llm,无论是专有的还是开源的,都可以利用现有的工具来极大地提高规范化性能,而无需进行微调。
{"title":"Generalizable and scalable multistage biomedical concept normalization leveraging large language models.","authors":"Nicholas J Dobbins","doi":"10.1017/rsm.2025.9","DOIUrl":"10.1017/rsm.2025.9","url":null,"abstract":"<p><strong>Background: </strong>Biomedical entity normalization is critical to biomedical research because the richness of free-text clinical data, such as progress notes, can often be fully leveraged only after translating words and phrases into structured and coded representations suitable for analysis. Large Language Models (LLMs), in turn, have shown great potential and high performance in a variety of natural language processing (NLP) tasks, but their application for normalization remains understudied.</p><p><strong>Methods: </strong>We applied both proprietary and open-source LLMs in combination with several rule-based normalization systems commonly used in biomedical research. We used a two-step LLM integration approach, (1) using an LLM to generate alternative phrasings of a source utterance, and (2) to prune candidate UMLS concepts, using a variety of prompting methods. We measure results by $F_{beta }$, where we favor recall over precision, and F1.</p><p><strong>Results: </strong>We evaluated a total of 5,523 concept terms and text contexts from a publicly available dataset of human-annotated biomedical abstracts. Incorporating GPT-3.5-turbo increased overall $F_{beta }$ and F1 in normalization systems +16.5 and +16.2 (OpenAI embeddings), +9.5 and +7.3 (MetaMapLite), +13.9 and +10.9 (QuickUMLS), and +10.5 and +10.3 (BM25), while the open-source Vicuna model achieved +20.2 and +21.7 (OpenAI embeddings), +10.8 and +12.2 (MetaMapLite), +14.7 and +15 (QuickUMLS), and +15.6 and +18.7 (BM25).</p><p><strong>Conclusions: </strong>Existing general-purpose LLMs, both propriety and open-source, can be leveraged to greatly improve normalization performance using existing tools, with no fine-tuning.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"479-490"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527512/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the potential of Claude 2 for risk of bias assessment: Using a large language model to assess randomized controlled trials with RoB 2. 探索Claude 2对偏倚风险评估的潜力:使用大型语言模型评估RoB 2的随机对照试验。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.12
Angelika Eisele-Metzger, Judith-Lisa Lieberum, Markus Toews, Waldemar Siemens, Felix Heilmeyer, Christian Haverkamp, Daniel Boehringer, Joerg J Meerpohl

Systematic reviews are essential for evidence-based health care, but conducting them is time- and resource-consuming. To date, efforts have been made to accelerate and (semi-)automate various steps of systematic reviews through the use of artificial intelligence (AI) and the emergence of large language models (LLMs) promises further opportunities. One crucial but complex task within systematic review conduct is assessing the risk of bias (RoB) of included studies. Therefore, the aim of this study was to test the LLM Claude 2 for RoB assessment of 100 randomized controlled trials, published in English language from 2013 onwards, using the revised Cochrane risk of bias tool ('RoB 2'; involving judgements for five specific domains and an overall judgement). We assessed the agreement of RoB judgements by Claude with human judgements published in Cochrane reviews. The observed agreement between Claude and Cochrane authors ranged from 41% for the overall judgement to 71% for domain 4 ('outcome measurement'). Cohen's κ was lowest for domain 5 ('selective reporting'; 0.10 (95% confidence interval (CI): -0.10-0.31)) and highest for domain 3 ('missing data'; 0.31 (95% CI: 0.10-0.52)), indicating slight to fair agreement. Fair agreement was found for the overall judgement (Cohen's κ: 0.22 (95% CI: 0.06-0.38)). Sensitivity analyses using alternative prompting techniques or the more recent version Claude 3 did not result in substantial changes. Currently, Claude's RoB 2 judgements cannot replace human RoB assessment. However, the potential of LLMs to support RoB assessment should be further explored.

系统评价对循证卫生保健至关重要,但进行系统评价既费时又耗资源。迄今为止,通过使用人工智能(AI),已经努力加速和(半)自动化系统审查的各个步骤,并且大型语言模型(llm)的出现预示着更多的机会。系统评价中一个关键但复杂的任务是评估纳入研究的偏倚风险(RoB)。因此,本研究的目的是使用修订后的Cochrane偏倚风险工具(“RoB 2”,涉及对五个特定领域的判断和一个总体判断),对2013年以来发表的100项随机对照试验的RoB评估进行LLM Claude 2测试。我们评估了Claude的RoB判断与Cochrane评论中发表的人类判断的一致性。Claude和Cochrane作者之间观察到的一致性从总体判断的41%到领域4(“结果测量”)的71%不等。Cohen’s κ在域5(“选择性报告”,0.10(95%可信区间(CI): -0.10-0.31))中最低,在域3(“缺失数据”,0.31 (95% CI: 0.10-0.52)中最高,表明稍微一致。总体判断一致(Cohen’s κ: 0.22 (95% CI: 0.06-0.38))。使用替代提示技术或最新版本Claude 3的敏感性分析没有导致实质性的变化。目前,克劳德的RoB 2判断不能取代人类的RoB评估。然而,法学硕士支持RoB评估的潜力还有待进一步探索。
{"title":"Exploring the potential of Claude 2 for risk of bias assessment: Using a large language model to assess randomized controlled trials with RoB 2.","authors":"Angelika Eisele-Metzger, Judith-Lisa Lieberum, Markus Toews, Waldemar Siemens, Felix Heilmeyer, Christian Haverkamp, Daniel Boehringer, Joerg J Meerpohl","doi":"10.1017/rsm.2025.12","DOIUrl":"10.1017/rsm.2025.12","url":null,"abstract":"<p><p>Systematic reviews are essential for evidence-based health care, but conducting them is time- and resource-consuming. To date, efforts have been made to accelerate and (semi-)automate various steps of systematic reviews through the use of artificial intelligence (AI) and the emergence of large language models (LLMs) promises further opportunities. One crucial but complex task within systematic review conduct is assessing the risk of bias (RoB) of included studies. Therefore, the aim of this study was to test the LLM Claude 2 for RoB assessment of 100 randomized controlled trials, published in English language from 2013 onwards, using the revised Cochrane risk of bias tool ('RoB 2'; involving judgements for five specific domains and an overall judgement). We assessed the agreement of RoB judgements by Claude with human judgements published in Cochrane reviews. The observed agreement between Claude and Cochrane authors ranged from 41% for the overall judgement to 71% for domain 4 ('outcome measurement'). Cohen's κ was lowest for domain 5 ('selective reporting'; 0.10 (95% confidence interval (CI): -0.10-0.31)) and highest for domain 3 ('missing data'; 0.31 (95% CI: 0.10-0.52)), indicating slight to fair agreement. Fair agreement was found for the overall judgement (Cohen's κ: 0.22 (95% CI: 0.06-0.38)). Sensitivity analyses using alternative prompting techniques or the more recent version Claude 3 did not result in substantial changes. Currently, Claude's RoB 2 judgements cannot replace human RoB assessment. However, the potential of LLMs to support RoB assessment should be further explored.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"491-508"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527486/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantitative bias analysis for unmeasured confounding in unanchored population-adjusted indirect comparisons. 非锚定人口调整间接比较中未测量混杂的定量偏倚分析。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.13
Shijie Ren, Sa Ren, Nicky J Welton, Mark Strong

Unanchored population-adjusted indirect comparisons (PAICs) such as matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC) attracted a significant attention in the health technology assessment field in recent years. These methods allow for indirect comparisons by balancing different patient characteristics in single-arm studies in the case where individual patient-level data are only available for one study. However, the validity of findings from unanchored MAIC/STC analyses is frequently questioned by decision makers, due to the assumption that all potential prognostic factors and effect modifiers are accounted for. Addressing this critical concern, we introduce a sensitivity analysis algorithm for unanchored PAICs by extending quantitative bias analysis techniques traditionally used in epidemiology. Our proposed sensitivity analysis involves simulating important covariates that were not reported by the comparator study when conducting unanchored STC and enables the formal evaluating of the impact of unmeasured confounding in a quantitative manner without additional assumptions. We demonstrate the practical application of this method through a real-world case study of metastatic colorectal cancer, highlighting its utility in enhancing the robustness and credibility of unanchored PAIC results. Our findings emphasise the necessity of formal quantitative sensitivity analysis in interpreting unanchored PAIC results, as it quantifies the robustness of conclusions regarding potential unmeasured confounders and supports more robust, reliable, and informative decision-making in healthcare.

近年来,非锚定人口调整间接比较(PAICs)如匹配调整间接比较(MAIC)和模拟治疗比较(STC)在卫生技术评估领域引起了广泛关注。这些方法允许通过平衡单臂研究中的不同患者特征来进行间接比较,而单个患者水平的数据仅可用于一项研究。然而,非锚定的MAIC/STC分析结果的有效性经常受到决策者的质疑,因为假设所有潜在的预后因素和影响修饰因素都被考虑在内。为了解决这一关键问题,我们通过扩展流行病学中传统使用的定量偏差分析技术,引入了一种针对非锚定PAICs的敏感性分析算法。我们提出的敏感性分析包括在进行非锚定STC时模拟比较研究未报告的重要协变量,并且能够在没有额外假设的情况下以定量方式正式评估未测量混杂的影响。我们通过转移性结直肠癌的真实案例研究展示了该方法的实际应用,强调了其在提高无锚定pac结果的稳健性和可信度方面的实用性。我们的研究结果强调了正式定量敏感性分析在解释非锚定pac结果时的必要性,因为它量化了关于潜在未测量混杂因素的结论的稳健性,并支持医疗保健中更稳健、可靠和信息丰富的决策。
{"title":"Quantitative bias analysis for unmeasured confounding in unanchored population-adjusted indirect comparisons.","authors":"Shijie Ren, Sa Ren, Nicky J Welton, Mark Strong","doi":"10.1017/rsm.2025.13","DOIUrl":"10.1017/rsm.2025.13","url":null,"abstract":"<p><p>Unanchored population-adjusted indirect comparisons (PAICs) such as matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC) attracted a significant attention in the health technology assessment field in recent years. These methods allow for indirect comparisons by balancing different patient characteristics in single-arm studies in the case where individual patient-level data are only available for one study. However, the validity of findings from unanchored MAIC/STC analyses is frequently questioned by decision makers, due to the assumption that all potential prognostic factors and effect modifiers are accounted for. Addressing this critical concern, we introduce a sensitivity analysis algorithm for unanchored PAICs by extending quantitative bias analysis techniques traditionally used in epidemiology. Our proposed sensitivity analysis involves simulating important covariates that were not reported by the comparator study when conducting unanchored STC and enables the formal evaluating of the impact of unmeasured confounding in a quantitative manner without additional assumptions. We demonstrate the practical application of this method through a real-world case study of metastatic colorectal cancer, highlighting its utility in enhancing the robustness and credibility of unanchored PAIC results. Our findings emphasise the necessity of formal quantitative sensitivity analysis in interpreting unanchored PAIC results, as it quantifies the robustness of conclusions regarding potential unmeasured confounders and supports more robust, reliable, and informative decision-making in healthcare.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"509-527"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527536/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthesis of depression outcomes reported on different scales: A comparison of methods for modelling mean differences. 不同量表报告的抑郁结果的综合:建模平均差异方法的比较。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.7
Beatrice C Downing, Nicky J Welton, Hugo Pedder, Ifigeneia Mavranezouli, Odette Megnin-Viggars, A E Ades

Several methods have been proposed for the synthesis of continuous outcomes reported on different scales, including the Standardised Mean Difference (SMD) and the Ratio of Means (RoM). SMDs can be formed by dividing the study mean treatment effect either by a study-specific (Study-SMD) or a scale-specific (Scale-SMD) standard deviation (SD). We compared the performance of RoM to the different standardisation methods with and without meta-regression (MR) on baseline severity, in a Bayesian network meta-analysis (NMA) of 14 treatments for depression, reported on five different scales. There was substantial between-study variation in the SDs reported on the same scale. Based on the Deviance Information Criterion, RoM was preferred as having better model fit than the SMD models. Model fit for SMD models was not improved with meta-regression. Percentage shrinkage was used as a scale-independent measure with higher % shrinkage indicating lower heterogeneity. Heterogeneity was lowest for RoM (20.5% shrinkage), then Scale-SMD (18.2% shrinkage), and highest for Study-SMD (16.7% shrinkage). Model choice impacted which treatment was estimated to be most effective. However, all models picked out the same three highest-ranked treatments using the GRADE criteria. Alongside other indicators, higher shrinkage of RoM models suggests that treatments for depression act multiplicatively rather than additively. Further research is needed to determine whether these findings extend to Patient- and Clinician-Reported Outcomes used in other application areas. Where treatment effects are additive, we recommend using Scale-SMD for standardisation to avoid the additional heterogeneity introduced by Study-SMD.

已经提出了几种方法来综合不同尺度上报告的连续结果,包括标准化平均差(SMD)和均值比(RoM)。通过将研究平均治疗效果除以特定研究(study- smd)或特定量表(Scale-SMD)标准偏差(SD),可以形成标准差。在贝叶斯网络荟萃分析(NMA)中,我们在5个不同的量表上报告了14种抑郁症治疗方法,比较了RoM在基线严重程度上使用和不使用meta回归(MR)的不同标准化方法的表现。在同一量表上报告的SDs在研究间存在大量差异。基于偏差信息准则,RoM模型比SMD模型具有更好的模型拟合。meta回归并没有改善SMD模型的拟合。收缩率被用作一个尺度无关的措施,较高的收缩率表明较低的异质性。RoM的异质性最低(收缩20.5%),然后是Scale-SMD(收缩18.2%),Study-SMD的异质性最高(收缩16.7%)。模型选择影响了哪种治疗被认为是最有效的。然而,使用GRADE标准,所有模型都选择了相同的三个排名最高的治疗方法。与其他指标一样,RoM模型的高收缩率表明,抑郁症的治疗是乘数性的,而不是加法性的。需要进一步的研究来确定这些发现是否延伸到其他应用领域的患者和临床报告的结果。如果治疗效果是累加性的,我们建议使用Scale-SMD进行标准化,以避免Study-SMD引入的额外异质性。
{"title":"Synthesis of depression outcomes reported on different scales: A comparison of methods for modelling mean differences.","authors":"Beatrice C Downing, Nicky J Welton, Hugo Pedder, Ifigeneia Mavranezouli, Odette Megnin-Viggars, A E Ades","doi":"10.1017/rsm.2025.7","DOIUrl":"10.1017/rsm.2025.7","url":null,"abstract":"<p><p>Several methods have been proposed for the synthesis of continuous outcomes reported on different scales, including the Standardised Mean Difference (SMD) and the Ratio of Means (RoM). SMDs can be formed by dividing the study mean treatment effect either by a study-specific (Study-SMD) or a scale-specific (Scale-SMD) standard deviation (SD). We compared the performance of RoM to the different standardisation methods with and without meta-regression (MR) on baseline severity, in a Bayesian network meta-analysis (NMA) of 14 treatments for depression, reported on five different scales. There was substantial between-study variation in the SDs reported on the same scale. Based on the Deviance Information Criterion, RoM was preferred as having better model fit than the SMD models. Model fit for SMD models was not improved with meta-regression. Percentage shrinkage was used as a scale-independent measure with higher % shrinkage indicating lower heterogeneity. Heterogeneity was lowest for RoM (20.5% shrinkage), then Scale-SMD (18.2% shrinkage), and highest for Study-SMD (16.7% shrinkage). Model choice impacted which treatment was estimated to be most effective. However, all models picked out the same three highest-ranked treatments using the GRADE criteria. Alongside other indicators, higher shrinkage of RoM models suggests that treatments for depression act multiplicatively rather than additively. Further research is needed to determine whether these findings extend to Patient- and Clinician-Reported Outcomes used in other application areas. Where treatment effects are additive, we recommend using Scale-SMD for standardisation to avoid the additional heterogeneity introduced by Study-SMD.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"460-478"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527519/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel robust meta-analysis model using the t distribution for outlier accommodation and detection. 使用t分布进行离群调节和检测的新颖稳健元分析模型。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.8
Yue Wang, Jianhua Zhao, Fen Jiang, Lei Shi, Jianxin Pan

Random effects meta-analysis model is an important tool for integrating results from multiple independent studies. However, the standard model is based on the assumption of normal distributions for both random effects and within-study errors, making it susceptible to outlying studies. Although robust modeling using the t distribution is an appealing idea, the existing work, that explores the use of the t distribution only for random effects, involves complicated numerical integration and numerical optimization. In this article, a novel robust meta-analysis model using the t distribution is proposed (tMeta). The novelty is that the marginal distribution of the effect size in tMeta follows the t distribution, enabling that tMeta can simultaneously accommodate and detect outlying studies in a simple and adaptive manner. A simple and fast EM-type algorithm is developed for maximum likelihood estimation. Due to the mathematical tractability of the t distribution, tMeta frees from numerical integration and allows for efficient optimization. Experiments on real data demonstrate that tMeta is compared favorably with related competitors in situations involving mild outliers. Moreover, in the presence of gross outliers, while related competitors may fail, tMeta continues to perform consistently and robustly.

随机效应荟萃分析模型是整合多个独立研究结果的重要工具。然而,对于随机效应和研究内误差,标准模型是基于正态分布的假设,这使得它容易受到外围研究的影响。虽然使用t分布的稳健建模是一个吸引人的想法,但现有的工作,探索使用t分布的随机效应,涉及复杂的数值积分和数值优化。在这篇文章中,我们提出了一种新的使用t分布的稳健元分析模型(tMeta)。新颖之处在于tMeta中效应大小的边际分布遵循t分布,使得tMeta能够以一种简单和自适应的方式同时容纳和检测外围研究。提出了一种简单快速的em型最大似然估计算法。由于t分布的数学可追溯性,tMeta从数值积分中解脱出来,并允许有效的优化。实际数据实验表明,在轻度异常值的情况下,tMeta优于相关竞争对手。此外,在总异常值存在的情况下,虽然相关竞争对手可能会失败,但tMeta将继续保持稳定和强劲的表现。
{"title":"A novel robust meta-analysis model using the <i>t</i> distribution for outlier accommodation and detection.","authors":"Yue Wang, Jianhua Zhao, Fen Jiang, Lei Shi, Jianxin Pan","doi":"10.1017/rsm.2025.8","DOIUrl":"10.1017/rsm.2025.8","url":null,"abstract":"<p><p>Random effects meta-analysis model is an important tool for integrating results from multiple independent studies. However, the standard model is based on the assumption of normal distributions for both random effects and within-study errors, making it susceptible to outlying studies. Although robust modeling using the <i>t</i> distribution is an appealing idea, the existing work, that explores the use of the <i>t</i> distribution only for random effects, involves complicated numerical integration and numerical optimization. In this article, a novel robust meta-analysis model using the <i>t</i> distribution is proposed (<i>t</i>Meta). The novelty is that the marginal distribution of the effect size in <i>t</i>Meta follows the <i>t</i> distribution, enabling that <i>t</i>Meta can simultaneously accommodate and detect outlying studies in a simple and adaptive manner. A simple and fast EM-type algorithm is developed for maximum likelihood estimation. Due to the mathematical tractability of the <i>t</i> distribution, <i>t</i>Meta frees from numerical integration and allows for efficient optimization. Experiments on real data demonstrate that <i>t</i>Meta is compared favorably with related competitors in situations involving mild outliers. Moreover, in the presence of gross outliers, while related competitors may fail, <i>t</i>Meta continues to perform consistently and robustly.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"442-459"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527545/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CausalMetaR: An R package for performing causally interpretable meta-analyses. CausalMetaR:一个R包,用于执行因果关系可解释的元分析。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-03-01 Epub Date: 2025-03-12 DOI: 10.1017/rsm.2025.5
Guanbo Wang, Sean McGrath, Yi Lian

Researchers would often like to leverage data from a collection of sources (e.g., meta-analyses of randomized trials, multi-center trials, pooled analyses of observational cohorts) to estimate causal effects in a target population of interest. However, because different data sources typically represent different underlying populations, traditional meta-analytic methods may not produce causally interpretable estimates that apply to any reasonable target population. In this article, we present the CausalMetaR R package, which implements robust and efficient methods to estimate causal effects in a given internal or external target population using multi-source data. The package includes estimators of average and subgroup treatment effects for the entire target population. To produce efficient and robust estimates of causal effects, the package implements doubly robust and non-parametric efficient estimators and supports using flexible data-adaptive (e.g., machine learning techniques) methods and cross-fitting techniques to estimate the nuisance models (e.g., the treatment model, the outcome model). We briefly review the methods, describe the key features of the package, and demonstrate how to use the package through an example. The package aims to facilitate causal analyses in the context of meta-analysis.

研究人员通常喜欢利用来自各种来源的数据(例如,随机试验的荟萃分析,多中心试验,观察队列的汇总分析)来估计目标人群的因果效应。然而,由于不同的数据源通常代表不同的潜在人群,传统的元分析方法可能无法产生适用于任何合理目标人群的因果关系可解释的估计。在本文中,我们介绍了CausalMetaR R包,它实现了鲁棒和有效的方法,可以使用多源数据估计给定内部或外部目标群体中的因果效应。该方案包括对整个目标人群的平均治疗效果和亚组治疗效果的估计。为了产生有效和稳健的因果效应估计,该软件包实现了双稳健和非参数有效估计器,并支持使用灵活的数据自适应(例如,机器学习技术)方法和交叉拟合技术来估计有害模型(例如,治疗模型,结果模型)。我们简要回顾了这些方法,描述了包的主要特性,并通过一个示例演示了如何使用包。该包旨在促进元分析背景下的因果分析。
{"title":"CausalMetaR: An R package for performing causally interpretable meta-analyses.","authors":"Guanbo Wang, Sean McGrath, Yi Lian","doi":"10.1017/rsm.2025.5","DOIUrl":"10.1017/rsm.2025.5","url":null,"abstract":"<p><p>Researchers would often like to leverage data from a collection of sources (e.g., meta-analyses of randomized trials, multi-center trials, pooled analyses of observational cohorts) to estimate causal effects in a target population of interest. However, because different data sources typically represent different underlying populations, traditional meta-analytic methods may not produce causally interpretable estimates that apply to any reasonable target population. In this article, we present the CausalMetaR R package, which implements robust and efficient methods to estimate causal effects in a given internal or external target population using multi-source data. The package includes estimators of average and subgroup treatment effects for the entire target population. To produce efficient and robust estimates of causal effects, the package implements doubly robust and non-parametric efficient estimators and supports using flexible data-adaptive (e.g., machine learning techniques) methods and cross-fitting techniques to estimate the nuisance models (e.g., the treatment model, the outcome model). We briefly review the methods, describe the key features of the package, and demonstrate how to use the package through an example. The package aims to facilitate causal analyses in the context of meta-analysis.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 2","pages":"425-440"},"PeriodicalIF":6.1,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527535/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Research Synthesis Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1