Research Synthesis Methods最新文献_第7页

Twenty years of network meta-analysis: Continuing controversies and recent developments 网络荟萃分析二十年：持续争议与最新发展。

IF 5 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2024-01-18 DOI: 10.1002/jrsm.1700

A. E. Ades, Nicky J. Welton, Sofia Dias, David M. Phillippo, Deborah M. Caldwell

Network meta-analysis (NMA) is an extension of pairwise meta-analysis (PMA) which combines evidence from trials on multiple treatments in connected networks. NMA delivers internally consistent estimates of relative treatment efficacy, needed for rational decision making. Over its first 20 years NMA's use has grown exponentially, with applications in both health technology assessment (HTA), primarily re-imbursement decisions and clinical guideline development, and clinical research publications. This has been a period of transition in meta-analysis, first from its roots in educational and social psychology, where large heterogeneous datasets could be explored to find effect modifiers, to smaller pairwise meta-analyses in clinical medicine on average with less than six studies. This has been followed by narrowly-focused estimation of the effects of specific treatments at specific doses in specific populations in sparse networks, where direct comparisons are unavailable or informed by only one or two studies. NMA is a powerful and well-established technique but, in spite of the exponential increase in applications, doubts about the reliability and validity of NMA persist. Here we outline the continuing controversies, and review some recent developments. We suggest that heterogeneity should be minimized, as it poses a threat to the reliability of NMA which has not been fully appreciated, perhaps because it has not been seen as a problem in PMA. More research is needed on the extent of heterogeneity and inconsistency in datasets used for decision making, on formal methods for making recommendations based on NMA, and on the further development of multi-level network meta-regression.

网络荟萃分析（NMA）是成对荟萃分析（PMA）的延伸，它将多种治疗方法的试验证据整合到连接的网络中。NMA 提供理性决策所需的内部一致的相对疗效估计值。在最初的 20 年中，NMA 的应用呈指数级增长，主要应用于健康技术评估 (HTA)（主要是报销决策和临床指南制定）和临床研究出版物。这一时期是荟萃分析的转型期，先是起源于教育学和社会心理学，通过探索大型异质数据集来寻找效应修饰因子，然后是临床医学中平均少于六项研究的小型成对荟萃分析。随后，在缺乏直接比较或只有一两项研究提供信息的稀疏网络中，对特定人群特定剂量的特定治疗效果进行了狭义的估计。NMA 是一种强大而成熟的技术，但尽管其应用呈指数级增长，人们对 NMA 的可靠性和有效性的怀疑依然存在。在此，我们概述了持续存在的争议，并回顾了一些最新进展。我们建议应尽量减少异质性，因为它对 NMA 的可靠性构成了威胁，而这种威胁尚未得到充分认识，这可能是因为在 PMA 中异质性未被视为一个问题。需要对决策所用数据集的异质性和不一致性程度、基于 NMA 提出建议的正式方法以及多层次网络元回归的进一步发展开展更多研究。

{"title":"Twenty years of network meta-analysis: Continuing controversies and recent developments","authors":"A. E. Ades, Nicky J. Welton, Sofia Dias, David M. Phillippo, Deborah M. Caldwell","doi":"10.1002/jrsm.1700","DOIUrl":"10.1002/jrsm.1700","url":null,"abstract":"Network meta-analysis (NMA) is an extension of pairwise meta-analysis (PMA) which combines evidence from trials on multiple treatments in connected networks. NMA delivers internally consistent estimates of relative treatment efficacy, needed for rational decision making. Over its first 20 years NMA's use has grown exponentially, with applications in both health technology assessment (HTA), primarily re-imbursement decisions and clinical guideline development, and clinical research publications. This has been a period of transition in meta-analysis, first from its roots in educational and social psychology, where large heterogeneous datasets could be explored to find effect modifiers, to smaller pairwise meta-analyses in clinical medicine on average with less than six studies. This has been followed by narrowly-focused estimation of the effects of specific treatments at specific doses in specific populations in sparse networks, where direct comparisons are unavailable or informed by only one or two studies. NMA is a powerful and well-established technique but, in spite of the exponential increase in applications, doubts about the reliability and validity of NMA persist. Here we outline the continuing controversies, and review some recent developments. We suggest that heterogeneity should be minimized, as it poses a threat to the reliability of NMA which has not been fully appreciated, perhaps because it has not been seen as a problem in PMA. More research is needed on the extent of heterogeneity and inconsistency in datasets used for decision making, on formal methods for making recommendations based on NMA, and on the further development of multi-level network meta-regression.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 5","pages":"702-727"},"PeriodicalIF":5.0,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1700","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139484797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Appropriateness of conducting and reporting random-effects meta-analysis in oncology 在肿瘤学中进行和报告随机效应荟萃分析的适当性。

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2024-01-14 DOI: 10.1002/jrsm.1702

Jinma Ren, Jia Ma, Joseph C. Cappelleri

A random-effects model is often applied in meta-analysis when considerable heterogeneity among studies is observed due to the differences in patient characteristics, timeframe, treatment regimens, and other study characteristics. Since 2014, the journals Research Synthesis Methods and the Annals of Internal Medicine have published a few noteworthy papers that explained why the most widely used method for pooling heterogeneous studies—the DerSimonian–Laird (DL) estimator—can produce biased estimates with falsely high precision and recommended to use other several alternative methods. Nevertheless, more than half of studies (55.7%) published in top oncology-specific journals during 2015–2022 did not report any detailed method in the random-effects meta-analysis. Of the studies that did report the methodology used, the DL method was still the dominant one reported. Thus, while the authors recommend that Research Synthesis Methods and the Annals of Internal Medicine continue to increase the publication of its articles that report on specific methods for handling heterogeneity and use random-effects estimates that provide more accurate confidence limits than the DL estimator, other journals that publish meta-analyses in oncology (and presumably in other disease areas) are urged to do the same on a much larger scale than currently documented.

由于患者特征、时间框架、治疗方案和其他研究特征的差异，当观察到研究之间存在相当大的异质性时，随机效应模型通常被应用于荟萃分析。自2014年以来，《研究综合方法》（Research Synthesis Methods）杂志和《内科学年鉴》（Annals of Internal Medicine）杂志发表了几篇值得关注的论文，解释了为什么最广泛使用的异质性研究集合方法--DerSimonian-Laird（DL）估计器--会产生有偏差的估计值和虚假的高精度，并建议使用其他几种替代方法。然而，2015-2022年间发表在顶级肿瘤学期刊上的一半以上（55.7%）研究并未报告随机效应荟萃分析的任何详细方法。在报告了所用方法的研究中，DL 方法仍是报告的主要方法。因此，作者建议《研究综合方法》和《内科学年鉴》继续增加发表文章，报告处理异质性的具体方法，并使用随机效应估计值，提供比DL估计值更准确的置信区间，同时敦促其他发表肿瘤学（可能还有其他疾病领域）荟萃分析的期刊也这样做，而且规模要比目前记录的大得多。

{"title":"Appropriateness of conducting and reporting random-effects meta-analysis in oncology","authors":"Jinma Ren, Jia Ma, Joseph C. Cappelleri","doi":"10.1002/jrsm.1702","DOIUrl":"10.1002/jrsm.1702","url":null,"abstract":"A random-effects model is often applied in meta-analysis when considerable heterogeneity among studies is observed due to the differences in patient characteristics, timeframe, treatment regimens, and other study characteristics. Since 2014, the journals Research Synthesis Methods and the Annals of Internal Medicine have published a few noteworthy papers that explained why the most widely used method for pooling heterogeneous studies—the DerSimonian–Laird (DL) estimator—can produce biased estimates with falsely high precision and recommended to use other several alternative methods. Nevertheless, more than half of studies (55.7%) published in top oncology-specific journals during 2015–2022 did not report any detailed method in the random-effects meta-analysis. Of the studies that did report the methodology used, the DL method was still the dominant one reported. Thus, while the authors recommend that Research Synthesis Methods and the Annals of Internal Medicine continue to increase the publication of its articles that report on specific methods for handling heterogeneity and use random-effects estimates that provide more accurate confidence limits than the DL estimator, other journals that publish meta-analyses in oncology (and presumably in other disease areas) are urged to do the same on a much larger scale than currently documented.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 2","pages":"326-331"},"PeriodicalIF":9.8,"publicationDate":"2024-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139465478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using qualitative comparative analysis as a mixed methods synthesis in systematic mixed studies reviews: Guidance and a worked example 在系统性混合研究综述中使用定性比较分析作为混合方法综述：指南和实例。

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2024-01-09 DOI: 10.1002/jrsm.1698

Reem El Sherif, Pierre Pluye, Quan Nha Hong, Benoît Rihoux

Qualitative comparative analysis (QCA) is a hybrid method designed to bridge the gap between qualitative and quantitative research in a case-sensitive approach that considers each case holistically as a complex configuration of conditions and outcomes. QCA allows for multiple conjunctural causation, implying that it is often a combination of conditions that produces an outcome, that multiple pathways may lead to the same outcome, and that in different contexts, the same condition may have a different impact on the outcome. This approach to complexity allows QCA to provide a practical understanding for complex, real-world situations, and the context of implementing interventions. There are guides for conducting QCA in primary research and quantitative systematic reviews yet, to our knowledge, no guidance for conducting QCA in systematic mixed studies reviews (SMSRs). Thus, the specific objectives of this paper are to (1) describe a step-by-step approach for novice researchers for using QCA to integrate qualitative and quantitative evidence, including guidance on how to use software; (2) highlight specific challenges; (3) propose potential solutions from a worked example; and (4) provide recommendations for reporting.

定性比较分析（QCA）是一种混合方法，旨在弥合定性研究与定量研究之间的差距，它采用对案例敏感的方法，将每个案例整体视为条件和结果的复杂组合。QCA 允许多重因果关系，这意味着产生结果的往往是各种条件的组合，多种途径可能导致相同的结果，而且在不同的情况下，相同的条件可能对结果产生不同的影响。这种处理复杂性的方法使 QCA 能够为复杂的现实情况和干预措施的实施提供实用的理解。目前已有在初级研究和定量系统综述中开展 QCA 的指南，但据我们所知，还没有在系统性混合研究综述（SMSR）中开展 QCA 的指南。因此，本文的具体目标是：(1) 为新手研究人员描述使用QCA整合定性和定量证据的逐步方法，包括如何使用软件的指导；(2) 强调具体挑战；(3) 从一个工作实例中提出潜在解决方案；(4) 提供报告建议。

{"title":"Using qualitative comparative analysis as a mixed methods synthesis in systematic mixed studies reviews: Guidance and a worked example","authors":"Reem El Sherif, Pierre Pluye, Quan Nha Hong, Benoît Rihoux","doi":"10.1002/jrsm.1698","DOIUrl":"10.1002/jrsm.1698","url":null,"abstract":"Qualitative comparative analysis (QCA) is a hybrid method designed to bridge the gap between qualitative and quantitative research in a case-sensitive approach that considers each case holistically as a complex configuration of conditions and outcomes. QCA allows for multiple conjunctural causation, implying that it is often a combination of conditions that produces an outcome, that multiple pathways may lead to the same outcome, and that in different contexts, the same condition may have a different impact on the outcome. This approach to complexity allows QCA to provide a practical understanding for complex, real-world situations, and the context of implementing interventions. There are guides for conducting QCA in primary research and quantitative systematic reviews yet, to our knowledge, no guidance for conducting QCA in systematic mixed studies reviews (SMSRs). Thus, the specific objectives of this paper are to (1) describe a step-by-step approach for novice researchers for using QCA to integrate qualitative and quantitative evidence, including guidance on how to use software; (2) highlight specific challenges; (3) propose potential solutions from a worked example; and (4) provide recommendations for reporting.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 3","pages":"450-465"},"PeriodicalIF":9.8,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1698","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139401167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing recall in automated record screening: A resampling algorithm 提高自动记录筛选的召回率：重采样算法

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2024-01-07 DOI: 10.1002/jrsm.1690

Zhipeng Hou, Elizabeth Tipton

Literature screening is the process of identifying all relevant records from a pool of candidate paper records in systematic review, meta-analysis, and other research synthesis tasks. This process is time consuming, expensive, and prone to human error. Screening prioritization methods attempt to help reviewers identify most relevant records while only screening a proportion of candidate records with high priority. In previous studies, screening prioritization is often referred to as automatic literature screening or automatic literature identification. Numerous screening prioritization methods have been proposed in recent years. However, there is a lack of screening prioritization methods with reliable performance. Our objective is to develop a screening prioritization algorithm with reliable performance for practical use, for example, an algorithm that guarantees an 80% chance of identifying at least $� � 80 � % �$ of the relevant records. Based on a target-based method proposed in Cormack and Grossman, we propose a screening prioritization algorithm using sampling with replacement. The algorithm is a wrapper algorithm that can work with any current screening prioritization algorithm to guarantee the performance. We prove, with mathematics and probability theory, that the algorithm guarantees the performance. We also run numeric experiments to test the performance of our algorithm when applied in practice. The numeric experiment results show this algorithm achieve reliable performance under different circumstances. The proposed screening prioritization algorithm can be reliably used in real world research synthesis tasks.

文献筛选是从系统综述、荟萃分析和其他研究综述任务的候选纸质记录库中找出所有相关记录的过程。这一过程耗时长、成本高，而且容易出现人为错误。筛选优先级的方法试图帮助审稿人识别最相关的记录，同时只筛选一部分具有高优先级的候选记录。在以往的研究中，筛选优先级通常被称为自动文献筛选或自动文献识别。近年来提出了许多筛选优先级的方法。然而，目前还缺乏性能可靠的筛选优先级排序方法。我们的目标是为实际应用开发一种性能可靠的筛选优先级算法，例如，一种能保证 80% 的几率识别出至少 80% $$ 80% $$ 的相关记录的算法。基于 Cormack 和 Grossman 提出的基于目标的方法，我们提出了一种使用替换抽样的筛选优先级算法。该算法是一种包装算法，可以与当前任何筛选优先级算法一起使用，以保证性能。我们用数学和概率论证明，该算法可以保证性能。我们还进行了数值实验，测试算法在实际应用中的性能。数值实验结果表明，该算法在不同情况下都能实现可靠的性能。所提出的筛选优先级算法可以可靠地应用于现实世界的研究综合任务中。

{"title":"Enhancing recall in automated record screening: A resampling algorithm","authors":"Zhipeng Hou, Elizabeth Tipton","doi":"10.1002/jrsm.1690","DOIUrl":"10.1002/jrsm.1690","url":null,"abstract":"Literature screening is the process of identifying all relevant records from a pool of candidate paper records in systematic review, meta-analysis, and other research synthesis tasks. This process is time consuming, expensive, and prone to human error. Screening prioritization methods attempt to help reviewers identify most relevant records while only screening a proportion of candidate records with high priority. In previous studies, screening prioritization is often referred to as automatic literature screening or automatic literature identification. Numerous screening prioritization methods have been proposed in recent years. However, there is a lack of screening prioritization methods with reliable performance. Our objective is to develop a screening prioritization algorithm with reliable performance for practical use, for example, an algorithm that guarantees an 80% chance of identifying at least <math>\u0000 <mrow>\u0000 <mn>80</mn>\u0000 <mo>%</mo>\u0000 </mrow></math> of the relevant records. Based on a target-based method proposed in Cormack and Grossman, we propose a screening prioritization algorithm using sampling with replacement. The algorithm is a wrapper algorithm that can work with any current screening prioritization algorithm to guarantee the performance. We prove, with mathematics and probability theory, that the algorithm guarantees the performance. We also run numeric experiments to test the performance of our algorithm when applied in practice. The numeric experiment results show this algorithm achieve reliable performance under different circumstances. The proposed screening prioritization algorithm can be reliably used in real world research synthesis tasks.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 3","pages":"372-383"},"PeriodicalIF":9.8,"publicationDate":"2024-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1690","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139376893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advancing the methodology of mapping reviews: A scoping review 推进制图审查方法：范围审查。

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2024-01-02 DOI: 10.1002/jrsm.1694

Hanan Khalil, Fiona Campbell, Katrina Danial, Danielle Pollock, Zachary Munn, Vivian Welsh, Ashrita Saran, Dimi Hoppe, Andrea C. Tricco

This scoping review aims to identify and systematically review published mapping reviews to assess their commonality and heterogeneity and determine whether additional efforts should be made to standardise methodology and reporting. The following databases were searched; Ovid MEDLINE, Embase, CINAHL, PsycINFO, Campbell collaboration database, Social Science Abstracts, Library and Information Science Abstracts (LISA). Following a pilot-test on a random sample of 20 citations included within title and abstracts, two team members independently completed all screening. Ten articles were piloted at full-text screening, and then each citation was reviewed independently by two team members. Discrepancies at both stages were resolved through discussion. Following a pilot-test on a random sample of five relevant full-text articles, one team member abstracted all the relevant data. Uncertainties in the data abstraction were resolved by another team member. A total of 335 articles were eligible for this scoping review and subsequently included. There was an increasing growth in the number of published mapping reviews over the years from 5 in 2010 to 73 in 2021. Moreover, there was a significant variability in reporting the included mapping reviews including their research question, priori protocol, methodology, data synthesis and reporting. This work has further highlighted the gaps in evidence synthesis methodologies. Further guidance developed by evidence synthesis organisations, such as JBI and Campbell, has the potential to clarify challenges experienced by researchers, given the magnitude of mapping reviews published every year.

本范围界定综述旨在对已发表的制图综述进行识别和系统综述，以评估其共性和异质性，并确定是否应进一步努力实现方法和报告的标准化。我们检索了以下数据库：Ovid MEDLINE、Embase、CINAHL、PsycINFO、Campbell 合作数据库、《社会科学文摘》、《图书馆与信息科学文摘》（LISA）。在对包含在标题和摘要中的 20 篇引文进行随机抽样试验后，两名小组成员独立完成了所有筛选工作。先对 10 篇文章进行全文筛选，然后由两名小组成员独立审查每条引文。在这两个阶段出现的差异均通过讨论解决。在对随机抽取的五篇相关全文文章进行试点测试后，由一名团队成员对所有相关数据进行摘要。数据摘要中的不确定性由另一名团队成员解决。共有 335 篇文章符合此次范围界定审查的条件，随后被纳入审查范围。多年来，已发表的绘图综述数量不断增加，从 2010 年的 5 篇增加到 2021 年的 73 篇。此外，所收录的绘图综述在研究问题、先验方案、方法论、数据综合和报告等方面的报告差异很大。这项工作进一步凸显了证据合成方法的不足。鉴于每年发表的图谱综述数量巨大，JBI 和 Campbell 等证据合成组织制定的进一步指南有可能澄清研究人员遇到的挑战。

{"title":"Advancing the methodology of mapping reviews: A scoping review","authors":"Hanan Khalil, Fiona Campbell, Katrina Danial, Danielle Pollock, Zachary Munn, Vivian Welsh, Ashrita Saran, Dimi Hoppe, Andrea C. Tricco","doi":"10.1002/jrsm.1694","DOIUrl":"10.1002/jrsm.1694","url":null,"abstract":"This scoping review aims to identify and systematically review published mapping reviews to assess their commonality and heterogeneity and determine whether additional efforts should be made to standardise methodology and reporting. The following databases were searched; Ovid MEDLINE, Embase, CINAHL, PsycINFO, Campbell collaboration database, Social Science Abstracts, Library and Information Science Abstracts (LISA). Following a pilot-test on a random sample of 20 citations included within title and abstracts, two team members independently completed all screening. Ten articles were piloted at full-text screening, and then each citation was reviewed independently by two team members. Discrepancies at both stages were resolved through discussion. Following a pilot-test on a random sample of five relevant full-text articles, one team member abstracted all the relevant data. Uncertainties in the data abstraction were resolved by another team member. A total of 335 articles were eligible for this scoping review and subsequently included. There was an increasing growth in the number of published mapping reviews over the years from 5 in 2010 to 73 in 2021. Moreover, there was a significant variability in reporting the included mapping reviews including their research question, priori protocol, methodology, data synthesis and reporting. This work has further highlighted the gaps in evidence synthesis methodologies. Further guidance developed by evidence synthesis organisations, such as JBI and Campbell, has the potential to clarify challenges experienced by researchers, given the magnitude of mapping reviews published every year.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 3","pages":"384-397"},"PeriodicalIF":9.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1694","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139085188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Meta-analysis and partial correlation coefficients: A matter of weights 元分析和偏相关系数：权重问题

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2023-12-29 DOI: 10.1002/jrsm.1697

Sanghyun Hong, W. Robert Reed

This study builds on the simulation framework of a recent paper by Stanley and Doucouliagos (Research Synthesis Methods 2023;14;515–519). S&D use simulations to make the argument that meta-analyses using partial correlation coefficients (PCCs) should employ a “suboptimal” estimator of the PCC standard error when constructing weights for fixed effect and random effects estimation. We address concerns that their simulations and subsequent recommendation may give meta-analysts a misleading impression. While the estimator they promote dominates the “correct” formula in their Monte Carlo framework, there are other estimators that perform even better. We conclude that more research is needed before best practice recommendations can be made for meta-analyses with PCCs.

本研究以 Stanley 和 Doucouliagos 最近发表的论文（Research Synthesis Methods 2023;14;515-519）的模拟框架为基础。S&D利用模拟提出了一个论点：使用偏相关系数（PCC）进行荟萃分析时，在构建固定效应和随机效应估计的权重时，应采用偏相关系数标准误差的 "次优 "估计器。我们担心他们的模拟和随后的建议可能会给元分析者造成误导。虽然他们所推荐的估计器在蒙特卡罗框架中占据了 "正确 "公式的优势，但还有其他估计器表现得更好。我们的结论是，在为使用 PCC 的元分析提出最佳实践建议之前，还需要进行更多的研究。

引用次数: 0

Bayesian random-effects meta-analysis with empirical heterogeneity priors for application in health technology assessment with very few studies 采用经验异质性先验的贝叶斯随机效应荟萃分析，应用于研究极少的卫生技术评估。

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2023-12-28 DOI: 10.1002/jrsm.1685

Jona Lilienthal, Sibylle Sturtz, Christoph Schürmann, Matthias Maiworm, Christian Röver, Tim Friede, Ralf Bender

In Bayesian random-effects meta-analysis, the use of weakly informative prior distributions is of particular benefit in cases where only a few studies are included, a situation often encountered in health technology assessment (HTA). Suggestions for empirical prior distributions are available in the literature but it is unknown whether these are adequate in the context of HTA. Therefore, a database of all relevant meta-analyses conducted by the Institute for Quality and Efficiency in Health Care (IQWiG, Germany) was constructed to derive empirical prior distributions for the heterogeneity parameter suitable for HTA. Previously, an extension to the normal-normal hierarchical model had been suggested for this purpose. For different effect measures, this extended model was applied on the database to conservatively derive a prior distribution for the heterogeneity parameter. Comparison of a Bayesian approach using the derived priors with IQWiG's current standard approach for evidence synthesis shows favorable properties. Therefore, these prior distributions are recommended for future meta-analyses in HTA settings and could be embedded into the IQWiG evidence synthesis approach in the case of very few studies.

在贝叶斯随机效应荟萃分析中，使用弱信息先验分布对仅纳入少数研究的情况特别有益，而这正是卫生技术评估（HTA）中经常遇到的情况。文献中提供了经验先验分布的建议，但这些建议是否适用于 HTA 还不得而知。因此，我们建立了一个由医疗质量与效率研究所（IQWiG，德国）进行的所有相关荟萃分析的数据库，以推导出适合 HTA 的异质性参数的经验先验分布。在此之前，曾有人为此建议对正态-正态层次模型进行扩展。针对不同的效应量，在数据库中应用了这一扩展模型，以保守的方式得出异质性参数的先验分布。将使用推导出的先验值的贝叶斯方法与 IQWiG 目前用于证据综合的标准方法进行比较，结果显示两者具有良好的特性。因此，建议在未来的 HTA 环境中进行荟萃分析时使用这些先验分布，在研究极少的情况下，可将其嵌入 IQWiG 证据综合方法中。

{"title":"Bayesian random-effects meta-analysis with empirical heterogeneity priors for application in health technology assessment with very few studies","authors":"Jona Lilienthal, Sibylle Sturtz, Christoph Schürmann, Matthias Maiworm, Christian Röver, Tim Friede, Ralf Bender","doi":"10.1002/jrsm.1685","DOIUrl":"10.1002/jrsm.1685","url":null,"abstract":"In Bayesian random-effects meta-analysis, the use of weakly informative prior distributions is of particular benefit in cases where only a few studies are included, a situation often encountered in health technology assessment (HTA). Suggestions for empirical prior distributions are available in the literature but it is unknown whether these are adequate in the context of HTA. Therefore, a database of all relevant meta-analyses conducted by the Institute for Quality and Efficiency in Health Care (IQWiG, Germany) was constructed to derive empirical prior distributions for the heterogeneity parameter suitable for HTA. Previously, an extension to the normal-normal hierarchical model had been suggested for this purpose. For different effect measures, this extended model was applied on the database to conservatively derive a prior distribution for the heterogeneity parameter. Comparison of a Bayesian approach using the derived priors with IQWiG's current standard approach for evidence synthesis shows favorable properties. Therefore, these prior distributions are recommended for future meta-analyses in HTA settings and could be embedded into the IQWiG evidence synthesis approach in the case of very few studies.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 2","pages":"275-287"},"PeriodicalIF":9.8,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1685","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139047915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Consensus on the definition and assessment of external validity of randomized controlled trials: A Delphi study 关于随机对照试验外部有效性的定义和评估的共识：德尔菲研究。

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2023-12-25 DOI: 10.1002/jrsm.1688

Andres Jung, Tobias Braun, Susan Armijo-Olivo, Dimitris Challoumas, Kerstin Luedtke

External validity is an important parameter that needs to be considered for decision making in health research, but no widely accepted measurement tool for the assessment of external validity of randomized controlled trials (RCTs) exists. One of the most limiting factors for creating such a tool is probably the substantial heterogeneity and lack of consensus in this field. The objective of this study was to reach consensus on a definition of external validity and on criteria to assess the external validity of RCTs included in systematic reviews. A three-round online Delphi study was conducted. The development of the Delphi survey was based on findings from a previous systematic review. Potential panelists were identified through a comprehensive web search. Consensus was reached when at least 67% of the panelists agreed to a proposal. Eighty-four panelists from different countries and various disciplines participated in at least one round of this study. Consensus was reached on the definition of external validity (“External validity is the extent to which results of trials provide an acceptable basis for generalization to other circumstances such as variations in populations, settings, interventions, outcomes, or other relevant contextual factors”), and on 14 criteria to assess the external validity of RCTs in systematic reviews. The results of this Delphi study provide a consensus-based reference standard for future tool development. Future research should focus on adapting, pilot testing, and validating these criteria to develop measurement tools for the assessment of external validity.

外部效度是健康研究决策中需要考虑的一个重要参数，但目前还没有被广泛接受的随机对照试验（RCT）外部效度评估工具。创建此类工具的最大限制因素之一可能是该领域存在大量异质性且缺乏共识。本研究的目的是就外部有效性的定义以及评估系统综述中随机对照试验外部有效性的标准达成共识。我们进行了三轮在线德尔菲研究。德尔菲调查是根据之前的系统综述结果制定的。通过全面的网络搜索确定了潜在的专家组成员。当至少 67% 的专家组成员同意一项建议时，即达成共识。来自不同国家和不同学科的 84 位专家组成员至少参与了一轮研究。我们就外部效度的定义（"外部效度是指试验结果在多大程度上为推广到其他情况（如人群、环境、干预措施、结果或其他相关背景因素的变化）提供了可接受的基础"）以及系统综述中评估 RCT 外部效度的 14 项标准达成了共识。这项德尔菲研究的结果为今后的工具开发提供了一个基于共识的参考标准。未来的研究应侧重于调整、试点测试和验证这些标准，以开发用于评估外部有效性的测量工具。

{"title":"Consensus on the definition and assessment of external validity of randomized controlled trials: A Delphi study","authors":"Andres Jung, Tobias Braun, Susan Armijo-Olivo, Dimitris Challoumas, Kerstin Luedtke","doi":"10.1002/jrsm.1688","DOIUrl":"10.1002/jrsm.1688","url":null,"abstract":"External validity is an important parameter that needs to be considered for decision making in health research, but no widely accepted measurement tool for the assessment of external validity of randomized controlled trials (RCTs) exists. One of the most limiting factors for creating such a tool is probably the substantial heterogeneity and lack of consensus in this field. The objective of this study was to reach consensus on a definition of external validity and on criteria to assess the external validity of RCTs included in systematic reviews. A three-round online Delphi study was conducted. The development of the Delphi survey was based on findings from a previous systematic review. Potential panelists were identified through a comprehensive web search. Consensus was reached when at least 67% of the panelists agreed to a proposal. Eighty-four panelists from different countries and various disciplines participated in at least one round of this study. Consensus was reached on the definition of external validity (“External validity is the extent to which results of trials provide an acceptable basis for generalization to other circumstances such as variations in populations, settings, interventions, outcomes, or other relevant contextual factors”), and on 14 criteria to assess the external validity of RCTs in systematic reviews. The results of this Delphi study provide a consensus-based reference standard for future tool development. Future research should focus on adapting, pilot testing, and validating these criteria to develop measurement tools for the assessment of external validity.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 2","pages":"288-302"},"PeriodicalIF":9.8,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1688","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139037243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automated data analysis of unstructured grey literature in health research: A mapping review 对健康研究中的非结构化灰色文献进行自动数据分析：绘图审查。

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2023-12-19 DOI: 10.1002/jrsm.1692

Lena Schmidt, Saleh Mohamed, Nick Meader, Jaume Bacardit, Dawn Craig

The amount of grey literature and ‘softer’ intelligence from social media or websites is vast. Given the long lead-times of producing high-quality peer-reviewed health information, this is causing a demand for new ways to provide prompt input for secondary research. To our knowledge, this is the first review of automated data extraction methods or tools for health-related grey literature and soft data, with a focus on (semi)automating horizon scans, health technology assessments (HTA), evidence maps, or other literature reviews. We searched six databases to cover both health- and computer-science literature. After deduplication, 10% of the search results were screened by two reviewers, the remainder was single-screened up to an estimated 95% sensitivity; screening was stopped early after screening an additional 1000 results with no new includes. All full texts were retrieved, screened, and extracted by a single reviewer and 10% were checked in duplicate. We included 84 papers covering automation for health-related social media, internet fora, news, patents, government agencies and charities, or trial registers. From each paper, we extracted data about important functionalities for users of the tool or method; information about the level of support and reliability; and about practical challenges and research gaps. Poor availability of code, data, and usable tools leads to low transparency regarding performance and duplication of work. Financial implications, scalability, integration into downstream workflows, and meaningful evaluations should be carefully planned before starting to develop a tool, given the vast amounts of data and opportunities those tools offer to expedite research.

灰色文献和来自社交媒体或网站的 "软 "情报数量巨大。鉴于高质量的同行评审健康信息的制作周期较长，这就需要有新的方法来为二次研究提供及时的输入。据我们所知，这是首次对与健康相关的灰色文献和软数据的自动数据提取方法或工具进行综述，重点关注地平线扫描、健康技术评估 (HTA)、证据图谱或其他文献综述的（半）自动化。我们搜索了六个数据库，涵盖了健康和计算机科学文献。重复数据删除后，10% 的搜索结果由两名审稿人进行筛选，其余的则进行单项筛选，灵敏度估计为 95%；在筛选了另外 1000 个结果且没有新收录后，我们提前停止了筛选。所有全文均由一名审稿人进行检索、筛选和提取，10%的全文进行了重复检查。我们共收录了 84 篇论文，涉及与健康相关的社交媒体、互联网论坛、新闻、专利、政府机构和慈善机构或试验登记册的自动化。我们从每篇论文中提取了有关工具或方法用户重要功能的数据、有关支持水平和可靠性的信息，以及有关实际挑战和研究空白的数据。代码、数据和可用工具的可用性差，导致绩效透明度低和工作重复。鉴于这些工具可提供大量数据和机会以加快研究，因此在开始开发工具之前，应仔细规划其财务影响、可扩展性、与下游工作流程的整合以及有意义的评估。

{"title":"Automated data analysis of unstructured grey literature in health research: A mapping review","authors":"Lena Schmidt, Saleh Mohamed, Nick Meader, Jaume Bacardit, Dawn Craig","doi":"10.1002/jrsm.1692","DOIUrl":"10.1002/jrsm.1692","url":null,"abstract":"The amount of grey literature and ‘softer’ intelligence from social media or websites is vast. Given the long lead-times of producing high-quality peer-reviewed health information, this is causing a demand for new ways to provide prompt input for secondary research. To our knowledge, this is the first review of automated data extraction methods or tools for health-related grey literature and soft data, with a focus on (semi)automating horizon scans, health technology assessments (HTA), evidence maps, or other literature reviews. We searched six databases to cover both health- and computer-science literature. After deduplication, 10% of the search results were screened by two reviewers, the remainder was single-screened up to an estimated 95% sensitivity; screening was stopped early after screening an additional 1000 results with no new includes. All full texts were retrieved, screened, and extracted by a single reviewer and 10% were checked in duplicate. We included 84 papers covering automation for health-related social media, internet fora, news, patents, government agencies and charities, or trial registers. From each paper, we extracted data about important functionalities for users of the tool or method; information about the level of support and reliability; and about practical challenges and research gaps. Poor availability of code, data, and usable tools leads to low transparency regarding performance and duplication of work. Financial implications, scalability, integration into downstream workflows, and meaningful evaluations should be carefully planned before starting to develop a tool, given the vast amounts of data and opportunities those tools offer to expedite research.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 2","pages":"178-197"},"PeriodicalIF":9.8,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1692","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138794448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Assessment of temporal instability in the applied ecology and conservation evidence base 评估应用生态学和保护证据库中的时间不稳定性。

IF 9.8 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Research Synthesis Methods

Pub Date : 2023-12-19 DOI: 10.1002/jrsm.1691

Elizabeth Brisco, Elena Kulinskaya, Julia Koricheva

Outcomes of meta-analyses are increasingly used to inform evidence-based decision making in various research fields. However, a number of recent studies have reported rapid temporal changes in magnitude and significance of the reported effects which could make policy-relevant recommendations from meta-analyses to quickly go out of date. We assessed the extent and patterns of temporal trends in magnitude and statistical significance of the cumulative effects in meta-analyses in applied ecology and conservation published between 2004 and 2018. Of the 121 meta-analyses analysed, 93% showed a temporal trend in cumulative effect magnitude or significance with 27% of the datasets exhibiting temporal trends in both. The most common trend was the early study effect when at least one of the first 5 years effect size estimates exhibited more than 50% magnitude difference to the subsequent estimate. The observed temporal trends persisted in majority of datasets once moderators were accounted for. Only 5 datasets showed significant changes in sample size over time which could potentially explain the observed temporal change in the cumulative effects. Year of publication of meta-analysis had no significant effect on presence of temporal trends in cumulative effects. Our results show that temporal changes in magnitude and statistical significance in applied ecology are widespread and represent a serious potential threat to use of meta-analyses for decision-making in conservation and environmental management. We recommend use of cumulative meta-analyses and call for more studies exploring the causes of the temporal effects.

荟萃分析的结果越来越多地被用于各研究领域的循证决策。然而，最近的一些研究报告显示，所报告的效应大小和显著性在时间上发生了快速变化，这可能会使荟萃分析中与政策相关的建议很快过时。我们评估了 2004 年至 2018 年间发表的应用生态学和保护荟萃分析中累积效应的规模和统计意义的时间趋势的程度和模式。在分析的 121 项元分析中，93% 显示出累积效应大小或显著性的时间趋势，27% 的数据集显示出两者的时间趋势。最常见的趋势是早期研究效应，即前 5 年的效应大小估计值中至少有一年与随后的估计值相差 50%以上。一旦考虑到调节因素，观察到的时间趋势在大多数数据集中持续存在。只有 5 个数据集的样本量随时间发生了显著变化，这可能解释了所观察到的累积效应的时间变化。荟萃分析的发表年份对累积效应的时间趋势没有明显影响。我们的研究结果表明，在应用生态学中，影响程度和统计意义的时间变化非常普遍，这对在保护和环境管理决策中使用荟萃分析构成了严重的潜在威胁。我们建议使用累积荟萃分析，并呼吁开展更多研究，探索时间效应的原因。

{"title":"Assessment of temporal instability in the applied ecology and conservation evidence base","authors":"Elizabeth Brisco, Elena Kulinskaya, Julia Koricheva","doi":"10.1002/jrsm.1691","DOIUrl":"10.1002/jrsm.1691","url":null,"abstract":"Outcomes of meta-analyses are increasingly used to inform evidence-based decision making in various research fields. However, a number of recent studies have reported rapid temporal changes in magnitude and significance of the reported effects which could make policy-relevant recommendations from meta-analyses to quickly go out of date. We assessed the extent and patterns of temporal trends in magnitude and statistical significance of the cumulative effects in meta-analyses in applied ecology and conservation published between 2004 and 2018. Of the 121 meta-analyses analysed, 93% showed a temporal trend in cumulative effect magnitude or significance with 27% of the datasets exhibiting temporal trends in both. The most common trend was the early study effect when at least one of the first 5 years effect size estimates exhibited more than 50% magnitude difference to the subsequent estimate. The observed temporal trends persisted in majority of datasets once moderators were accounted for. Only 5 datasets showed significant changes in sample size over time which could potentially explain the observed temporal change in the cumulative effects. Year of publication of meta-analysis had no significant effect on presence of temporal trends in cumulative effects. Our results show that temporal changes in magnitude and statistical significance in applied ecology are widespread and represent a serious potential threat to use of meta-analyses for decision-making in conservation and environmental management. We recommend use of cumulative meta-analyses and call for more studies exploring the causes of the temporal effects.","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"15 3","pages":"398-412"},"PeriodicalIF":9.8,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrsm.1691","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138794446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0