首页 > 最新文献

Cochrane Evidence Synthesis and Methods最新文献

英文 中文
Associations of Social and Demographic Factors on the Outcomes of Ocular Melanoma and Other Adult Ocular Neoplasms in the United States: A Systematic Review. 在美国,社会和人口因素与眼部黑色素瘤和其他成人眼部肿瘤预后的关联:一项系统综述。
Pub Date : 2026-03-10 eCollection Date: 2026-03-01 DOI: 10.1002/cesm.70075
Daniel Shaughnessy, Vijay Joshi, Natalia Dellavalle, Louis Leslie, Michael Edwards, Timothy Waxweiler, Tianjing Li, Riaz Qureshi

Introduction: Social determinants of health (SDOH), including economic stability, education access and quality, healthcare access and quality, neighborhood and built environment, and social and community context, shape gaps in health outcomes across many conditions. Ocular neoplasms are no exception. Cancers such as uveal melanoma, conjunctival squamous cell carcinoma, ocular lymphoma, and ocular Kaposi sarcoma may be especially vulnerable to social and demographic influences. We systematically reviewed documented associations between SDOH and these ocular cancers in the United States.

Methods: Following a pre-registered protocol, we searched MEDLINE, Embase, and Web of Science (from January 2000 to November 2023) for primary studies of any design that evaluated one or more relationships between SDOH and outcomes related to the ocular cancers listed above. Outcomes included cancer incidence, stage at diagnosis, treatment patterns, survival, and mortality. We extracted study design, population, exposure, and outcome characteristics, classified each exposure-outcome association by its direction (e.g., favorable, unfavorable, or null), and assessed the risk of bias using a modified Newcastle-Ottawa Scale. Due to heterogeneity in exposure and outcome definitions, we narratively synthesized findings by SDOH domain.

Results: We included 21 studies examining 167 unique associations. Social and community context, typically represented as race and ethnicity, was the most frequently studied domain, followed by economic stability (e.g., income) and healthcare access and quality (e.g., insurance type or travel distance). Across domains, lower socioeconomic status, public or no insurance, minority racial and ethnic identity, and care at academic centers generally are associated with later stage at diagnosis, higher odds of enucleation, or worse survival. Higher income, private insurance, and treatment at experienced facilities were often associated to earlier presentation and better outcomes.

Conclusion: SDOH have a measurable and often unfavorable relationship with the diagnosis, management, and prognosis of rare adult ocular cancers in the United States. Standardized SDOH exposures and measurements, prospective data collection, and adjustment for confounding are necessary to strengthen the evidence and guide multi-domain interventions (e.g., expanded insurance, travel assistance to high-volume centers, and community eye-health initiatives) aimed at narrowing these gaps.

健康的社会决定因素(SDOH),包括经济稳定性、教育机会和质量、医疗保健机会和质量、邻里和建成环境以及社会和社区背景,在许多条件下形成了健康结果的差距。眼部肿瘤也不例外。诸如葡萄膜黑色素瘤、结膜鳞状细胞癌、眼淋巴瘤和眼卡波西肉瘤等癌症可能特别容易受到社会和人口统计学的影响。我们系统地回顾了美国SDOH和这些眼部癌症之间的文献关联。方法:按照预先注册的方案,我们检索了MEDLINE、Embase和Web of Science(2000年1月至2023年11月),以评估SDOH与上述眼癌相关结果之间一种或多种关系的任何设计的初步研究。结果包括癌症发病率、诊断阶段、治疗模式、生存率和死亡率。我们提取了研究设计、人群、暴露和结果特征,根据其方向(例如,有利、不利或无效)对每个暴露-结果关联进行分类,并使用改进的纽卡斯尔-渥太华量表评估偏倚风险。由于暴露和结果定义的异质性,我们通过SDOH域叙述性地综合了研究结果。结果:我们纳入了21项研究,检查了167种独特的关联。社会和社区背景(通常以种族和族裔表示)是最常被研究的领域,其次是经济稳定性(如收入)和医疗保健获取和质量(如保险类型或旅行距离)。在各个领域,较低的社会经济地位、公共或无保险、少数种族和民族身份以及在学术中心的护理通常与较晚的诊断阶段、较高的去核几率或较差的存活率有关。较高的收入、私人保险和在经验丰富的机构接受治疗往往与早期就诊和更好的结果有关。结论:在美国,SDOH与罕见成人眼癌的诊断、治疗和预后有明显的不利关系。标准化的SDOH暴露和测量、前瞻性数据收集和对混杂因素的调整对于加强证据和指导旨在缩小这些差距的多领域干预措施(例如,扩大保险、向大容量中心提供旅行援助和社区眼健康倡议)是必要的。
{"title":"Associations of Social and Demographic Factors on the Outcomes of Ocular Melanoma and Other Adult Ocular Neoplasms in the United States: A Systematic Review.","authors":"Daniel Shaughnessy, Vijay Joshi, Natalia Dellavalle, Louis Leslie, Michael Edwards, Timothy Waxweiler, Tianjing Li, Riaz Qureshi","doi":"10.1002/cesm.70075","DOIUrl":"https://doi.org/10.1002/cesm.70075","url":null,"abstract":"<p><strong>Introduction: </strong>Social determinants of health (SDOH), including economic stability, education access and quality, healthcare access and quality, neighborhood and built environment, and social and community context, shape gaps in health outcomes across many conditions. Ocular neoplasms are no exception. Cancers such as uveal melanoma, conjunctival squamous cell carcinoma, ocular lymphoma, and ocular Kaposi sarcoma may be especially vulnerable to social and demographic influences. We systematically reviewed documented associations between SDOH and these ocular cancers in the United States.</p><p><strong>Methods: </strong>Following a pre-registered protocol, we searched MEDLINE, Embase, and Web of Science (from January 2000 to November 2023) for primary studies of any design that evaluated one or more relationships between SDOH and outcomes related to the ocular cancers listed above. Outcomes included cancer incidence, stage at diagnosis, treatment patterns, survival, and mortality. We extracted study design, population, exposure, and outcome characteristics, classified each exposure-outcome association by its direction (e.g., favorable, unfavorable, or null), and assessed the risk of bias using a modified Newcastle-Ottawa Scale. Due to heterogeneity in exposure and outcome definitions, we narratively synthesized findings by SDOH domain.</p><p><strong>Results: </strong>We included 21 studies examining 167 unique associations. Social and community context, typically represented as race and ethnicity, was the most frequently studied domain, followed by economic stability (e.g., income) and healthcare access and quality (e.g., insurance type or travel distance). Across domains, lower socioeconomic status, public or no insurance, minority racial and ethnic identity, and care at academic centers generally are associated with later stage at diagnosis, higher odds of enucleation, or worse survival. Higher income, private insurance, and treatment at experienced facilities were often associated to earlier presentation and better outcomes.</p><p><strong>Conclusion: </strong>SDOH have a measurable and often unfavorable relationship with the diagnosis, management, and prognosis of rare adult ocular cancers in the United States. Standardized SDOH exposures and measurements, prospective data collection, and adjustment for confounding are necessary to strengthen the evidence and guide multi-domain interventions (e.g., expanded insurance, travel assistance to high-volume centers, and community eye-health initiatives) aimed at narrowing these gaps.</p>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 2","pages":"e70075"},"PeriodicalIF":0.0,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12977123/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147446569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Large Language Models to Address Contextual Questions in Systematic Reviews. 使用大型语言模型来解决系统评论中的上下文问题。
Pub Date : 2026-02-26 eCollection Date: 2026-03-01 DOI: 10.1002/cesm.70060
Susanne Hempel, Kimny Sysawang, Haley K Holmer, Erin Tokutomi, Suchitra Iyer, Zhen Wang, Edi Kuhn, Mohammad Hassan Murad

Objectives: Systematic evidence reviews (SERs) produced by the U.S. Agency for Healthcare Research and Quality (AHRQ) Evidence-based Practice Center (EPC) Program use contextual questions to provide context and background information on the topic. There is currently no standardized approach to address contextual questions in systematic reviews. This study explored the use of publicly available large language models (LLMs) in addressing contextual questions.

Study design: Using a set of 20 published and 5 yet to be published SERs, we selected one contextual question per report and used it as a prompt to elicit answers from an LLM (ChatGPT, Bard, Claude, or Perplexity). Two independent reviewers rated the results using a priori established evaluation criteria (https://osf.io/4k3cu/), comparing the response in the SER to LLM-generated responses. The study was guided by six research questions addressing feasibility, validity of content, validity of structure, mistakes, congruence between responses, and incremental validity of using LLMs to address contextual questions.

Results: Using minimal prompt engineering produced relevant responses and documented the feasibility of LLM-generated answers to contextual questions. Responses differed in content and format and are not reproducible (e.g., LLMs update regularly), but LLMs were able to produce articulate, clinically plausible, and well-structured responses. We detected few factual errors, contradictions, and no instance of suspected bias, but citations supporting LLM-generated responses could often not be produced or could not be verified ('confabulations'). Congruence with human generated responses varied, with LLM-generated responses providing more background on the topic and SERs providing more nuanced answers in response to the contextual question. Results regarding incremental validity were mixed and may depend on the tool.

Conclusion: LLMs are potentially helpful in addressing contextual questions in systematic reviews but human expertise remains essential for using the generated information in a meaningful way.

目的:由美国卫生保健研究和质量机构(AHRQ)循证实践中心(EPC)项目制作的系统证据审查(SERs)使用上下文问题来提供有关该主题的上下文和背景信息。目前还没有标准化的方法来解决系统评价中的上下文问题。本研究探索了在处理上下文问题时使用公开可用的大型语言模型(llm)。研究设计:使用一组20个已发表的和5个尚未发表的SERs,我们在每个报告中选择一个上下文问题,并将其用作提示,以引出法学硕士(ChatGPT, Bard, Claude或Perplexity)的答案。两名独立审稿人使用先验建立的评估标准(https://osf)对结果进行评级。io/4k3cu/),将SER中的响应与llm生成的响应进行比较。本研究以六个研究问题为指导,包括可行性、内容效度、结构效度、错误、回答一致性和使用法学硕士解决上下文问题的增量效度。结果:使用最小的提示工程产生了相关的响应,并记录了法学硕士生成的上下文问题答案的可行性。响应在内容和格式上有所不同,并且不可重复(例如,法学硕士定期更新),但法学硕士能够产生清晰,临床合理且结构良好的响应。我们几乎没有发现事实错误、矛盾,也没有发现可疑的偏见,但支持法学硕士生成的回应的引用通常无法产生或无法验证(“虚构”)。与人类生成的响应的一致性各不相同,法学硕士生成的响应提供了更多关于主题的背景,而SERs提供了更细致的答案来响应上下文问题。关于增量效度的结果是混合的,可能取决于工具。结论:法学硕士可能有助于解决系统评价中的上下文问题,但人类专业知识对于以有意义的方式使用生成的信息仍然至关重要。
{"title":"Using Large Language Models to Address Contextual Questions in Systematic Reviews.","authors":"Susanne Hempel, Kimny Sysawang, Haley K Holmer, Erin Tokutomi, Suchitra Iyer, Zhen Wang, Edi Kuhn, Mohammad Hassan Murad","doi":"10.1002/cesm.70060","DOIUrl":"https://doi.org/10.1002/cesm.70060","url":null,"abstract":"<p><strong>Objectives: </strong>Systematic evidence reviews (SERs) produced by the U.S. Agency for Healthcare Research and Quality (AHRQ) Evidence-based Practice Center (EPC) Program use contextual questions to provide context and background information on the topic. There is currently no standardized approach to address contextual questions in systematic reviews. This study explored the use of publicly available large language models (LLMs) in addressing contextual questions.</p><p><strong>Study design: </strong>Using a set of 20 published and 5 yet to be published SERs, we selected one contextual question per report and used it as a prompt to elicit answers from an LLM (ChatGPT, Bard, Claude, or Perplexity). Two independent reviewers rated the results using a priori established evaluation criteria (https://osf.io/4k3cu/), comparing the response in the SER to LLM-generated responses. The study was guided by six research questions addressing feasibility, validity of content, validity of structure, mistakes, congruence between responses, and incremental validity of using LLMs to address contextual questions.</p><p><strong>Results: </strong>Using minimal prompt engineering produced relevant responses and documented the feasibility of LLM-generated answers to contextual questions. Responses differed in content and format and are not reproducible (e.g., LLMs update regularly), but LLMs were able to produce articulate, clinically plausible, and well-structured responses. We detected few factual errors, contradictions, and no instance of suspected bias, but citations supporting LLM-generated responses could often not be produced or could not be verified ('confabulations'). Congruence with human generated responses varied, with LLM-generated responses providing more background on the topic and SERs providing more nuanced answers in response to the contextual question. Results regarding incremental validity were mixed and may depend on the tool.</p><p><strong>Conclusion: </strong>LLMs are potentially helpful in addressing contextual questions in systematic reviews but human expertise remains essential for using the generated information in a meaningful way.</p>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 2","pages":"e70060"},"PeriodicalIF":0.0,"publicationDate":"2026-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12948247/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147328851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to “Health Equity in Systematic Reviews: A Tutorial—Part 1 Getting Started With Health Equity in Your Review”, “Health Equity in Systematic Reviews: A Tutorial—Part 2 Implementing Health Equity Throughout Your Methods”, “Meta-Analysis Using Time-to-Event Data: A Tutorial” and “Split Body Trials in Systematic Reviews and Meta-Analyses: A Tutorial” 更正了“系统评价中的健康公平:教程-第1部分:如何开始你的评价中的健康公平”、“系统评价中的健康公平:教程-第2部分:在你的方法中实现健康公平”、“使用事件时间数据的荟萃分析:教程”和“系统评价和荟萃分析中的分离体试验:教程”。
Pub Date : 2026-02-19 DOI: 10.1002/cesm.70062

J. Petkovic, J. P. Pardo, V. Welch, et al., “Health Equity in Systematic Reviews: A Tutorial—Part 1 Getting Started With Health Equity in Your Review,” Cochrane Evidence Synthesis and Methods 3 (2025): 1–6, https://doi.org/10.1002/cesm.70055.

J. Petkovic, J. P. Pardo, V. Welch, et al., “Health Equity in Systematic Reviews: A Tutorial—Part 2 Implementing Health Equity Throughout Your Methods,” Cochrane Evidence Synthesis and Methods 3 (2025): 1–7, https://doi.org/10.1002/cesm.70054.

Krishan, A. and Dwan, K. (2025), Meta-Analysis Using Time-to-Event Data: A Tutorial. Cochrane Evidence Synthesis and Methods, 3: e70041. https://doi.org/10.1002/cesm.70041

Livingstone, N., Dwan, K. and Chaplin, M. (2025), Split Body Trials in Systematic Reviews and Meta-Analyses: A Tutorial. Cochrane Evidence Synthesis and Methods, 3: e70052. https://doi.org/10.1002/cesm.70052

The two articles by Petkovic et al. (1) were incorrectly labelled as “Methods Article”. They have now been corrected to “Tutorial”.

In Petkovic et al. (1), Appendix A, the row for ‘Religion’ states “This refers to the religious beliefs and can affect health equity when the choices related to these beliefs are imposed on a person by their family or community”. This should state “This refers to religious beliefs, which can influence health and well-being by fostering community support and shared values, and may also affect health equity (positively and negatively) when choices related to these beliefs are shaped by family or community expectations”.

In Petkovic et al. (2), the following Conflict of Interest statement was missing:

CONFLICT OF INTEREST STATEMENT

All authors are members of the leadership team of the Cochrane Health Equity Thematic Group. The authors have no other conflicts to declare.

The article by Krishan & Dwan (3) was incorrectly labelled as “Brief Report”. This has now been corrected to “Tutorial”.

The article by Livingstone, Dwan & Chaplin (4) was incorrectly labelled as “Editorial”. This has now been corrected to “Tutorial”.

These have now been corrected in the published articles.

We apologize for these errors.

[这更正了文章DOI: 10.1002/cesm.70055。][更正文章DOI: 10.1002/cesm.70054][更正文章DOI: 10.1002/cesm.70041][更正文章DOI: 10.1002/cesm.70052]。
{"title":"Correction to “Health Equity in Systematic Reviews: A Tutorial—Part 1 Getting Started With Health Equity in Your Review”, “Health Equity in Systematic Reviews: A Tutorial—Part 2 Implementing Health Equity Throughout Your Methods”, “Meta-Analysis Using Time-to-Event Data: A Tutorial” and “Split Body Trials in Systematic Reviews and Meta-Analyses: A Tutorial”","authors":"","doi":"10.1002/cesm.70062","DOIUrl":"10.1002/cesm.70062","url":null,"abstract":"<p>J. Petkovic, J. P. Pardo, V. Welch, et al., “Health Equity in Systematic Reviews: A Tutorial—Part 1 Getting Started With Health Equity in Your Review,” <i>Cochrane Evidence Synthesis and Methods</i> 3 (2025): 1–6, https://doi.org/10.1002/cesm.70055.</p><p>J. Petkovic, J. P. Pardo, V. Welch, et al., “Health Equity in Systematic Reviews: A Tutorial—Part 2 Implementing Health Equity Throughout Your Methods,” <i>Cochrane Evidence Synthesis and Methods</i> 3 (2025): 1–7, https://doi.org/10.1002/cesm.70054.</p><p>Krishan, A. and Dwan, K. (2025), Meta-Analysis Using Time-to-Event Data: A Tutorial. Cochrane Evidence Synthesis and Methods, 3: e70041. https://doi.org/10.1002/cesm.70041</p><p>Livingstone, N., Dwan, K. and Chaplin, M. (2025), Split Body Trials in Systematic Reviews and Meta-Analyses: A Tutorial. Cochrane Evidence Synthesis and Methods, 3: e70052. https://doi.org/10.1002/cesm.70052</p><p>The two articles by Petkovic <i>et al.</i> (<span>1</span>) were incorrectly labelled as “Methods Article”. They have now been corrected to “Tutorial”.</p><p>In Petkovic <i>et al.</i> (<span>1</span>), Appendix A, the row for ‘Religion’ states “This refers to the religious beliefs and can affect health equity when the choices related to these beliefs are imposed on a person by their family or community”. This should state “This refers to religious beliefs, which can influence health and well-being by fostering community support and shared values, and may also affect health equity (positively and negatively) when choices related to these beliefs are shaped by family or community expectations”.</p><p>In Petkovic <i>et al.</i> (<span>2</span>), the following Conflict of Interest statement was missing:</p><p><b>CONFLICT OF INTEREST STATEMENT</b></p><p>All authors are members of the leadership team of the Cochrane Health Equity Thematic Group. The authors have no other conflicts to declare.</p><p>The article by Krishan &amp; Dwan (<span>3</span>) was incorrectly labelled as “Brief Report”. This has now been corrected to “Tutorial”.</p><p>The article by Livingstone, Dwan &amp; Chaplin (<span>4</span>) was incorrectly labelled as “Editorial”. This has now been corrected to “Tutorial”.</p><p>These have now been corrected in the published articles.</p><p>We apologize for these errors.</p>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12919368/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147273839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensitivity and Precision of Search Strategies Built Using a Text-Mining Word Frequency Tool (PubReMiner) Compared to Current Best Practice for Building Search Strategies: A Study Within a Review (SWAR) 使用文本挖掘词频工具(PubReMiner)构建搜索策略的灵敏度和精度与当前构建搜索策略的最佳实践的比较:一篇综述中的研究(SWAR)。
Pub Date : 2026-02-18 DOI: 10.1002/cesm.70074
Andrew Dullea, Marie Carrigan, Lydia O'Sullivan, Isabelle Delaunois, Helen Clark, Martin Boudou, Martina Giusti, Kieran A. Walsh, Patricia Harrington, Susan M. Smith, Máirín Ryan

Objective

PubReMiner is a text-mining tool that analyses a seed set of citations to assess word frequency in titles, abstracts, and Medical Subject Headings (MeSH). This study aimed to determine the sensitivity and precision of search strategies developed using the PubReMiner tool compared to conventional search strategies developed by a librarian at our institution.

Methods

Twelve consecutive reviews conducted at our center were included from September 2023 to January 2025. These reviews included various types of evidence synthesis, including rapid reviews and systematic reviews, covering a variety of topics. One librarian developed a comprehensive search strategy, which included a conventional MEDLINE search for each review. Separately, two librarians independently developed MEDLINE search strategies using PubReMiner-generated word frequency tables (PubReMiner 1 and PubReMiner 2). All search strategies were constructed by experienced librarians using predefined work instructions. Primary outcomes were sensitivity and precision. Secondary outcomes included the number needed to read, the number of unique references retrieved, and the time taken to construct each strategy.

Results

Sensitivity of PubReMiner strategies was generally lower than that of conventional strategies; however, in one review, PubReMiner achieved a higher sensitivity (83.87%) than the conventional strategy (58.06%). Only the sensitivity outcome showed a statistically significant difference between search methods (Friedman test p = 0.0065). No statistically significant difference in precision between the searches was identified. PubReMiner strategies were typically faster to construct but yielded inconsistent performance across reviews and between librarians.

Conclusion

While PubReMiner offers efficiency advantages, its inconsistent performance in retrieving relevant studies suggests that it should not replace conventional search strategies. The study illustrates the value of multi-review SWARs in producing evidence that informs evidence synthesis practices.

目的:PubReMiner是一个文本挖掘工具,用于分析引文种子集,以评估标题、摘要和医学主题标题(MeSH)中的词频。本研究旨在确定使用PubReMiner工具开发的搜索策略与我们机构图书管理员开发的传统搜索策略的敏感性和准确性。方法:纳入2023年9月至2025年1月在本中心连续进行的12篇综述。这些综述包括各种类型的证据综合,包括快速综述和系统综述,涵盖各种主题。一位图书管理员开发了一种全面的搜索策略,其中包括对每篇评论进行传统的MEDLINE搜索。另外,两位图书馆员使用PubReMiner生成的词频表(PubReMiner 1和PubReMiner 2)独立开发了MEDLINE搜索策略。所有的搜索策略都是由经验丰富的图书馆员使用预定义的工作指令构建的。主要结局为敏感性和精密度。次要结果包括需要阅读的数量,检索到的唯一参考文献的数量,以及构建每种策略所花费的时间。结果:PubReMiner策略的敏感性普遍低于常规策略;然而,在一篇综述中,PubReMiner获得了比传统策略(58.06%)更高的灵敏度(83.87%)。只有敏感性结果显示搜索方法之间有统计学显著差异(Friedman检验p = 0.0065)。两种搜索方法的精确度没有统计学上的显著差异。PubReMiner策略的构建速度通常更快,但在评审和图书管理员之间产生了不一致的性能。结论:虽然PubReMiner具有效率优势,但其在检索相关研究时的不一致表现表明,它不应取代传统的搜索策略。该研究说明了多重评审SWARs在产生证据方面的价值,这些证据为证据合成实践提供了依据。
{"title":"Sensitivity and Precision of Search Strategies Built Using a Text-Mining Word Frequency Tool (PubReMiner) Compared to Current Best Practice for Building Search Strategies: A Study Within a Review (SWAR)","authors":"Andrew Dullea,&nbsp;Marie Carrigan,&nbsp;Lydia O'Sullivan,&nbsp;Isabelle Delaunois,&nbsp;Helen Clark,&nbsp;Martin Boudou,&nbsp;Martina Giusti,&nbsp;Kieran A. Walsh,&nbsp;Patricia Harrington,&nbsp;Susan M. Smith,&nbsp;Máirín Ryan","doi":"10.1002/cesm.70074","DOIUrl":"10.1002/cesm.70074","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <h3> Objective</h3>\u0000 \u0000 <p>PubReMiner is a text-mining tool that analyses a seed set of citations to assess word frequency in titles, abstracts, and Medical Subject Headings (MeSH). This study aimed to determine the sensitivity and precision of search strategies developed using the PubReMiner tool compared to conventional search strategies developed by a librarian at our institution.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Methods</h3>\u0000 \u0000 <p>Twelve consecutive reviews conducted at our center were included from September 2023 to January 2025. These reviews included various types of evidence synthesis, including rapid reviews and systematic reviews, covering a variety of topics. One librarian developed a comprehensive search strategy, which included a conventional MEDLINE search for each review. Separately, two librarians independently developed MEDLINE search strategies using PubReMiner-generated word frequency tables (PubReMiner 1 and PubReMiner 2). All search strategies were constructed by experienced librarians using predefined work instructions. Primary outcomes were sensitivity and precision. Secondary outcomes included the number needed to read, the number of unique references retrieved, and the time taken to construct each strategy.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Results</h3>\u0000 \u0000 <p>Sensitivity of PubReMiner strategies was generally lower than that of conventional strategies; however, in one review, PubReMiner achieved a higher sensitivity (83.87%) than the conventional strategy (58.06%). Only the sensitivity outcome showed a statistically significant difference between search methods (Friedman test <i>p</i> = 0.0065). No statistically significant difference in precision between the searches was identified. PubReMiner strategies were typically faster to construct but yielded inconsistent performance across reviews and between librarians.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Conclusion</h3>\u0000 \u0000 <p>While PubReMiner offers efficiency advantages, its inconsistent performance in retrieving relevant studies suggests that it should not replace conventional search strategies. The study illustrates the value of multi-review SWARs in producing evidence that informs evidence synthesis practices.</p>\u0000 </section>\u0000 </div>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12915468/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146230274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ninety-Seven Percent of Trials Investigating Robotic Interventions in Physiotherapy Contained Abstract Spin: A Meta-Research Review 研究物理治疗中机器人干预的97%的试验包含抽象旋转:一项元研究综述。
Pub Date : 2026-02-12 DOI: 10.1002/cesm.70072
Hilary Tier, Jana Verveer, David B. Anderson, Camila Quel De Oliveira, Nicci Bartley, Poonam Mehta, Rafael Z. Pinto, Arianne P. Verhagen, Alana B. McCambridge, Peter W. Stubbs
<div> <section> <h3> Background</h3> <p>Abstract spin involves misrepresenting or misreporting study findings in the abstract of an article. The abstract is the most easily accessible part of the article and may determine if an article is read, purchased or the findings are implemented into practice. Trials using new technologies, such as robotics, may be particularly vulnerable to spin due to the high costs associated with research and development.</p> </section> <section> <h3> Objective</h3> <p>To identify and assess abstract spin in physiotherapy clinical trials investigating robotic interventions.</p> </section> <section> <h3> Design</h3> <p>Meta-research review.</p> </section> <section> <h3> Methods</h3> <p>We searched the Physiotherapy Evidence Database (PEDro) in August 2024 for two-armed clinical trials investigating robotic interventions compared to nonrobotic interventions, in any patient population. Article screening and data extraction were performed by two people independently. Quality assessment was performed using the PEDro scale with PEDro scores ≥ 6 deemed high quality. Abstract spin was assessed by two independent raters using a 7-item checklist. Spin items were scored “present,” “not present” or “not applicable.” Data were presented as counts and percentages.</p> </section> <section> <h3> Results</h3> <p>We included 160 trials, of which 95% were in neurological physiotherapy and 61% of trials were high quality. Almost all trials (97%) contained at least one item of spin. Most often abstracts failed to mention adverse events (90%) or overenthusiastically interpretated non-significant (primary) outcomes (77%). One percent of abstracts clearly omitted negative primary outcomes, and 23% of abstracts recommended treatments without clinically important effects on the primary outcomes. These low spin percentages were due to many trials not reporting any negative finding and trials not providing a clinical recommendation in the abstract.</p> </section> <section> <h3> Conclusion</h3> <p>Ninety-seven percent of abstracts in trials investigating robotic interventions in physiotherapy contained spin. Academic journals should be conscious of the high prevalence of abstract spin in robotic trials and consider implementing stricter author guidelines or peer-review practices to ensure abstracts truly reflect the study findings.</p>
背景:spin涉及在文章摘要中歪曲或误报研究结果。摘要是文章中最容易获得的部分,它可以决定一篇文章是否被阅读、购买或将研究结果付诸实践。由于与研究和开发相关的高成本,使用新技术(如机器人技术)的试验可能特别容易受到影响。目的:在研究机器人干预的物理治疗临床试验中识别和评估抽象旋转。设计:元研究综述。方法:我们于2024年8月检索了物理治疗证据数据库(PEDro),在任何患者群体中进行双臂临床试验,研究机器人干预与非机器人干预的比较。文章筛选和数据提取由两人独立完成。采用PEDro量表进行质量评价,PEDro评分≥6为高质量。摘要旋转由两名独立评分者使用7项检查表进行评估。旋转项目被打分为“存在”、“不存在”或“不适用”。数据以计数和百分比表示。结果:我们纳入160项试验,其中95%为神经物理治疗,61%为高质量试验。几乎所有的试验(97%)至少包含一个旋转项目。大多数摘要没有提及不良事件(90%)或过度热情地解释非重要(主要)结局(77%)。1%的摘要明确省略了负面的主要结果,23%的摘要推荐了对主要结果没有临床重要影响的治疗方法。这些低自旋百分比是由于许多试验没有报告任何负面发现和试验没有在摘要中提供临床推荐。结论:在研究机器人干预物理治疗的试验中,97%的摘要含有旋转。学术期刊应该意识到机器人试验中抽象旋转的高普遍性,并考虑实施更严格的作者指南或同行评审实践,以确保摘要真实反映研究结果。
{"title":"Ninety-Seven Percent of Trials Investigating Robotic Interventions in Physiotherapy Contained Abstract Spin: A Meta-Research Review","authors":"Hilary Tier,&nbsp;Jana Verveer,&nbsp;David B. Anderson,&nbsp;Camila Quel De Oliveira,&nbsp;Nicci Bartley,&nbsp;Poonam Mehta,&nbsp;Rafael Z. Pinto,&nbsp;Arianne P. Verhagen,&nbsp;Alana B. McCambridge,&nbsp;Peter W. Stubbs","doi":"10.1002/cesm.70072","DOIUrl":"10.1002/cesm.70072","url":null,"abstract":"&lt;div&gt;\u0000 \u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Background&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;Abstract spin involves misrepresenting or misreporting study findings in the abstract of an article. The abstract is the most easily accessible part of the article and may determine if an article is read, purchased or the findings are implemented into practice. Trials using new technologies, such as robotics, may be particularly vulnerable to spin due to the high costs associated with research and development.&lt;/p&gt;\u0000 &lt;/section&gt;\u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Objective&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;To identify and assess abstract spin in physiotherapy clinical trials investigating robotic interventions.&lt;/p&gt;\u0000 &lt;/section&gt;\u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Design&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;Meta-research review.&lt;/p&gt;\u0000 &lt;/section&gt;\u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Methods&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;We searched the Physiotherapy Evidence Database (PEDro) in August 2024 for two-armed clinical trials investigating robotic interventions compared to nonrobotic interventions, in any patient population. Article screening and data extraction were performed by two people independently. Quality assessment was performed using the PEDro scale with PEDro scores ≥ 6 deemed high quality. Abstract spin was assessed by two independent raters using a 7-item checklist. Spin items were scored “present,” “not present” or “not applicable.” Data were presented as counts and percentages.&lt;/p&gt;\u0000 &lt;/section&gt;\u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Results&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;We included 160 trials, of which 95% were in neurological physiotherapy and 61% of trials were high quality. Almost all trials (97%) contained at least one item of spin. Most often abstracts failed to mention adverse events (90%) or overenthusiastically interpretated non-significant (primary) outcomes (77%). One percent of abstracts clearly omitted negative primary outcomes, and 23% of abstracts recommended treatments without clinically important effects on the primary outcomes. These low spin percentages were due to many trials not reporting any negative finding and trials not providing a clinical recommendation in the abstract.&lt;/p&gt;\u0000 &lt;/section&gt;\u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Conclusion&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;Ninety-seven percent of abstracts in trials investigating robotic interventions in physiotherapy contained spin. Academic journals should be conscious of the high prevalence of abstract spin in robotic trials and consider implementing stricter author guidelines or peer-review practices to ensure abstracts truly reflect the study findings.&lt;/p&gt;\u0000 ","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12904089/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146204511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Promoting the Implementation of Co-Produced Cochrane Evidence: An Exploratory Study of Improving Partnering With Consumers 促进共同生产科克伦证据的实施:改善与消费者合作的探索性研究。
Pub Date : 2026-02-03 DOI: 10.1002/cesm.70071
Bronwen Merner, Louisa Walsh, Janet Jull, Nora Refahi, Vasileios Tsialtas, Benjamin Shemesh, Mel Kotze, Rebecca Ryan

Introduction

Co-production of evidence syntheses has the potential to facilitate translation of research findings into policy and practice. However, few studies have explored the process of implementing co-produced evidence. This gap limits our understanding of how, and to what extent, co-production promotes knowledge translation.

In this study, we used an implementation science lens to explore factors influencing the implementation of the Best Practice Principles in partnering with consumers (BPP) in hospitals in Melbourne, Australia. The BPP were developed as part of a co-produced Cochrane qualitative evidence synthesis exploring consumers' and health providers' experiences and perceptions of partnering. We use the findings of our study to develop strategies for evidence synthesis teams engaged in co-production to optimize the implementation of their review findings.

Methods

This exploratory, qualitative study was informed by cooperative inquiry and normalization process theory (NPT). A six-member panel, including researchers, policy makers and consumers, guided data collection and analysis. Data collection involved semi-structured interviews with eleven participants (including consumer engagement leads, consumer representatives, and a policymaker) about how to implement the BPP in Melbourne hospitals. Interviews were analyzed using framework analysis.

Results

Interview participants reported the BPP were relevant to practice, consumer-centered, practical, and flexible. There were several additional factors that could impact their uptake into practice. These included integration of the BPP into government policies and guidelines, evidence of the cost/benefit of BPP implementation, endorsement from health service leadership, involvement of consumers throughout the implementation process, a structured implementation, and flexible measurement of implementation success.

Conclusion

This exploratory study suggested that the BPP, a tool developed through a co-produced Cochrane qualitative evidence synthesis, promoted knowledge translation. Other factors at the macro- (political and economic), meso- (systems and organizations), and micro- (individual) levels could influence the implementation's success. Implications for evidence synthesis teams aiming to optimize the knowledge translation of their review results are discussed.

导言:证据综合的联合制作有可能促进将研究成果转化为政策和实践。然而,很少有研究探索实施共同产生证据的过程。这一差距限制了我们对合拍片如何以及在多大程度上促进知识翻译的理解。在这项研究中,我们使用实施科学的视角来探索影响澳大利亚墨尔本医院在与消费者合作(BPP)中实施最佳实践原则的因素。BPP是作为共同制作的Cochrane定性证据综合的一部分开发的,该综合探讨了消费者和卫生服务提供者对合作的经验和看法。我们利用我们的研究结果为参与联合制作的证据合成团队制定策略,以优化其审查结果的实施。方法:采用合作问询与规范化过程理论(NPT)进行探索性定性研究。一个包括研究人员、政策制定者和消费者在内的六人小组指导数据收集和分析。数据收集涉及与11位参与者(包括消费者参与领导、消费者代表和政策制定者)进行的半结构化访谈,内容涉及如何在墨尔本医院实施BPP。访谈采用框架分析法进行分析。结果:受访者报告BPP与实践相关,以消费者为中心,实用,灵活。还有几个其他因素可能会影响他们的实践。这些措施包括将BPP纳入政府政策和指导方针、BPP实施成本/收益的证据、卫生服务部门领导的认可、消费者在整个实施过程中的参与、结构化的实施以及对实施成功的灵活衡量。结论:本探索性研究表明,通过Cochrane合作开发的定性证据合成工具BPP促进了知识翻译。宏观(政治和经济)、中观(系统和组织)和微观(个人)层面的其他因素可能影响实施的成功。对旨在优化其综述结果的知识转化的证据合成团队的影响进行了讨论。
{"title":"Promoting the Implementation of Co-Produced Cochrane Evidence: An Exploratory Study of Improving Partnering With Consumers","authors":"Bronwen Merner,&nbsp;Louisa Walsh,&nbsp;Janet Jull,&nbsp;Nora Refahi,&nbsp;Vasileios Tsialtas,&nbsp;Benjamin Shemesh,&nbsp;Mel Kotze,&nbsp;Rebecca Ryan","doi":"10.1002/cesm.70071","DOIUrl":"10.1002/cesm.70071","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <h3> Introduction</h3>\u0000 \u0000 <p>Co-production of evidence syntheses has the potential to facilitate translation of research findings into policy and practice. However, few studies have explored the process of implementing co-produced evidence. This gap limits our understanding of how, and to what extent, co-production promotes knowledge translation.</p>\u0000 \u0000 <p>In this study, we used an implementation science lens to explore factors influencing the implementation of the Best Practice Principles in partnering with consumers (BPP) in hospitals in Melbourne, Australia. The BPP were developed as part of a co-produced Cochrane qualitative evidence synthesis exploring consumers' and health providers' experiences and perceptions of partnering. We use the findings of our study to develop strategies for evidence synthesis teams engaged in co-production to optimize the implementation of their review findings.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Methods</h3>\u0000 \u0000 <p>This exploratory, qualitative study was informed by cooperative inquiry and normalization process theory (NPT). A six-member panel, including researchers, policy makers and consumers, guided data collection and analysis. Data collection involved semi-structured interviews with eleven participants (including consumer engagement leads, consumer representatives, and a policymaker) about how to implement the BPP in Melbourne hospitals. Interviews were analyzed using framework analysis.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Results</h3>\u0000 \u0000 <p>Interview participants reported the BPP were relevant to practice, consumer-centered, practical, and flexible. There were several additional factors that could impact their uptake into practice. These included integration of the BPP into government policies and guidelines, evidence of the cost/benefit of BPP implementation, endorsement from health service leadership, involvement of consumers throughout the implementation process, a structured implementation, and flexible measurement of implementation success.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Conclusion</h3>\u0000 \u0000 <p>This exploratory study suggested that the BPP, a tool developed through a co-produced Cochrane qualitative evidence synthesis, promoted knowledge translation. Other factors at the macro- (political and economic), meso- (systems and organizations), and micro- (individual) levels could influence the implementation's success. Implications for evidence synthesis teams aiming to optimize the knowledge translation of their review results are discussed.</p>\u0000 </section>\u0000 </div>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12865661/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146121488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to “Sensitivity Analysis in Meta-Analysis: A Tutorial” 修正“meta分析中的敏感性分析:教程”。
Pub Date : 2026-01-28 DOI: 10.1002/cesm.70070

N. M. Aung, I. Jurak, S. Mehmood, and E. Axon, “Sensitivity Analysis in Meta-Analysis: A Tutorial,” Cochrane Evidence Synthesis and Methods 4 (2026): 1–7. https://doi.org/10.1002/cesm.70067.

The article category has been corrected from “METHODS ARTICLE” to “TUTORIAL.”

We apologize for this error.

[这更正了文章DOI: 10.1002/cesm.70067.]。
{"title":"Correction to “Sensitivity Analysis in Meta-Analysis: A Tutorial”","authors":"","doi":"10.1002/cesm.70070","DOIUrl":"10.1002/cesm.70070","url":null,"abstract":"<p>N. M. Aung, I. Jurak, S. Mehmood, and E. Axon, “Sensitivity Analysis in Meta-Analysis: A Tutorial,” <i>Cochrane Evidence Synthesis and Methods</i> 4 (2026): 1–7. https://doi.org/10.1002/cesm.70067.</p><p>The article category has been corrected from “METHODS ARTICLE” to “TUTORIAL.”</p><p>We apologize for this error.</p>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12850237/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Don't Stop Me Now, `Cause I'm Having a Good Time Screening: Evaluation of Stopping Methods for Safe Use of Priority Screening in Systematic Reviews 不要现在阻止我,因为我玩得很开心筛选:在系统评价中优先筛选安全使用的停止方法的评估。
Pub Date : 2026-01-21 DOI: 10.1002/cesm.70068
Tim Repke, Francesca Tinsdeall, Diana Danilenko, Sergio Graziosi, Finn Müller-Hansen, Lena Schmidt, James Thomas, Gert van Valkenhoef

Introduction

Priority screening has the potential to reduce the number of records that need to be annotated in systematic literature reviews. So-called technology-assisted reviews (TAR) use machine-learning with prior include/exclude annotations to continuously rank unseen records by their predicted relevance to find relevant records earlier. In this article, we present a systematic evaluation of methods to determine when it is safe to stop screening when using prioritization.

Methods

We implement an open-source evaluation framework that features a novel method to generate rankings and simulate priority screening processes for 81 real-world data sets. We use these simulations to evaluate 15 statistical or rule-based (heuristic) stopping methods, testing a range of hyperparameters for each.

Results

The work-saving potential and performance of stopping criteria heavily rely on “good” rankings, which are typically not achieved by a single ranking algorithm across the entire screening process. Our evaluation shows that almost all existing stopping methods either fail to reliably stop without missing relevant records or fail to utilize the full potential work-savings. Only one method reliably meets the set recall target, but stops conservatively.

Conclusions

Many digital evidence synthesis tools provide priority screening features that are already used in many research projects. However, the theoretical work-savings demonstrated in retrospective simulations of prioritization can only be unlocked with safe and reproducible stopping criteria. Our results highlight the need for improved stopping methods and guidelines on how to responsibly use priority screening. We also urge screening platforms to provide indicators and authors to transparently report metrics when automating (parts of) their synthesis.

优先筛选有可能减少系统文献综述中需要注释的记录数量。所谓的技术辅助审查(TAR)使用机器学习和预先包含/排除注释,根据预测的相关性对未见记录进行连续排序,以便更早地找到相关记录。在这篇文章中,我们提出了一个系统的评估方法,以确定何时是安全的停止筛选时,使用优先级。方法:我们实现了一个开源评估框架,该框架采用了一种新颖的方法来生成81个真实世界数据集的排名和模拟优先筛选过程。我们使用这些模拟来评估15种统计或基于规则的(启发式)停止方法,并为每种方法测试一系列超参数。结果:停止标准的节省工作潜力和性能严重依赖于“好”排名,这通常不是通过整个筛选过程中的单一排名算法实现的。我们的评估表明,几乎所有现有的停止方法要么不能可靠地停止而不丢失相关记录,要么不能充分利用潜在的工作节省。只有一种方法可靠地满足设定的召回目标,但会保守地停止。结论:许多数字证据合成工具提供了已经在许多研究项目中使用的优先筛选功能。然而,在回溯性的优先级模拟中,理论上的工作节省只能通过安全和可重复的停止标准来实现。我们的结果强调需要改进停止方法和指导方针,如何负责任地使用优先筛选。我们还敦促筛选平台在自动化(部分)其合成时提供指标和作者透明地报告指标。
{"title":"Don't Stop Me Now, `Cause I'm Having a Good Time Screening: Evaluation of Stopping Methods for Safe Use of Priority Screening in Systematic Reviews","authors":"Tim Repke,&nbsp;Francesca Tinsdeall,&nbsp;Diana Danilenko,&nbsp;Sergio Graziosi,&nbsp;Finn Müller-Hansen,&nbsp;Lena Schmidt,&nbsp;James Thomas,&nbsp;Gert van Valkenhoef","doi":"10.1002/cesm.70068","DOIUrl":"10.1002/cesm.70068","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <h3> Introduction</h3>\u0000 \u0000 <p>Priority screening has the potential to reduce the number of records that need to be annotated in systematic literature reviews. So-called technology-assisted reviews (TAR) use machine-learning with prior include/exclude annotations to continuously rank unseen records by their predicted relevance to find relevant records earlier. In this article, we present a systematic evaluation of methods to determine when it is safe to stop screening when using prioritization.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Methods</h3>\u0000 \u0000 <p>We implement an open-source evaluation framework that features a novel method to generate rankings and simulate priority screening processes for 81 real-world data sets. We use these simulations to evaluate 15 statistical or rule-based (heuristic) stopping methods, testing a range of hyperparameters for each.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Results</h3>\u0000 \u0000 <p>The work-saving potential and performance of stopping criteria heavily rely on “good” rankings, which are typically not achieved by a single ranking algorithm across the entire screening process. Our evaluation shows that almost all existing stopping methods either fail to reliably stop without missing relevant records or fail to utilize the full potential work-savings. Only one method reliably meets the set recall target, but stops conservatively.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Conclusions</h3>\u0000 \u0000 <p>Many digital evidence synthesis tools provide priority screening features that are already used in many research projects. However, the theoretical work-savings demonstrated in retrospective simulations of prioritization can only be unlocked with safe and reproducible stopping criteria. Our results highlight the need for improved stopping methods and guidelines on how to responsibly use priority screening. We also urge screening platforms to provide indicators and authors to transparently report metrics when automating (parts of) their synthesis.</p>\u0000 </section>\u0000 </div>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12825451/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146055756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Methods of Engaging Interest-Holders in Healthcare Evidence Syntheses: A Scoping Review 医疗证据综合中利益相关者参与的方法:范围综述
Pub Date : 2026-01-15 DOI: 10.1002/cesm.70066
Alex Todhunter-Brown, Jennifer Petkovic, Christine Chang, Ursula Griebler, Ailish Hannigan, Jennifer Hilgart, Basharat Hussain, Janet Jull, Christina Koscher-Kien, Dominic Ledinger, Barbara Nussbaumer-Streit, Oyekola Oloyede, Eve Tomlinson, Shoba Dawson, Omar Dewidar, Sean Grant, Lyubov Lytvyn, Thomas W. Concannon, Leonila Dans, Denny John, Zoe Jordan, Evan Mayo-Wilson, Chris McCutcheon, Francesco Nonino, Danielle Pollock, Karine Toupin April, Pauline Campbell, Joanne Khabsa, Olivia Magwood, Vivian Welch, Peter Tugwell
<div> <section> <h3> Introduction</h3> <p>Engaging interest-holders in health care evidence syntheses may make evidence syntheses more relevant, useful, and accessible. However, the best way(s) to engage interest-holders within the evidence synthesis process remain unknown. A previous scoping review collated 291 publications that reported interest-holder engagement in evidence syntheses, but conclusions were limited due to poor reporting. In the present scoping review, our aim was to identify and collate up-to-date publications focussed on interest-holder engagement in healthcare evidence syntheses, describe reported methods of engagement, and compare the results with those from the previous review.</p> </section> <section> <h3> Methods</h3> <p>We updated a scoping review, following JBI guidance, using a pre-published protocol that defined all key terminology in this field. We systematically searched five electronic databases (MEDLINE, CINAHL, EMBASE, PsycInfo, and SCOPUS). Searches were conducted from January 2016 to February 2024. Records were imported into Covidence and screened by pairs of independent reviewers, including any publications that reported engagement of interest-holders in evidence syntheses. We extracted and coded key data relating to the evidence synthesis topic and ACTIVE framework domains (who was engaged, when, and in what way). Two reviewers independently made a judgment of the comprehensiveness of the description of methods of engagement, using a “traffic-light” system, coding evidence syntheses with comprehensive descriptions as “green,” brief or partial descriptions as “amber,” and those with few details as “red”; disagreements were resolved through discussion. Additional detailed data relating to the engagement methods were extracted from “green” evidence syntheses. Any disagreements were resolved through discussion. Data were synthesized within tables, and narrative summaries were written to provide an overview of key methods of engaging interest-holders within the identified evidence syntheses.</p> </section> <section> <h3> Results</h3> <p>We identified 302 publications published since the previous review. Most (272/302, 90%) reported interest-holder engagement in a single evidence synthesis; of these, 74% (200/272) engaged patients and/or their carers, while 17% (46/272) engaged other interest-holders only, and the remainder (26/272, 9.6%) was unclear. Over three-quarters of the evidence syntheses were conducted either in the United Kingdom, United States, Canada, or Australia (215/272, 79%). Most often (113/272, 42%), interest-holders were engaged at both the initial (scope and question setting) <i>and</i> final (inte
让利益攸关方参与卫生保健证据综合可使证据综合更具相关性、实用性和可及性。然而,让利益相关者参与证据合成过程的最佳方式仍然未知。先前的范围审查整理了291份报告利益持有人参与证据合成的出版物,但由于报告不力,结论有限。在当前的范围综述中,我们的目的是识别和整理关注医疗保健证据综合中利益相关者参与的最新出版物,描述报道的参与方法,并将结果与先前综述的结果进行比较。方法:根据JBI指南,使用预先发布的协议更新了范围综述,该协议定义了该领域的所有关键术语。我们系统地检索了5个电子数据库(MEDLINE、CINAHL、EMBASE、PsycInfo和SCOPUS)。搜索于2016年1月至2024年2月进行。将记录输入到covid中,并由独立审稿人对其进行筛选,包括报告利益相关者参与证据合成的任何出版物。我们提取并编码了与证据合成主题和ACTIVE框架域(谁参与、何时参与、以何种方式参与)相关的关键数据。两名审稿人独立对参与方式描述的全面性作出判断,采用“红绿灯”制度,将描述全面的证据综合编码为“绿色”,将描述简短或部分的证据综合编码为“琥珀色”,将描述细节较少的证据综合编码为“红色”;分歧通过讨论得到解决。从“绿色”证据合成中提取了与审计业务方法有关的其他详细数据。任何分歧都是通过讨论解决的。在表格中对数据进行了综合,并编写了叙述性摘要,概述了在已确定的证据综合中吸引利益相关者的关键方法。结果:我们确定了自上次综述以来发表的302篇文献。大多数(272/302,90%)报告利益持有人参与了单一证据合成;其中,74%(200/272)涉及患者和/或其护理人员,17%(46/272)仅涉及其他利益相关者,其余(26/272,9.6%)不清楚。超过四分之三的证据合成是在英国、美国、加拿大或澳大利亚进行的(215/272,79%)。大多数情况下(113/272,42%),利益相关者参与了最初(范围和问题设置)和最终(结果解释)审查阶段(称为“顶部和尾部”方法)。19%(51/272)被判定为提供了一种或多种方法(s)或方法(es)的全面(“绿色”)描述,以参与证据合成,能够进行详细的数据提取和描述。大多数:参与患者/公众成员和其他利益相关者群体(30/51,59%);采用“封闭式”招聘策略(30/51,59%);在解释调查结果阶段让利益相关者参与(39/51,76%);至少有一个利息持有人作为共同作者(27/51,52%)。利益相关者一般出席不采用正式接触方式的会议。在整个审查过程中,让利益相关者参与多种活动是很常见的。我们来自MuSE联盟的国际团队更新了之前的范围审查,汇编了有关利益相关者参与证据合成的最新证据。我们整理了302份出版物,并描述了51份证据综合报告中利益相关者参与的方法,我们认为这些证据综合提供了最全面的信息。利益相关者参与了这一过程的所有阶段,使用了广泛的参与方法,但没有与证据综合的类型或重点相关的明确模式。最常见的是,患者/公众和专业利益相关者都参与其中,但我们的例子中约有四分之一只涉及患者/公众成员,少数只涉及专业利益相关者。我们确定了一些不同的参与策略,并利用这些策略为潜在的决策工具提供信息,以支持参与策略的选择。我们就利益相关者参与证据综合和未来研究的行为和报告提出建议,以推进这一领域的发展。
{"title":"Methods of Engaging Interest-Holders in Healthcare Evidence Syntheses: A Scoping Review","authors":"Alex Todhunter-Brown,&nbsp;Jennifer Petkovic,&nbsp;Christine Chang,&nbsp;Ursula Griebler,&nbsp;Ailish Hannigan,&nbsp;Jennifer Hilgart,&nbsp;Basharat Hussain,&nbsp;Janet Jull,&nbsp;Christina Koscher-Kien,&nbsp;Dominic Ledinger,&nbsp;Barbara Nussbaumer-Streit,&nbsp;Oyekola Oloyede,&nbsp;Eve Tomlinson,&nbsp;Shoba Dawson,&nbsp;Omar Dewidar,&nbsp;Sean Grant,&nbsp;Lyubov Lytvyn,&nbsp;Thomas W. Concannon,&nbsp;Leonila Dans,&nbsp;Denny John,&nbsp;Zoe Jordan,&nbsp;Evan Mayo-Wilson,&nbsp;Chris McCutcheon,&nbsp;Francesco Nonino,&nbsp;Danielle Pollock,&nbsp;Karine Toupin April,&nbsp;Pauline Campbell,&nbsp;Joanne Khabsa,&nbsp;Olivia Magwood,&nbsp;Vivian Welch,&nbsp;Peter Tugwell","doi":"10.1002/cesm.70066","DOIUrl":"https://doi.org/10.1002/cesm.70066","url":null,"abstract":"&lt;div&gt;\u0000 \u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Introduction&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;Engaging interest-holders in health care evidence syntheses may make evidence syntheses more relevant, useful, and accessible. However, the best way(s) to engage interest-holders within the evidence synthesis process remain unknown. A previous scoping review collated 291 publications that reported interest-holder engagement in evidence syntheses, but conclusions were limited due to poor reporting. In the present scoping review, our aim was to identify and collate up-to-date publications focussed on interest-holder engagement in healthcare evidence syntheses, describe reported methods of engagement, and compare the results with those from the previous review.&lt;/p&gt;\u0000 &lt;/section&gt;\u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Methods&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;We updated a scoping review, following JBI guidance, using a pre-published protocol that defined all key terminology in this field. We systematically searched five electronic databases (MEDLINE, CINAHL, EMBASE, PsycInfo, and SCOPUS). Searches were conducted from January 2016 to February 2024. Records were imported into Covidence and screened by pairs of independent reviewers, including any publications that reported engagement of interest-holders in evidence syntheses. We extracted and coded key data relating to the evidence synthesis topic and ACTIVE framework domains (who was engaged, when, and in what way). Two reviewers independently made a judgment of the comprehensiveness of the description of methods of engagement, using a “traffic-light” system, coding evidence syntheses with comprehensive descriptions as “green,” brief or partial descriptions as “amber,” and those with few details as “red”; disagreements were resolved through discussion. Additional detailed data relating to the engagement methods were extracted from “green” evidence syntheses. Any disagreements were resolved through discussion. Data were synthesized within tables, and narrative summaries were written to provide an overview of key methods of engaging interest-holders within the identified evidence syntheses.&lt;/p&gt;\u0000 &lt;/section&gt;\u0000 \u0000 &lt;section&gt;\u0000 \u0000 &lt;h3&gt; Results&lt;/h3&gt;\u0000 \u0000 &lt;p&gt;We identified 302 publications published since the previous review. Most (272/302, 90%) reported interest-holder engagement in a single evidence synthesis; of these, 74% (200/272) engaged patients and/or their carers, while 17% (46/272) engaged other interest-holders only, and the remainder (26/272, 9.6%) was unclear. Over three-quarters of the evidence syntheses were conducted either in the United Kingdom, United States, Canada, or Australia (215/272, 79%). Most often (113/272, 42%), interest-holders were engaged at both the initial (scope and question setting) &lt;i&gt;and&lt;/i&gt; final (inte","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cesm.70066","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146016391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic Reviews as Part of Doctoral Theses and for the Promotion to Associate Professor: A Descriptive Study of University Policies in Sweden 作为博士论文一部分的系统评论和副教授的晋升:瑞典大学政策的描述性研究。
Pub Date : 2026-01-14 DOI: 10.1002/cesm.70069
Martin Ringsten, Lea Styrmisdottir, Matilda Naesström, Minna Johansson, Matteo Bruschettini, Susanna M. Wallerstedt

Background

Almost a decade ago, about half of biomedical PhD programs across Europe specifically stated that systematic reviews could not be accepted as part of a doctoral thesis, illustrating limited merit value at that time. The aim of this study was to explore current Swedish university policies on this research design.

Methods

Policy documents for PhD theses and applications to associate professor positions were obtained from all medical faculties at universities in Sweden. Instructions regarding systematic reviews, with focus on their merit value and related aspects, were independently extracted and categorized by two authors, with discrepancies resolved in consensus discussions.

Results

All seven medical faculties accepted at least one systematic review within a PhD thesis, five restricted the number of such studies accepted, and five provided instructions regarding this study design. Regarding policies for promotion to associate professor, six medical faculties accepted at least one published systematic review to merit recognition―the remaining one required meta-analyses for acceptance―and three explicitly restricted the number of systematic reviews. No restrictions or guidance were provided for other designs intended to answer specific research questions.

Conclusion

As of 2025, systematic reviews appear to be generally recognized as contributing to authors' academic merit. For this research design exclusively, some universities impose restrictions that may limit their recognition, and some provide guidance which may help ensure quality in reporting. These findings may encourage research to evaluate the merit value of systematic reviews in other settings, and to examine potential implications of restrictions and guidance in policy documents.

背景:大约十年前,欧洲大约一半的生物医学博士课程特别声明,系统评论不能作为博士论文的一部分,这表明当时的价值有限。本研究的目的是探讨当前瑞典大学对这一研究设计的政策。方法:收集瑞典各大学医学院博士论文和副教授职位申请政策文件。关于系统评价的说明,重点是其优点和相关方面,由两位作者独立提取和分类,在一致讨论中解决差异。结果:所有七所医学院均接受博士论文中至少一篇系统评价,其中五所限制接受此类研究的数量,五所提供有关本研究设计的说明。在副教授晋升政策方面,6所医学院接受了至少一篇已发表的系统综述,以获得认可——其余一所需要荟萃分析才能接受——3所明确限制了系统综述的数量。对于旨在回答特定研究问题的其他设计,没有提供任何限制或指导。结论:截至2025年,系统综述似乎被普遍认为有助于作者的学术价值。对于这种专门的研究设计,有些大学会施加限制,这可能会限制他们的认可,有些大学会提供指导,这可能有助于确保报告的质量。这些发现可能鼓励研究在其他情况下评价系统审查的价值,并审查政策文件中限制和指导的潜在影响。
{"title":"Systematic Reviews as Part of Doctoral Theses and for the Promotion to Associate Professor: A Descriptive Study of University Policies in Sweden","authors":"Martin Ringsten,&nbsp;Lea Styrmisdottir,&nbsp;Matilda Naesström,&nbsp;Minna Johansson,&nbsp;Matteo Bruschettini,&nbsp;Susanna M. Wallerstedt","doi":"10.1002/cesm.70069","DOIUrl":"10.1002/cesm.70069","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <h3> Background</h3>\u0000 \u0000 <p>Almost a decade ago, about half of biomedical PhD programs across Europe specifically stated that systematic reviews could not be accepted as part of a doctoral thesis, illustrating limited merit value at that time. The aim of this study was to explore current Swedish university policies on this research design.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Methods</h3>\u0000 \u0000 <p>Policy documents for PhD theses and applications to associate professor positions were obtained from all medical faculties at universities in Sweden. Instructions regarding systematic reviews, with focus on their merit value and related aspects, were independently extracted and categorized by two authors, with discrepancies resolved in consensus discussions.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Results</h3>\u0000 \u0000 <p>All seven medical faculties accepted at least one systematic review within a PhD thesis, five restricted the number of such studies accepted, and five provided instructions regarding this study design. Regarding policies for promotion to associate professor, six medical faculties accepted at least one published systematic review to merit recognition―the remaining one required meta-analyses for acceptance―and three explicitly restricted the number of systematic reviews. No restrictions or guidance were provided for other designs intended to answer specific research questions.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Conclusion</h3>\u0000 \u0000 <p>As of 2025, systematic reviews appear to be generally recognized as contributing to authors' academic merit. For this research design exclusively, some universities impose restrictions that may limit their recognition, and some provide guidance which may help ensure quality in reporting. These findings may encourage research to evaluate the merit value of systematic reviews in other settings, and to examine potential implications of restrictions and guidance in policy documents.</p>\u0000 </section>\u0000 </div>","PeriodicalId":100286,"journal":{"name":"Cochrane Evidence Synthesis and Methods","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12806540/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146000321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cochrane Evidence Synthesis and Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1