首页 > 最新文献

Research Synthesis Methods最新文献

英文 中文
Translation and validation of a geographic search filter to identify studies about Germany in Embase (Ovid) and MEDLINE(R) ALL (Ovid). 翻译和验证地理搜索过滤器,以识别Embase (Ovid)和MEDLINE(R) ALL (Ovid)中关于德国的研究。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-01 Epub Date: 2025-06-09 DOI: 10.1017/rsm.2025.10016
Alexander Pachanov, Catharina Muente, Julian Hirt, Dawid Pieper

We developed a geographic search filter for retrieving studies about Germany from PubMed. In this study, we aimed to translate and validate it for use in Embase and MEDLINE(R) ALL via Ovid. Adjustments included aligning PubMed field tags with Ovid's syntax, adding a keyword heading field for both databases, and incorporating a correspondence address field for Embase. To validate the filters, we used systematic reviews (SRs) that included studies about Germany without imposing geographic restrictions on their search strategies. Subsequently, we conducted (i) case studies (CSs), applying the filters to the search strategies of the 17 eligible SRs; and (ii) aggregation studies, combining the SRs' search strategies with the 'OR' operator and applying the filters. In the CSs, the filters demonstrated a median sensitivity of 100% in both databases, with interquartile ranges (IQRs) of 100%-100% in Embase and 93.75%-100% in MEDLINE(R) ALL. Median precision improved from 0.11% (IQR: 0.05%-0.30%) to 1.65% (IQR: 0.78%-3.06%) and from 0.19% (IQR: 0.11%-0.60%) to 5.13% (IQR: 1.77%-6.85%), while the number needed to read (NNR) decreased from 893.40 (IQR: 354.81-2,219.58) to 60.44 (IQR: 33.94-128.97) and from 513.29 (IQR: 167.35-930.99) to 19.50 (IQR: 14.66-59.35) for Embase and MEDLINE(R) ALL, respectively. In the aggregation studies, the overall sensitivities were 98.19% and 97.14%, with NNRs of 83.29 and 33.34 in Embase and MEDLINE(R) ALL, respectively. The new Embase and MEDLINE(R) ALL filters for Ovid reliably retrieve studies about Germany, enhancing search precision. The approach described in our study can support search filter developers in translating filters for various topics and contexts.

我们开发了一个地理搜索过滤器,用于从PubMed检索有关德国的研究。在本研究中,我们旨在通过Ovid翻译并验证其在Embase和MEDLINE(R) ALL中的应用。调整包括将PubMed字段标记与Ovid的语法对齐,为两个数据库添加关键字标题字段,并为Embase合并通信地址字段。为了验证过滤器,我们使用了系统评价(SRs),其中包括关于德国的研究,而没有对其搜索策略施加地理限制。随后,我们进行了(i)案例研究(CSs),将过滤器应用于17个符合条件的sr的搜索策略;(ii)聚合研究,将sr的搜索策略与“或”运算符结合起来,并应用过滤器。在CSs中,过滤器在两个数据库中的中位灵敏度均为100%,Embase的四分位数范围(IQRs)为100%-100%,MEDLINE(R) ALL的四分位数范围(IQRs)为93.75%-100%。Embase和MEDLINE(R) ALL的中位精度从0.11% (IQR: 0.05% ~ 0.30%)提高到1.65% (IQR: 0.78% ~ 3.06%),从0.19% (IQR: 0.11% ~ 0.60%)提高到5.13% (IQR: 1.77% ~ 6.85%),所需读取数(NNR)分别从893.40 (IQR: 354.81 ~ 2219.58)降低到60.44 (IQR: 33.94 ~ 128.97),从513.29 (IQR: 167.35 ~ 930.99)降低到19.50 (IQR: 14.66 ~ 59.35)。在聚集研究中,Embase和MEDLINE(R) ALL的总体敏感性分别为98.19%和97.14%,NNRs分别为83.29和33.34。新的Embase和MEDLINE(R) ALL过滤器可靠地检索有关德国的研究,提高了搜索精度。我们研究中描述的方法可以支持搜索过滤器开发人员翻译各种主题和上下文的过滤器。
{"title":"Translation and validation of a geographic search filter to identify studies about Germany in Embase (Ovid) and MEDLINE(R) ALL (Ovid).","authors":"Alexander Pachanov, Catharina Muente, Julian Hirt, Dawid Pieper","doi":"10.1017/rsm.2025.10016","DOIUrl":"10.1017/rsm.2025.10016","url":null,"abstract":"<p><p>We developed a geographic search filter for retrieving studies about Germany from PubMed. In this study, we aimed to translate and validate it for use in Embase and MEDLINE(R) ALL via Ovid. Adjustments included aligning PubMed field tags with Ovid's syntax, adding a keyword heading field for both databases, and incorporating a correspondence address field for Embase. To validate the filters, we used systematic reviews (SRs) that included studies about Germany without imposing geographic restrictions on their search strategies. Subsequently, we conducted (i) case studies (CSs), applying the filters to the search strategies of the 17 eligible SRs; and (ii) aggregation studies, combining the SRs' search strategies with the 'OR' operator and applying the filters. In the CSs, the filters demonstrated a median sensitivity of 100% in both databases, with interquartile ranges (IQRs) of 100%-100% in Embase and 93.75%-100% in MEDLINE(R) ALL. Median precision improved from 0.11% (IQR: 0.05%-0.30%) to 1.65% (IQR: 0.78%-3.06%) and from 0.19% (IQR: 0.11%-0.60%) to 5.13% (IQR: 1.77%-6.85%), while the number needed to read (NNR) decreased from 893.40 (IQR: 354.81-2,219.58) to 60.44 (IQR: 33.94-128.97) and from 513.29 (IQR: 167.35-930.99) to 19.50 (IQR: 14.66-59.35) for Embase and MEDLINE(R) ALL, respectively. In the aggregation studies, the overall sensitivities were 98.19% and 97.14%, with NNRs of 83.29 and 33.34 in Embase and MEDLINE(R) ALL, respectively. The new Embase and MEDLINE(R) ALL filters for Ovid reliably retrieve studies about Germany, enhancing search precision. The approach described in our study can support search filter developers in translating filters for various topics and contexts.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 4","pages":"688-700"},"PeriodicalIF":6.1,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527497/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Same, same, but different: A method to harmonise and deduplicate study records from WHO ICTRP and ClinicalTrials.gov prior to screening. 相同,相同,但不同:一种在筛选前协调和消除WHO ICTRP和ClinicalTrials.gov研究记录的方法。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-01 Epub Date: 2025-04-25 DOI: 10.1017/rsm.2025.20
Zahra Premji, Chris Cooper

Trials registry records represent a challenge in deduplication compared to deduplicating studies reported in journals and exported from bibliographic databases such as MEDLINE. We demonstrate why this is the case and propose a method to deduplicate registry records from the WHO International Clinical Trials Registry Platform (ICTRP) and ClinicalTrials.gov (CTG) specifically in the reference management tool EndNote (desktop version). We believe that our method is not only more efficient but that it will minimise the risk of registry records being incorrectly removed as duplicates in automated deduplication. The method has seven steps and is detailed in this tutorial as a step-by-step guide.

与在期刊上报告并从MEDLINE等书目数据库导出的重复数据删除研究相比,试验注册记录在重复数据删除方面是一个挑战。我们论证了为什么会出现这种情况,并提出了一种方法来从WHO国际临床试验注册平台(ICTRP)和ClinicalTrials.gov (CTG)中删除重复的注册记录,特别是在参考管理工具EndNote(桌面版)中。我们相信,我们的方法不仅更有效,而且可以最大限度地降低注册表记录在自动重复数据删除中被错误删除的风险。该方法有七个步骤,在本教程中作为一步一步的指导详细介绍。
{"title":"Same, same, but different: A method to harmonise and deduplicate study records from WHO ICTRP and ClinicalTrials.gov prior to screening.","authors":"Zahra Premji, Chris Cooper","doi":"10.1017/rsm.2025.20","DOIUrl":"10.1017/rsm.2025.20","url":null,"abstract":"<p><p>Trials registry records represent a challenge in deduplication compared to deduplicating studies reported in journals and exported from bibliographic databases such as MEDLINE. We demonstrate why this is the case and propose a method to deduplicate registry records from the WHO International Clinical Trials Registry Platform (ICTRP) and ClinicalTrials.gov (CTG) specifically in the reference management tool EndNote (desktop version). We believe that our method is not only more efficient but that it will minimise the risk of registry records being incorrectly removed as duplicates in automated deduplication. The method has seven steps and is detailed in this tutorial as a step-by-step guide.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 4","pages":"587-600"},"PeriodicalIF":6.1,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527485/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative artificial intelligence use in evidence synthesis: A systematic review. 生成式人工智能在证据合成中的应用:系统综述。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-01 Epub Date: 2025-04-24 DOI: 10.1017/rsm.2025.16
Justin Clark, Belinda Barton, Loai Albarqouni, Oyungerel Byambasuren, Tanisha Jowsey, Justin Keogh, Tian Liang, Christian Moro, Hayley O'Neill, Mark Jones

Introduction: With the increasing accessibility of tools such as ChatGPT, Copilot, DeepSeek, Dall-E, and Gemini, generative artificial intelligence (GenAI) has been poised as a potential, research timesaving tool, especially for synthesising evidence. Our objective was to determine whether GenAI can assist with evidence synthesis by assessing its performance using its accuracy, error rates, and time savings compared to the traditional expert-driven approach.

Methods: To systematically review the evidence, we searched five databases on 17 January 2025, synthesised outcomes reporting on the accuracy, error rates, or time taken, and appraised the risk-of-bias using a modified version of QUADAS-2.

Results: We identified 3,071 unique records, 19 of which were included in our review. Most studies had a high or unclear risk-of-bias in Domain 1A: review selection, Domain 2A: GenAI conduct, and Domain 1B: applicability of results. When used for (1) searching GenAI missed 68% to 96% (median = 91%) of studies, (2) screening made incorrect inclusion decisions ranging from 0% to 29% (median = 10%); and incorrect exclusion decisions ranging from 1% to 83% (median = 28%), (3) incorrect data extractions ranging from 4% to 31% (median = 14%), (4) incorrect risk-of-bias assessments ranging from 10% to 56% (median = 27%).

Conclusion: Our review shows that the current evidence does not support GenAI use in evidence synthesis without human involvement or oversight. However, for most tasks other than searching, GenAI may have a role in assisting humans with evidence synthesis.

随着ChatGPT、Copilot、DeepSeek、Dall-E和Gemini等工具的日益普及,生成式人工智能(GenAI)已经成为一种潜在的、节省研究时间的工具,尤其是在合成证据方面。我们的目标是通过评估GenAI的准确性、错误率和与传统专家驱动方法相比节省的时间,来确定GenAI是否可以帮助证据合成。方法:为了系统地回顾证据,我们于2025年1月17日检索了5个数据库,综合了准确性、错误率或所需时间的结果报告,并使用改进版QUADAS-2评估了偏倚风险。结果:我们确定了3071条独特的记录,其中19条纳入了我们的综述。大多数研究在领域1A(综述选择)、领域2A(基因行为)和领域1B(结果的适用性)中存在较高或不明确的偏倚风险。当用于(1)搜索GenAI时,遗漏了68%至96%(中位数= 91%)的研究,(2)筛选错误的纳入决策范围为0%至29%(中位数= 10%);不正确的排除决策范围从1%到83%(中位数= 28%),(3)不正确的数据提取范围从4%到31%(中位数= 14%),(4)不正确的偏倚风险评估范围从10%到56%(中位数= 27%)。结论:我们的综述表明,目前的证据不支持在没有人类参与或监督的情况下将GenAI用于证据合成。然而,对于搜索以外的大多数任务,GenAI可能在协助人类合成证据方面发挥作用。
{"title":"Generative artificial intelligence use in evidence synthesis: A systematic review.","authors":"Justin Clark, Belinda Barton, Loai Albarqouni, Oyungerel Byambasuren, Tanisha Jowsey, Justin Keogh, Tian Liang, Christian Moro, Hayley O'Neill, Mark Jones","doi":"10.1017/rsm.2025.16","DOIUrl":"10.1017/rsm.2025.16","url":null,"abstract":"<p><strong>Introduction: </strong>With the increasing accessibility of tools such as ChatGPT, Copilot, DeepSeek, Dall-E, and Gemini, generative artificial intelligence (GenAI) has been poised as a potential, research timesaving tool, especially for synthesising evidence. Our objective was to determine whether GenAI can assist with evidence synthesis by assessing its performance using its accuracy, error rates, and time savings compared to the traditional expert-driven approach.</p><p><strong>Methods: </strong>To systematically review the evidence, we searched five databases on 17 January 2025, synthesised outcomes reporting on the accuracy, error rates, or time taken, and appraised the risk-of-bias using a modified version of QUADAS-2.</p><p><strong>Results: </strong>We identified 3,071 unique records, 19 of which were included in our review. Most studies had a high or unclear risk-of-bias in Domain 1A: review selection, Domain 2A: GenAI conduct, and Domain 1B: applicability of results. When used for (1) searching GenAI missed 68% to 96% (median = 91%) of studies, (2) screening made incorrect inclusion decisions ranging from 0% to 29% (median = 10%); and incorrect exclusion decisions ranging from 1% to 83% (median = 28%), (3) incorrect data extractions ranging from 4% to 31% (median = 14%), (4) incorrect risk-of-bias assessments ranging from 10% to 56% (median = 27%).</p><p><strong>Conclusion: </strong>Our review shows that the current evidence does not support GenAI use in evidence synthesis without human involvement or oversight. However, for most tasks other than searching, GenAI may have a role in assisting humans with evidence synthesis.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 4","pages":"601-619"},"PeriodicalIF":6.1,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527500/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted corrected covered area (wCCA): A measure of informational overlap among reviews. 加权修正覆盖面积(wCCA):评估之间信息重叠的度量。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-01 Epub Date: 2025-04-24 DOI: 10.1017/rsm.2025.19
Xiangji Ying, Konstantinos I Bougioukas, Dawid Pieper, Evan Mayo-Wilson

When conducting overviews of reviews, investigators must measure and describe the extent to which included systematic reviews (SRs) contain the same primary studies. The corrected covered area (CCA) quantifies overlap by counting primary studies included across a set of SRs. In this article, we introduce a modification to the CCA, the weighted CCA (wCCA), which accounts for differences in information contributed by primary studies. The wCCA adjusts the original CCA by weighting studies based on the square roots of their sample sizes. By weighting primary studies according to their precision, wCCA provides a useful and complementary representation of overlap in evidence syntheses .

当进行综述时,研究者必须测量和描述系统综述(SRs)包含相同的主要研究的程度。校正的覆盖面积(CCA)通过计算一组sr中包含的主要研究来量化重叠。在本文中,我们引入了一种修正的CCA,加权CCA (wCCA),它解释了原始研究提供的信息的差异。wCCA根据样本量的平方根对原始CCA进行加权。通过对原始研究的精度进行加权,wCCA在证据合成中提供了重叠的有用和互补的表示。
{"title":"Weighted corrected covered area (wCCA): A measure of informational overlap among reviews.","authors":"Xiangji Ying, Konstantinos I Bougioukas, Dawid Pieper, Evan Mayo-Wilson","doi":"10.1017/rsm.2025.19","DOIUrl":"10.1017/rsm.2025.19","url":null,"abstract":"<p><p>When conducting overviews of reviews, investigators must measure and describe the extent to which included systematic reviews (SRs) contain the same primary studies. The corrected covered area (CCA) quantifies overlap by counting primary studies included across a set of SRs. In this article, we introduce a modification to the CCA, the weighted CCA (wCCA), which accounts for differences in information contributed by primary studies. The wCCA adjusts the original CCA by weighting studies based on the square roots of their sample sizes. By weighting primary studies according to their precision, wCCA provides a useful and complementary representation of overlap in evidence syntheses .</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 4","pages":"701-708"},"PeriodicalIF":6.1,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527530/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A critical assessment of matching-adjusted indirect comparisons in relation to target populations. 对与目标人群相关的匹配调整间接比较的关键评估。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.10
Ziren Jiang, Jialing Liu, Demissie Alemayehu, Joseph C Cappelleri, Devin Abrahami, Yong Chen, Haitao Chu

Matching-adjusted indirect comparison (MAIC) has been increasingly applied in health technology assessments (HTA). By reweighting subjects from a trial with individual participant data (IPD) to match the summary statistics of covariates in another trial with aggregate data (AgD), MAIC enables a comparison of the interventions for the AgD trial population. However, when there are imbalances in effect modifiers with different magnitudes of modification across treatments, contradictory conclusions may arise if MAIC is performed with the IPD and AgD swapped between trials. This can lead to the "MAIC paradox," where different entities reach opposing conclusions about which treatment is more effective, despite analyzing the same data. In this paper, we use synthetic data to illustrate this paradox and emphasize the importance of clearly defining the target population in HTA submissions. Additionally, we recommend making de-identified IPD available to HTA agencies, enabling further indirect comparisons that better reflect the overall population represented by both IPD and AgD trials, as well as other relevant target populations for policy decisions. This would help ensure more accurate and consistent assessments of comparative effectiveness.

匹配调整间接比较(MAIC)在卫生技术评价(HTA)中的应用越来越广泛。MAIC通过对具有个体参与者数据(IPD)的试验中的受试者重新加权,使其与具有总体数据(AgD)的另一项试验的协变量汇总统计数据相匹配,从而能够对AgD试验人群的干预措施进行比较。然而,当不同治疗的效果调节剂不平衡时,如果在试验之间交换IPD和AgD进行MAIC,可能会产生矛盾的结论。这可能导致“MAIC悖论”,即尽管分析了相同的数据,但不同的实体对哪种治疗更有效得出了相反的结论。在本文中,我们使用综合数据来说明这一悖论,并强调在HTA提交中明确定义目标人群的重要性。此外,我们建议HTA机构可以使用去识别的IPD,以便进一步进行间接比较,更好地反映IPD和AgD试验所代表的总体人群,以及政策决策的其他相关目标人群。这将有助于确保对相对效力进行更准确和一致的评估。
{"title":"A critical assessment of matching-adjusted indirect comparisons in relation to target populations.","authors":"Ziren Jiang, Jialing Liu, Demissie Alemayehu, Joseph C Cappelleri, Devin Abrahami, Yong Chen, Haitao Chu","doi":"10.1017/rsm.2025.10","DOIUrl":"10.1017/rsm.2025.10","url":null,"abstract":"<p><p>Matching-adjusted indirect comparison (MAIC) has been increasingly applied in health technology assessments (HTA). By reweighting subjects from a trial with individual participant data (IPD) to match the summary statistics of covariates in another trial with aggregate data (AgD), MAIC enables a comparison of the interventions for the AgD trial population. However, when there are imbalances in effect modifiers with different magnitudes of modification across treatments, contradictory conclusions may arise if MAIC is performed with the IPD and AgD swapped between trials. This can lead to the \"MAIC paradox,\" where different entities reach opposing conclusions about which treatment is more effective, despite analyzing the same data. In this paper, we use synthetic data to illustrate this paradox and emphasize the importance of clearly defining the target population in HTA submissions. Additionally, we recommend making de-identified IPD available to HTA agencies, enabling further indirect comparisons that better reflect the overall population represented by both IPD and AgD trials, as well as other relevant target populations for policy decisions. This would help ensure more accurate and consistent assessments of comparative effectiveness.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"569-574"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527493/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensitivity analysis for reporting bias on the time-dependent summary receiver operating characteristics curve in meta-analysis of prognosis studies with time-to-event outcomes. 在具有时间-事件结局预后研究的meta分析中,报告偏倚对时间相关的总接受者工作特征曲线的敏感性分析。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.14
Yi Zhou, Ao Huang, Satoshi Hattori

In prognosis studies with time-to-event outcomes, the survivals of groups with high/low biomarker expression are often estimated by the Kaplan-Meier method, and the difference between groups is measured by the hazard ratios (HRs). Since the high/low expressions are usually determined by study-specific cutoff values, synthesizing only HRs for summarizing the prognostic capacity of a biomarker brings heterogeneity in the meta-analysis. The time-dependent summary receiver operating characteristics (SROC) curve was proposed as a cutoff-free summary of the prognostic capacity, extended from the SROC curve in meta-analysis of diagnostic studies. However, estimates of the time-dependent SROC curve may be threatened by reporting bias in that studies with significant outcomes, such as HRs, are more likely to be published and selected in meta-analyses. Under this conjecture, this paper proposes a sensitivity analysis method for quantifying and adjusting reporting bias on the time-dependent SROC curve. We model the publication process determined by the significance of the HRs and introduce a sensitivity analysis method based on the conditional likelihood constrained by some expected proportions of published studies. Simulation studies showed that the proposed method could reduce reporting bias given the correctly-specified marginal selection probability. The proposed method is illustrated on the real-world meta-analysis of Ki67 for breast cancer.

在具有时间到事件结果的预后研究中,生物标志物高/低表达组的存活率通常通过Kaplan-Meier方法估计,组间差异通过风险比(hr)来衡量。由于高/低表达通常由研究特定的截止值决定,因此仅合成hr来总结生物标志物的预后能力会在meta分析中带来异质性。时间相关的总接受者工作特征(SROC)曲线被提出作为预后能力的无截止总结,从诊断研究的meta分析中的SROC曲线扩展而来。然而,对时间相关SROC曲线的估计可能会受到报告偏倚的威胁,因为具有显著结果(如hr)的研究更有可能被发表并被荟萃分析选中。在此假设下,本文提出了一种量化和调整随时间变化的SROC曲线报告偏差的敏感性分析方法。我们建立了由hr的显著性决定的发表过程模型,并引入了一种基于条件似然的敏感性分析方法,该方法受发表研究的一些预期比例的约束。仿真研究表明,在给定正确的边际选择概率的情况下,该方法可以减少报告偏差。提出的方法是在现实世界的荟萃分析Ki67乳腺癌说明。
{"title":"Sensitivity analysis for reporting bias on the time-dependent summary receiver operating characteristics curve in meta-analysis of prognosis studies with time-to-event outcomes.","authors":"Yi Zhou, Ao Huang, Satoshi Hattori","doi":"10.1017/rsm.2025.14","DOIUrl":"10.1017/rsm.2025.14","url":null,"abstract":"<p><p>In prognosis studies with time-to-event outcomes, the survivals of groups with high/low biomarker expression are often estimated by the Kaplan-Meier method, and the difference between groups is measured by the hazard ratios (HRs). Since the high/low expressions are usually determined by study-specific cutoff values, synthesizing only HRs for summarizing the prognostic capacity of a biomarker brings heterogeneity in the meta-analysis. The time-dependent summary receiver operating characteristics (SROC) curve was proposed as a cutoff-free summary of the prognostic capacity, extended from the SROC curve in meta-analysis of diagnostic studies. However, estimates of the time-dependent SROC curve may be threatened by reporting bias in that studies with significant outcomes, such as HRs, are more likely to be published and selected in meta-analyses. Under this conjecture, this paper proposes a sensitivity analysis method for quantifying and adjusting reporting bias on the time-dependent SROC curve. We model the publication process determined by the significance of the HRs and introduce a sensitivity analysis method based on the conditional likelihood constrained by some expected proportions of published studies. Simulation studies showed that the proposed method could reduce reporting bias given the correctly-specified marginal selection probability. The proposed method is illustrated on the real-world meta-analysis of Ki67 for breast cancer.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"528-549"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527534/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
metaConvert: an automatic suite for estimation of 11 different effect size measures and flexible conversion across them. metaConvert:一个自动套件,用于估计11种不同的效应大小措施和灵活的转换。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.11
Corentin J Gosling, Samuele Cortese, Marco Solmi, Belen Haza, Eduard Vieta, Richard Delorme, Paolo Fusar-Poli, Joaquim Radua

A fundamental pillar of science is the estimation of the effect size of associations. However, this task is sometimes difficult and error-prone. To facilitate this process, the R package metaConvert automatically calculates and flexibly converts multiple effect size measures. It applies more than 120 formulas to convert any relevant input data into Cohen's d, Hedges' g, mean difference, odds ratio, risk ratio, incidence rate ratio, correlation coefficient, Fisher's r-to-z transformed correlation coefficient, variability ratio, coefficient of variation ratio, or number needed to treat. Researchers unfamiliar with R can use this software through a browser-based graphical interface (https://metaconvert.org/). We hope this suite will help researchers in the life sciences and other disciplines estimate and convert effect sizes more easily and accurately.

科学的一个基本支柱是对关联效应大小的估计。然而,这项任务有时很困难,而且容易出错。为了方便这一过程,R包metaConvert自动计算并灵活转换多个效应大小度量。它使用120多个公式将任何相关输入数据转换为Cohen’s d、Hedges’g、均值差、优势比、风险比、发病率比、相关系数、Fisher’s r- z转换相关系数、变异性比、变异系数比或需要处理的数。不熟悉R的研究人员可以通过基于浏览器的图形界面(https://metaconvert.org/)使用该软件。我们希望这个套件能够帮助生命科学和其他学科的研究人员更容易、更准确地估计和转换效应大小。
{"title":"metaConvert: an automatic suite for estimation of 11 different effect size measures and flexible conversion across them.","authors":"Corentin J Gosling, Samuele Cortese, Marco Solmi, Belen Haza, Eduard Vieta, Richard Delorme, Paolo Fusar-Poli, Joaquim Radua","doi":"10.1017/rsm.2025.11","DOIUrl":"10.1017/rsm.2025.11","url":null,"abstract":"<p><p>A fundamental pillar of science is the estimation of the effect size of associations. However, this task is sometimes difficult and error-prone. To facilitate this process, the R package metaConvert automatically calculates and flexibly converts multiple effect size measures. It applies more than 120 formulas to convert any relevant input data into Cohen's <i>d</i>, Hedges' <i>g</i>, mean difference, odds ratio, risk ratio, incidence rate ratio, correlation coefficient, Fisher's r-to-z transformed correlation coefficient, variability ratio, coefficient of variation ratio, or number needed to treat. Researchers unfamiliar with R can use this software through a browser-based graphical interface (https://metaconvert.org/). We hope this suite will help researchers in the life sciences and other disciplines estimate and convert effect sizes more easily and accurately.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"575-586"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527507/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Treatment recommendations based on network meta-analysis: Rules for risk-averse decision-makers. 基于网络荟萃分析的治疗建议:风险规避决策者的规则。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.17
A E Ades, Annabel L Davies, David M Phillippo, Hugo Pedder, Howard Thom, Beatrice Downing, Deborah M Caldwell, Nicky J Welton

The treatment recommendation based on a network meta-analysis (NMA) is usually the single treatment with the highest expected value (EV) on an evaluative function. We explore approaches that recommend multiple treatments and that penalise uncertainty, making them suitable for risk-averse decision-makers. We introduce loss-adjusted EV (LaEV) and compare it to GRADE and three probability-based rankings. We define properties of a valid ranking under uncertainty and other desirable properties of ranking systems. A two-stage process is proposed: the first identifies treatments superior to the reference treatment; the second identifies those that are also within a minimal clinically important difference (MCID) of the best treatment. Decision rules and ranking systems are compared on stylised examples and 10 NMAs used in NICE (National Institute of Health and Care Excellence) guidelines. Only LaEV reliably delivers valid rankings under uncertainty and has all the desirable properties. In 10 NMAs comparing between 5 and 41 treatments, an EV decision maker would recommend 4-14 treatments, and LaEV 0-3 (median 2) fewer. GRADE rules give rise to anomalies, and, like the probability-based rankings, the number of treatments recommended depends on arbitrary probability cutoffs. Among treatments that are superior to the reference, GRADE privileges the more uncertain ones, and in 3/10 cases, GRADE failed to recommend the treatment with the highest EV and LaEV. A two-stage approach based on MCID ensures that EV- and LaEV-based rules recommend a clinically appropriate number of treatments. For a risk-averse decision maker, LaEV is conservative, simple to implement, and has an independent theoretical foundation.

基于网络元分析(NMA)的治疗建议通常是评价函数期望值(EV)最高的单一治疗。我们探索了推荐多种治疗方法并惩罚不确定性的方法,使其适合厌恶风险的决策者。我们引入损失调整EV (LaEV),并将其与GRADE和三种基于概率的排名进行比较。我们定义了不确定情况下有效排序的性质和排序系统的其他理想性质。提出了两阶段流程:首先确定优于参考处理的处理;第二种识别那些也在最小临床重要差异(MCID)内的最佳治疗。决策规则和排名系统在风格化的例子和NICE(国家健康和护理卓越研究所)指南中使用的10个nma进行了比较。只有LaEV在不确定的情况下可靠地提供有效的排名,并具有所有理想的属性。在10个nma中,比较5 - 41种治疗方法,EV决策者会推荐4-14种治疗方法,LaEV 0-3(中位数2)更少。GRADE规则会产生异常,并且,像基于概率的排名一样,推荐的治疗数量取决于任意的概率截止值。在优于参考的治疗方案中,GRADE优先于更不确定的治疗方案,在3/10的病例中,GRADE未能推荐EV和LaEV最高的治疗方案。基于MCID的两阶段方法确保基于EV和laev的规则推荐临床适当的治疗数量。对于风险厌恶型决策者而言,LaEV具有保守性、实现简单、独立的理论基础。
{"title":"Treatment recommendations based on network meta-analysis: Rules for risk-averse decision-makers.","authors":"A E Ades, Annabel L Davies, David M Phillippo, Hugo Pedder, Howard Thom, Beatrice Downing, Deborah M Caldwell, Nicky J Welton","doi":"10.1017/rsm.2025.17","DOIUrl":"10.1017/rsm.2025.17","url":null,"abstract":"<p><p>The treatment recommendation based on a network meta-analysis (NMA) is usually the single treatment with the highest expected value (EV) on an evaluative function. We explore approaches that recommend multiple treatments and that penalise uncertainty, making them suitable for risk-averse decision-makers. We introduce loss-adjusted EV (LaEV) and compare it to GRADE and three probability-based rankings. We define properties of a valid ranking under uncertainty and other desirable properties of ranking systems. A two-stage process is proposed: the first identifies treatments superior to the reference treatment; the second identifies those that are also within a minimal clinically important difference (MCID) of the best treatment. Decision rules and ranking systems are compared on stylised examples and 10 NMAs used in NICE (National Institute of Health and Care Excellence) guidelines. Only LaEV reliably delivers valid rankings under uncertainty and has all the desirable properties. In 10 NMAs comparing between 5 and 41 treatments, an EV decision maker would recommend 4-14 treatments, and LaEV 0-3 (median 2) fewer. GRADE rules give rise to anomalies, and, like the probability-based rankings, the number of treatments recommended depends on arbitrary probability cutoffs. Among treatments that are superior to the reference, GRADE privileges the more uncertain ones, and in 3/10 cases, GRADE failed to recommend the treatment with the highest EV and LaEV. A two-stage approach based on MCID ensures that EV- and LaEV-based rules recommend a clinically appropriate number of treatments. For a risk-averse decision maker, LaEV is conservative, simple to implement, and has an independent theoretical foundation.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"550-568"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527546/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalizable and scalable multistage biomedical concept normalization leveraging large language models. 利用大型语言模型的可泛化和可扩展的多阶段生物医学概念规范化。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.9
Nicholas J Dobbins

Background: Biomedical entity normalization is critical to biomedical research because the richness of free-text clinical data, such as progress notes, can often be fully leveraged only after translating words and phrases into structured and coded representations suitable for analysis. Large Language Models (LLMs), in turn, have shown great potential and high performance in a variety of natural language processing (NLP) tasks, but their application for normalization remains understudied.

Methods: We applied both proprietary and open-source LLMs in combination with several rule-based normalization systems commonly used in biomedical research. We used a two-step LLM integration approach, (1) using an LLM to generate alternative phrasings of a source utterance, and (2) to prune candidate UMLS concepts, using a variety of prompting methods. We measure results by $F_{beta }$, where we favor recall over precision, and F1.

Results: We evaluated a total of 5,523 concept terms and text contexts from a publicly available dataset of human-annotated biomedical abstracts. Incorporating GPT-3.5-turbo increased overall $F_{beta }$ and F1 in normalization systems +16.5 and +16.2 (OpenAI embeddings), +9.5 and +7.3 (MetaMapLite), +13.9 and +10.9 (QuickUMLS), and +10.5 and +10.3 (BM25), while the open-source Vicuna model achieved +20.2 and +21.7 (OpenAI embeddings), +10.8 and +12.2 (MetaMapLite), +14.7 and +15 (QuickUMLS), and +15.6 and +18.7 (BM25).

Conclusions: Existing general-purpose LLMs, both propriety and open-source, can be leveraged to greatly improve normalization performance using existing tools, with no fine-tuning.

背景:生物医学实体规范化对生物医学研究至关重要,因为自由文本临床数据(如进度记录)的丰富性通常只有在将单词和短语翻译成适合分析的结构化和编码表示后才能充分利用。反过来,大型语言模型(llm)在各种自然语言处理(NLP)任务中显示出巨大的潜力和高性能,但它们在规范化方面的应用仍有待研究。方法:我们结合生物医学研究中常用的几种基于规则的规范化系统,应用专有和开源法学硕士。我们使用了两步LLM集成方法,(1)使用LLM生成源话语的替代短语,(2)使用各种提示方法修剪候选的UMLS概念。我们用F_{beta}$和F1来衡量结果,其中我们倾向于召回率而不是精度。结果:我们从人类注释的生物医学摘要的公开数据集中评估了总共5523个概念术语和文本上下文。采用gpt -3.5 turbo的归一化系统在+16.5和+16.2 (OpenAI嵌入)、+9.5和+7.3 (MetaMapLite)、+13.9和+10.9 (QuickUMLS)、+10.5和+10.3 (BM25)中增加了总体$F_{beta}$和F1,而开源Vicuna模型在+20.2和+21.7 (OpenAI嵌入)、+10.8和+12.2 (MetaMapLite)、+14.7和+15 (QuickUMLS)、+15.6和+18.7 (BM25)中实现了+20.2和+21.7。结论:现有的通用llm,无论是专有的还是开源的,都可以利用现有的工具来极大地提高规范化性能,而无需进行微调。
{"title":"Generalizable and scalable multistage biomedical concept normalization leveraging large language models.","authors":"Nicholas J Dobbins","doi":"10.1017/rsm.2025.9","DOIUrl":"10.1017/rsm.2025.9","url":null,"abstract":"<p><strong>Background: </strong>Biomedical entity normalization is critical to biomedical research because the richness of free-text clinical data, such as progress notes, can often be fully leveraged only after translating words and phrases into structured and coded representations suitable for analysis. Large Language Models (LLMs), in turn, have shown great potential and high performance in a variety of natural language processing (NLP) tasks, but their application for normalization remains understudied.</p><p><strong>Methods: </strong>We applied both proprietary and open-source LLMs in combination with several rule-based normalization systems commonly used in biomedical research. We used a two-step LLM integration approach, (1) using an LLM to generate alternative phrasings of a source utterance, and (2) to prune candidate UMLS concepts, using a variety of prompting methods. We measure results by $F_{beta }$, where we favor recall over precision, and F1.</p><p><strong>Results: </strong>We evaluated a total of 5,523 concept terms and text contexts from a publicly available dataset of human-annotated biomedical abstracts. Incorporating GPT-3.5-turbo increased overall $F_{beta }$ and F1 in normalization systems +16.5 and +16.2 (OpenAI embeddings), +9.5 and +7.3 (MetaMapLite), +13.9 and +10.9 (QuickUMLS), and +10.5 and +10.3 (BM25), while the open-source Vicuna model achieved +20.2 and +21.7 (OpenAI embeddings), +10.8 and +12.2 (MetaMapLite), +14.7 and +15 (QuickUMLS), and +15.6 and +18.7 (BM25).</p><p><strong>Conclusions: </strong>Existing general-purpose LLMs, both propriety and open-source, can be leveraged to greatly improve normalization performance using existing tools, with no fine-tuning.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"479-490"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527512/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the potential of Claude 2 for risk of bias assessment: Using a large language model to assess randomized controlled trials with RoB 2. 探索Claude 2对偏倚风险评估的潜力:使用大型语言模型评估RoB 2的随机对照试验。
IF 6.1 2区 生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-05-01 DOI: 10.1017/rsm.2025.12
Angelika Eisele-Metzger, Judith-Lisa Lieberum, Markus Toews, Waldemar Siemens, Felix Heilmeyer, Christian Haverkamp, Daniel Boehringer, Joerg J Meerpohl

Systematic reviews are essential for evidence-based health care, but conducting them is time- and resource-consuming. To date, efforts have been made to accelerate and (semi-)automate various steps of systematic reviews through the use of artificial intelligence (AI) and the emergence of large language models (LLMs) promises further opportunities. One crucial but complex task within systematic review conduct is assessing the risk of bias (RoB) of included studies. Therefore, the aim of this study was to test the LLM Claude 2 for RoB assessment of 100 randomized controlled trials, published in English language from 2013 onwards, using the revised Cochrane risk of bias tool ('RoB 2'; involving judgements for five specific domains and an overall judgement). We assessed the agreement of RoB judgements by Claude with human judgements published in Cochrane reviews. The observed agreement between Claude and Cochrane authors ranged from 41% for the overall judgement to 71% for domain 4 ('outcome measurement'). Cohen's κ was lowest for domain 5 ('selective reporting'; 0.10 (95% confidence interval (CI): -0.10-0.31)) and highest for domain 3 ('missing data'; 0.31 (95% CI: 0.10-0.52)), indicating slight to fair agreement. Fair agreement was found for the overall judgement (Cohen's κ: 0.22 (95% CI: 0.06-0.38)). Sensitivity analyses using alternative prompting techniques or the more recent version Claude 3 did not result in substantial changes. Currently, Claude's RoB 2 judgements cannot replace human RoB assessment. However, the potential of LLMs to support RoB assessment should be further explored.

系统评价对循证卫生保健至关重要,但进行系统评价既费时又耗资源。迄今为止,通过使用人工智能(AI),已经努力加速和(半)自动化系统审查的各个步骤,并且大型语言模型(llm)的出现预示着更多的机会。系统评价中一个关键但复杂的任务是评估纳入研究的偏倚风险(RoB)。因此,本研究的目的是使用修订后的Cochrane偏倚风险工具(“RoB 2”,涉及对五个特定领域的判断和一个总体判断),对2013年以来发表的100项随机对照试验的RoB评估进行LLM Claude 2测试。我们评估了Claude的RoB判断与Cochrane评论中发表的人类判断的一致性。Claude和Cochrane作者之间观察到的一致性从总体判断的41%到领域4(“结果测量”)的71%不等。Cohen’s κ在域5(“选择性报告”,0.10(95%可信区间(CI): -0.10-0.31))中最低,在域3(“缺失数据”,0.31 (95% CI: 0.10-0.52)中最高,表明稍微一致。总体判断一致(Cohen’s κ: 0.22 (95% CI: 0.06-0.38))。使用替代提示技术或最新版本Claude 3的敏感性分析没有导致实质性的变化。目前,克劳德的RoB 2判断不能取代人类的RoB评估。然而,法学硕士支持RoB评估的潜力还有待进一步探索。
{"title":"Exploring the potential of Claude 2 for risk of bias assessment: Using a large language model to assess randomized controlled trials with RoB 2.","authors":"Angelika Eisele-Metzger, Judith-Lisa Lieberum, Markus Toews, Waldemar Siemens, Felix Heilmeyer, Christian Haverkamp, Daniel Boehringer, Joerg J Meerpohl","doi":"10.1017/rsm.2025.12","DOIUrl":"10.1017/rsm.2025.12","url":null,"abstract":"<p><p>Systematic reviews are essential for evidence-based health care, but conducting them is time- and resource-consuming. To date, efforts have been made to accelerate and (semi-)automate various steps of systematic reviews through the use of artificial intelligence (AI) and the emergence of large language models (LLMs) promises further opportunities. One crucial but complex task within systematic review conduct is assessing the risk of bias (RoB) of included studies. Therefore, the aim of this study was to test the LLM Claude 2 for RoB assessment of 100 randomized controlled trials, published in English language from 2013 onwards, using the revised Cochrane risk of bias tool ('RoB 2'; involving judgements for five specific domains and an overall judgement). We assessed the agreement of RoB judgements by Claude with human judgements published in Cochrane reviews. The observed agreement between Claude and Cochrane authors ranged from 41% for the overall judgement to 71% for domain 4 ('outcome measurement'). Cohen's κ was lowest for domain 5 ('selective reporting'; 0.10 (95% confidence interval (CI): -0.10-0.31)) and highest for domain 3 ('missing data'; 0.31 (95% CI: 0.10-0.52)), indicating slight to fair agreement. Fair agreement was found for the overall judgement (Cohen's κ: 0.22 (95% CI: 0.06-0.38)). Sensitivity analyses using alternative prompting techniques or the more recent version Claude 3 did not result in substantial changes. Currently, Claude's RoB 2 judgements cannot replace human RoB assessment. However, the potential of LLMs to support RoB assessment should be further explored.</p>","PeriodicalId":226,"journal":{"name":"Research Synthesis Methods","volume":"16 3","pages":"491-508"},"PeriodicalIF":6.1,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12527486/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Research Synthesis Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1