Evaluating GPT Models for Automated Literature Screening in Wastewater-Based Epidemiology.

IF 6.7 Q1 ENGINEERING, ENVIRONMENTAL ACS Environmental Au Pub Date : 2024-12-03 eCollection Date: 2025-01-15 DOI:10.1021/acsenvironau.4c00042

Kaseba Chibwe, David Mantilla-Calderon, Fangqiong Ling

{"title":"Evaluating GPT Models for Automated Literature Screening in Wastewater-Based Epidemiology.","authors":"Kaseba Chibwe, David Mantilla-Calderon, Fangqiong Ling","doi":"10.1021/acsenvironau.4c00042","DOIUrl":null,"url":null,"abstract":"<p><p>Methods to quantitatively synthesize findings across multiple studies is an emerging need in wastewater-based epidemiology (WBE), where disease tracking through wastewater analysis is performed at broad geographical locations using various techniques to facilitate public health responses. Meta-analysis provides a rigorous statistical procedure for research synthesis, yet the manual process of screening large volumes of literature remains a hurdle for its application in timely evidence-based public health responses. Here, we evaluated the performance of GPT-3, GPT-3.5, and GPT-4 models in automated screening of publications for meta-analysis in the WBE literature. We show that the chat completion model in GPT-4 accurately differentiates papers that contain original data from those that did not with texts of the Abstract as the input at a Precision of 0.96 and Recall of 1.00, exceeding current quality standards for manual screening (Recall = 0.95) while costing less than $0.01 per paper. GPT models performed less accurately in detecting studies reporting relevant sampling location, highlighting the value of maintaining human intervention in AI-assisted literature screening. Importantly, we show that certain formulation and model choices generated nonsensical answers to the screening tasks, while others did not, urging the attention to robustness when employing AI-assisted literature screening. This study provided novel performance evaluation data on GPT models for document screening as a step in meta-analysis, suggesting AI-assisted literature screening a useful complementary technique to speed up research synthesis in WBE.</p>","PeriodicalId":29801,"journal":{"name":"ACS Environmental Au","volume":"5 1","pages":"61-68"},"PeriodicalIF":6.7000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11741058/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Environmental Au","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1021/acsenvironau.4c00042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/15 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Methods to quantitatively synthesize findings across multiple studies is an emerging need in wastewater-based epidemiology (WBE), where disease tracking through wastewater analysis is performed at broad geographical locations using various techniques to facilitate public health responses. Meta-analysis provides a rigorous statistical procedure for research synthesis, yet the manual process of screening large volumes of literature remains a hurdle for its application in timely evidence-based public health responses. Here, we evaluated the performance of GPT-3, GPT-3.5, and GPT-4 models in automated screening of publications for meta-analysis in the WBE literature. We show that the chat completion model in GPT-4 accurately differentiates papers that contain original data from those that did not with texts of the Abstract as the input at a Precision of 0.96 and Recall of 1.00, exceeding current quality standards for manual screening (Recall = 0.95) while costing less than $0.01 per paper. GPT models performed less accurately in detecting studies reporting relevant sampling location, highlighting the value of maintaining human intervention in AI-assisted literature screening. Importantly, we show that certain formulation and model choices generated nonsensical answers to the screening tasks, while others did not, urging the attention to robustness when employing AI-assisted literature screening. This study provided novel performance evaluation data on GPT models for document screening as a step in meta-analysis, suggesting AI-assisted literature screening a useful complementary technique to speed up research synthesis in WBE.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评价废水流行病学文献自动筛选的GPT模型。

在基于废水的流行病学（WBE）中，定量综合多个研究结果的方法是一项新出现的需求，其中通过废水分析在广泛的地理位置使用各种技术进行疾病跟踪，以促进公共卫生反应。荟萃分析为研究综合提供了严格的统计程序，但筛选大量文献的人工过程仍然是其在及时循证公共卫生响应中应用的障碍。在这里，我们评估了GPT-3、GPT-3.5和GPT-4模型在WBE文献中用于meta分析的出版物自动筛选中的性能。我们表明，GPT-4中的聊天完成模型准确地区分了包含原始数据的论文和不包含摘要文本的论文，其精确度为0.96，召回率为1.00，超过了目前人工筛选的质量标准（召回率= 0.95），而每篇论文的成本低于0.01美元。GPT模型在检测报告相关采样位置的研究时表现不太准确，突出了在人工智能辅助文献筛选中保持人工干预的价值。重要的是，我们表明某些公式和模型选择对筛选任务产生了无意义的答案，而另一些则没有，这促使人们在使用人工智能辅助文献筛选时注意鲁棒性。本研究为文献筛选提供了新的GPT模型的性能评估数据，作为荟萃分析的一个步骤，表明人工智能辅助文献筛选是加快WBE研究综合的有用补充技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACS Environmental Au 环境科学-

CiteScore

7.10

自引率

0.00%

发文量

期刊介绍： ACS Environmental Au is an open access journal which publishes experimental research and theoretical results in all aspects of environmental science and technology both pure and applied. Short letters comprehensive articles reviews and perspectives are welcome in the following areas:Alternative EnergyAnthropogenic Impacts on Atmosphere Soil or WaterBiogeochemical CyclingBiomass or Wastes as ResourcesContaminants in Aquatic and Terrestrial EnvironmentsEnvironmental Data ScienceEcotoxicology and Public HealthEnergy and ClimateEnvironmental Modeling Processes and Measurement Methods and TechnologiesEnvironmental Nanotechnology and BiotechnologyGreen ChemistryGreen Manufacturing and EngineeringRisk assessment Regulatory Frameworks and Life-Cycle AssessmentsTreatment and Resource Recovery and Waste Management

期刊最新文献

Issue Editorial Masthead Issue Publication Information Machine Learning Reveals Signatures of Promiscuous Microbial Amidases for Micropollutant Biotransformations. Machine Learning Reveals Signatures of Promiscuous Microbial Amidases for Micropollutant Biotransformations Evaluating GPT Models for Automated Literature Screening in Wastewater-Based Epidemiology.