Seyed Mohammad Sajjadi, Alisa Mohebbi, Amirhossein Ehsani, Amir Marashi, Aida Azhdarimoghaddam, Shaghayegh Karami, Mohammad Amin Karimi, Mahsa Sadeghi, Kiana Firoozi, Amir Mohammad Zamani, Amirhossein Rigi, Melika Nayebagha, Mahsa Asadi Anar, Pooya Eini, Sadaf Salehi, Mahsa Rostami Ghezeljeh
{"title":"Identifying abdominal aortic aneurysm size and presence using Natural Language Processing of radiology reports: a systematic review and meta-analysis.","authors":"Seyed Mohammad Sajjadi, Alisa Mohebbi, Amirhossein Ehsani, Amir Marashi, Aida Azhdarimoghaddam, Shaghayegh Karami, Mohammad Amin Karimi, Mahsa Sadeghi, Kiana Firoozi, Amir Mohammad Zamani, Amirhossein Rigi, Melika Nayebagha, Mahsa Asadi Anar, Pooya Eini, Sadaf Salehi, Mahsa Rostami Ghezeljeh","doi":"10.1007/s00261-025-04810-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and aim: </strong>Prior investigations of the natural history of abdominal aortic aneurysms (AAAs) have been constrained by small sample sizes or uneven assessments of aggregated data. Natural language processing (NLP) can significantly enhance the investigation and treatment of patients with AAAs by swiftly and effectively collecting imaging data from health records. This meta-analysis aimed to evaluate the efficacy of NLP techniques in reliably identifying the existence or absence of AAAs and measuring the maximal abdominal aortic diameter in extensive datasets of radiology study reports.</p><p><strong>Method: </strong>The PubMed, Scopus, Web of Science, Embase, and Science Direct databases were searched until March 2024 to obtain pertinent papers. The RAYYAN intelligent tool for systematic reviews was utilized to screen the studies. The meta-analysis was conducted using STATA v18 software. Egger's test was employed to evaluate publication bias. The Newcastle Ottawa Scale was employed to assess the quality of the listed studies. A plot digitizer was employed to extract digital data.</p><p><strong>Result: </strong>A total of 39,094 individuals with AAA were included in this analysis. Twenty-seven thousand three hundred twenty-six patients were male, and 11,383 were female. The mean age of the total participants was 73.1 ± 1.25 years. Analysis results for pooled estimation of performance variables such as: The sensitivity, specificity, precision, and accuracy of the implemented NLP model were analyzed as follows: 0.89(0.88-0.91), 0.88 (0.87-0.89), 0.92 (0.89-0.95), and 0.91 (0.89-0.93) respectively. The aneurysm diameter size difference reported in follow-up before and after NLP implementation in the included studies showed a 0.05 cm reduction in size, which was statistically significant.</p><p><strong>Conclusion: </strong>NLP holds great potential for automating the detection of AAA size and presence in radiology reports, enhancing efficiency and scalability over manual review. However, challenges persist. Variability in report formats, terminology, and unstructured data can compromise accuracy. Additionally, NLP models rely on high-quality, annotated training datasets, which may be incomplete or unrepresentative. While NLP aids in identifying AAA-related data, human oversight is essential to ensure decisions are informed by the patient's broader clinical context. Ongoing algorithm refinement and seamless integration into clinical workflows are key to improving NLP's utility and reliability in this field.</p>","PeriodicalId":7126,"journal":{"name":"Abdominal Radiology","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Abdominal Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00261-025-04810-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Background and aim: Prior investigations of the natural history of abdominal aortic aneurysms (AAAs) have been constrained by small sample sizes or uneven assessments of aggregated data. Natural language processing (NLP) can significantly enhance the investigation and treatment of patients with AAAs by swiftly and effectively collecting imaging data from health records. This meta-analysis aimed to evaluate the efficacy of NLP techniques in reliably identifying the existence or absence of AAAs and measuring the maximal abdominal aortic diameter in extensive datasets of radiology study reports.
Method: The PubMed, Scopus, Web of Science, Embase, and Science Direct databases were searched until March 2024 to obtain pertinent papers. The RAYYAN intelligent tool for systematic reviews was utilized to screen the studies. The meta-analysis was conducted using STATA v18 software. Egger's test was employed to evaluate publication bias. The Newcastle Ottawa Scale was employed to assess the quality of the listed studies. A plot digitizer was employed to extract digital data.
Result: A total of 39,094 individuals with AAA were included in this analysis. Twenty-seven thousand three hundred twenty-six patients were male, and 11,383 were female. The mean age of the total participants was 73.1 ± 1.25 years. Analysis results for pooled estimation of performance variables such as: The sensitivity, specificity, precision, and accuracy of the implemented NLP model were analyzed as follows: 0.89(0.88-0.91), 0.88 (0.87-0.89), 0.92 (0.89-0.95), and 0.91 (0.89-0.93) respectively. The aneurysm diameter size difference reported in follow-up before and after NLP implementation in the included studies showed a 0.05 cm reduction in size, which was statistically significant.
Conclusion: NLP holds great potential for automating the detection of AAA size and presence in radiology reports, enhancing efficiency and scalability over manual review. However, challenges persist. Variability in report formats, terminology, and unstructured data can compromise accuracy. Additionally, NLP models rely on high-quality, annotated training datasets, which may be incomplete or unrepresentative. While NLP aids in identifying AAA-related data, human oversight is essential to ensure decisions are informed by the patient's broader clinical context. Ongoing algorithm refinement and seamless integration into clinical workflows are key to improving NLP's utility and reliability in this field.
期刊介绍:
Abdominal Radiology seeks to meet the professional needs of the abdominal radiologist by publishing clinically pertinent original, review and practice related articles on the gastrointestinal and genitourinary tracts and abdominal interventional and radiologic procedures. Case reports are generally not accepted unless they are the first report of a new disease or condition, or part of a special solicited section.
Reasons to Publish Your Article in Abdominal Radiology:
· Official journal of the Society of Abdominal Radiology (SAR)
· Published in Cooperation with:
European Society of Gastrointestinal and Abdominal Radiology (ESGAR)
European Society of Urogenital Radiology (ESUR)
Asian Society of Abdominal Radiology (ASAR)
· Efficient handling and Expeditious review
· Author feedback is provided in a mentoring style
· Global readership
· Readers can earn CME credits