下载PDF
{"title":"Anatomy-specific Progression Classification in Chest Radiographs via Weakly Supervised Learning.","authors":"Ke Yu, Shantanu Ghosh, Zhexiong Liu, Christopher Deible, Clare B Poynton, Kayhan Batmanghelich","doi":"10.1148/ryai.230277","DOIUrl":null,"url":null,"abstract":"<p><p>Purpose To develop a machine learning approach for classifying disease progression in chest radiographs using weak labels automatically derived from radiology reports. Materials and Methods In this retrospective study, a twin neural network was developed to classify anatomy-specific disease progression into four categories: improved, unchanged, worsened, and new. A two-step weakly supervised learning approach was employed, pretraining the model on 243 008 frontal chest radiographs from 63 877 patients (mean age, 51.7 years ± 17.0 [SD]; 34 813 [55%] female) included in the MIMIC-CXR database and fine-tuning it on the subset with progression labels derived from consecutive studies. Model performance was evaluated for six pathologic observations on test datasets of unseen patients from the MIMIC-CXR database. Area under the receiver operating characteristic (AUC) analysis was used to evaluate classification performance. The algorithm is also capable of generating bounding-box predictions to localize areas of new progression. Recall, precision, and mean average precision were used to evaluate the new progression localization. One-tailed paired <i>t</i> tests were used to assess statistical significance. Results The model outperformed most baselines in progression classification, achieving macro AUC scores of 0.72 ± 0.004 for atelectasis, 0.75 ± 0.007 for consolidation, 0.76 ± 0.017 for edema, 0.81 ± 0.006 for effusion, 0.7 ± 0.032 for pneumonia, and 0.69 ± 0.01 for pneumothorax. For new observation localization, the model achieved mean average precision scores of 0.25 ± 0.03 for atelectasis, 0.34 ± 0.03 for consolidation, 0.33 ± 0.03 for edema, and 0.31 ± 0.03 for pneumothorax. Conclusion Disease progression classification models were developed on a large chest radiograph dataset, which can be used to monitor interval changes and detect new pathologic conditions on chest radiographs. <b>Keywords:</b> Prognosis, Unsupervised Learning, Transfer Learning, Convolutional Neural Network (CNN), Emergency Radiology, Named Entity Recognition <i>Supplemental material is available for this article.</i> © RSNA, 2024 See also commentary by Alves and Venkadesh in this issue.</p>","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230277"},"PeriodicalIF":8.1000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11427915/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiology-Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1148/ryai.230277","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
引用
批量引用
Abstract
Purpose To develop a machine learning approach for classifying disease progression in chest radiographs using weak labels automatically derived from radiology reports. Materials and Methods In this retrospective study, a twin neural network was developed to classify anatomy-specific disease progression into four categories: improved, unchanged, worsened, and new. A two-step weakly supervised learning approach was employed, pretraining the model on 243 008 frontal chest radiographs from 63 877 patients (mean age, 51.7 years ± 17.0 [SD]; 34 813 [55%] female) included in the MIMIC-CXR database and fine-tuning it on the subset with progression labels derived from consecutive studies. Model performance was evaluated for six pathologic observations on test datasets of unseen patients from the MIMIC-CXR database. Area under the receiver operating characteristic (AUC) analysis was used to evaluate classification performance. The algorithm is also capable of generating bounding-box predictions to localize areas of new progression. Recall, precision, and mean average precision were used to evaluate the new progression localization. One-tailed paired t tests were used to assess statistical significance. Results The model outperformed most baselines in progression classification, achieving macro AUC scores of 0.72 ± 0.004 for atelectasis, 0.75 ± 0.007 for consolidation, 0.76 ± 0.017 for edema, 0.81 ± 0.006 for effusion, 0.7 ± 0.032 for pneumonia, and 0.69 ± 0.01 for pneumothorax. For new observation localization, the model achieved mean average precision scores of 0.25 ± 0.03 for atelectasis, 0.34 ± 0.03 for consolidation, 0.33 ± 0.03 for edema, and 0.31 ± 0.03 for pneumothorax. Conclusion Disease progression classification models were developed on a large chest radiograph dataset, which can be used to monitor interval changes and detect new pathologic conditions on chest radiographs. Keywords: Prognosis, Unsupervised Learning, Transfer Learning, Convolutional Neural Network (CNN), Emergency Radiology, Named Entity Recognition Supplemental material is available for this article. © RSNA, 2024 See also commentary by Alves and Venkadesh in this issue.
通过弱监督学习对胸部 X 光片进行特定解剖学进展分类
"刚刚接受 "的论文经过同行评审,已被接受在《放射学》上发表:人工智能》上发表。这篇文章在以最终版本发表之前,还将经过校对、排版和校对审核。请注意,在制作最终校对稿的过程中,可能会发现一些错误,从而影响文章内容。目的 开发一种机器学习方法,利用从放射学报告中自动提取的弱标签对胸片中的疾病进展进行分类。材料与方法 在这项回顾性研究中,开发了一种孪生神经网络,将特定解剖结构的疾病进展分为四类:好转、不变、恶化和新发。研究采用了两步弱监督学习法,在来自 63,877 名 MIMIC-CXR 患者(平均年龄 51.7 岁;女性 34,813 人)的 243,008 张正面胸部 X 光片上对模型进行预训练,并在从连续研究中获得的疾病进展标签子集上对模型进行微调。在未见过的 MIMIC-CXR 患者测试数据集上对六种病理观察结果进行了模型性能评估。接受者操作特征下面积(AUC)分析用于评估分类性能。该算法还能生成边界框预测,以定位新的进展区域。采用召回率、精确度和平均精确度(mAP)来评估新进展定位。采用单尾配对 t 检验来评估统计意义。结果 该模型在进展分类方面的表现优于大多数基线模型,其宏观AUC得分分别为:肺不张(0.72 ± 0.004)、肺不张(0.75 ± 0.007)、肺水肿(0.76 ± 0.017)、肺积液(0.81 ± 0.006)、肺炎(0.7 ± 0.032)和气胸(0.69 ± 0.01)。对于新的观察定位,该模型的 mAP 评分分别为:肺不张(0.25 ± 0.03)、肺不张(0.34 ± 0.03)、肺水肿(0.33 ± 0.03)和气胸(0.31 ± 0.03)。结论 在大型胸片数据集上开发了疾病进展分类模型,可用于监测间隔变化和检测胸片上的新病变。©RSNA,2024。
本文章由计算机程序翻译,如有差异,请以英文原文为准。