Nikolas Touloumes, Georgia Gagianas, James Bradley, Michael Muelly, Angad Kalra, Joshua Reicher
{"title":"ScreenDx, an artificial intelligence-based algorithm for the incidental detection of pulmonary fibrosis.","authors":"Nikolas Touloumes, Georgia Gagianas, James Bradley, Michael Muelly, Angad Kalra, Joshua Reicher","doi":"10.1016/j.amjms.2025.02.011","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Nonspecific symptoms and variability in radiographic reporting patterns contribute to a diagnostic delay of the diagnosis of pulmonary fibrosis. An attractive solution is the use of machine-learning algorithms to screen for radiographic features suggestive of pulmonary fibrosis. Thus, we developed and validated a machine learning classifier algorithm (ScreenDx) to screen computed tomography imaging and identify incidental cases of pulmonary fibrosis.</p><p><strong>Methods: </strong>ScreenDx is a deep learning convolutional neural network that was developed from a multi-source dataset (cohort A) of 3,658 cases of normal and abnormal CT's, including CT's from patients with COPD, emphysema, and community-acquired pneumonia. Cohort B, a US-based cohort (n=381) was used for tuning the algorithm, and external validation was performed on cohort C (n=683), a separate international dataset.</p><p><strong>Results: </strong>At the optimal threshold, the sensitivity and specificity for detection of pulmonary fibrosis in cohort B was 0.91 (95% CI 88-94%) and 0.95 (95% CI 93-97%), respectively, with AUC 0.98. In the external validation dataset (cohort C), the sensitivity and specificity were 1.0 (95% 99.9-100.0) and 0.98 (95% CI 97.9-99.6), respectively, with AUC 0.997. There were no significant differences in the ability of ScreenDx to identify pulmonary fibrosis based on CT manufacturer (Phillips, Toshiba, GE Healthcare, or Siemens) or slice thickness (2 mm vs 2-4 mm vs 4 mm).</p><p><strong>Conclusion: </strong>Regardless of CT manufacturer or slice thickness, ScreenDx demonstrated high performance across two, multi-site datasets for identifying incidental cases of pulmonary fibrosis. This suggest that the algorithm may be generalizable across patient populations and different healthcare systems.</p>","PeriodicalId":94223,"journal":{"name":"The American journal of the medical sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The American journal of the medical sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.amjms.2025.02.011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Nonspecific symptoms and variability in radiographic reporting patterns contribute to a diagnostic delay of the diagnosis of pulmonary fibrosis. An attractive solution is the use of machine-learning algorithms to screen for radiographic features suggestive of pulmonary fibrosis. Thus, we developed and validated a machine learning classifier algorithm (ScreenDx) to screen computed tomography imaging and identify incidental cases of pulmonary fibrosis.
Methods: ScreenDx is a deep learning convolutional neural network that was developed from a multi-source dataset (cohort A) of 3,658 cases of normal and abnormal CT's, including CT's from patients with COPD, emphysema, and community-acquired pneumonia. Cohort B, a US-based cohort (n=381) was used for tuning the algorithm, and external validation was performed on cohort C (n=683), a separate international dataset.
Results: At the optimal threshold, the sensitivity and specificity for detection of pulmonary fibrosis in cohort B was 0.91 (95% CI 88-94%) and 0.95 (95% CI 93-97%), respectively, with AUC 0.98. In the external validation dataset (cohort C), the sensitivity and specificity were 1.0 (95% 99.9-100.0) and 0.98 (95% CI 97.9-99.6), respectively, with AUC 0.997. There were no significant differences in the ability of ScreenDx to identify pulmonary fibrosis based on CT manufacturer (Phillips, Toshiba, GE Healthcare, or Siemens) or slice thickness (2 mm vs 2-4 mm vs 4 mm).
Conclusion: Regardless of CT manufacturer or slice thickness, ScreenDx demonstrated high performance across two, multi-site datasets for identifying incidental cases of pulmonary fibrosis. This suggest that the algorithm may be generalizable across patient populations and different healthcare systems.
背景:非特异性症状和放射影像报告模式的多变性导致肺纤维化的诊断延迟。一个有吸引力的解决方案是使用机器学习算法来筛查提示肺纤维化的影像学特征。因此,我们开发并验证了一种机器学习分类器算法(ScreenDx),用于筛查计算机断层扫描成像并识别肺纤维化的偶发病例:ScreenDx是一种深度学习卷积神经网络,它是从一个包含3658例正常和异常CT的多源数据集(队列A)中开发出来的,其中包括慢性阻塞性肺病、肺气肿和社区获得性肺炎患者的CT。队列 B 是一个基于美国的队列(n=381),用于调整算法;队列 C 是一个独立的国际数据集(n=683),用于外部验证:在最佳阈值下,队列 B 检测肺纤维化的灵敏度和特异度分别为 0.91(95% CI 88-94%)和 0.95(95% CI 93-97%),AUC 为 0.98。在外部验证数据集(队列 C)中,灵敏度和特异性分别为 1.0(95% CI 99.9-100.0)和 0.98(95% CI 97.9-99.6),AUC 为 0.997。根据 CT 生产商(飞利浦、东芝、通用电气医疗集团或西门子)或切片厚度(2 毫米 vs 2-4 毫米 vs 4 毫米),ScreenDx 识别肺纤维化的能力没有明显差异:无论 CT 生产商或切片厚度如何,ScreenDx 在两个多站点数据集中都表现出了很高的识别肺纤维化偶发病例的能力。这表明该算法可适用于不同的患者群体和不同的医疗系统。