首页 > 最新文献

Journal of Digital Imaging最新文献

英文 中文
Intra-Individual Reproducibility of Automated Abdominal Organ Segmentation—Performance of TotalSegmentator Compared to Human Readers and an Independent nnU-Net Model 腹部器官自动分割的个体内再现性--TotalSegmentator 与人类阅读器和独立 nnU-Net 模型的性能比较
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-18 DOI: 10.1007/s10278-024-01265-w
Lorraine Abel, Jakob Wasserthal, Manfred T. Meyer, Jan Vosshenrich, Shan Yang, Ricardo Donners, Markus Obmann, Daniel Boll, Elmar Merkle, Hanns-Christian Breit, Martin Segeroth

The purpose of this study is to assess segmentation reproducibility of artificial intelligence-based algorithm, TotalSegmentator, across 34 anatomical structures using multiphasic abdominal CT scans comparing unenhanced, arterial, and portal venous phases in the same patients. A total of 1252 multiphasic abdominal CT scans acquired at our institution between January 1, 2012, and December 31, 2022, were retrospectively included. TotalSegmentator was used to derive volumetric measurements of 34 abdominal organs and structures from the total of 3756 CT series. Reproducibility was evaluated across three contrast phases per CT and compared to two human readers and an independent nnU-Net trained on the BTCV dataset. Relative deviation in segmented volumes and absolute volume deviations (AVD) were reported. Volume deviation within 5% was considered reproducible. Thus, non-inferiority testing was conducted using a 5% margin. Twenty-nine out of 34 structures had volume deviations within 5% and were considered reproducible. Volume deviations for the adrenal glands, gallbladder, spleen, and duodenum were above 5%. Highest reproducibility was observed for bones (− 0.58% [95% CI: − 0.58, − 0.57]) and muscles (− 0.33% [− 0.35, − 0.32]). Among abdominal organs, volume deviation was 1.67% (1.60, 1.74). TotalSegmentator outperformed the reproducibility of the nnU-Net trained on the BTCV dataset with an AVD of 6.50% (6.41, 6.59) vs. 10.03% (9.86, 10.20; p < 0.0001), most notably in cases with pathologic findings. Similarly, TotalSegmentator’s AVD between different contrast phases was superior compared to the interreader AVD for the same contrast phase (p = 0.036). TotalSegmentator demonstrated high intra-individual reproducibility for most abdominal structures in multiphasic abdominal CT scans. Although reproducibility was lower in pathologic cases, it outperforms both human readers and a nnU-Net trained on the BTCV dataset.

本研究旨在评估基于人工智能的算法 TotalSegmentator 在 34 个解剖结构中的分割再现性,该算法使用多相腹部 CT 扫描,比较同一患者的未增强、动脉和门静脉相。我们回顾性地纳入了本机构在 2012 年 1 月 1 日至 2022 年 12 月 31 日期间获得的 1252 份多相腹部 CT 扫描。使用 TotalSegmentator 从总共 3756 个 CT 系列中得出 34 个腹部器官和结构的容积测量值。对每台 CT 的三个对比阶段的再现性进行了评估,并与两名人类阅读器和在 BTCV 数据集上训练的独立 nnU-Net 进行了比较。报告了分割体积的相对偏差和绝对体积偏差(AVD)。体积偏差在 5% 以内被认为是可重复的。因此,非劣效性测试使用 5%的余量进行。在 34 个结构中,有 29 个结构的体积偏差在 5%以内,被认为是可重复的。肾上腺、胆囊、脾脏和十二指肠的体积偏差超过了 5%。骨骼(- 0.58% [95% CI: - 0.58, - 0.57])和肌肉(- 0.33% [- 0.35, - 0.32])的再现性最高。在腹部器官中,体积偏差为 1.67% (1.60, 1.74)。TotalSegmentator 的再现性优于在 BTCV 数据集上训练的 nnU-Net,其 AVD 为 6.50% (6.41, 6.59) vs. 10.03% (9.86, 10.20; p < 0.0001),在有病理结果的病例中尤为明显。同样,TotalSegmentator 在不同对比阶段之间的 AVD 也优于同一对比阶段的读片机间 AVD(p = 0.036)。在多相腹部 CT 扫描中,TotalSegmentator 对大多数腹部结构显示出较高的个体内再现性。虽然在病理病例中重现性较低,但它的表现优于人类阅读器和在 BTCV 数据集上训练的 nnU-Net。
{"title":"Intra-Individual Reproducibility of Automated Abdominal Organ Segmentation—Performance of TotalSegmentator Compared to Human Readers and an Independent nnU-Net Model","authors":"Lorraine Abel, Jakob Wasserthal, Manfred T. Meyer, Jan Vosshenrich, Shan Yang, Ricardo Donners, Markus Obmann, Daniel Boll, Elmar Merkle, Hanns-Christian Breit, Martin Segeroth","doi":"10.1007/s10278-024-01265-w","DOIUrl":"https://doi.org/10.1007/s10278-024-01265-w","url":null,"abstract":"<p>The purpose of this study is to assess segmentation reproducibility of artificial intelligence-based algorithm, TotalSegmentator, across 34 anatomical structures using multiphasic abdominal CT scans comparing unenhanced, arterial, and portal venous phases in the same patients. A total of 1252 multiphasic abdominal CT scans acquired at our institution between January 1, 2012, and December 31, 2022, were retrospectively included. TotalSegmentator was used to derive volumetric measurements of 34 abdominal organs and structures from the total of 3756 CT series. Reproducibility was evaluated across three contrast phases per CT and compared to two human readers and an independent nnU-Net trained on the BTCV dataset. Relative deviation in segmented volumes and absolute volume deviations (AVD) were reported. Volume deviation within 5% was considered reproducible. Thus, non-inferiority testing was conducted using a 5% margin. Twenty-nine out of 34 structures had volume deviations within 5% and were considered reproducible. Volume deviations for the adrenal glands, gallbladder, spleen, and duodenum were above 5%. Highest reproducibility was observed for bones (− 0.58% [95% CI: − 0.58, − 0.57]) and muscles (− 0.33% [− 0.35, − 0.32]). Among abdominal organs, volume deviation was 1.67% (1.60, 1.74). TotalSegmentator outperformed the reproducibility of the nnU-Net trained on the BTCV dataset with an AVD of 6.50% (6.41, 6.59) vs. 10.03% (9.86, 10.20; <i>p</i> &lt; 0.0001), most notably in cases with pathologic findings. Similarly, TotalSegmentator’s AVD between different contrast phases was superior compared to the interreader AVD for the same contrast phase (<i>p</i> = 0.036). TotalSegmentator demonstrated high intra-individual reproducibility for most abdominal structures in multiphasic abdominal CT scans. Although reproducibility was lower in pathologic cases, it outperforms both human readers and a nnU-Net trained on the BTCV dataset.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"33 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Teleradiology-Based Referrals for Patients with Gastroenterological Diseases Between Tertiary and Regional Hospitals: A Hospital-to-Hospital Approach 三级医院和地区医院之间基于远程放射学的肠胃病患者转诊:医院对医院的方法
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-17 DOI: 10.1007/s10278-024-01264-x
Kosuke Suzuki, Hiroaki Saito, Yoshika Saito, Akashi Endo, Daichi Togo, Risa Hanada, Rie Iwaya, Toshinori Sato, Kei Niida, Ryuta Suzuki, Junichi Togashi, Satoshi Ito, Yukari Tanaka, Yoshitaka Nawata, Kimihiro Igarashi, Hidetaka Hamamoto, Akihiko Ozaki, Tetsuya Tanimoto, Yasuteru Shimamura, Shunichi Sugawara, Masaki Nakashima, Toru Okuzono, Masato Nakahori, Akimichi Chonan, Tomoki Matsuda

Teleradiology is recognized for fostering collaboration between regional and tertiary hospitals. However, its application in gastroenterological diseases remains underexplored. This study aimed to assess the effectiveness of teleradiology in improving gastroenterological care. This retrospective study analyzed patients with gastroenterological diseases in a tertiary hospital who were referred from a regional hospital using a cloud-based radiology image-sharing system between July 2020 and June 2023. Our primary focus was to conduct a descriptive statistical analysis to evaluate patient characteristics and the referral process and analyze the timeframes from referral to transfer and from the start of treatment to discharge and the outcomes. We analyzed 56 patients, with 45 (80.4%) presenting hepatobiliary pancreatic disease. The most frequent condition was common bile duct stones (17 cases). Forty-nine cases were transferred for inpatient treatments, four underwent endoscopic examinations as outpatients, and two had imaging consultation without subsequent hospital visits. On referral day, 16 patients were transferred, and the remaining 33 (67.3%) were placed on a waiting list starting from the subsequent day. The median time from referral to admission was 1 day (range: 0–14 days), and the median time from referral to treatment was 2 days (range: 0–14 days). Remote image-sharing systems ensure accurate imaging at referral, preventing care delays. In collaboration with regional and tertiary hospitals, teleradiology may also be useful for gastroenterological diseases.

远程放射学在促进地区医院和三级医院之间的合作方面已得到公认。然而,远程放射学在消化系统疾病中的应用仍未得到充分探索。本研究旨在评估远程放射学在改善肠胃病治疗方面的效果。这项回顾性研究分析了一家三级医院在2020年7月至2023年6月期间使用基于云的放射影像共享系统从一家地区医院转诊的肠胃病患者。我们的主要重点是进行描述性统计分析,评估患者特征和转诊流程,分析从转诊到转院、从开始治疗到出院的时间范围和结果。我们对 56 名患者进行了分析,其中 45 人(80.4%)患有肝胆胰疾病。最常见的疾病是胆总管结石(17 例)。49例患者转入住院治疗,4例患者在门诊接受了内窥镜检查,2例患者接受了影像学会诊,但随后未到医院就诊。在转诊当天,有16名患者被转院,其余33名患者(67.3%)从第二天起被列入候诊名单。从转诊到入院的中位时间为 1 天(范围:0-14 天),从转诊到治疗的中位时间为 2 天(范围:0-14 天)。远程图像共享系统确保了转诊时的准确成像,避免了护理延误。通过与地区医院和三级医院合作,远程放射学对肠胃疾病也很有用。
{"title":"Teleradiology-Based Referrals for Patients with Gastroenterological Diseases Between Tertiary and Regional Hospitals: A Hospital-to-Hospital Approach","authors":"Kosuke Suzuki, Hiroaki Saito, Yoshika Saito, Akashi Endo, Daichi Togo, Risa Hanada, Rie Iwaya, Toshinori Sato, Kei Niida, Ryuta Suzuki, Junichi Togashi, Satoshi Ito, Yukari Tanaka, Yoshitaka Nawata, Kimihiro Igarashi, Hidetaka Hamamoto, Akihiko Ozaki, Tetsuya Tanimoto, Yasuteru Shimamura, Shunichi Sugawara, Masaki Nakashima, Toru Okuzono, Masato Nakahori, Akimichi Chonan, Tomoki Matsuda","doi":"10.1007/s10278-024-01264-x","DOIUrl":"https://doi.org/10.1007/s10278-024-01264-x","url":null,"abstract":"<p>Teleradiology is recognized for fostering collaboration between regional and tertiary hospitals. However, its application in gastroenterological diseases remains underexplored. This study aimed to assess the effectiveness of teleradiology in improving gastroenterological care. This retrospective study analyzed patients with gastroenterological diseases in a tertiary hospital who were referred from a regional hospital using a cloud-based radiology image-sharing system between July 2020 and June 2023. Our primary focus was to conduct a descriptive statistical analysis to evaluate patient characteristics and the referral process and analyze the timeframes from referral to transfer and from the start of treatment to discharge and the outcomes. We analyzed 56 patients, with 45 (80.4%) presenting hepatobiliary pancreatic disease. The most frequent condition was common bile duct stones (17 cases). Forty-nine cases were transferred for inpatient treatments, four underwent endoscopic examinations as outpatients, and two had imaging consultation without subsequent hospital visits. On referral day, 16 patients were transferred, and the remaining 33 (67.3%) were placed on a waiting list starting from the subsequent day. The median time from referral to admission was 1 day (range: 0–14 days), and the median time from referral to treatment was 2 days (range: 0–14 days). Remote image-sharing systems ensure accurate imaging at referral, preventing care delays. In collaboration with regional and tertiary hospitals, teleradiology may also be useful for gastroenterological diseases.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"188 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-step Integrative Workflow Implementation to Improve Documentation of Point of Care Ultrasound in Medical Intensive Care Unit 实施多步骤综合工作流程,改进医疗重症监护室护理点超声检查记录
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-16 DOI: 10.1007/s10278-024-01260-1
Vishal Deepak, Haroon Ahmed, Joseph Minardi, Sunil Sharma

Point of care ultrasound (POCUS) provides quick bedside assessment for diagnosing and managing life-threatening conditions in critical care medicine. There has been increasing interest in developing infrastructure to archive images, record clinical interpretation, assess quality, and recoup revenue for POCUS. We present a simple workflow by systems integration of electronic medical record, ultrasound machines, picture archiving, and communication system to facilitate POCUS documentation and billing. We recorded a trend on the number of POCUS performed before and after introduction of the structured integration. We observed and recorded a linear increase over time post-intervention. Our innovative and integrated POCUS workflow is an effective way to document and bill POCUS.

护理点超声检查(POCUS)为危重病医学中诊断和处理危及生命的病症提供快速床旁评估。人们对开发基础设施以存档图像、记录临床解释、评估质量和回收 POCUS 收入的兴趣与日俱增。我们通过系统集成电子病历、超声波机、图片存档和通信系统,介绍了一个简单的工作流程,以方便 POCUS 的记录和计费。我们记录了结构化整合前后实施 POCUS 的数量趋势。我们观察到并记录了干预后的线性增长。我们创新的一体化 POCUS 工作流程是记录和结算 POCUS 的有效方法。
{"title":"A Multi-step Integrative Workflow Implementation to Improve Documentation of Point of Care Ultrasound in Medical Intensive Care Unit","authors":"Vishal Deepak, Haroon Ahmed, Joseph Minardi, Sunil Sharma","doi":"10.1007/s10278-024-01260-1","DOIUrl":"https://doi.org/10.1007/s10278-024-01260-1","url":null,"abstract":"<p>Point of care ultrasound (POCUS) provides quick bedside assessment for diagnosing and managing life-threatening conditions in critical care medicine. There has been increasing interest in developing infrastructure to archive images, record clinical interpretation, assess quality, and recoup revenue for POCUS. We present a simple workflow by systems integration of electronic medical record, ultrasound machines, picture archiving, and communication system to facilitate POCUS documentation and billing. We recorded a trend on the number of POCUS performed before and after introduction of the structured integration. We observed and recorded a linear increase over time post-intervention. Our innovative and integrated POCUS workflow is an effective way to document and bill POCUS.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"1 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of Age-Related Differences in Lower Leg Muscles Quality Using Radiomic Features of Magnetic Resonance Images 利用磁共振成像的放射学特征评估与年龄相关的小腿肌肉质量差异
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-16 DOI: 10.1007/s10278-024-01268-7
Takuro Shiiba, Suzumi Mori, Takuya Shimozono, Shuji Ito, Kazuki Takano

Sarcopenia, characterised by a decline in muscle mass and strength, affects the health of the elderly, leading to increased falls, hospitalisation, and mortality rates. Muscle quality, reflecting microscopic and macroscopic muscle changes, is a critical determinant of physical function. To utilise radiomic features extracted from magnetic resonance (MR) images to assess age-related changes in muscle quality, a dataset of 24 adults, divided into older (male/female: 6/6, 66–79 years) and younger (male/female: 6/6, 21–31 years) groups, was used to investigate the radiomics features of the dorsiflexor and plantar flexor muscles of the lower leg that are critical for mobility. MR images were processed using MaZda software for feature extraction. Dimensionality reduction was performed using principal component analysis and recursive feature elimination, followed by classification using machine learning models, such as support vector machine, extreme gradient boosting, and naïve Bayes. A leave-one-out validation test was used to train and test the classifiers, and the area under the receiver operating characteristic curve (AUC) was used to evaluate the classification performance. The analysis revealed that significant differences in radiomic feature distributions were found between age groups, with older adults showing higher complexity and variability in muscle texture. The plantar flexors showed similar or higher AUC than the dorsiflexors in all models. When the dorsiflexor muscles were combined with the plantar flexor muscles, they tended to have a higher AUC than when they were used alone. Radiomic features in lower-leg MR images reflect ageing, especially in the plantar flexor muscles. Radiomic analysis can offer a deeper understanding of age-related muscle quality than traditional muscle mass assessments.

以肌肉质量和力量下降为特征的 "肌肉疏松症 "影响着老年人的健康,导致跌倒率、住院率和死亡率上升。肌肉质量反映了肌肉的微观和宏观变化,是决定身体功能的关键因素。为了利用从磁共振(MR)图像中提取的放射组学特征来评估肌肉质量与年龄有关的变化,研究人员使用了一个由 24 名成年人组成的数据集,分为老年组(男/女:6/6,66-79 岁)和年轻组(男/女:6/6,21-31 岁),研究对行动能力至关重要的小腿背屈肌和跖屈肌的放射组学特征。磁共振图像使用 MaZda 软件进行特征提取处理。使用主成分分析和递归特征消除进行降维,然后使用机器学习模型进行分类,如支持向量机、极梯度提升和天真贝叶斯。在对分类器进行训练和测试时使用了留一验证测试,并使用接收者工作特征曲线下面积(AUC)来评估分类性能。分析结果表明,不同年龄组之间的放射学特征分布存在显著差异,老年人的肌肉纹理复杂性和变异性更高。在所有模型中,跖屈肌的 AUC 值与背屈肌相似或更高。当背屈肌与跖屈肌结合使用时,它们的 AUC 往往高于单独使用时。小腿核磁共振图像中的放射线学特征反映了老化,尤其是跖屈肌的老化。与传统的肌肉质量评估相比,放射线组学分析能更深入地了解与年龄相关的肌肉质量。
{"title":"Assessment of Age-Related Differences in Lower Leg Muscles Quality Using Radiomic Features of Magnetic Resonance Images","authors":"Takuro Shiiba, Suzumi Mori, Takuya Shimozono, Shuji Ito, Kazuki Takano","doi":"10.1007/s10278-024-01268-7","DOIUrl":"https://doi.org/10.1007/s10278-024-01268-7","url":null,"abstract":"<p>Sarcopenia, characterised by a decline in muscle mass and strength, affects the health of the elderly, leading to increased falls, hospitalisation, and mortality rates. Muscle quality, reflecting microscopic and macroscopic muscle changes, is a critical determinant of physical function. To utilise radiomic features extracted from magnetic resonance (MR) images to assess age-related changes in muscle quality, a dataset of 24 adults, divided into older (male/female: 6/6, 66–79 years) and younger (male/female: 6/6, 21–31 years) groups, was used to investigate the radiomics features of the dorsiflexor and plantar flexor muscles of the lower leg that are critical for mobility. MR images were processed using MaZda software for feature extraction. Dimensionality reduction was performed using principal component analysis and recursive feature elimination, followed by classification using machine learning models, such as support vector machine, extreme gradient boosting, and naïve Bayes. A leave-one-out validation test was used to train and test the classifiers, and the area under the receiver operating characteristic curve (AUC) was used to evaluate the classification performance. The analysis revealed that significant differences in radiomic feature distributions were found between age groups, with older adults showing higher complexity and variability in muscle texture. The plantar flexors showed similar or higher AUC than the dorsiflexors in all models. When the dorsiflexor muscles were combined with the plantar flexor muscles, they tended to have a higher AUC than when they were used alone. Radiomic features in lower-leg MR images reflect ageing, especially in the plantar flexor muscles. Radiomic analysis can offer a deeper understanding of age-related muscle quality than traditional muscle mass assessments.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"48 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated ASPECTS Segmentation and Scoring Tool: a Method Tailored for a Colombian Telestroke Network 自动 ASPECTS 分段和评分工具:为哥伦比亚远程中风网络量身定制的方法
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-16 DOI: 10.1007/s10278-024-01258-9
Esteban Ortiz, Juan Rivera, Manuel Granja, Nelson Agudelo, Marcela Hernández Hoyos, Antonio Salazar

To evaluate our two non-machine learning (non-ML)-based algorithmic approaches for detecting early ischemic infarcts on brain CT images of patients with acute ischemic stroke symptoms, tailored to our local population, to be incorporated in our telestroke software. One-hundred and thirteen acute stroke patients, excluding hemorrhagic, subacute, and chronic patients, with accessible brain CT images were divided into calibration and test sets. The gold standard was determined through consensus among three neuroradiologist. Four neuroradiologist independently reported Alberta Stroke Program Early CT Scores (ASPECTSs). ASPECTSs were also obtained using a commercial ML solution (CMLS), and our two methods, namely the Mean Hounsfield Unit (HU) relative difference (RELDIF) and the density distribution equivalence test (DDET), which used statistical analyze the of the HUs of each region and its contralateral side. Automated segmentation was perfect for cortical regions, while minimal adjustment was required for basal ganglia regions. For dichotomized-ASPECTSs (ASPECTS < 6) in the test set, the area under the receiver operating characteristic curve (AUC) was 0.85 for the DDET method, 0.84 for the RELDIF approach, 0.64 for the CMLS, and ranged from 0.71–0.89 for the neuroradiologist. The accuracy was 0.85 for the DDET method, 0.88 for the RELDIF approach, and was ranged from 0.83 − 0.96 for the neuroradiologist. Equivalence at a margin of 5% was documented among the DDET, RELDIF, and gold standard on mean ASPECTSs. Noninferiority tests of the AUC and accuracy of infarct detection revealed similarities between both DDET and RELDIF, and the CMLS, and with at least one neuroradiologist. The alignment of our methods with the evaluations of neuroradiologist and the CMLS indicates the potential of our methods to serve as supportive tools in clinical settings, facilitating prompt and accurate stroke diagnosis, especially in health care settings, such as Colombia, where neuroradiologist are limited.

评估我们的两种基于非机器学习(non-ML)的算法,用于检测急性缺血性中风症状患者脑部 CT 图像上的早期缺血性梗死。113 名急性中风患者(不包括出血性、亚急性和慢性患者)的脑 CT 图像被分为校准集和测试集。金标准由三位神经放射学专家共同确定。四位神经放射学专家独立报告阿尔伯塔省卒中计划早期 CT 评分(ASPECTS)。ASPECTS 也是通过商业 ML 解决方案 (CMLS) 和我们的两种方法获得的,即平均 Hounsfield 单位 (HU) 相对差值 (RELDIF) 和密度分布等值测试 (DDET),后者用于统计分析每个区域及其对侧的 HU 值。大脑皮层区域的自动分割非常完美,而基底节区域的自动分割只需极少的调整。对于测试集中的二分法-ASPECTS(ASPECTS <6),DDET 方法的接收器操作特征曲线下面积(AUC)为 0.85,RELDIF 方法为 0.84,CMLS 为 0.64,而神经放射科医师的接收器操作特征曲线下面积为 0.71-0.89。DDET 方法的准确度为 0.85,RELDIF 方法为 0.88,神经放射科医生的准确度为 0.83 - 0.96。DDET 法、RELDIF 法和黄金标准在平均 ASPECTS 上的等效性为 5%。对 AUC 和梗死检测准确性的非劣效性测试表明,DDET 和 RELDIF 与 CMLS 以及至少一位神经放射学家的方法相似。我们的方法与神经放射科医生和 CMLS 的评估结果一致,这表明我们的方法有潜力成为临床环境中的辅助工具,促进及时准确的中风诊断,尤其是在哥伦比亚等神经放射科医生有限的医疗环境中。
{"title":"Automated ASPECTS Segmentation and Scoring Tool: a Method Tailored for a Colombian Telestroke Network","authors":"Esteban Ortiz, Juan Rivera, Manuel Granja, Nelson Agudelo, Marcela Hernández Hoyos, Antonio Salazar","doi":"10.1007/s10278-024-01258-9","DOIUrl":"https://doi.org/10.1007/s10278-024-01258-9","url":null,"abstract":"<p>To evaluate our two non-machine learning (non-ML)-based algorithmic approaches for detecting early ischemic infarcts on brain CT images of patients with acute ischemic stroke symptoms, tailored to our local population, to be incorporated in our telestroke software. One-hundred and thirteen acute stroke patients, excluding hemorrhagic, subacute, and chronic patients, with accessible brain CT images were divided into calibration and test sets. The gold standard was determined through consensus among three neuroradiologist. Four neuroradiologist independently reported Alberta Stroke Program Early CT Scores (ASPECTSs). ASPECTSs were also obtained using a commercial ML solution (CMLS), and our two methods, namely the Mean Hounsfield Unit (HU) relative difference (RELDIF) and the density distribution equivalence test (DDET), which used statistical analyze the of the HUs of each region and its contralateral side. Automated segmentation was perfect for cortical regions, while minimal adjustment was required for basal ganglia regions. For dichotomized-ASPECTSs (ASPECTS &lt; 6) in the test set, the area under the receiver operating characteristic curve (AUC) was 0.85 for the DDET method, 0.84 for the RELDIF approach, 0.64 for the CMLS, and ranged from 0.71–0.89 for the neuroradiologist. The accuracy was 0.85 for the DDET method, 0.88 for the RELDIF approach, and was ranged from 0.83 − 0.96 for the neuroradiologist. Equivalence at a margin of 5% was documented among the DDET, RELDIF, and gold standard on mean ASPECTSs. Noninferiority tests of the AUC and accuracy of infarct detection revealed similarities between both DDET and RELDIF, and the CMLS, and with at least one neuroradiologist. The alignment of our methods with the evaluations of neuroradiologist and the CMLS indicates the potential of our methods to serve as supportive tools in clinical settings, facilitating prompt and accurate stroke diagnosis, especially in health care settings, such as Colombia, where neuroradiologist are limited.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"55 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vital Characteristics Cellular Neural Network (VCeNN) for Melanoma Lesion Segmentation: A Biologically Inspired Deep Learning Approach 用于黑色素瘤病灶分割的生命特征细胞神经网络(VCeNN):受生物学启发的深度学习方法
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-16 DOI: 10.1007/s10278-024-01257-w
Tongxin Yang, Qilin Huang, Fenglin Cai, Jie Li, Li Jiang, Yulong Xia

Cutaneous melanoma is a highly lethal form of cancer. Developing a medical image segmentation model capable of accurately delineating melanoma lesions with high robustness and generalization presents a formidable challenge. This study draws inspiration from cellular functional characteristics and natural selection, proposing a novel medical segmentation model named the vital characteristics cellular neural network. This model incorporates vital characteristics observed in multicellular organisms, including memory, adaptation, apoptosis, and division. Memory module enables the network to rapidly adapt to input data during the early stages of training, accelerating model convergence. Adaptation module allows neurons to select the appropriate activation function based on varying environmental conditions. Apoptosis module reduces the risk of overfitting by pruning neurons with low activation values. Division module enhances the network’s learning capacity by duplicating neurons with high activation values. Experimental evaluations demonstrate the efficacy of this model in enhancing the performance of neural networks for medical image segmentation. The proposed method achieves outstanding results across numerous publicly available datasets, indicating its potential to contribute significantly to the field of medical image analysis and facilitating accurate and efficient segmentation of medical imagery. The proposed method achieves outstanding results across numerous publicly available datasets, with an F1 score of 0.901, Intersection over Union of 0.841, and Dice coefficient of 0.913, indicating its potential to contribute significantly to the field of medical image analysis and facilitating accurate and efficient segmentation of medical imagery.

皮肤黑色素瘤是一种致死率极高的癌症。开发一种能够准确划分黑色素瘤病变、具有高鲁棒性和泛化能力的医学影像分割模型是一项艰巨的挑战。本研究从细胞功能特征和自然选择中汲取灵感,提出了一种名为 "生命特征细胞神经网络 "的新型医学图像分割模型。该模型结合了在多细胞生物体中观察到的重要特征,包括记忆、适应、凋亡和分裂。记忆模块可使网络在训练的早期阶段迅速适应输入数据,从而加速模型的收敛。适应模块允许神经元根据不同的环境条件选择适当的激活函数。凋亡模块通过修剪激活值较低的神经元来降低过度拟合的风险。分裂模块通过复制激活值高的神经元来增强网络的学习能力。实验评估证明了该模型在提高医学图像分割神经网络性能方面的功效。所提出的方法在众多公开数据集上都取得了优异的成绩,这表明它有望为医学图像分析领域做出重大贡献,并促进医学图像的准确、高效分割。所提出的方法在众多公开数据集上取得了优异的成绩,F1 得分为 0.901,Intersection over Union 为 0.841,Dice coefficient 为 0.913,这表明该方法有望在医学图像分析领域做出重大贡献,促进医学图像的准确、高效分割。
{"title":"Vital Characteristics Cellular Neural Network (VCeNN) for Melanoma Lesion Segmentation: A Biologically Inspired Deep Learning Approach","authors":"Tongxin Yang, Qilin Huang, Fenglin Cai, Jie Li, Li Jiang, Yulong Xia","doi":"10.1007/s10278-024-01257-w","DOIUrl":"https://doi.org/10.1007/s10278-024-01257-w","url":null,"abstract":"<p>Cutaneous melanoma is a highly lethal form of cancer. Developing a medical image segmentation model capable of accurately delineating melanoma lesions with high robustness and generalization presents a formidable challenge. This study draws inspiration from cellular functional characteristics and natural selection, proposing a novel medical segmentation model named the vital characteristics cellular neural network. This model incorporates vital characteristics observed in multicellular organisms, including memory, adaptation, apoptosis, and division. Memory module enables the network to rapidly adapt to input data during the early stages of training, accelerating model convergence. Adaptation module allows neurons to select the appropriate activation function based on varying environmental conditions. Apoptosis module reduces the risk of overfitting by pruning neurons with low activation values. Division module enhances the network’s learning capacity by duplicating neurons with high activation values. Experimental evaluations demonstrate the efficacy of this model in enhancing the performance of neural networks for medical image segmentation. The proposed method achieves outstanding results across numerous publicly available datasets, indicating its potential to contribute significantly to the field of medical image analysis and facilitating accurate and efficient segmentation of medical imagery. The proposed method achieves outstanding results across numerous publicly available datasets, with an F1 score of 0.901, Intersection over Union of 0.841, and Dice coefficient of 0.913, indicating its potential to contribute significantly to the field of medical image analysis and facilitating accurate and efficient segmentation of medical imagery.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"55 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Septic Arthritis Modeling Using Sonographic Fusion with Attention and Selective Transformation: a Preliminary Study 利用声像图融合注意和选择性变换建立化脓性关节炎模型:初步研究
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-16 DOI: 10.1007/s10278-024-01259-8
Chung-Ming Lo, Kuo-Lung Lai

Conventionally diagnosing septic arthritis relies on detecting the causal pathogens in samples of synovial fluid, synovium, or blood. However, isolating these pathogens through cultures takes several days, thus delaying both diagnosis and treatment. Establishing a quantitative classification model from ultrasound images for rapid septic arthritis diagnosis is mandatory. For the study, a database composed of 342 images of non-septic arthritis and 168 images of septic arthritis produced by grayscale (GS) and power Doppler (PD) ultrasound was constructed. In the proposed architecture of fusion with attention and selective transformation (FAST), both groups of images were combined in a vision transformer (ViT) with the convolutional block attention module, which incorporates spatial, modality, and channel features. Fivefold cross-validation was applied to evaluate the generalized ability. The FAST architecture achieved the accuracy, sensitivity, specificity, and area under the curve (AUC) of 86.33%, 80.66%, 90.25%, and 0.92, respectively. These performances were higher than using conventional ViT (82.14%) and significantly better than using one modality alone (GS 73.88%, PD 72.02%), with the p-value being less than 0.01. Through the integration of multi-modality and the extraction of multiple channel features, the established model provided promising accuracy and AUC in septic arthritis classification. The end-to-end learning of ultrasound features can provide both rapid and objective assessment suggestions for future clinic use.

化脓性关节炎的传统诊断方法是在滑液、滑膜或血液样本中检测致病病原体。然而,通过培养分离这些病原体需要数天时间,从而延误了诊断和治疗。因此,必须从超声图像中建立一个定量分类模型,以快速诊断化脓性关节炎。为进行这项研究,我们建立了一个数据库,其中包括由灰度(GS)和功率多普勒(PD)超声波生成的 342 幅非化脓性关节炎图像和 168 幅化脓性关节炎图像。在所提出的注意力和选择性变换融合(FAST)架构中,两组图像都在视觉变换器(ViT)中与卷积块注意力模块相结合,后者结合了空间、模式和通道特征。五重交叉验证用于评估泛化能力。FAST 架构的准确率、灵敏度、特异性和曲线下面积(AUC)分别达到了 86.33%、80.66%、90.25% 和 0.92。这些性能均高于传统 ViT(82.14%),明显优于单独使用一种模式(GS 73.88%,PD 72.02%),P 值小于 0.01。通过多模态整合和多通道特征提取,所建立的模型在脓毒性关节炎分类中提供了良好的准确性和AUC。对超声特征的端到端学习可为今后的临床应用提供快速、客观的评估建议。
{"title":"Septic Arthritis Modeling Using Sonographic Fusion with Attention and Selective Transformation: a Preliminary Study","authors":"Chung-Ming Lo, Kuo-Lung Lai","doi":"10.1007/s10278-024-01259-8","DOIUrl":"https://doi.org/10.1007/s10278-024-01259-8","url":null,"abstract":"<p>Conventionally diagnosing septic arthritis relies on detecting the causal pathogens in samples of synovial fluid, synovium, or blood. However, isolating these pathogens through cultures takes several days, thus delaying both diagnosis and treatment. Establishing a quantitative classification model from ultrasound images for rapid septic arthritis diagnosis is mandatory. For the study, a database composed of 342 images of non-septic arthritis and 168 images of septic arthritis produced by grayscale (GS) and power Doppler (PD) ultrasound was constructed. In the proposed architecture of fusion with attention and selective transformation (FAST), both groups of images were combined in a vision transformer (ViT) with the convolutional block attention module, which incorporates spatial, modality, and channel features. Fivefold cross-validation was applied to evaluate the generalized ability. The FAST architecture achieved the accuracy, sensitivity, specificity, and area under the curve (AUC) of 86.33%, 80.66%, 90.25%, and 0.92, respectively. These performances were higher than using conventional ViT (82.14%) and significantly better than using one modality alone (GS 73.88%, PD 72.02%), with the <i>p</i>-value being less than 0.01. Through the integration of multi-modality and the extraction of multiple channel features, the established model provided promising accuracy and AUC in septic arthritis classification. The end-to-end learning of ultrasound features can provide both rapid and objective assessment suggestions for future clinic use.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"188 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation of ComBat Harmonization on Radiomic and Deep Features from Multi-Center Abdominal MRI Data 从多中心腹部磁共振成像数据中研究 ComBat 对放射组学和深度特征的协调性
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-16 DOI: 10.1007/s10278-024-01253-0
Wei Jia, Hailong Li, Redha Ali, Krishna P. Shanbhogue, William R. Masch, Anum Aslam, David T. Harris, Scott B. Reeder, Jonathan R. Dillman, Lili He

ComBat harmonization has been developed to remove non-biological variations for data in multi-center research applying artificial intelligence (AI). We investigated the effectiveness of ComBat harmonization on radiomic and deep features extracted from large, multi-center abdominal MRI data. A retrospective study was conducted on T2-weighted (T2W) abdominal MRI data retrieved from individual patients with suspected or known chronic liver disease at three study sites. MRI data were acquired using systems from three manufacturers and two field strengths. Radiomic features and deep features were extracted using the PyRadiomics pipeline and a Swin Transformer. ComBat was used to harmonize radiomic and deep features across different manufacturers and field strengths. Student’s t-test, ANOVA test, and Cohen’s F score were applied to assess the difference in individual features before and after ComBat harmonization. Between two field strengths, 76.7%, 52.9%, and 26.7% of radiomic features, and 89.0%, 56.5%, and 0.1% of deep features from three manufacturers were significantly different. Among the three manufacturers, 90.1% and 75.0% of radiomic features and 89.3% and 84.1% of deep features from two field strengths were significantly different. After ComBat harmonization, there were no significant differences in radiomic and deep features among manufacturers or field strengths based on t-tests or ANOVA tests. Reduced Cohen’s F scores were consistently observed after ComBat harmonization. ComBat harmonization effectively harmonizes radiomic and deep features by removing the non-biological variations due to system manufacturers and/or field strengths in large multi-center clinical abdominal MRI datasets.

ComBat 协调技术的开发是为了在应用人工智能(AI)的多中心研究中消除数据的非生物变异。我们研究了 ComBat 协调对从大型多中心腹部 MRI 数据中提取的放射学和深度特征的有效性。我们在三个研究机构对疑似或已知慢性肝病患者的 T2 加权(T2W)腹部 MRI 数据进行了回顾性研究。磁共振成像数据是使用三家制造商的系统和两种场强采集的。使用 PyRadiomics 管道和 Swin Transformer 提取了放射组学特征和深度特征。ComBat 用于协调不同制造商和不同场强的放射组学特征和深度特征。采用学生 t 检验、方差分析检验和 Cohen's F 评分来评估 ComBat 协调前后单个特征的差异。在两种场强之间,三家制造商分别有 76.7%、52.9% 和 26.7% 的放射学特征和 89.0%、56.5% 和 0.1% 的深部特征存在显著差异。在三家制造商中,两种场强的放射性原子特征分别为 90.1%和 75.0%,深层特征分别为 89.3%和 84.1%,差异显著。在 ComBat 协调后,根据 t 检验或方差分析检验,不同制造商或不同场强的放射性体征和深部特征没有明显差异。ComBat 协调后,Cohen's F 分数持续降低。在大型多中心临床腹部 MRI 数据集中,ComBat 协调通过消除系统制造商和/或磁场强度造成的非生物学差异,有效地协调了放射学和深部特征。
{"title":"Investigation of ComBat Harmonization on Radiomic and Deep Features from Multi-Center Abdominal MRI Data","authors":"Wei Jia, Hailong Li, Redha Ali, Krishna P. Shanbhogue, William R. Masch, Anum Aslam, David T. Harris, Scott B. Reeder, Jonathan R. Dillman, Lili He","doi":"10.1007/s10278-024-01253-0","DOIUrl":"https://doi.org/10.1007/s10278-024-01253-0","url":null,"abstract":"<p>ComBat harmonization has been developed to remove non-biological variations for data in multi-center research applying artificial intelligence (AI). We investigated the effectiveness of ComBat harmonization on radiomic and deep features extracted from large, multi-center abdominal MRI data. A retrospective study was conducted on T2-weighted (T2W) abdominal MRI data retrieved from individual patients with suspected or known chronic liver disease at three study sites. MRI data were acquired using systems from three manufacturers and two field strengths. Radiomic features and deep features were extracted using the PyRadiomics pipeline and a Swin Transformer. ComBat was used to harmonize radiomic and deep features across different manufacturers and field strengths. Student’s <i>t</i>-test, ANOVA test, and Cohen’s F score were applied to assess the difference in individual features before and after ComBat harmonization. Between two field strengths, 76.7%, 52.9%, and 26.7% of radiomic features, and 89.0%, 56.5%, and 0.1% of deep features from three manufacturers were significantly different. Among the three manufacturers, 90.1% and 75.0% of radiomic features and 89.3% and 84.1% of deep features from two field strengths were significantly different. After ComBat harmonization, there were no significant differences in radiomic and deep features among manufacturers or field strengths based on <i>t-</i>tests or ANOVA tests. Reduced Cohen’s F scores were consistently observed after ComBat harmonization. ComBat harmonization effectively harmonizes radiomic and deep features by removing the non-biological variations due to system manufacturers and/or field strengths in large multi-center clinical abdominal MRI datasets.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"55 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Out-of-Distribution Detection and Radiological Data Monitoring Using Statistical Process Control 利用统计过程控制进行配送外检测和辐射数据监测
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-16 DOI: 10.1007/s10278-024-01212-9
Ghada Zamzmi, Kesavan Venkatesh, Brandon Nelson, Smriti Prathapan, Paul Yi, Berkman Sahiner, Jana G. Delfino

Machine learning (ML) models often fail with data that deviates from their training distribution. This is a significant concern for ML-enabled devices as data drift may lead to unexpected performance. This work introduces a new framework for out of distribution (OOD) detection and data drift monitoring that combines ML and geometric methods with statistical process control (SPC). We investigated different design choices, including methods for extracting feature representations and drift quantification for OOD detection in individual images and as an approach for input data monitoring. We evaluated the framework for both identifying OOD images and demonstrating the ability to detect shifts in data streams over time. We demonstrated a proof-of-concept via the following tasks: 1) differentiating axial vs. non-axial CT images, 2) differentiating CXR vs. other radiographic imaging modalities, and 3) differentiating adult CXR vs. pediatric CXR. For the identification of individual OOD images, our framework achieved high sensitivity in detecting OOD inputs: 0.980 in CT, 0.984 in CXR, and 0.854 in pediatric CXR. Our framework is also adept at monitoring data streams and identifying the time a drift occurred. In our simulations tracking drift over time, it effectively detected a shift from CXR to non-CXR instantly, a transition from axial to non-axial CT within few days, and a drift from adult to pediatric CXRs within a day—all while maintaining a low false positive rate. Through additional experiments, we demonstrate the framework is modality-agnostic and independent from the underlying model structure, making it highly customizable for specific applications and broadly applicable across different imaging modalities and deployed ML models.

机器学习 (ML) 模型在数据偏离其训练分布时经常会失败。这对于支持 ML 的设备来说是一个重大问题,因为数据漂移可能会导致意外的性能。这项工作引入了一个新的框架,用于偏离分布 (OOD) 检测和数据漂移监控,该框架将 ML 和几何方法与统计过程控制 (SPC) 相结合。我们研究了不同的设计选择,包括提取特征表示和漂移量化的方法,用于单个图像中的 OOD 检测,以及作为输入数据监控的一种方法。我们评估了识别 OOD 图像的框架,并展示了检测数据流随时间变化的能力。我们通过以下任务演示了概念验证:1)区分轴向与非轴向 CT 图像;2)区分 CXR 与其他放射成像模式;3)区分成人 CXR 与儿童 CXR。对于单个 OOD 图像的识别,我们的框架在检测 OOD 输入方面达到了很高的灵敏度:CT 为 0.980,CXR 为 0.984,儿科 CXR 为 0.854。我们的框架还善于监控数据流并识别漂移发生的时间。在我们跟踪随时间漂移的模拟中,它能有效地即时检测到从 CXR 到非 CXR 的转变,在几天内检测到从轴向 CT 到非轴向 CT 的转变,在一天内检测到从成人 CXR 到儿童 CXR 的漂移,同时保持较低的误报率。通过更多实验,我们证明了该框架与成像模式无关,也与底层模型结构无关,因此可针对特定应用进行高度定制,并广泛适用于不同成像模式和已部署的 ML 模型。
{"title":"Out-of-Distribution Detection and Radiological Data Monitoring Using Statistical Process Control","authors":"Ghada Zamzmi, Kesavan Venkatesh, Brandon Nelson, Smriti Prathapan, Paul Yi, Berkman Sahiner, Jana G. Delfino","doi":"10.1007/s10278-024-01212-9","DOIUrl":"https://doi.org/10.1007/s10278-024-01212-9","url":null,"abstract":"<p>Machine learning (ML) models often fail with data that deviates from their training distribution. This is a significant concern for ML-enabled devices as data drift may lead to unexpected performance. This work introduces a new framework for out of distribution (OOD) detection and data drift monitoring that combines ML and geometric methods with statistical process control (SPC). We investigated different design choices, including methods for extracting feature representations and drift quantification for OOD detection in individual images and as an approach for input data monitoring. We evaluated the framework for both identifying OOD images and demonstrating the ability to detect shifts in data streams over time. We demonstrated a proof-of-concept via the following tasks: 1) differentiating axial vs. non-axial CT images, 2) differentiating CXR vs. other radiographic imaging modalities, and 3) differentiating adult CXR vs. pediatric CXR. For the identification of individual OOD images, our framework achieved high sensitivity in detecting OOD inputs: 0.980 in CT, 0.984 in CXR, and 0.854 in pediatric CXR. Our framework is also adept at monitoring data streams and identifying the time a drift occurred. In our simulations tracking drift over time, it effectively detected a shift from CXR to non-CXR instantly, a transition from axial to non-axial CT within few days, and a drift from adult to pediatric CXRs within a day—all while maintaining a low false positive rate. Through additional experiments, we demonstrate the framework is modality-agnostic and independent from the underlying model structure, making it highly customizable for specific applications and broadly applicable across different imaging modalities and deployed ML models.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"37 9 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142259288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning for Automated Classification of Hip Hardware on Radiographs 利用深度学习对 X 光片上的髋关节硬件进行自动分类
IF 4.4 2区 工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2024-09-12 DOI: 10.1007/s10278-024-01263-y
Yuntong Ma, Justin L. Bauer, Acacia H. Yoon, Christopher F. Beaulieu, Luke Yoon, Bao H. Do, Charles X. Fang

Purpose: To develop a deep learning model for automated classification of orthopedic hardware on pelvic and hip radiographs, which can be clinically implemented to decrease radiologist workload and improve consistency among radiology reports.

Materials and Methods: Pelvic and hip radiographs from 4279 studies in 1073 patients were retrospectively obtained and reviewed by musculoskeletal radiologists. Two convolutional neural networks, EfficientNet-B4 and NFNet-F3, were trained to perform the image classification task into the following most represented categories: no hardware, total hip arthroplasty (THA), hemiarthroplasty, intramedullary nail, femoral neck cannulated screws, dynamic hip screw, lateral blade/plate, THA with additional femoral fixation, and post-infectious hip. Model performance was assessed on an independent test set of 851 studies from 262 patients and compared to individual performance of five subspecialty-trained radiologists using leave-one-out analysis against an aggregate gold standard label.

Results: For multiclass classification, the area under the receiver operating characteristic curve (AUC) for NFNet-F3 was 0.99 or greater for all classes, and EfficientNet-B4 0.99 or greater for all classes except post-infectious hip, with an AUC of 0.97. When compared with human observers, models achieved an accuracy of 97%, which is non-inferior to four out of five radiologists and outperformed one radiologist. Cohen’s kappa coefficient for both models ranged from 0.96 to 0.97, indicating excellent inter-reader agreement.

Conclusion: A deep learning model can be used to classify a range of orthopedic hip hardware with high accuracy and comparable performance to subspecialty-trained radiologists.

目的:开发一种深度学习模型,用于对骨盆和髋关节X光片上的矫形硬件进行自动分类,该模型可在临床上实施,以减少放射科医生的工作量并提高放射学报告的一致性:肌肉骨骼放射科医生回顾性地获取并审查了 1073 名患者的 4279 张骨盆和髋关节 X 光片。对两个卷积神经网络(EfficientNet-B4 和 NFNet-F3)进行了训练,以便将图像分类为以下最具代表性的类别:无硬件、全髋关节置换术(THA)、半髋关节置换术、髓内钉、股骨颈套管螺钉、动态髋关节螺钉、外侧刀片/钢板、带有额外股骨固定的全髋关节置换术以及感染后髋关节。对来自 262 名患者的 851 项研究的独立测试集进行了模型性能评估,并使用留一分析法将其与五名接受过亚专业培训的放射科医生的个人性能进行了比较:在多类分类中,NFNet-F3 的接收器工作特征曲线下面积 (AUC) 在所有类别中均达到或超过 0.99,EfficientNet-B4 的接收器工作特征曲线下面积 (AUC) 在所有类别中均达到或超过 0.99,感染后髋关节除外,其 AUC 为 0.97。与人类观察者相比,模型的准确率达到了 97%,不逊于五分之四的放射科医生,也优于一位放射科医生。两个模型的科恩卡帕系数在 0.96 到 0.97 之间,表明阅读者之间的一致性非常好:结论:深度学习模型可用于对一系列骨科髋关节硬件进行分类,准确率高,性能可与经过亚专业培训的放射科医生媲美。
{"title":"Deep Learning for Automated Classification of Hip Hardware on Radiographs","authors":"Yuntong Ma, Justin L. Bauer, Acacia H. Yoon, Christopher F. Beaulieu, Luke Yoon, Bao H. Do, Charles X. Fang","doi":"10.1007/s10278-024-01263-y","DOIUrl":"https://doi.org/10.1007/s10278-024-01263-y","url":null,"abstract":"<p>Purpose: To develop a deep learning model for automated classification of orthopedic hardware on pelvic and hip radiographs, which can be clinically implemented to decrease radiologist workload and improve consistency among radiology reports.</p><p>Materials and Methods: Pelvic and hip radiographs from 4279 studies in 1073 patients were retrospectively obtained and reviewed by musculoskeletal radiologists. Two convolutional neural networks, EfficientNet-B4 and NFNet-F3, were trained to perform the image classification task into the following most represented categories: no hardware, total hip arthroplasty (THA), hemiarthroplasty, intramedullary nail, femoral neck cannulated screws, dynamic hip screw, lateral blade/plate, THA with additional femoral fixation, and post-infectious hip. Model performance was assessed on an independent test set of 851 studies from 262 patients and compared to individual performance of five subspecialty-trained radiologists using leave-one-out analysis against an aggregate gold standard label.</p><p>Results: For multiclass classification, the area under the receiver operating characteristic curve (AUC) for NFNet-F3 was 0.99 or greater for all classes, and EfficientNet-B4 0.99 or greater for all classes except post-infectious hip, with an AUC of 0.97. When compared with human observers, models achieved an accuracy of 97%, which is non-inferior to four out of five radiologists and outperformed one radiologist. Cohen’s kappa coefficient for both models ranged from 0.96 to 0.97, indicating excellent inter-reader agreement.</p><p>Conclusion: A deep learning model can be used to classify a range of orthopedic hip hardware with high accuracy and comparable performance to subspecialty-trained radiologists.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"158 1","pages":""},"PeriodicalIF":4.4,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142216217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Digital Imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1