Radiology-Artificial Intelligence最新文献_第8页

Noninvasive Molecular Subtyping of Pediatric Low-Grade Glioma with Self-Supervised Transfer Learning. 利用自我监督转移学习对小儿低级别胶质瘤进行无创分子亚型分析

IF 8.1 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.230333

Divyanshu Tak, Zezhong Ye, Anna Zapaischykova, Yining Zha, Aidan Boyd, Sridhar Vajapeyam, Rishi Chopra, Hasaan Hayat, Sanjay P Prabhu, Kevin X Liu, Hesham Elhalawani, Ali Nabavizadeh, Ariana Familiar, Adam C Resnick, Sabine Mueller, Hugo J W L Aerts, Pratiti Bandopadhayay, Keith L Ligon, Daphne A Haas-Kogan, Tina Y Poussaint, Benjamin H Kann

Purpose To develop and externally test a scan-to-prediction deep learning pipeline for noninvasive, MRI-based BRAF mutational status classification for pediatric low-grade glioma. Materials and Methods This retrospective study included two pediatric low-grade glioma datasets with linked genomic and diagnostic T2-weighted MRI data of patients: Dana-Farber/Boston Children's Hospital (development dataset, n = 214 [113 (52.8%) male; 104 (48.6%) BRAF wild type, 60 (28.0%) BRAF fusion, and 50 (23.4%) BRAF V600E]) and the Children's Brain Tumor Network (external testing, n = 112 [55 (49.1%) male; 35 (31.2%) BRAF wild type, 60 (53.6%) BRAF fusion, and 17 (15.2%) BRAF V600E]). A deep learning pipeline was developed to classify BRAF mutational status (BRAF wild type vs BRAF fusion vs BRAF V600E) via a two-stage process: (a) three-dimensional tumor segmentation and extraction of axial tumor images and (b) section-wise, deep learning-based classification of mutational status. Knowledge-transfer and self-supervised approaches were investigated to prevent model overfitting, with a primary end point of the area under the receiver operating characteristic curve (AUC). To enhance model interpretability, a novel metric, center of mass distance, was developed to quantify the model attention around the tumor. Results A combination of transfer learning from a pretrained medical imaging-specific network and self-supervised label cross-training (TransferX) coupled with consensus logic yielded the highest classification performance with an AUC of 0.82 (95% CI: 0.72, 0.91), 0.87 (95% CI: 0.61, 0.97), and 0.85 (95% CI: 0.66, 0.95) for BRAF wild type, BRAF fusion, and BRAF V600E, respectively, on internal testing. On external testing, the pipeline yielded an AUC of 0.72 (95% CI: 0.64, 0.86), 0.78 (95% CI: 0.61, 0.89), and 0.72 (95% CI: 0.64, 0.88) for BRAF wild type, BRAF fusion, and BRAF V600E, respectively. Conclusion Transfer learning and self-supervised cross-training improved classification performance and generalizability for noninvasive pediatric low-grade glioma mutational status prediction in a limited data scenario. Keywords: Pediatrics, MRI, CNS, Brain/Brain Stem, Oncology, Feature Detection, Diagnosis, Supervised Learning, Transfer Learning, Convolutional Neural Network (CNN) Supplemental material is available for this article. © RSNA, 2024.

"刚刚接受 "的论文经过同行评审，已被接受在《放射学》上发表：人工智能》上发表。这篇文章在以最终版本发表之前，还将经过校对、排版和校对审核。请注意，在制作最终校对稿的过程中，可能会发现影响内容的错误。目的为基于 MRI 的小儿低级别胶质瘤（pLGG）无创 BRAF 突变状态分类开发一种扫描到预测的深度学习管道，并对其进行外部测试。材料与方法这项回顾性研究包括两个 pLGG 数据集，其中包含患者的基因组和诊断 T2 加权 MRI 数据：BCH（开发数据集，n = 214 [60 (28%) BRAF-Fusion, 50 (23%) BRAF V600E, 104 (49%) 野生型]）和儿童脑肿瘤网络（外部测试，n = 112 [60 (53%) BRAF-Fusion, 17 (15%) BRAF-V600E, 35 (32%) 野生型]）。我们开发了一个深度学习管道，通过两个阶段对 BRAF 突变状态（V600E 与融合型与野生型）进行分类：1）轴向肿瘤图像的三维肿瘤分割和提取；2）基于深度学习的突变状态切片分类。我们研究了知识转移和自我监督方法，以防止模型过拟合，主要终点是接收者操作特征曲线下面积（AUC）。为了提高模型的可解释性，我们开发了一种新的指标--COMDist（质量中心距离），用于量化肿瘤周围的模型关注度。结果在内部测试中，来自预训练医学影像特定网络的迁移学习和自监督标签交叉训练（TransferX）与共识逻辑相结合产生了最高的分类性能，对于野生型、BRAF-融合型和BRAF-V600E的AUC分别为0.82[95% CI：0.72-0.91]、0.87[95% CI：0.61-0.97]和0.85[95% CI：0.66-0.95]。在外部测试中，野生型、BRAF-融合型和 BRAF-V600E 类别的 AUC 分别为 0.72 [95% CI: 0.64-0.86]、0.78 [95% CI: 0.61-0.89] 和 0.72 [95% CI: 0.64-0.88]。结论在数据有限的情况下，迁移学习和自我监督交叉训练提高了无创 pLGG 突变状态预测的分类性能和普适性。©RSNA, 2024.

{"title":"Noninvasive Molecular Subtyping of Pediatric Low-Grade Glioma with Self-Supervised Transfer Learning.","authors":"Divyanshu Tak, Zezhong Ye, Anna Zapaischykova, Yining Zha, Aidan Boyd, Sridhar Vajapeyam, Rishi Chopra, Hasaan Hayat, Sanjay P Prabhu, Kevin X Liu, Hesham Elhalawani, Ali Nabavizadeh, Ariana Familiar, Adam C Resnick, Sabine Mueller, Hugo J W L Aerts, Pratiti Bandopadhayay, Keith L Ligon, Daphne A Haas-Kogan, Tina Y Poussaint, Benjamin H Kann","doi":"10.1148/ryai.230333","DOIUrl":"10.1148/ryai.230333","url":null,"abstract":"Purpose To develop and externally test a scan-to-prediction deep learning pipeline for noninvasive, MRI-based BRAF mutational status classification for pediatric low-grade glioma. Materials and Methods This retrospective study included two pediatric low-grade glioma datasets with linked genomic and diagnostic T2-weighted MRI data of patients: Dana-Farber/Boston Children's Hospital (development dataset, n = 214 [113 (52.8%) male; 104 (48.6%) BRAF wild type, 60 (28.0%) BRAF fusion, and 50 (23.4%) BRAF V600E]) and the Children's Brain Tumor Network (external testing, n = 112 [55 (49.1%) male; 35 (31.2%) BRAF wild type, 60 (53.6%) BRAF fusion, and 17 (15.2%) BRAF V600E]). A deep learning pipeline was developed to classify BRAF mutational status (BRAF wild type vs BRAF fusion vs BRAF V600E) via a two-stage process: (a) three-dimensional tumor segmentation and extraction of axial tumor images and (b) section-wise, deep learning-based classification of mutational status. Knowledge-transfer and self-supervised approaches were investigated to prevent model overfitting, with a primary end point of the area under the receiver operating characteristic curve (AUC). To enhance model interpretability, a novel metric, center of mass distance, was developed to quantify the model attention around the tumor. Results A combination of transfer learning from a pretrained medical imaging-specific network and self-supervised label cross-training (TransferX) coupled with consensus logic yielded the highest classification performance with an AUC of 0.82 (95% CI: 0.72, 0.91), 0.87 (95% CI: 0.61, 0.97), and 0.85 (95% CI: 0.66, 0.95) for BRAF wild type, BRAF fusion, and BRAF V600E, respectively, on internal testing. On external testing, the pipeline yielded an AUC of 0.72 (95% CI: 0.64, 0.86), 0.78 (95% CI: 0.61, 0.89), and 0.72 (95% CI: 0.64, 0.88) for BRAF wild type, BRAF fusion, and BRAF V600E, respectively. Conclusion Transfer learning and self-supervised cross-training improved classification performance and generalizability for noninvasive pediatric low-grade glioma mutational status prediction in a limited data scenario. Keywords: Pediatrics, MRI, CNS, Brain/Brain Stem, Oncology, Feature Detection, Diagnosis, Supervised Learning, Transfer Learning, Convolutional Neural Network (CNN) Supplemental material is available for this article. © RSNA, 2024.","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230333"},"PeriodicalIF":8.1,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140508/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140040504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact of Deep Learning Image Reconstruction Methods on MRI Throughput. 深度学习图像重建方法对核磁共振成像吞吐量的影响

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.230181

Anthony Yang, Mark Finkelstein, Clara Koo, Amish H Doshi

Purpose To evaluate the effect of implementing two distinct commercially available deep learning reconstruction (DLR) algorithms on the efficiency of MRI examinations conducted in real clinical practice within an outpatient setting at a large, multicenter institution. Materials and Methods This retrospective study included 7346 examinations from 10 clinical MRI scanners analyzed during the pre- and postimplementation periods of DLR methods. Two different types of DLR methods, namely Digital Imaging and Communications in Medicine (DICOM)-based and k-space-based methods, were implemented in half of the scanners (three DICOM-based and two k-space-based), while the remaining five scanners had no DLR method implemented. Scan and room times of each examination type during the pre- and postimplementation periods were compared among the different DLR methods using the Wilcoxon test. Results The application of deep learning methods resulted in significant reductions in scan and room times for certain examination types. The DICOM-based method demonstrated up to a 53% reduction in scan times and a 41% reduction in room times for various study types. The k-space-based method demonstrated up to a 27% reduction in scan times but did not significantly reduce room times. Conclusion DLR methods were associated with reductions in scan and room times in a clinical setting, though the effects were heterogeneous depending on examination type. Thus, potential adopters should carefully evaluate their case mix to determine the impact of integrating these tools. Keywords: Deep Learning MRI Reconstruction, Reconstruction Algorithms, DICOM-based Reconstruction, k-Space-based Reconstruction © RSNA, 2024 See also the commentary by GharehMohammadi in this issue.

"刚刚接受 "的论文经过同行评审，已被接受在《放射学》上发表：人工智能》上发表。这篇文章在以最终版本发表之前，还将经过校对、排版和校对审核。请注意，在制作最终校对稿的过程中，可能会发现影响文章内容的错误。目的评估在一家大型多中心机构的门诊环境中，实施两种不同的市售深度学习重建（DLR）算法对实际临床实践中进行的 MRI 检查效率的影响。材料与方法这项回顾性研究包括十台临床磁共振成像扫描仪的 7346 次检查，在 DLR 方法实施前和实施后进行了分析。半数扫描仪（三台基于 DICOM，两台基于 k-space）采用了两种不同类型的 DLR 方法，即基于医学数字成像和通信（DICOM）的方法和基于 k-space的方法，其余五台扫描仪未采用 DLR 方法。使用 Wilcoxon 检验比较了不同 DLR 方法在实施前和实施后期间每种检查类型的扫描时间和检查室时间。结果深度学习方法的应用显著缩短了某些检查类型的扫描和检查室时间。基于 DICOM 的方法显示，各种检查类型的扫描时间最多可减少 53%，检查室时间最多可减少 41%。基于 k 空间的方法最多可减少 27% 的扫描时间，但不能显著减少检查室时间。结论 DLR 方法与临床环境中扫描和室内时间的减少有关，但效果因检查类型而异。因此，潜在的采用者应仔细评估其病例组合，以确定整合这些工具的影响。©RSNA，2024。

{"title":"Impact of Deep Learning Image Reconstruction Methods on MRI Throughput.","authors":"Anthony Yang, Mark Finkelstein, Clara Koo, Amish H Doshi","doi":"10.1148/ryai.230181","DOIUrl":"10.1148/ryai.230181","url":null,"abstract":"Purpose To evaluate the effect of implementing two distinct commercially available deep learning reconstruction (DLR) algorithms on the efficiency of MRI examinations conducted in real clinical practice within an outpatient setting at a large, multicenter institution. Materials and Methods This retrospective study included 7346 examinations from 10 clinical MRI scanners analyzed during the pre- and postimplementation periods of DLR methods. Two different types of DLR methods, namely Digital Imaging and Communications in Medicine (DICOM)-based and k-space-based methods, were implemented in half of the scanners (three DICOM-based and two k-space-based), while the remaining five scanners had no DLR method implemented. Scan and room times of each examination type during the pre- and postimplementation periods were compared among the different DLR methods using the Wilcoxon test. Results The application of deep learning methods resulted in significant reductions in scan and room times for certain examination types. The DICOM-based method demonstrated up to a 53% reduction in scan times and a 41% reduction in room times for various study types. The k-space-based method demonstrated up to a 27% reduction in scan times but did not significantly reduce room times. Conclusion DLR methods were associated with reductions in scan and room times in a clinical setting, though the effects were heterogeneous depending on examination type. Thus, potential adopters should carefully evaluate their case mix to determine the impact of integrating these tools. Keywords: Deep Learning MRI Reconstruction, Reconstruction Algorithms, DICOM-based Reconstruction, k-Space-based Reconstruction © RSNA, 2024 See also the commentary by GharehMohammadi in this issue.","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230181"},"PeriodicalIF":9.8,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140511/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140176775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bone Age Prediction under Stress. 压力下的骨龄预测

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.240137

Shahriar Faghani, Bradley J Erickson

引用次数: 0

Lessons Learned in Building Expertly Annotated Multi-Institution Datasets and Hosting the RSNA AI Challenges. 建立专家注释的多机构数据集和举办 RSNA 人工智能挑战赛的经验教训。

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.230227

Felipe C Kitamura, Luciano M Prevedello, Errol Colak, Safwan S Halabi, Matthew P Lungren, Robyn L Ball, Jayashree Kalpathy-Cramer, Charles E Kahn, Tyler Richards, Jason F Talbott, George Shih, Hui Ming Lin, Katherine P Andriole, Maryam Vazirabad, Bradley J Erickson, Adam E Flanders, John Mongan

The Radiological Society of North America (RSNA) has held artificial intelligence competitions to tackle real-world medical imaging problems at least annually since 2017. This article examines the challenges and processes involved in organizing these competitions, with a specific emphasis on the creation and curation of high-quality datasets. The collection of diverse and representative medical imaging data involves dealing with issues of patient privacy and data security. Furthermore, ensuring quality and consistency in data, which includes expert labeling and accounting for various patient and imaging characteristics, necessitates substantial planning and resources. Overcoming these obstacles requires meticulous project management and adherence to strict timelines. The article also highlights the potential of crowdsourced annotation to progress medical imaging research. Through the RSNA competitions, an effective global engagement has been realized, resulting in innovative solutions to complex medical imaging problems, thus potentially transforming health care by enhancing diagnostic accuracy and patient outcomes. Keywords: Use of AI in Education, Artificial Intelligence © RSNA, 2024.

"刚刚接受 "的论文经过同行评审，已被接受在《放射学》上发表：人工智能》上发表。这篇文章在以最终版本发表之前，还将经过校对、排版和校对审核。请注意，在制作最终校对稿的过程中，可能会发现一些可能影响内容的错误。自 2017 年以来，北美放射学会（RSNA）至少每年举办一次人工智能竞赛，以解决现实世界中的医学影像问题。本文探讨了组织这些竞赛所面临的挑战和过程，特别强调了高质量数据集的创建和整理。收集多样化和有代表性的医学影像数据涉及处理患者隐私和数据安全问题。此外，要确保数据的质量和一致性，包括专家标记和考虑各种患者和成像特征，还需要大量的规划和资源。要克服这些障碍，就必须进行细致的项目管理，并严格遵守时间表。文章还强调了众包注释在推动医学影像研究方面的潜力。通过 RSNA 竞赛，实现了有效的全球参与，为复杂的医学影像问题提供了创新的解决方案，从而有可能通过提高诊断准确性和患者疗效来改变医疗保健。©RSNA，2024。

{"title":"Lessons Learned in Building Expertly Annotated Multi-Institution Datasets and Hosting the RSNA AI Challenges.","authors":"Felipe C Kitamura, Luciano M Prevedello, Errol Colak, Safwan S Halabi, Matthew P Lungren, Robyn L Ball, Jayashree Kalpathy-Cramer, Charles E Kahn, Tyler Richards, Jason F Talbott, George Shih, Hui Ming Lin, Katherine P Andriole, Maryam Vazirabad, Bradley J Erickson, Adam E Flanders, John Mongan","doi":"10.1148/ryai.230227","DOIUrl":"10.1148/ryai.230227","url":null,"abstract":"The Radiological Society of North America (RSNA) has held artificial intelligence competitions to tackle real-world medical imaging problems at least annually since 2017. This article examines the challenges and processes involved in organizing these competitions, with a specific emphasis on the creation and curation of high-quality datasets. The collection of diverse and representative medical imaging data involves dealing with issues of patient privacy and data security. Furthermore, ensuring quality and consistency in data, which includes expert labeling and accounting for various patient and imaging characteristics, necessitates substantial planning and resources. Overcoming these obstacles requires meticulous project management and adherence to strict timelines. The article also highlights the potential of crowdsourced annotation to progress medical imaging research. Through the RSNA competitions, an effective global engagement has been realized, resulting in innovative solutions to complex medical imaging problems, thus potentially transforming health care by enhancing diagnostic accuracy and patient outcomes. Keywords: Use of AI in Education, Artificial Intelligence © RSNA, 2024.","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230227"},"PeriodicalIF":9.8,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140499/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140111550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact of AI for Digital Breast Tomosynthesis on Breast Cancer Detection and Interpretation Time. 数字乳腺断层合成的人工智能对乳腺癌检测和判读时间的影响。

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.230318

Eun Kyung Park, SooYoung Kwak, Weonsuk Lee, Joon Suk Choi, Thijs Kooi, Eun-Kyung Kim

Purpose To develop an artificial intelligence (AI) model for the diagnosis of breast cancer on digital breast tomosynthesis (DBT) images and to investigate whether it could improve diagnostic accuracy and reduce radiologist reading time. Materials and Methods A deep learning AI algorithm was developed and validated for DBT with retrospectively collected examinations (January 2010 to December 2021) from 14 institutions in the United States and South Korea. A multicenter reader study was performed to compare the performance of 15 radiologists (seven breast specialists, eight general radiologists) in interpreting DBT examinations in 258 women (mean age, 56 years ± 13.41 [SD]), including 65 cancer cases, with and without the use of AI. Area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and reading time were evaluated. Results The AUC for stand-alone AI performance was 0.93 (95% CI: 0.92, 0.94). With AI, radiologists' AUC improved from 0.90 (95% CI: 0.86, 0.93) to 0.92 (95% CI: 0.88, 0.96) (P = .003) in the reader study. AI showed higher specificity (89.64% [95% CI: 85.34%, 93.94%]) than radiologists (77.34% [95% CI: 75.82%, 78.87%]) (P < .001). When reading with AI, radiologists' sensitivity increased from 85.44% (95% CI: 83.22%, 87.65%) to 87.69% (95% CI: 85.63%, 89.75%) (P = .04), with no evidence of a difference in specificity. Reading time decreased from 54.41 seconds (95% CI: 52.56, 56.27) without AI to 48.52 seconds (95% CI: 46.79, 50.25) with AI (P < .001). Interreader agreement measured by Fleiss κ increased from 0.59 to 0.62. Conclusion The AI model showed better diagnostic accuracy than radiologists in breast cancer detection, as well as reduced reading times. The concurrent use of AI in DBT interpretation could improve both accuracy and efficiency. Keywords: Breast, Computer-Aided Diagnosis (CAD), Tomosynthesis, Artificial Intelligence, Digital Breast Tomosynthesis, Breast Cancer, Computer-Aided Detection, Screening Supplemental material is available for this article. © RSNA, 2024 See also the commentary by Bae in this issue.

"刚刚接受 "的论文经过同行评审，已被接受在《放射学》上发表：人工智能》上发表。这篇文章在以最终版本发表之前，还将经过校对、排版和校对审核。请注意，在制作最终校对稿的过程中，可能会发现影响内容的错误。目的开发一种用于数字乳腺断层扫描（DBT）中乳腺癌诊断的人工智能（AI），并研究它是否能提高诊断准确性并减少放射科医生的阅读时间。材料与方法针对 DBT 开发了深度学习人工智能算法，并通过回顾性收集美国和韩国 14 家机构的检查结果（2010 年 1 月至 2021 年 12 月）进行了验证。我们进行了一项多中心、读者研究，比较了 15 位放射科医生（7 位乳腺专家，8 位普通放射科医生）在解读 258 位女性（平均 56 岁 ± 13.41 [SD]）（包括 65 例癌症病例）的 DBT 检查结果时，使用和未使用人工智能的表现。对接收者操作特征曲线下面积（AUC）、灵敏度、特异性和读片时间进行了评估。结果独立人工智能性能的 AUC 为 0.93（95% CI：0.92,0.94）。在读者研究中，使用人工智能后，放射医师的 AUC 从 0.90 (0.86, 0.93) 提高到 0.92 (0.88, 0.96; P = .003)。人工智能的特异性（89.64% (85.34, 93.94)）高于放射科医生（77.34% (75.82, 78.87; P < .001)）。使用 AI 进行读片时，放射科医生的灵敏度从 85.44% (83.22, 87.65) 提高到 87.69% (85.63, 89.75; P = .04)，但特异性没有差异。阅读时间从无人工智能时的 54.41 秒（52.56, 56.27）减少到有人工智能时的 48.52 秒（46.79, 50.25）（P < .001）。用弗莱斯卡帕（Fleiss kappa）测量的读数间一致性分别从 0.59 上升到 0.62。结论在乳腺癌检测方面，人工智能模型比放射科医生显示出更高的诊断准确性，并缩短了读片时间。在 DBT 解释中同时使用人工智能可以提高准确性和效率。©RSNA，2024。

{"title":"Impact of AI for Digital Breast Tomosynthesis on Breast Cancer Detection and Interpretation Time.","authors":"Eun Kyung Park, SooYoung Kwak, Weonsuk Lee, Joon Suk Choi, Thijs Kooi, Eun-Kyung Kim","doi":"10.1148/ryai.230318","DOIUrl":"10.1148/ryai.230318","url":null,"abstract":"Purpose To develop an artificial intelligence (AI) model for the diagnosis of breast cancer on digital breast tomosynthesis (DBT) images and to investigate whether it could improve diagnostic accuracy and reduce radiologist reading time. Materials and Methods A deep learning AI algorithm was developed and validated for DBT with retrospectively collected examinations (January 2010 to December 2021) from 14 institutions in the United States and South Korea. A multicenter reader study was performed to compare the performance of 15 radiologists (seven breast specialists, eight general radiologists) in interpreting DBT examinations in 258 women (mean age, 56 years ± 13.41 [SD]), including 65 cancer cases, with and without the use of AI. Area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and reading time were evaluated. Results The AUC for stand-alone AI performance was 0.93 (95% CI: 0.92, 0.94). With AI, radiologists' AUC improved from 0.90 (95% CI: 0.86, 0.93) to 0.92 (95% CI: 0.88, 0.96) (P = .003) in the reader study. AI showed higher specificity (89.64% [95% CI: 85.34%, 93.94%]) than radiologists (77.34% [95% CI: 75.82%, 78.87%]) (P < .001). When reading with AI, radiologists' sensitivity increased from 85.44% (95% CI: 83.22%, 87.65%) to 87.69% (95% CI: 85.63%, 89.75%) (P = .04), with no evidence of a difference in specificity. Reading time decreased from 54.41 seconds (95% CI: 52.56, 56.27) without AI to 48.52 seconds (95% CI: 46.79, 50.25) with AI (P < .001). Interreader agreement measured by Fleiss κ increased from 0.59 to 0.62. Conclusion The AI model showed better diagnostic accuracy than radiologists in breast cancer detection, as well as reduced reading times. The concurrent use of AI in DBT interpretation could improve both accuracy and efficiency. Keywords: Breast, Computer-Aided Diagnosis (CAD), Tomosynthesis, Artificial Intelligence, Digital Breast Tomosynthesis, Breast Cancer, Computer-Aided Detection, Screening Supplemental material is available for this article. © RSNA, 2024 See also the commentary by Bae in this issue.","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230318"},"PeriodicalIF":9.8,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140510/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140858350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

When the Student Becomes the Master: Boosting Intracranial Hemorrhage Detection Generalizability with Teacher-Student Learning. 当学生成为主人：通过师生学习提高颅内出血检测的通用性。

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.240126

Nathaniel Swinburne

引用次数: 0

Deep Learning-based Approach for Brainstem and Ventricular MR Planimetry: Application in Patients with Progressive Supranuclear Palsy. 基于深度学习的脑干和脑室磁共振平面测量法：在进行性核上性麻痹患者中的应用。

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.230151

Salvatore Nigro, Marco Filardi, Benedetta Tafuri, Martina Nicolardi, Roberto De Blasi, Alessia Giugno, Valentina Gnoni, Giammarco Milella, Daniele Urso, Stefano Zoccolella, Giancarlo Logroscino

Purpose To develop a fast and fully automated deep learning (DL)-based method for the MRI planimetric segmentation and measurement of the brainstem and ventricular structures most affected in patients with progressive supranuclear palsy (PSP). Materials and Methods In this retrospective study, T1-weighted MR images in healthy controls (n = 84) were used to train DL models for segmenting the midbrain, pons, middle cerebellar peduncle (MCP), superior cerebellar peduncle (SCP), third ventricle, and frontal horns (FHs). Internal, external, and clinical test datasets (n = 305) were used to assess segmentation model reliability. DL masks from test datasets were used to automatically extract midbrain and pons areas and the width of MCP, SCP, third ventricle, and FHs. Automated measurements were compared with those manually performed by an expert radiologist. Finally, these measures were combined to calculate the midbrain to pons area ratio, MR parkinsonism index (MRPI), and MRPI 2.0, which were used to differentiate patients with PSP (n = 71) from those with Parkinson disease (PD) (n = 129). Results Dice coefficients above 0.85 were found for all brain regions when comparing manual and DL-based segmentations. A strong correlation was observed between automated and manual measurements (Spearman ρ > 0.80, P < .001). DL-based measurements showed excellent performance in differentiating patients with PSP from those with PD, with an area under the receiver operating characteristic curve above 0.92. Conclusion The automated approach successfully segmented and measured the brainstem and ventricular structures. DL-based models may represent a useful approach to support the diagnosis of PSP and potentially other conditions associated with brainstem and ventricular alterations. Keywords: MR Imaging, Brain/Brain Stem, Segmentation, Quantification, Diagnosis, Convolutional Neural Network Supplemental material is available for this article. © RSNA, 2024 See also the commentary by Mohajer in this issue.

"刚刚接受 "的论文经过同行评审，已被接受在《放射学》上发表：人工智能》上发表。这篇文章在以最终版本发表之前，还将经过校对、排版和校对审核。请注意，在制作最终校对稿的过程中，可能会发现影响内容的错误。目的开发一种基于深度学习（DL）的快速全自动方法，用于进行性核上性麻痹（PSP）患者脑干和脑室结构的平面分割和测量。材料与方法在这项回顾性研究中，健康对照组（n=84）的 T1 加权磁共振图像被用于训练 DL 模型，以分割中脑、脑桥、小脑中胚层 (MCP)、小脑上胚层 (SCP)、第三脑室 (3rd V) 和额角 (FHs)。内部、外部和临床测试数据集（n=305）用于评估分割模型的可靠性。测试数据集的 DL 掩膜用于自动提取中脑和脑桥区域以及 MCP、SCP、第 3 V 和 FHs 的宽度。将自动测量结果与放射科专家手动测量结果进行比较。最后，综合这些测量结果计算出中脑与脑桥面积比、磁共振帕金森病指数（MRPI）和 MRPI 2.0，用于区分帕金森病患者（71 人）和帕金森病患者（129 人）。结果在比较人工和基于 DL 的分割时，发现所有脑区的 Dice 系数均高于 0.85。自动测量与手动测量之间存在很强的相关性（Spearman's Rho>0.80，p

{"title":"Deep Learning-based Approach for Brainstem and Ventricular MR Planimetry: Application in Patients with Progressive Supranuclear Palsy.","authors":"Salvatore Nigro, Marco Filardi, Benedetta Tafuri, Martina Nicolardi, Roberto De Blasi, Alessia Giugno, Valentina Gnoni, Giammarco Milella, Daniele Urso, Stefano Zoccolella, Giancarlo Logroscino","doi":"10.1148/ryai.230151","DOIUrl":"10.1148/ryai.230151","url":null,"abstract":"Purpose To develop a fast and fully automated deep learning (DL)-based method for the MRI planimetric segmentation and measurement of the brainstem and ventricular structures most affected in patients with progressive supranuclear palsy (PSP). Materials and Methods In this retrospective study, T1-weighted MR images in healthy controls (n = 84) were used to train DL models for segmenting the midbrain, pons, middle cerebellar peduncle (MCP), superior cerebellar peduncle (SCP), third ventricle, and frontal horns (FHs). Internal, external, and clinical test datasets (n = 305) were used to assess segmentation model reliability. DL masks from test datasets were used to automatically extract midbrain and pons areas and the width of MCP, SCP, third ventricle, and FHs. Automated measurements were compared with those manually performed by an expert radiologist. Finally, these measures were combined to calculate the midbrain to pons area ratio, MR parkinsonism index (MRPI), and MRPI 2.0, which were used to differentiate patients with PSP (n = 71) from those with Parkinson disease (PD) (n = 129). Results Dice coefficients above 0.85 were found for all brain regions when comparing manual and DL-based segmentations. A strong correlation was observed between automated and manual measurements (Spearman ρ > 0.80, P < .001). DL-based measurements showed excellent performance in differentiating patients with PSP from those with PD, with an area under the receiver operating characteristic curve above 0.92. Conclusion The automated approach successfully segmented and measured the brainstem and ventricular structures. DL-based models may represent a useful approach to support the diagnosis of PSP and potentially other conditions associated with brainstem and ventricular alterations. Keywords: MR Imaging, Brain/Brain Stem, Segmentation, Quantification, Diagnosis, Convolutional Neural Network Supplemental material is available for this article. © RSNA, 2024 See also the commentary by Mohajer in this issue.","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230151"},"PeriodicalIF":9.8,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140505/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140176774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

AI Improves Cancer Detection and Reading Time of Digital Breast Tomosynthesis. 人工智能改善了数字乳腺断层扫描的癌症检测和读取时间。

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.240219

Min Sun Bae

引用次数: 0

Erratum for: Identification of Precise 3D CT Radiomics for Habitat Computation by Machine Learning in Cancer. 勘误：通过癌症中的机器学习识别用于人居计算的精确 3D CT 放射线组学。

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.249001

Olivia Prior, Carlos Macarro, Víctor Navarro, Camilo Monreal, Marta Ligero, Alonso Garcia-Ruiz, Garazi Serna, Sara Simonetti, Irene Braña, Maria Vieito, Manuel Escobar, Jaume Capdevila, Annette T Byrne, Rodrigo Dienstmann, Rodrigo Toledo, Paolo Nuciforo, Elena Garralda, Francesco Grussu, Kinga Bernatowicz, Raquel Perez-Lopez

引用次数: 0

Evaluating the Robustness of a Deep Learning Bone Age Algorithm to Clinical Image Variation Using Computational Stress Testing. 利用计算压力测试评估深度学习骨龄算法对临床图像变化的鲁棒性。

IF 9.8 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Radiology-Artificial Intelligence

Pub Date : 2024-05-01 DOI: 10.1148/ryai.230240

Samantha M Santomartino, Kristin Putman, Elham Beheshtian, Vishwa S Parekh, Paul H Yi

Purpose To evaluate the robustness of an award-winning bone age deep learning (DL) model to extensive variations in image appearance. Materials and Methods In December 2021, the DL bone age model that won the 2017 RSNA Pediatric Bone Age Challenge was retrospectively evaluated using the RSNA validation set (1425 pediatric hand radiographs; internal test set in this study) and the Digital Hand Atlas (DHA) (1202 pediatric hand radiographs; external test set). Each test image underwent seven types of transformations (rotations, flips, brightness, contrast, inversion, laterality marker, and resolution) to represent a range of image appearances, many of which simulate real-world variations. Computational "stress tests" were performed by comparing the model's predictions on baseline and transformed images. Mean absolute differences (MADs) of predicted bone ages compared with radiologist-determined ground truth on baseline versus transformed images were compared using Wilcoxon signed rank tests. The proportion of clinically significant errors (CSEs) was compared using McNemar tests. Results There was no evidence of a difference in MAD of the model on the two baseline test sets (RSNA = 6.8 months, DHA = 6.9 months; P = .05), indicating good model generalization to external data. Except for the RSNA dataset images with an appended radiologic laterality marker (P = .86), there were significant differences in MAD for both the DHA and RSNA datasets among other transformation groups (rotations, flips, brightness, contrast, inversion, and resolution). There were significant differences in proportion of CSEs for 57% of the image transformations (19 of 33) performed on the DHA dataset. Conclusion Although an award-winning pediatric bone age DL model generalized well to curated external images, it had inconsistent predictions on images that had undergone simple transformations reflective of several real-world variations in image appearance. Keywords: Pediatrics, Hand, Convolutional Neural Network, Radiography Supplemental material is available for this article. © RSNA, 2024 See also commentary by Faghani and Erickson in this issue.

"刚刚接受 "的论文经过同行评审，已被接受在《放射学》上发表：人工智能》上发表。这篇文章在以最终版本发表之前，还将经过校对、排版和校对审核。请注意，在制作最终校对稿的过程中，可能会发现影响文章内容的错误。目的评估获奖的骨龄深度学习（DL）模型对图像外观的广泛变化的鲁棒性。材料与方法 2021 年 12 月，使用北美放射学会（RSNA）验证集（n = 1425 张小儿手部放射照片；内部测试集）和数字手图集（DHA；n = 1202 张小儿手部放射照片；外部测试集）对赢得 2017 年 RSNA 小儿骨龄挑战赛的 DL 骨龄模型进行了回顾性评估。每张测试图像都经过七种类型的转换（旋转、翻转、亮度、对比度、反转、侧位标记和分辨率），以代表一系列图像外观，其中许多是模拟真实世界的变化。通过比较模型对基线图像和转换图像的预测，进行了计算 "压力测试"。使用 Wilcoxon Signed Rank 检验比较了基线图像和转换图像上预测骨龄与放射科医生确定的基本真实值的平均绝对差值（MAD）。使用 McNemar 检验比较有临床意义的误差 (CSE) 比例。结果在两个基线测试集（RSNA = 6.8，DHA = 6.9；P = .05）上，没有证据表明模型的 MAD 存在差异，这表明模型对外部数据具有良好的泛化能力。除了带有附加放射学侧位标记（P = .86）的 RSNA 图像外，DHA 和 RSNA 数据集的 MAD 在其他转换组（旋转、翻转、亮度、对比度、反转和分辨率）之间存在显著差异。在对 DHA 数据集进行的图像转换中，57.6%（19/33）的 CSE 比例存在明显差异。结论尽管获奖的小儿骨龄 DL 模型对经过策划的外部图像具有良好的通用性，但它对经过简单转换的图像的预测不一致，而这些转换反映了图像外观的几种真实世界的变化。©RSNA, 2024.

{"title":"Evaluating the Robustness of a Deep Learning Bone Age Algorithm to Clinical Image Variation Using Computational Stress Testing.","authors":"Samantha M Santomartino, Kristin Putman, Elham Beheshtian, Vishwa S Parekh, Paul H Yi","doi":"10.1148/ryai.230240","DOIUrl":"10.1148/ryai.230240","url":null,"abstract":"Purpose To evaluate the robustness of an award-winning bone age deep learning (DL) model to extensive variations in image appearance. Materials and Methods In December 2021, the DL bone age model that won the 2017 RSNA Pediatric Bone Age Challenge was retrospectively evaluated using the RSNA validation set (1425 pediatric hand radiographs; internal test set in this study) and the Digital Hand Atlas (DHA) (1202 pediatric hand radiographs; external test set). Each test image underwent seven types of transformations (rotations, flips, brightness, contrast, inversion, laterality marker, and resolution) to represent a range of image appearances, many of which simulate real-world variations. Computational \"stress tests\" were performed by comparing the model's predictions on baseline and transformed images. Mean absolute differences (MADs) of predicted bone ages compared with radiologist-determined ground truth on baseline versus transformed images were compared using Wilcoxon signed rank tests. The proportion of clinically significant errors (CSEs) was compared using McNemar tests. Results There was no evidence of a difference in MAD of the model on the two baseline test sets (RSNA = 6.8 months, DHA = 6.9 months; P = .05), indicating good model generalization to external data. Except for the RSNA dataset images with an appended radiologic laterality marker (P = .86), there were significant differences in MAD for both the DHA and RSNA datasets among other transformation groups (rotations, flips, brightness, contrast, inversion, and resolution). There were significant differences in proportion of CSEs for 57% of the image transformations (19 of 33) performed on the DHA dataset. Conclusion Although an award-winning pediatric bone age DL model generalized well to curated external images, it had inconsistent predictions on images that had undergone simple transformations reflective of several real-world variations in image appearance. Keywords: Pediatrics, Hand, Convolutional Neural Network, Radiography Supplemental material is available for this article. © RSNA, 2024 See also commentary by Faghani and Erickson in this issue.","PeriodicalId":29787,"journal":{"name":"Radiology-Artificial Intelligence","volume":" ","pages":"e230240"},"PeriodicalIF":9.8,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11140516/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140111549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0