A Progressive and Cross-Domain Deep Transfer Learning Framework for Wrist Fracture Detection

IF 3.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Artificial Intelligence and Soft Computing Research Pub Date : 2021-04-01 DOI:10.2478/jaiscr-2022-0007
Christophe Karam, Julia El Zini, M. Awad, C. Saade, L. Naffaa, Mohammad Ali K. El Amine
{"title":"A Progressive and Cross-Domain Deep Transfer Learning Framework for Wrist Fracture Detection","authors":"Christophe Karam, Julia El Zini, M. Awad, C. Saade, L. Naffaa, Mohammad Ali K. El Amine","doi":"10.2478/jaiscr-2022-0007","DOIUrl":null,"url":null,"abstract":"Abstract There has been an amplified focus on and benefit from the adoption of artificial intelligence (AI) in medical imaging applications. However, deep learning approaches involve training with massive amounts of annotated data in order to guarantee generalization and achieve high accuracies. Gathering and annotating large sets of training images require expertise which is both expensive and time-consuming, especially in the medical field. Furthermore, in health care systems where mistakes can have catastrophic consequences, there is a general mistrust in the black-box aspect of AI models. In this work, we focus on improving the performance of medical imaging applications when limited data is available while focusing on the interpretability aspect of the proposed AI model. This is achieved by employing a novel transfer learning framework, progressive transfer learning, an automated annotation technique and a correlation analysis experiment on the learned representations. Progressive transfer learning helps jump-start the training of deep neural networks while improving the performance by gradually transferring knowledge from two source tasks into the target task. It is empirically tested on the wrist fracture detection application by first training a general radiology network RadiNet and using its weights to initialize RadiNetwrist, that is trained on wrist images to detect fractures. Experiments show that RadiNetwrist achieves an accuracy of 87% and an AUC ROC of 94% as opposed to 83% and 92% when it is pre-trained on the ImageNet dataset. This improvement in performance is investigated within an explainable AI framework. More concretely, the learned deep representations of RadiNetwrist are compared to those learned by the baseline model by conducting a correlation analysis experiment. The results show that, when transfer learning is gradually applied, some features are learned earlier in the network. Moreover, the deep layers in the progressive transfer learning framework are shown to encode features that are not encountered when traditional transfer learning techniques are applied. In addition to the empirical results, a clinical study is conducted and the performance of RadiNetwrist is compared to that of an expert radiologist. We found that RadiNetwrist exhibited similar performance to that of radiologists with more than 20 years of experience. This motivates follow-up research to train on more data to feasibly surpass radiologists’ performance, and investigate the interpretability of AI models in the healthcare domain where the decision-making process needs to be credible and transparent.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"12 1","pages":"101 - 120"},"PeriodicalIF":3.3000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Artificial Intelligence and Soft Computing Research","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.2478/jaiscr-2022-0007","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 3

Abstract

Abstract There has been an amplified focus on and benefit from the adoption of artificial intelligence (AI) in medical imaging applications. However, deep learning approaches involve training with massive amounts of annotated data in order to guarantee generalization and achieve high accuracies. Gathering and annotating large sets of training images require expertise which is both expensive and time-consuming, especially in the medical field. Furthermore, in health care systems where mistakes can have catastrophic consequences, there is a general mistrust in the black-box aspect of AI models. In this work, we focus on improving the performance of medical imaging applications when limited data is available while focusing on the interpretability aspect of the proposed AI model. This is achieved by employing a novel transfer learning framework, progressive transfer learning, an automated annotation technique and a correlation analysis experiment on the learned representations. Progressive transfer learning helps jump-start the training of deep neural networks while improving the performance by gradually transferring knowledge from two source tasks into the target task. It is empirically tested on the wrist fracture detection application by first training a general radiology network RadiNet and using its weights to initialize RadiNetwrist, that is trained on wrist images to detect fractures. Experiments show that RadiNetwrist achieves an accuracy of 87% and an AUC ROC of 94% as opposed to 83% and 92% when it is pre-trained on the ImageNet dataset. This improvement in performance is investigated within an explainable AI framework. More concretely, the learned deep representations of RadiNetwrist are compared to those learned by the baseline model by conducting a correlation analysis experiment. The results show that, when transfer learning is gradually applied, some features are learned earlier in the network. Moreover, the deep layers in the progressive transfer learning framework are shown to encode features that are not encountered when traditional transfer learning techniques are applied. In addition to the empirical results, a clinical study is conducted and the performance of RadiNetwrist is compared to that of an expert radiologist. We found that RadiNetwrist exhibited similar performance to that of radiologists with more than 20 years of experience. This motivates follow-up research to train on more data to feasibly surpass radiologists’ performance, and investigate the interpretability of AI models in the healthcare domain where the decision-making process needs to be credible and transparent.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
腕部骨折检测的渐进式跨域深度迁移学习框架
人工智能(AI)在医学成像应用中的应用得到了越来越多的关注和受益。然而,深度学习方法需要使用大量带注释的数据进行训练,以保证泛化和实现高精度。收集和注释大量训练图像集需要专业知识,这既昂贵又耗时,特别是在医学领域。此外,在医疗保健系统中,错误可能会造成灾难性的后果,人们普遍不信任人工智能模型的黑箱方面。在这项工作中,我们专注于在可用数据有限的情况下提高医学成像应用程序的性能,同时关注所提出的AI模型的可解释性方面。这是通过采用一种新的迁移学习框架、渐进式迁移学习、自动标注技术和对学习表征的相关性分析实验来实现的。渐进式迁移学习通过将两个源任务之间的知识逐步迁移到目标任务中来提高深度神经网络的性能,从而帮助深度神经网络快速启动训练。首先训练一个通用的放射学网络RadiNet,利用其权值初始化radinetw腕部图像,对该方法在腕部骨折检测中的应用进行了实证检验。实验表明,RadiNetwrist的准确率为87%,AUC ROC为94%,而在ImageNet数据集上进行预训练时,准确率为83%,ROC为92%。这种性能的改进是在一个可解释的人工智能框架内进行研究的。具体来说,通过相关分析实验,将学习到的radinetw腕上的深度表征与基线模型学习到的深度表征进行比较。结果表明,随着迁移学习的逐步应用,网络中的一些特征学习得更早。此外,渐进式迁移学习框架中的深层被证明编码了传统迁移学习技术应用时不会遇到的特征。除了实证结果外,还进行了临床研究,并将radinetwwrist的性能与放射科专家的性能进行了比较。我们发现RadiNetwrist的表现与拥有20年以上经验的放射科医生相似。这促使后续研究在更多数据上进行训练,以超越放射科医生的表现,并研究人工智能模型在医疗保健领域的可解释性,因为决策过程需要可信和透明。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Artificial Intelligence and Soft Computing Research
Journal of Artificial Intelligence and Soft Computing Research COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
7.00
自引率
25.00%
发文量
10
审稿时长
24 weeks
期刊介绍: Journal of Artificial Intelligence and Soft Computing Research (available also at Sciendo (De Gruyter)) is a dynamically developing international journal focused on the latest scientific results and methods constituting traditional artificial intelligence methods and soft computing techniques. Our goal is to bring together scientists representing both approaches and various research communities.
期刊最新文献
Bending Path Understanding Based on Angle Projections in Field Environments Self-Organized Operational Neural Networks for The Detection of Atrial Fibrillation Interpreting Convolutional Layers in DNN Model Based on Time–Frequency Representation of Emotional Speech A Few-Shot Learning Approach for Covid-19 Diagnosis Using Quasi-Configured Topological Spaces Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1