Semantic consistency generative adversarial network for cross-modality domain adaptation in ultrasound thyroid nodule classification

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Applied Intelligence Pub Date : 2022-01-13 DOI:10.1007/s10489-021-03025-7

Jun Zhao, Xiaosong Zhou, Guohua Shi, Ning Xiao, Kai Song, Juanjuan Zhao, Rui Hao, Keqin Li

{"title":"Semantic consistency generative adversarial network for cross-modality domain adaptation in ultrasound thyroid nodule classification","authors":"Jun Zhao, Xiaosong Zhou, Guohua Shi, Ning Xiao, Kai Song, Juanjuan Zhao, Rui Hao, Keqin Li","doi":"10.1007/s10489-021-03025-7","DOIUrl":null,"url":null,"abstract":"<div><p>Deep convolutional networks have been widely used for various medical image processing tasks. However, the performance of existing learning-based networks is still limited due to the lack of large training datasets. When a general deep model is directly deployed to a new dataset with heterogeneous features, the effect of domain shifts is usually ignored, and performance degradation problems occur. In this work, by designing the semantic consistency generative adversarial network (SCGAN), we propose a new multimodal domain adaptation method for medical image diagnosis. SCGAN performs cross-domain collaborative alignment of ultrasound images and domain knowledge. Specifically, we utilize a self-attention mechanism for adversarial learning between dual domains to overcome visual differences across modal data and preserve the domain invariance of the extracted semantic features. In particular, we embed nested metric learning in the semantic information space, thus enhancing the semantic consistency of cross-modal features. Furthermore, the adversarial learning of our network is guided by a discrepancy loss for encouraging the learning of semantic-level content and a regularization term for enhancing network generalization. We evaluate our method on a thyroid ultrasound image dataset for benign and malignant diagnosis of nodules. The experimental results of a comprehensive study show that the accuracy of the SCGAN method for the classification of thyroid nodules reaches 94.30%, and the AUC reaches 97.02%. These results are significantly better than the state-of-the-art methods.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"52 9","pages":"10369 - 10383"},"PeriodicalIF":3.4000,"publicationDate":"2022-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10489-021-03025-7.pdf","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-021-03025-7","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 12

Abstract

Deep convolutional networks have been widely used for various medical image processing tasks. However, the performance of existing learning-based networks is still limited due to the lack of large training datasets. When a general deep model is directly deployed to a new dataset with heterogeneous features, the effect of domain shifts is usually ignored, and performance degradation problems occur. In this work, by designing the semantic consistency generative adversarial network (SCGAN), we propose a new multimodal domain adaptation method for medical image diagnosis. SCGAN performs cross-domain collaborative alignment of ultrasound images and domain knowledge. Specifically, we utilize a self-attention mechanism for adversarial learning between dual domains to overcome visual differences across modal data and preserve the domain invariance of the extracted semantic features. In particular, we embed nested metric learning in the semantic information space, thus enhancing the semantic consistency of cross-modal features. Furthermore, the adversarial learning of our network is guided by a discrepancy loss for encouraging the learning of semantic-level content and a regularization term for enhancing network generalization. We evaluate our method on a thyroid ultrasound image dataset for benign and malignant diagnosis of nodules. The experimental results of a comprehensive study show that the accuracy of the SCGAN method for the classification of thyroid nodules reaches 94.30%, and the AUC reaches 97.02%. These results are significantly better than the state-of-the-art methods.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

超声甲状腺结节分类中用于跨模态域自适应的语义一致性生成对抗性网络

深度卷积网络已被广泛用于各种医学图像处理任务。然而，由于缺乏大型训练数据集，现有的基于学习的网络的性能仍然有限。当将通用深度模型直接部署到具有异构特征的新数据集时，通常会忽略域转移的影响，并出现性能下降问题。在这项工作中，通过设计语义一致性生成对抗性网络（SCGAN），我们提出了一种新的用于医学图像诊断的多模式域自适应方法。SCGAN执行超声图像和领域知识的跨领域协作对准。具体而言，我们利用自注意机制进行双域之间的对抗性学习，以克服模态数据之间的视觉差异，并保持提取的语义特征的域不变性。特别是，我们将嵌套度量学习嵌入到语义信息空间中，从而增强跨模态特征的语义一致性。此外，我们网络的对抗性学习是由用于鼓励语义级别内容学习的差异损失和用于增强网络泛化的正则化术语来指导的。我们在甲状腺超声图像数据集上评估了我们的方法，用于结节的良性和恶性诊断。综合研究的实验结果表明，SCGAN方法对甲状腺结节的分类准确率达到94.30%，AUC达到97.02%。这些结果明显优于最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Applied Intelligence 工程技术-计算机：人工智能

CiteScore

6.60

自引率

20.80%

发文量

1361

审稿时长

5.9 months

期刊介绍： With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.