Semi-supervised pre-training based multi-task network for thyroid-associated ophthalmopathy classification

IF 3.4 2区 工程技术 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Displays Pub Date : 2025-04-01 Epub Date: 2025-02-01 DOI:10.1016/j.displa.2025.102974
MingFei Yang , TianFeng Zhang , XueFei Song , YuZhong Zhang , Lei Zhou
{"title":"Semi-supervised pre-training based multi-task network for thyroid-associated ophthalmopathy classification","authors":"MingFei Yang ,&nbsp;TianFeng Zhang ,&nbsp;XueFei Song ,&nbsp;YuZhong Zhang ,&nbsp;Lei Zhou","doi":"10.1016/j.displa.2025.102974","DOIUrl":null,"url":null,"abstract":"<div><div>Thyroid-associated ophthalmopathy (TAO) is a blinding autoimmune disorder, and early diagnosis is crucial in preventing vision loss. Orbital CT imaging has emerged as a valuable tool for diagnosing and screening TAO. Radiomic is currently the most dominant technique for TAO diagnosis, however it is costly due to the need for manual image labeling by medical professionals. Convolutional Neural Network (CNN) is another promising technique for TAO diagnosis. However, the performance of CNN based classification may degrade due to the limited size of collected data or the complexity of designed model. Utilizing pretraining model is a crucial technique for boosting the performance of CNN based TAO classification. Therefore, a novel semi-supervised pretraining based multi-task network for TAO classification is proposed in this paper. Firstly, a multi-task network is designed, which consists of an encoder, a classification branch and two segmentation decoder. Then, the multi-task network is pretrained by minimizing the prediction difference between two segmentation decoders through a semi-supervised way. In this way, the pseudo voxel-level supervision can be generated for the unlabeled images. Finally, the encoder and one light-weighted decoder can be initialized by the pretrained weights, and then they are jointly optimized for TAO classification with the classification branch through multi-task learning. Our proposed network model was comprehensively evaluated on a private dataset which consists of 982 orbital CT scans for TAO diagnosis. We also tested the classification generalization performance using an external dataset. The experimental results demonstrate that our model significantly improves the classification performance when compared with current SOTA methods. The source code is publically available at <span><span>https://github.com/VLAD-KONATA/TAO_CT</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"87 ","pages":"Article 102974"},"PeriodicalIF":3.4000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225000113","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/1 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Thyroid-associated ophthalmopathy (TAO) is a blinding autoimmune disorder, and early diagnosis is crucial in preventing vision loss. Orbital CT imaging has emerged as a valuable tool for diagnosing and screening TAO. Radiomic is currently the most dominant technique for TAO diagnosis, however it is costly due to the need for manual image labeling by medical professionals. Convolutional Neural Network (CNN) is another promising technique for TAO diagnosis. However, the performance of CNN based classification may degrade due to the limited size of collected data or the complexity of designed model. Utilizing pretraining model is a crucial technique for boosting the performance of CNN based TAO classification. Therefore, a novel semi-supervised pretraining based multi-task network for TAO classification is proposed in this paper. Firstly, a multi-task network is designed, which consists of an encoder, a classification branch and two segmentation decoder. Then, the multi-task network is pretrained by minimizing the prediction difference between two segmentation decoders through a semi-supervised way. In this way, the pseudo voxel-level supervision can be generated for the unlabeled images. Finally, the encoder and one light-weighted decoder can be initialized by the pretrained weights, and then they are jointly optimized for TAO classification with the classification branch through multi-task learning. Our proposed network model was comprehensively evaluated on a private dataset which consists of 982 orbital CT scans for TAO diagnosis. We also tested the classification generalization performance using an external dataset. The experimental results demonstrate that our model significantly improves the classification performance when compared with current SOTA methods. The source code is publically available at https://github.com/VLAD-KONATA/TAO_CT.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于半监督预训练的甲状腺相关眼病分类多任务网络
甲状腺相关性眼病(TAO)是一种致盲性自身免疫性疾病,早期诊断对预防视力丧失至关重要。眼眶CT成像已成为诊断和筛查TAO的重要工具。放射组学是目前最主要的TAO诊断技术,但由于需要医疗专业人员手动标记图像,因此成本高昂。卷积神经网络(CNN)是另一种很有前途的TAO诊断技术。然而,由于收集数据的规模有限或设计模型的复杂性,基于CNN的分类性能可能会下降。利用预训练模型是提高基于CNN的TAO分类性能的关键技术。为此,本文提出了一种新的基于半监督预训练的多任务TAO分类网络。首先,设计了一个多任务网络,该网络由一个编码器、一个分类分支和两个分段解码器组成。然后,通过半监督的方式最小化两个分割解码器之间的预测差,对多任务网络进行预训练。通过这种方法,可以对未标记的图像生成伪体素级监督。最后,利用预训练的权值对编码器和一个轻量级解码器进行初始化,然后通过多任务学习,与分类分支一起对编码器和一个轻量级解码器进行TAO分类优化。我们提出的网络模型在一个私人数据集上进行了全面评估,该数据集由982个眼眶CT扫描组成,用于TAO诊断。我们还使用外部数据集测试了分类泛化性能。实验结果表明,与现有的SOTA方法相比,我们的模型显著提高了分类性能。源代码可在https://github.com/VLAD-KONATA/TAO_CT上公开获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Displays
Displays 工程技术-工程:电子与电气
CiteScore
4.60
自引率
25.60%
发文量
138
审稿时长
92 days
期刊介绍: Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface. Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.
期刊最新文献
An end-to-end Chinese-Braille translation method based on mT5: Vocabulary expansion and structural enhancement A degradation-adaptive deep-sea polymetallic nodule image segmentation framework with dual-frequency fusion for Jiaolong submersible Decoder-enhanced and semantic-aware layered image compression for human and machine Using siamese networks with transfer learning for dental identification on small-samples datasets Modeling epistemic uncertainty in 3D Gaussian Splatting for robust scene reconstruction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1