{"title":"MTPret: Improving X-Ray Image Analytics With Multitask Pretraining","authors":"Weibin Liao;Qingzhong Wang;Xuhong Li;Yi Liu;Zeyu Chen;Siyu Huang;Dejing Dou;Yanwu Xu;Haoyi Xiong","doi":"10.1109/TAI.2024.3400750","DOIUrl":null,"url":null,"abstract":"While deep neural networks (DNNs) have been widely used in various X-ray image analytics tasks such as classification, segmentation, detection, etc., there frequently needs to collect and annotate a huge amount of training data to train a model for every single task. In this work, we proposed a multitask self-supervised pretraining strategy \n<italic>MTPret</i>\n to improve the performance of DNNs in various X-ray analytics tasks. \n<italic>MTPret</i>\n first trains the backbone to learn visual representations from multiple datasets of different tasks through contrastive learning, then \n<italic>MTPret</i>\n leverages a multitask continual learning to learn discriminative features from various downstream tasks. To evaluate the performance of \n<italic>MTPret</i>\n, we collected eleven X-ray image datasets from different body parts, such as heads, chest, lungs, bones, and etc., for various tasks to pretrain backbones, and fine-tuned the networks on seven of the tasks. The evaluation results on top of the seven tasks showed \n<italic>MTPret</i>\n outperformed a large number of baseline methods, including other initialization strategies, pretrained models, and task-specific algorithms in recent studies. In addition, we also performed experiments based on two external tasks, where the datasets of external tasks have not been used in pretraining. The excellent performance of \n<italic>MTPret</i>\n further confirmed the generalizability and superiority of the proposed multitask self-supervised pretraining.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10531186/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
While deep neural networks (DNNs) have been widely used in various X-ray image analytics tasks such as classification, segmentation, detection, etc., there frequently needs to collect and annotate a huge amount of training data to train a model for every single task. In this work, we proposed a multitask self-supervised pretraining strategy
MTPret
to improve the performance of DNNs in various X-ray analytics tasks.
MTPret
first trains the backbone to learn visual representations from multiple datasets of different tasks through contrastive learning, then
MTPret
leverages a multitask continual learning to learn discriminative features from various downstream tasks. To evaluate the performance of
MTPret
, we collected eleven X-ray image datasets from different body parts, such as heads, chest, lungs, bones, and etc., for various tasks to pretrain backbones, and fine-tuned the networks on seven of the tasks. The evaluation results on top of the seven tasks showed
MTPret
outperformed a large number of baseline methods, including other initialization strategies, pretrained models, and task-specific algorithms in recent studies. In addition, we also performed experiments based on two external tasks, where the datasets of external tasks have not been used in pretraining. The excellent performance of
MTPret
further confirmed the generalizability and superiority of the proposed multitask self-supervised pretraining.