Yuchao Zheng , Chen Li , Xiaomin Zhou , Haoyuan Chen , Hao Xu , Yixin Li , Haiqing Zhang , Xiaoyan Li , Hongzan Sun , Xinyu Huang , Marcin Grzegorzek
{"title":"Application of transfer learning and ensemble learning in image-level classification for breast histopathology","authors":"Yuchao Zheng , Chen Li , Xiaomin Zhou , Haoyuan Chen , Hao Xu , Yixin Li , Haiqing Zhang , Xiaoyan Li , Hongzan Sun , Xinyu Huang , Marcin Grzegorzek","doi":"10.1016/j.imed.2022.05.004","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Breast cancer has the highest prevalence among all cancers in women globally. The classification of histopathological images in the diagnosis of breast cancers is an area of clinical concern. In computer-aided diagnosis, most traditional classification models use a single network to extract features, although this approach has significant limitations. Moreover, many networks are trained and optimized on patient-level datasets, ignoring lower-level data labels.</p></div><div><h3>Methods</h3><p>This paper proposed a deep ensemble model based on image-level labels for the binary classification of breast histopathological images of benign and malignant lesions. First, the BreaKHis dataset was randomly divided into training, validation, and test sets. Then, data augmentation techniques were used to balance the numbers of benign and malignant samples. Third, based on their transfer learning performance and the complementarity between networks, VGG16, Xception, ResNet50, and DenseNet201 were selected as base classifiers.</p></div><div><h3>Results</h3><p>In a ensemble network model with accuracy as the weight, the image-level binary classification achieved an accuracy of <span><math><mrow><mn>98.90</mn><mo>%</mo></mrow></math></span>. To verify the capabilities of our method, it was experimentally compared with the latest transformer and multilayer perception (MLP) models on the same dataset. Our ensemble model showed a <span><math><mrow><mn>5</mn><mo>%</mo></mrow></math></span>–<span><math><mrow><mn>20</mn><mo>%</mo></mrow></math></span> advantage, emphasizing its far-reaching abilities in classification tasks.</p></div><div><h3>Conclusions</h3><p>This research focuses on improving the performance of a classification model with an ensemble algorithm. Transfer learning has an essential role in classification of small datasets, improving training speed and accuracy. Our model may outperform many existing approaches with respect to accuracy and has applications in the field of auxiliary medical diagnosis.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"3 2","pages":"Pages 115-128"},"PeriodicalIF":4.4000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent medicine","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266710262200047X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Breast cancer has the highest prevalence among all cancers in women globally. The classification of histopathological images in the diagnosis of breast cancers is an area of clinical concern. In computer-aided diagnosis, most traditional classification models use a single network to extract features, although this approach has significant limitations. Moreover, many networks are trained and optimized on patient-level datasets, ignoring lower-level data labels.
Methods
This paper proposed a deep ensemble model based on image-level labels for the binary classification of breast histopathological images of benign and malignant lesions. First, the BreaKHis dataset was randomly divided into training, validation, and test sets. Then, data augmentation techniques were used to balance the numbers of benign and malignant samples. Third, based on their transfer learning performance and the complementarity between networks, VGG16, Xception, ResNet50, and DenseNet201 were selected as base classifiers.
Results
In a ensemble network model with accuracy as the weight, the image-level binary classification achieved an accuracy of . To verify the capabilities of our method, it was experimentally compared with the latest transformer and multilayer perception (MLP) models on the same dataset. Our ensemble model showed a – advantage, emphasizing its far-reaching abilities in classification tasks.
Conclusions
This research focuses on improving the performance of a classification model with an ensemble algorithm. Transfer learning has an essential role in classification of small datasets, improving training speed and accuracy. Our model may outperform many existing approaches with respect to accuracy and has applications in the field of auxiliary medical diagnosis.