{"title":"Medical Image Classification: A Comparison of Deep Pre-trained Neural Networks","authors":"D. O. Alebiosu, Fermi Pasha Muhammad","doi":"10.1109/SCORED.2019.8896277","DOIUrl":null,"url":null,"abstract":"Medical image classification is an important step in the effective and accurate retrieval of medical images from large digital database where they are stored. This paper examines the effectiveness of using domain transferred neural networks (DCNNs) for classification of medical X-ray images. We employed two different convolutional neural network (CNN) architectures. VGGNet-16 and AlexNet pre-trained on ImageNet, a non- medical image database consisting of over 1.2 million scenery images were used for the classification task. The pre-trained networks served both as feature extractors and as fine-tuned networks. The extracted feature vector was used to train a linear support vector machine (SVM) to generate a model for the classification task. The fine-tuning process was done by replacing and retraining the last fully connected layers through backward propagation. Our method was evaluated on ImageCLEF2007 medical database. The database consist of 11,000 medical X-ray images (training dataset) and 1,000 images (testing dataset) classified into 116 categories. We compared the performance of the two networks both as feature generators and as fine-tuned networks on our dataset. The overall classification accuracy across all the 116 image classes shows that VGGNet-16 + SVM produced 79.6% and 85.77% as fine-tuned network. AlexNet + SVM produced a total classification accuracy of 84.27% and as a fine-tuned network produced a total of 86.47% which is the highest among the four techniques across all the 116 image classes. This study shows that the employment of a shallower pre-trained neural network such as AlexNet learn features that are more generalizable compared to deeper networkers such as VGGNet-16 and has a greater capability of increasing classification accuracy of medical image database. Though the pre-trained AlexNet outperformed VGGNet-16 in both ways, it can be noted that some image classes from the same sub-body region are difficult to classify accurately. This is as a result of inter-class similarity that exists among the images.","PeriodicalId":231004,"journal":{"name":"2019 IEEE Student Conference on Research and Development (SCOReD)","volume":"43 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Student Conference on Research and Development (SCOReD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCORED.2019.8896277","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Medical image classification is an important step in the effective and accurate retrieval of medical images from large digital database where they are stored. This paper examines the effectiveness of using domain transferred neural networks (DCNNs) for classification of medical X-ray images. We employed two different convolutional neural network (CNN) architectures. VGGNet-16 and AlexNet pre-trained on ImageNet, a non- medical image database consisting of over 1.2 million scenery images were used for the classification task. The pre-trained networks served both as feature extractors and as fine-tuned networks. The extracted feature vector was used to train a linear support vector machine (SVM) to generate a model for the classification task. The fine-tuning process was done by replacing and retraining the last fully connected layers through backward propagation. Our method was evaluated on ImageCLEF2007 medical database. The database consist of 11,000 medical X-ray images (training dataset) and 1,000 images (testing dataset) classified into 116 categories. We compared the performance of the two networks both as feature generators and as fine-tuned networks on our dataset. The overall classification accuracy across all the 116 image classes shows that VGGNet-16 + SVM produced 79.6% and 85.77% as fine-tuned network. AlexNet + SVM produced a total classification accuracy of 84.27% and as a fine-tuned network produced a total of 86.47% which is the highest among the four techniques across all the 116 image classes. This study shows that the employment of a shallower pre-trained neural network such as AlexNet learn features that are more generalizable compared to deeper networkers such as VGGNet-16 and has a greater capability of increasing classification accuracy of medical image database. Though the pre-trained AlexNet outperformed VGGNet-16 in both ways, it can be noted that some image classes from the same sub-body region are difficult to classify accurately. This is as a result of inter-class similarity that exists among the images.