{"title":"Combining a forward supervised filter learning with a sparse NMF for breast cancer histopathological image classification","authors":"ArunaDevi Karuppasamy , Abdelhamid Abdesselam , Hamza zidoum , Rachid Hedjam , Maiya Al-Bahri","doi":"10.1016/j.ibmed.2024.100174","DOIUrl":null,"url":null,"abstract":"<div><div>Histopathological images play a important role in clinical diagnosis, particularly in identifying and assessing the severity of abnormal conditions like benign lesions and malignant tumors. Traditional machine learning techniques for processing histopathology images involve the extraction of manual features from these images, which is typically done with the assistance of industry experts. Recent advancements in Deep Learning (DL), especially with Convolutional Neural Networks (CNN), have enabled the automatic extraction of multi-level abstract features directly from raw data. This capability significantly enhances the performance of complex computer vision tasks. Classic CNN models like AlexNet and VggNet employ back-propagation algorithms to learn filters in the training phase. However, these algorithms demand large labeled datasets, resulting in extensive computational processing. Additionally, they often face the vanishing gradient problem, which can negatively impact the quality of the learning process. Besides, in many domains, acquiring enough labeled images for conducting properly the training phase is a real challenge. To address these challenges, a feed-forward propagation approach was proposed using Non-Negative Matrix Factorization(NMF). The NMF technique factorizes the input data into two latent factors (non-negative matrices). It has been shown that by enforcing constraints such as sparsity on the latent factors, dominant features that are mostly correlated with tumors types can be extracted. In this work, a novel model combining sparse NMF and Support Vector Machine (SVM) was developed for classifying histopathological images. We have derived a mathematical model of a novel feed-forward filter learning approach that combines sparse NMF (SNMF) and Support Vector Machine technique (SVM). The model was used to design and implement a feed-forward CNN classifier to classify histopathology images. This model has been evaluated on the histopathology images from Sultan Qaboos University Hospital (SQUH dataset) and the public BreaKHis dataset. The experiments we have conducted demonstrate the efficiency of the proposed model, especially on small-sized SQUH datasets achieving an AUC of 0.90, 0.89, 0.85, and 0.86 on 4x,10x, 20x, and 40x magnifications, respectively, and achieving an AUC of 0.95 BreaKHis dataset.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100174"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521224000413","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Histopathological images play a important role in clinical diagnosis, particularly in identifying and assessing the severity of abnormal conditions like benign lesions and malignant tumors. Traditional machine learning techniques for processing histopathology images involve the extraction of manual features from these images, which is typically done with the assistance of industry experts. Recent advancements in Deep Learning (DL), especially with Convolutional Neural Networks (CNN), have enabled the automatic extraction of multi-level abstract features directly from raw data. This capability significantly enhances the performance of complex computer vision tasks. Classic CNN models like AlexNet and VggNet employ back-propagation algorithms to learn filters in the training phase. However, these algorithms demand large labeled datasets, resulting in extensive computational processing. Additionally, they often face the vanishing gradient problem, which can negatively impact the quality of the learning process. Besides, in many domains, acquiring enough labeled images for conducting properly the training phase is a real challenge. To address these challenges, a feed-forward propagation approach was proposed using Non-Negative Matrix Factorization(NMF). The NMF technique factorizes the input data into two latent factors (non-negative matrices). It has been shown that by enforcing constraints such as sparsity on the latent factors, dominant features that are mostly correlated with tumors types can be extracted. In this work, a novel model combining sparse NMF and Support Vector Machine (SVM) was developed for classifying histopathological images. We have derived a mathematical model of a novel feed-forward filter learning approach that combines sparse NMF (SNMF) and Support Vector Machine technique (SVM). The model was used to design and implement a feed-forward CNN classifier to classify histopathology images. This model has been evaluated on the histopathology images from Sultan Qaboos University Hospital (SQUH dataset) and the public BreaKHis dataset. The experiments we have conducted demonstrate the efficiency of the proposed model, especially on small-sized SQUH datasets achieving an AUC of 0.90, 0.89, 0.85, and 0.86 on 4x,10x, 20x, and 40x magnifications, respectively, and achieving an AUC of 0.95 BreaKHis dataset.