{"title":"A recommender system with multi-objective hybrid Harris Hawk optimization for feature selection and disease diagnosis","authors":"Madhusree Kuanr, Puspanjali Mohapatra","doi":"10.1016/j.health.2025.100384","DOIUrl":null,"url":null,"abstract":"<div><div>This study proposes a health recommender system to analyze health risk and disease prediction by identifying the most responsible disease-causing factors using a hybrid Genetic–Harris Hawk optimization multi-objective feature selection approach. The proposed recommender system uses the Tree-based Pipeline Optimization Tool (TPOT) automated machine learning model to recommend the most suitable machine learning prediction model with the best classifier in terms of classification accuracy for a disease with the selected features. It also recommends the top three disease-causing features for a particular disease that can be utilized to analyze a person’s health risk. The proposed system has also been compared with the competing prediction approaches using Principal Component Analysis (PCA), Singular Vector Decomposition (SVD), and Autoencoders. We show that the proposed system outperforms competing approaches in terms of classification accuracy.</div></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"7 ","pages":"Article 100384"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare analytics (New York, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772442525000036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study proposes a health recommender system to analyze health risk and disease prediction by identifying the most responsible disease-causing factors using a hybrid Genetic–Harris Hawk optimization multi-objective feature selection approach. The proposed recommender system uses the Tree-based Pipeline Optimization Tool (TPOT) automated machine learning model to recommend the most suitable machine learning prediction model with the best classifier in terms of classification accuracy for a disease with the selected features. It also recommends the top three disease-causing features for a particular disease that can be utilized to analyze a person’s health risk. The proposed system has also been compared with the competing prediction approaches using Principal Component Analysis (PCA), Singular Vector Decomposition (SVD), and Autoencoders. We show that the proposed system outperforms competing approaches in terms of classification accuracy.