Multimodal deep learning for chronic kidney disease prediction: leveraging feature selection algorithms and ensemble models

Q2 Computer Science International Journal of Computers and Applications Pub Date : 2023-10-03 DOI:10.1080/1206212x.2023.2262786

N. J. Subashini, K. Venkatesh

{"title":"Multimodal deep learning for chronic kidney disease prediction: leveraging feature selection algorithms and ensemble models","authors":"N. J. Subashini, K. Venkatesh","doi":"10.1080/1206212x.2023.2262786","DOIUrl":null,"url":null,"abstract":"ABSTRACTThis research presents an advanced approach to enhance disease diagnosis using imbalanced medical datasets. Feature selection techniques, LASSO and Relief, are applied to identify relevant features from the UCI dataset and missing values are handled appropriately. To address the class imbalance, SMOTEENN is used, creating a new combined dataset with selected features. Three deep learning models, FNNs, LSTMs, and GBMs, are employed and trained on the combined dataset, achieving remarkable accuracy (1.0). Evaluating the models on LASSO and Relief datasets independently, FNN/MLP obtains perfect accuracy, GBM performs well (0.9888 on LASSO and 1.0 on Relief), and LSTM shows good results (0.9663 on LASSO and 1.0 on Relief). This study demonstrates the effectiveness of combining LASSO and Relief for feature selection and highlights the impact of SMOTEENN on model performance. The achieved accuracy with all models on the combined dataset showcases deep learning's potential for accurate disease diagnosis even with imbalanced data, offering promising insights for robust medical diagnosis systems.KEYWORDS: Chronic kidney diseaseMultimodal deep learningLASSOReliefSMOTEENN Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationNotes on contributorsN. J. SubashiniN. J. Subashini is a Research scholar in Networking and Communications department, SRM Institute of Science and Technology. Her research interests include Data Mining, Artificial Intelligence, Deep Learning and Machine Learning.K. VenkateshK. Venkatesh is Associate Professor in Networking and Communications department, SRM Institute of Science and Technology. His research interests include Networking, Cloud Computing, Data Mining, Artificial Intelligence, and Machine Learning. He is the Program Coordinator for B. Tech CSE specialization with a focus on Computer Networking. Additionally, he serves as an Alumni Coordinator in the Department of Networking and Communications. He is a Cisco certified CCNA Lead Instructor and Academy Contact for SRM Institute of Science and Technology, formerly known as SRM University, Networking Academy.","PeriodicalId":39673,"journal":{"name":"International Journal of Computers and Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computers and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/1206212x.2023.2262786","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

ABSTRACTThis research presents an advanced approach to enhance disease diagnosis using imbalanced medical datasets. Feature selection techniques, LASSO and Relief, are applied to identify relevant features from the UCI dataset and missing values are handled appropriately. To address the class imbalance, SMOTEENN is used, creating a new combined dataset with selected features. Three deep learning models, FNNs, LSTMs, and GBMs, are employed and trained on the combined dataset, achieving remarkable accuracy (1.0). Evaluating the models on LASSO and Relief datasets independently, FNN/MLP obtains perfect accuracy, GBM performs well (0.9888 on LASSO and 1.0 on Relief), and LSTM shows good results (0.9663 on LASSO and 1.0 on Relief). This study demonstrates the effectiveness of combining LASSO and Relief for feature selection and highlights the impact of SMOTEENN on model performance. The achieved accuracy with all models on the combined dataset showcases deep learning's potential for accurate disease diagnosis even with imbalanced data, offering promising insights for robust medical diagnosis systems.KEYWORDS: Chronic kidney diseaseMultimodal deep learningLASSOReliefSMOTEENN Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationNotes on contributorsN. J. SubashiniN. J. Subashini is a Research scholar in Networking and Communications department, SRM Institute of Science and Technology. Her research interests include Data Mining, Artificial Intelligence, Deep Learning and Machine Learning.K. VenkateshK. Venkatesh is Associate Professor in Networking and Communications department, SRM Institute of Science and Technology. His research interests include Networking, Cloud Computing, Data Mining, Artificial Intelligence, and Machine Learning. He is the Program Coordinator for B. Tech CSE specialization with a focus on Computer Networking. Additionally, he serves as an Alumni Coordinator in the Department of Networking and Communications. He is a Cisco certified CCNA Lead Instructor and Academy Contact for SRM Institute of Science and Technology, formerly known as SRM University, Networking Academy.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

多模态深度学习用于慢性肾脏疾病预测:利用特征选择算法和集成模型

摘要本文提出了一种利用不平衡医疗数据集增强疾病诊断的先进方法。特征选择技术LASSO和Relief用于识别UCI数据集中的相关特征，并对缺失值进行适当处理。为了解决类不平衡问题，使用SMOTEENN，创建一个具有选定特征的新组合数据集。采用fnn、lstm和gbm三种深度学习模型对组合数据集进行训练，获得了显著的准确率(1.0)。在LASSO和Relief数据集上独立评估模型，FNN/MLP获得了较好的准确率，GBM表现良好(LASSO上0.9888,Relief上1.0)，LSTM表现良好(LASSO上0.9663,Relief上1.0)。本研究证明了LASSO和Relief相结合的特征选择的有效性，并强调了SMOTEENN对模型性能的影响。综合数据集上所有模型的准确性表明，即使在数据不平衡的情况下，深度学习也有可能准确诊断疾病，这为强大的医疗诊断系统提供了有希望的见解。关键词:慢性肾脏疾病多模式深度学习lassoreliefsmotenn披露声明作者未报告潜在的利益冲突。其他信息:贡献者说明j . SubashiniN。苏巴什尼是SRM科学技术研究所网络与通信系的研究学者。她的研究兴趣包括数据挖掘、人工智能、深度学习和机器学习。VenkateshK。文卡特什是SRM科学技术研究所网络与通信系副教授。他的研究兴趣包括网络、云计算、数据挖掘、人工智能和机器学习。他是B. Tech CSE专业的项目协调员，专注于计算机网络。此外，他还担任网络和通信部门的校友协调员。他是思科认证的CCNA首席讲师和SRM科学技术研究所(前身为SRM大学，网络学院)的学院联系人。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Computers and Applications Computer Science-Computer Graphics and Computer-Aided Design

CiteScore

4.70

自引率

0.00%

发文量

期刊介绍： The International Journal of Computers and Applications (IJCA) is a unique platform for publishing novel ideas, research outcomes and fundamental advances in all aspects of Computer Science, Computer Engineering, and Computer Applications. This is a peer-reviewed international journal with a vision to provide the academic and industrial community a platform for presenting original research ideas and applications. IJCA welcomes four special types of papers in addition to the regular research papers within its scope: (a) Papers for which all results could be easily reproducible. For such papers, the authors will be asked to upload "instructions for reproduction'''', possibly with the source codes or stable URLs (from where the codes could be downloaded). (b) Papers with negative results. For such papers, the experimental setting and negative results must be presented in detail. Also, why the negative results are important for the research community must be explained clearly. The rationale behind this kind of paper is that this would help researchers choose the correct approaches to solve problems and avoid the (already worked out) failed approaches. (c) Detailed report, case study and literature review articles about innovative software / hardware, new technology, high impact computer applications and future development with sufficient background and subject coverage. (d) Special issue papers focussing on a particular theme with significant importance or papers selected from a relevant conference with sufficient improvement and new material to differentiate from the papers published in a conference proceedings.