Lung Cancer Disease Prediction and Classification based on Feature Selection method using Bayesian Network, Logistic Regression, J48, Random Forest, and Naïve Bayes Algorithms
{"title":"Lung Cancer Disease Prediction and Classification based on Feature Selection method using Bayesian Network, Logistic Regression, J48, Random Forest, and Naïve Bayes Algorithms","authors":"J. Viji Cripsy, T. Divya","doi":"10.1109/ICSMDI57622.2023.00066","DOIUrl":null,"url":null,"abstract":"People who have never smoked can get lung cancer, but smokers have a higher risk than non-smokers. Any aspect of the respiratory system can be affected by lung cancer, which can start anywhere in the lungs, Different classification methods are used for lung cancer prediction. This article uses five different classification algorithms to predict lung cancer in patients using Kaggle dataset. Bayesian Network, Logistic Regression, J48, Random Forest and Naive Bayes methods are used, Based on the carefully identified correct and incorrect cases, the quality of the result was measured using the evaluation technique and the WEKA tool. The experimental results showed that Logistic Regression performed best (91.90%), followed by Naive Bayes (90.29%), Bayesian Network (88.34%), j48 (86.08%) and Random Forest (90.93%).","PeriodicalId":373017,"journal":{"name":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Smart Data Intelligence (ICSMDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSMDI57622.2023.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
People who have never smoked can get lung cancer, but smokers have a higher risk than non-smokers. Any aspect of the respiratory system can be affected by lung cancer, which can start anywhere in the lungs, Different classification methods are used for lung cancer prediction. This article uses five different classification algorithms to predict lung cancer in patients using Kaggle dataset. Bayesian Network, Logistic Regression, J48, Random Forest and Naive Bayes methods are used, Based on the carefully identified correct and incorrect cases, the quality of the result was measured using the evaluation technique and the WEKA tool. The experimental results showed that Logistic Regression performed best (91.90%), followed by Naive Bayes (90.29%), Bayesian Network (88.34%), j48 (86.08%) and Random Forest (90.93%).