{"title":"Incorporating Feature Selection Methods into Machine Learning-Based Covid-19 Diagnosis","authors":"Çağla Danacı, S. Tuncer","doi":"10.2478/acss-2022-0002","DOIUrl":null,"url":null,"abstract":"Abstract The aim of the study is to diagnose Covid-19 by machine learning algorithms using biochemical parameters. In addition to the aim of the study, October selection was performed using 14 different feature selection methods based on the biochemical parameters available to us. As a result of the study, the performance of the algorithms and feature selection methods was evaluated using performance evaluation criteria. The dataset used in the study consists of 100 covid-negative and 121 covid-positive data from a total of 221 patients. The dataset includes 16 biochemical parameters used for the diagnosis of Covid-19. Feature selection methods were used to reduce the number of parameters and perform the classification process. The result of the study shows that the new feature set obtained using feature selection algorithms yields very similar results to the set containing all features. Overall, 5 features obtained from 16 features by feature selection methods yielded the best performance for the K-Nearest Neighbour algorithm with the FSVFS feature selection method of 86.4 %.","PeriodicalId":41960,"journal":{"name":"Applied Computer Systems","volume":"16 1","pages":"13 - 18"},"PeriodicalIF":0.5000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Computer Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/acss-2022-0002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract The aim of the study is to diagnose Covid-19 by machine learning algorithms using biochemical parameters. In addition to the aim of the study, October selection was performed using 14 different feature selection methods based on the biochemical parameters available to us. As a result of the study, the performance of the algorithms and feature selection methods was evaluated using performance evaluation criteria. The dataset used in the study consists of 100 covid-negative and 121 covid-positive data from a total of 221 patients. The dataset includes 16 biochemical parameters used for the diagnosis of Covid-19. Feature selection methods were used to reduce the number of parameters and perform the classification process. The result of the study shows that the new feature set obtained using feature selection algorithms yields very similar results to the set containing all features. Overall, 5 features obtained from 16 features by feature selection methods yielded the best performance for the K-Nearest Neighbour algorithm with the FSVFS feature selection method of 86.4 %.