S Selvakumara Samy, S Karthick, Meghna Ghosal, Sameer Singh, J S Sudarsan, S Nithiyanantham
{"title":"Adoption of machine learning algorithm for predicting the length of stay of patients (construction workers) during COVID pandemic.","authors":"S Selvakumara Samy, S Karthick, Meghna Ghosal, Sameer Singh, J S Sudarsan, S Nithiyanantham","doi":"10.1007/s41870-023-01296-6","DOIUrl":null,"url":null,"abstract":"<p><p>The construction sector in a rapidly developing country like India is a very unorganized sector. A large number of workers were affected and hospitalized during the pandemic. This situation is costing the sector heavily in several respects. This research study was conducted as part of using machine learning algorithms to improve construction company health and safety policies. LOS (length of stay) is used to predict how long a patient will stay in a hospital. Predicting LOS is very useful not only for hospitals, but also for construction companies to measure resources and reduce costs. Predicting LOS has become an important step in most hospitals before admitting patients. In this post, we used the Medical Information Mart for Intensive Care(MIMIC III) dataset and applied four different machine learning algorithms: decision tree classifier, random forest, Artificial Neural Network (ANN), and logistic regression. First, I performed data pre-processing to clean up the dataset. In the next step, we performed function selection using the Select Best algorithm with an evaluation function of chi2 to perform hot coding. We then performed a split between training and testing and applied a machine learning algorithm. The metric used for comparison was accuracy. After implementing the algorithms, the accuracy was compared. Random forest was found to perform best at 89%. Afterwards, we performed hyperparameter tuning using a grid search algorithm on a random forest to obtain higher accuracy. The final accuracy is 90%. This kind of research can help improve health security policies by introducing modern computational techniques, and can also help optimize resources.</p>","PeriodicalId":73455,"journal":{"name":"International journal of information technology : an official journal of Bharati Vidyapeeth's Institute of Computer Applications and Management","volume":" ","pages":"1-9"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10250170/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of information technology : an official journal of Bharati Vidyapeeth's Institute of Computer Applications and Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41870-023-01296-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The construction sector in a rapidly developing country like India is a very unorganized sector. A large number of workers were affected and hospitalized during the pandemic. This situation is costing the sector heavily in several respects. This research study was conducted as part of using machine learning algorithms to improve construction company health and safety policies. LOS (length of stay) is used to predict how long a patient will stay in a hospital. Predicting LOS is very useful not only for hospitals, but also for construction companies to measure resources and reduce costs. Predicting LOS has become an important step in most hospitals before admitting patients. In this post, we used the Medical Information Mart for Intensive Care(MIMIC III) dataset and applied four different machine learning algorithms: decision tree classifier, random forest, Artificial Neural Network (ANN), and logistic regression. First, I performed data pre-processing to clean up the dataset. In the next step, we performed function selection using the Select Best algorithm with an evaluation function of chi2 to perform hot coding. We then performed a split between training and testing and applied a machine learning algorithm. The metric used for comparison was accuracy. After implementing the algorithms, the accuracy was compared. Random forest was found to perform best at 89%. Afterwards, we performed hyperparameter tuning using a grid search algorithm on a random forest to obtain higher accuracy. The final accuracy is 90%. This kind of research can help improve health security policies by introducing modern computational techniques, and can also help optimize resources.