{"title":"Balancing of an imbalanced dataset by applying SMOTE variants and predicting neonatal mortality using ensemble learning techniques","authors":"Sivarajan A, Bala Aditya A, Sivasankar E","doi":"10.1109/ICITIIT54346.2022.9744204","DOIUrl":null,"url":null,"abstract":"Dynamic environment and imbalanced datasets are unavoidable challenges in developing medical diagnostic tools where incremental learning is a necessity. The prediction tools upon imbalanced data normally work with majority class bias, and it is not easy to recognize faulty classes. This work aims to solve the class imbalance problem by generating synthetic data using SMOTE variants to balance the dataset and predict the neonatal mortality by adopting different ensemble classification methods. This system will be applied to diagnose newborns, vulnerable to die in the initial period of 28 days after birth.","PeriodicalId":184353,"journal":{"name":"2022 International Conference on Innovative Trends in Information Technology (ICITIIT)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Innovative Trends in Information Technology (ICITIIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITIIT54346.2022.9744204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Dynamic environment and imbalanced datasets are unavoidable challenges in developing medical diagnostic tools where incremental learning is a necessity. The prediction tools upon imbalanced data normally work with majority class bias, and it is not easy to recognize faulty classes. This work aims to solve the class imbalance problem by generating synthetic data using SMOTE variants to balance the dataset and predict the neonatal mortality by adopting different ensemble classification methods. This system will be applied to diagnose newborns, vulnerable to die in the initial period of 28 days after birth.