Zarish, B. Wajid, Umar Rashid, Sajida Zahid, Faria Anwar, F. G. Awan, Abdul Rauf Anwar, Imran Wajid
{"title":"Survival Rate Prediction of Blood Cancer (Leukemia) Patients Using Machine Learning Algorithms","authors":"Zarish, B. Wajid, Umar Rashid, Sajida Zahid, Faria Anwar, F. G. Awan, Abdul Rauf Anwar, Imran Wajid","doi":"10.1109/ETECTE55893.2022.10007402","DOIUrl":null,"url":null,"abstract":"Survival rate prediction for medical diseases is a complex task that requires high precision. With a low survival rate among reported patients, leukemia is a type of cancer of blood which is caused by the abnormal growth of white blood cells. It is critical to numerically evaluate the rate of survivability of patients suffering from leukemia. To this end, this paper employs a comprehensive database, namely Surveillance, Epidemiology, and End Results (SEER) maintained by The National Cancer Institute in MD, USA, to construct a survivability model for leukemia patients. To accurately predict the survival months of the patients, we develop a multi-class classification problem by binning the target variable into four bins. The resulting accuracy is improved by utilizing a multi-tier classification framework. Although, the final numerical results hold significance from biological viewpoint, it is recommended that a clinically relevant model be drawn with caution.","PeriodicalId":131572,"journal":{"name":"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETECTE55893.2022.10007402","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Survival rate prediction for medical diseases is a complex task that requires high precision. With a low survival rate among reported patients, leukemia is a type of cancer of blood which is caused by the abnormal growth of white blood cells. It is critical to numerically evaluate the rate of survivability of patients suffering from leukemia. To this end, this paper employs a comprehensive database, namely Surveillance, Epidemiology, and End Results (SEER) maintained by The National Cancer Institute in MD, USA, to construct a survivability model for leukemia patients. To accurately predict the survival months of the patients, we develop a multi-class classification problem by binning the target variable into four bins. The resulting accuracy is improved by utilizing a multi-tier classification framework. Although, the final numerical results hold significance from biological viewpoint, it is recommended that a clinically relevant model be drawn with caution.
医学疾病的生存率预测是一项复杂的任务,需要很高的精度。白血病是一种由白细胞异常生长引起的血癌,在报道的患者中存活率很低。对白血病患者的存活率进行数值评估是至关重要的。为此,本文利用美国MD国家癌症研究所(The National Cancer Institute in MD, USA)的监测、流行病学和最终结果(SEER)综合数据库,构建白血病患者的生存能力模型。为了准确预测患者的生存月数,我们通过将目标变量分为四个bin,开发了一个多类分类问题。通过使用多层分类框架,提高了结果的准确性。虽然最终的数值结果从生物学角度来看具有重要意义,但建议谨慎绘制临床相关模型。