Syed Abdullah Basit, R. Qureshi, A. Shahid, Sheheryar Khan
{"title":"Survival prediction of lung cancer patients by integration of clinical and molecular features using machine learning","authors":"Syed Abdullah Basit, R. Qureshi, A. Shahid, Sheheryar Khan","doi":"10.36227/TECHRXIV.12943469.V1","DOIUrl":null,"url":null,"abstract":"Among all kinds of cancer, lung cancer has the greatest fatality rate. The mutation in Epidermal growth factor receptor (EGFR) is a significant cause of cancer deaths. Lung cancer is often diagnosed at advanced cancer stages. In this work, we propose a model by integrating patient's personal information and molecular features using machine learning classifiers, and molecular dynamics simulation. The clinical information is taken from various published studies, and molecular features are extracted using the drug-protein interactions and binding free energy of the drug-protein complex. The proposed model achieves good accuracy with a random forest classifier and a deep neural network. We believe that the prediction can be a promising index, and may help physicians and oncologists to develop personalized therapies for lung cancer patients.","PeriodicalId":325357,"journal":{"name":"2021 15th International Conference on Open Source Systems and Technologies (ICOSST)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 15th International Conference on Open Source Systems and Technologies (ICOSST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36227/TECHRXIV.12943469.V1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Among all kinds of cancer, lung cancer has the greatest fatality rate. The mutation in Epidermal growth factor receptor (EGFR) is a significant cause of cancer deaths. Lung cancer is often diagnosed at advanced cancer stages. In this work, we propose a model by integrating patient's personal information and molecular features using machine learning classifiers, and molecular dynamics simulation. The clinical information is taken from various published studies, and molecular features are extracted using the drug-protein interactions and binding free energy of the drug-protein complex. The proposed model achieves good accuracy with a random forest classifier and a deep neural network. We believe that the prediction can be a promising index, and may help physicians and oncologists to develop personalized therapies for lung cancer patients.