Karen A Wong, Ankita Paul, Paige Fuentes, Diane C Lim, Anup Das, Miranda Tan
{"title":"Screening for Obstructive Sleep Apnea in Patients with Cancer – a Machine Learning Approach","authors":"Karen A Wong, Ankita Paul, Paige Fuentes, Diane C Lim, Anup Das, Miranda Tan","doi":"10.1093/sleepadvances/zpad042","DOIUrl":null,"url":null,"abstract":"Abstract Background Obstructive sleep apnea (OSA) is a highly prevalent sleep disorder associated with daytime sleepiness, fatigue, and increased all-cause mortality risk in cancer patients. Existing screening tools for OSA do not account for the interaction of cancer-related features that may increase OSA risk. Study Design and Methods This is a retrospective study of cancer patients at a single tertiary cancer institution who underwent home sleep apnea test (HSAT) to evaluate for OSA. Unsupervised machine learning (ML) was used to reduce the dimensions and extract significant features associated with OSA. ML classifiers were applied to principal components and model hyperparameters were optimized using k-fold cross validation. Training models for OSA were subsequently tested and compared with the STOP-Bang questionnaire on a prospective unseen test set of patients who underwent an HSAT. Results From a training dataset of 249 patients, kernel principal component analysis extracted 8 components through dimension reduction to explain the maximum variance with OSA at 98%. Predictors of OSA were smoking, asthma, chronic kidney disease, STOP-Bang score, race, diabetes, radiation to head/neck/thorax (RT-HNT), type of cancer, and cancer metastases. Of the ML models, PCA+RF had the highest sensitivity (96.8%), specificity (92.3%), negative predictive value (92%), F1 score (0.93), and ROC-AUC score (0.88). The PCA+RF screening algorithm also performed better than the STOP-Bang questionnaire alone when tested on a prospective unseen test set. Conclusion The PCA+RF ML model had the highest accuracy in screening for OSA in cancer patients. History of RT-HNT, cancer metastases, and type of cancer were identified as cancer-related risk factors for OSA.","PeriodicalId":21861,"journal":{"name":"SLEEP Advances","volume":"72 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SLEEP Advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/sleepadvances/zpad042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Background Obstructive sleep apnea (OSA) is a highly prevalent sleep disorder associated with daytime sleepiness, fatigue, and increased all-cause mortality risk in cancer patients. Existing screening tools for OSA do not account for the interaction of cancer-related features that may increase OSA risk. Study Design and Methods This is a retrospective study of cancer patients at a single tertiary cancer institution who underwent home sleep apnea test (HSAT) to evaluate for OSA. Unsupervised machine learning (ML) was used to reduce the dimensions and extract significant features associated with OSA. ML classifiers were applied to principal components and model hyperparameters were optimized using k-fold cross validation. Training models for OSA were subsequently tested and compared with the STOP-Bang questionnaire on a prospective unseen test set of patients who underwent an HSAT. Results From a training dataset of 249 patients, kernel principal component analysis extracted 8 components through dimension reduction to explain the maximum variance with OSA at 98%. Predictors of OSA were smoking, asthma, chronic kidney disease, STOP-Bang score, race, diabetes, radiation to head/neck/thorax (RT-HNT), type of cancer, and cancer metastases. Of the ML models, PCA+RF had the highest sensitivity (96.8%), specificity (92.3%), negative predictive value (92%), F1 score (0.93), and ROC-AUC score (0.88). The PCA+RF screening algorithm also performed better than the STOP-Bang questionnaire alone when tested on a prospective unseen test set. Conclusion The PCA+RF ML model had the highest accuracy in screening for OSA in cancer patients. History of RT-HNT, cancer metastases, and type of cancer were identified as cancer-related risk factors for OSA.