Janmajay Singh, Kentaro Oshiro, R. Krishnan, Masahiro Sato, T. Ohkuma, N. Kato
{"title":"Utilizing Informative Missingness for Early Prediction of Sepsis","authors":"Janmajay Singh, Kentaro Oshiro, R. Krishnan, Masahiro Sato, T. Ohkuma, N. Kato","doi":"10.23919/CinC49843.2019.9005809","DOIUrl":null,"url":null,"abstract":"Aims: Physicians have to routinely make crucial decisions about patients’ health in the ICU. Sepsis affects about 35% of ICU patients, killing approximately 25% of the afflicted. In this paper, we aim to predict the occurrence of sepsis early by studying the missingness of physiological variables and using it with the overall trends in data.Methods: We chose XGBoost as our base model and tried several variations by changing hyperparameters, window sizes and imputation methods. To further improve the model, we used masking vectors to represent the missingness of features in the dataset. Additional modifications include shifting the Sepsis Label to earlier time steps and tuning the classification probability threshold to further improve the model’s performance.Results: The XGBoost model with a sliding window of size 5, no imputation, utilizing informative missingness of all temporal variables and trained on labels shifted by 3 hours before toptimal, achieved a Utility Score of 0.337 on the full test set. We identified as \"CTL-Team\" in the challenge and were officially ranked 5th on the basis of this score.","PeriodicalId":6697,"journal":{"name":"2019 Computing in Cardiology (CinC)","volume":"43 1 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Computing in Cardiology (CinC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/CinC49843.2019.9005809","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Aims: Physicians have to routinely make crucial decisions about patients’ health in the ICU. Sepsis affects about 35% of ICU patients, killing approximately 25% of the afflicted. In this paper, we aim to predict the occurrence of sepsis early by studying the missingness of physiological variables and using it with the overall trends in data.Methods: We chose XGBoost as our base model and tried several variations by changing hyperparameters, window sizes and imputation methods. To further improve the model, we used masking vectors to represent the missingness of features in the dataset. Additional modifications include shifting the Sepsis Label to earlier time steps and tuning the classification probability threshold to further improve the model’s performance.Results: The XGBoost model with a sliding window of size 5, no imputation, utilizing informative missingness of all temporal variables and trained on labels shifted by 3 hours before toptimal, achieved a Utility Score of 0.337 on the full test set. We identified as "CTL-Team" in the challenge and were officially ranked 5th on the basis of this score.