{"title":"Automatic Extraction of Deep Phenotypes for Precision Medicine in Chronic Kidney Disease","authors":"Prerna Singh, V. Chandola, C. Fox","doi":"10.1145/3079452.3079489","DOIUrl":null,"url":null,"abstract":"Chronic Kidney Disease (CKD) is one of the deadliest diseases in the world, with 10% of the global population affected by the disease. Identifying subpopulations with characteristic disease progressions is important to find more efficient treatments for patients with this disease. The abundance of electronic health records (EHR) data can be used to find meaningful subtypes for CKD but comes with challenges during analysis, including irregular data sampling, and skewness in the data collected over time. In this paper, multiple regression techniques were used to fill in the missing estimated glomerular filtration rate (or eGFR -- a key measure for kidney function) trajectory data, so it can be clustered effectively. Clustering is applied to the enhanced data to obtain six subtypes, which capture crucial trends in the disease progression of patients. Moreover, the characteristics of patients in each of the subtypes had minor differences from others. These characteristics demonstrate risk factors and positive lifestyles choices of patients with CKD, which can help develop new treatments for CKD.","PeriodicalId":245682,"journal":{"name":"Proceedings of the 2017 International Conference on Digital Health","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 International Conference on Digital Health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3079452.3079489","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Chronic Kidney Disease (CKD) is one of the deadliest diseases in the world, with 10% of the global population affected by the disease. Identifying subpopulations with characteristic disease progressions is important to find more efficient treatments for patients with this disease. The abundance of electronic health records (EHR) data can be used to find meaningful subtypes for CKD but comes with challenges during analysis, including irregular data sampling, and skewness in the data collected over time. In this paper, multiple regression techniques were used to fill in the missing estimated glomerular filtration rate (or eGFR -- a key measure for kidney function) trajectory data, so it can be clustered effectively. Clustering is applied to the enhanced data to obtain six subtypes, which capture crucial trends in the disease progression of patients. Moreover, the characteristics of patients in each of the subtypes had minor differences from others. These characteristics demonstrate risk factors and positive lifestyles choices of patients with CKD, which can help develop new treatments for CKD.