P. Kroupa, Caroline Morton, K. L. Calvez, Matt Williams
{"title":"在一系列模拟数据集上评估简单的、可解释的时间到事件生存预测的k近邻算法","authors":"P. Kroupa, Caroline Morton, K. L. Calvez, Matt Williams","doi":"10.1109/CBMS.2019.00080","DOIUrl":null,"url":null,"abstract":"Survival prediction is a key task in medicine. Existing models are based on statistical techniques, such as the Cox models and there is limited work on the application of machine learning. In this paper we demonstrate that the K-Nearest Neighbour algorithm can be used for survival prediction. We show that its performance is as good as that of standard techniques, and that it provides a clear interpretation of the results. We show that pre-processing methods improve performance, and evaluate the performance across 20 different datasets with differing properties to show that the model performs well under various conditions. For low event rate datasets we show that KNN can outperform the Cox model.","PeriodicalId":311634,"journal":{"name":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing K-Nearest Neighbours Algorithm for Simple, Interpretable Time-to-Event Survival Predictions Over a Range of Simulated Datasets\",\"authors\":\"P. Kroupa, Caroline Morton, K. L. Calvez, Matt Williams\",\"doi\":\"10.1109/CBMS.2019.00080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Survival prediction is a key task in medicine. Existing models are based on statistical techniques, such as the Cox models and there is limited work on the application of machine learning. In this paper we demonstrate that the K-Nearest Neighbour algorithm can be used for survival prediction. We show that its performance is as good as that of standard techniques, and that it provides a clear interpretation of the results. We show that pre-processing methods improve performance, and evaluate the performance across 20 different datasets with differing properties to show that the model performs well under various conditions. For low event rate datasets we show that KNN can outperform the Cox model.\",\"PeriodicalId\":311634,\"journal\":{\"name\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS.2019.00080\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2019.00080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Assessing K-Nearest Neighbours Algorithm for Simple, Interpretable Time-to-Event Survival Predictions Over a Range of Simulated Datasets
Survival prediction is a key task in medicine. Existing models are based on statistical techniques, such as the Cox models and there is limited work on the application of machine learning. In this paper we demonstrate that the K-Nearest Neighbour algorithm can be used for survival prediction. We show that its performance is as good as that of standard techniques, and that it provides a clear interpretation of the results. We show that pre-processing methods improve performance, and evaluate the performance across 20 different datasets with differing properties to show that the model performs well under various conditions. For low event rate datasets we show that KNN can outperform the Cox model.