A simulation study and application of feature selection on survival least square support vector machines

H. A. Khoiri, D. Prastyo, S. W. Purnami
{"title":"A simulation study and application of feature selection on survival least square support vector machines","authors":"H. A. Khoiri, D. Prastyo, S. W. Purnami","doi":"10.1063/1.5121121","DOIUrl":null,"url":null,"abstract":"The Cox Proportional Hazard Model (Cox PHM) is commonly employed in survival analysis. It has proportional hazard assumption which is not always satisfied in real application. In such a case, the survival data can be analyzed using non-parametric approaches, one of them is the Survival Least Square Support Vector Machines (SURLS-SVM) recently developed. This approach does not require the proportional hazard assumption and the distribution of survival time can be unknown. Some papers apply SURLS-SVM on both simulation study and real data without considering feature selection. The performance of statistical methods can be determined by choosing relevant features selected as input. Therefore, the feature selection method is necessary to be applied in SURLS-SVM. In this paper, the Cox PHM and the SURLS-SVM with feature selection are applied on simulated data and clinical data, i.e. survival of cervical cancer patients. These two approaches are compared using prognostic index so-called concordance index (c-index). For both data sets, the c-index obtained from SURLS-SVM, with or without feature selection, is much higher than the one obtained from Cox PHM. On the cervical cancer data, SURLS-SVM with feature selection selects 10 relevant features out of 12 features. This also works for Cox PHM with feature selection.The Cox Proportional Hazard Model (Cox PHM) is commonly employed in survival analysis. It has proportional hazard assumption which is not always satisfied in real application. In such a case, the survival data can be analyzed using non-parametric approaches, one of them is the Survival Least Square Support Vector Machines (SURLS-SVM) recently developed. This approach does not require the proportional hazard assumption and the distribution of survival time can be unknown. Some papers apply SURLS-SVM on both simulation study and real data without considering feature selection. The performance of statistical methods can be determined by choosing relevant features selected as input. Therefore, the feature selection method is necessary to be applied in SURLS-SVM. In this paper, the Cox PHM and the SURLS-SVM with feature selection are applied on simulated data and clinical data, i.e. survival of cervical cancer patients. These two approaches are compared using prognostic index so-called concordance index (c-ind...","PeriodicalId":325925,"journal":{"name":"THE 4TH INNOVATION AND ANALYTICS CONFERENCE & EXHIBITION (IACE 2019)","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"THE 4TH INNOVATION AND ANALYTICS CONFERENCE & EXHIBITION (IACE 2019)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1063/1.5121121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The Cox Proportional Hazard Model (Cox PHM) is commonly employed in survival analysis. It has proportional hazard assumption which is not always satisfied in real application. In such a case, the survival data can be analyzed using non-parametric approaches, one of them is the Survival Least Square Support Vector Machines (SURLS-SVM) recently developed. This approach does not require the proportional hazard assumption and the distribution of survival time can be unknown. Some papers apply SURLS-SVM on both simulation study and real data without considering feature selection. The performance of statistical methods can be determined by choosing relevant features selected as input. Therefore, the feature selection method is necessary to be applied in SURLS-SVM. In this paper, the Cox PHM and the SURLS-SVM with feature selection are applied on simulated data and clinical data, i.e. survival of cervical cancer patients. These two approaches are compared using prognostic index so-called concordance index (c-index). For both data sets, the c-index obtained from SURLS-SVM, with or without feature selection, is much higher than the one obtained from Cox PHM. On the cervical cancer data, SURLS-SVM with feature selection selects 10 relevant features out of 12 features. This also works for Cox PHM with feature selection.The Cox Proportional Hazard Model (Cox PHM) is commonly employed in survival analysis. It has proportional hazard assumption which is not always satisfied in real application. In such a case, the survival data can be analyzed using non-parametric approaches, one of them is the Survival Least Square Support Vector Machines (SURLS-SVM) recently developed. This approach does not require the proportional hazard assumption and the distribution of survival time can be unknown. Some papers apply SURLS-SVM on both simulation study and real data without considering feature selection. The performance of statistical methods can be determined by choosing relevant features selected as input. Therefore, the feature selection method is necessary to be applied in SURLS-SVM. In this paper, the Cox PHM and the SURLS-SVM with feature selection are applied on simulated data and clinical data, i.e. survival of cervical cancer patients. These two approaches are compared using prognostic index so-called concordance index (c-ind...
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
生存最小二乘支持向量机特征选择的仿真研究与应用
Cox比例风险模型(Cox PHM)是常用的生存分析方法。它具有比例风险假设,但在实际应用中并不总是满足。在这种情况下,生存数据可以使用非参数方法进行分析,其中一种方法是最近开发的生存最小二乘支持向量机(SURLS-SVM)。这种方法不需要比例风险假设,生存时间的分布可以是未知的。一些论文将SURLS-SVM应用于仿真研究和实际数据,而不考虑特征选择。通过选择相关特征作为输入,可以确定统计方法的性能。因此,有必要将特征选择方法应用到SURLS-SVM中。本文将Cox PHM和带特征选择的SURLS-SVM应用于模拟数据和临床数据,即宫颈癌患者的生存率。这两种方法使用预后指数,即所谓的一致性指数(c-index)进行比较。对于这两个数据集,无论是否进行特征选择,SURLS-SVM得到的c-index都远高于Cox PHM得到的c-index。在宫颈癌数据上,带特征选择的SURLS-SVM从12个特征中选择出10个相关特征。这也适用于Cox PHM的特征选择。Cox比例风险模型(Cox PHM)是常用的生存分析方法。它具有比例风险假设,但在实际应用中并不总是满足。在这种情况下,生存数据可以使用非参数方法进行分析,其中一种方法是最近开发的生存最小二乘支持向量机(SURLS-SVM)。这种方法不需要比例风险假设,生存时间的分布可以是未知的。一些论文将SURLS-SVM应用于仿真研究和实际数据,而不考虑特征选择。通过选择相关特征作为输入,可以确定统计方法的性能。因此,有必要将特征选择方法应用到SURLS-SVM中。本文将Cox PHM和带特征选择的SURLS-SVM应用于模拟数据和临床数据,即宫颈癌患者的生存率。这两种方法使用预后指数进行比较,即所谓的一致性指数(c-ind)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Application of artificial intelligence in predicting ground settlement on earth slope The most important contaminants of air pollutants in Klang station using multivariate statistical analysis Tourism knowledge discovery through data mining techniques On some specific patterns of τ-adic non-adjacent form expansion over ring Z(τ): An alternative formula Exploratory factor analysis on occupational stress in context of Malaysian sewerage operations
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1