X. Bry, Théo Simac, Salah Eddine El Ghachi, P. Antoine
{"title":"Bridging data exploration and modeling in event-history analysis: the supervised-component Cox regression","authors":"X. Bry, Théo Simac, Salah Eddine El Ghachi, P. Antoine","doi":"10.1080/08898480.2018.1553413","DOIUrl":null,"url":null,"abstract":"ABSTRACT In event-history analysis with many possibly collinear regressors, Cox’s proportional hazard model, like all generalized linear models, can fail to be identified. Dimension-reduction and regularization are therefore needed. Penalty-based methods such as the ridge and the least absolute shrinkage and selection operator (LASSO) provide a regularized linear predictor, but fail to highlight the predictive structures. This is the gap filled by the supervised-component Cox regression (SCCoxR). Its principle is to compute a sequence of orthogonal explanatory components, which both rely on the strong correlation structures of regressors and optimize the goodness-of-fit of the model. One of its parameters tunes the balance between component strength and goodness of fit, thus bridging the gap between classical Cox regression with Cox regression on principal components. A second parameter allows the focus on subsets of highly correlated explanatory variables. A third parameter tunes the regularization of the model coefficients, leading to more robust estimates. Simulations show how to tune the parameters. The method is applied to the case study of polygamy in Dakar, Senegal.","PeriodicalId":49859,"journal":{"name":"Mathematical Population Studies","volume":"27 1","pages":"139 - 174"},"PeriodicalIF":1.4000,"publicationDate":"2020-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/08898480.2018.1553413","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Population Studies","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1080/08898480.2018.1553413","RegionNum":3,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"DEMOGRAPHY","Score":null,"Total":0}
引用次数: 1
Abstract
ABSTRACT In event-history analysis with many possibly collinear regressors, Cox’s proportional hazard model, like all generalized linear models, can fail to be identified. Dimension-reduction and regularization are therefore needed. Penalty-based methods such as the ridge and the least absolute shrinkage and selection operator (LASSO) provide a regularized linear predictor, but fail to highlight the predictive structures. This is the gap filled by the supervised-component Cox regression (SCCoxR). Its principle is to compute a sequence of orthogonal explanatory components, which both rely on the strong correlation structures of regressors and optimize the goodness-of-fit of the model. One of its parameters tunes the balance between component strength and goodness of fit, thus bridging the gap between classical Cox regression with Cox regression on principal components. A second parameter allows the focus on subsets of highly correlated explanatory variables. A third parameter tunes the regularization of the model coefficients, leading to more robust estimates. Simulations show how to tune the parameters. The method is applied to the case study of polygamy in Dakar, Senegal.
期刊介绍:
Mathematical Population Studies publishes carefully selected research papers in the mathematical and statistical study of populations. The journal is strongly interdisciplinary and invites contributions by mathematicians, demographers, (bio)statisticians, sociologists, economists, biologists, epidemiologists, actuaries, geographers, and others who are interested in the mathematical formulation of population-related questions.
The scope covers both theoretical and empirical work. Manuscripts should be sent to Manuscript central for review. The editor-in-chief has final say on the suitability for publication.