Christoph Schultheiss, Claude Renaux, Peter Buhlmann
{"title":"Multicarving for high-dimensional post-selection inference","authors":"Christoph Schultheiss, Claude Renaux, Peter Buhlmann","doi":"10.1214/21-EJS1825","DOIUrl":null,"url":null,"abstract":"We consider post-selection inference for high-dimensional (generalized) linear models. Data carving (Fithian et al., 2014) is a promising technique to perform this task. However, it suffers from the instability of the model selector and hence may lead to poor replicability, especially in high-dimensional settings. We propose the multicarve method inspired by multisplitting, to improve upon stability and replicability. Furthermore, we extend existing concepts to group inference and illustrate the applicability of the methodology also for generalized linear models.","PeriodicalId":186390,"journal":{"name":"arXiv: Methodology","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Methodology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/21-EJS1825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
We consider post-selection inference for high-dimensional (generalized) linear models. Data carving (Fithian et al., 2014) is a promising technique to perform this task. However, it suffers from the instability of the model selector and hence may lead to poor replicability, especially in high-dimensional settings. We propose the multicarve method inspired by multisplitting, to improve upon stability and replicability. Furthermore, we extend existing concepts to group inference and illustrate the applicability of the methodology also for generalized linear models.
我们考虑高维(广义)线性模型的后选择推理。数据雕刻(Fithian et al., 2014)是执行此任务的一种有前途的技术。然而,它受到模型选择器的不稳定性的影响,因此可能导致较差的可复制性,特别是在高维设置中。我们提出了受多重分裂启发的多重曲线方法,以提高稳定性和可复制性。此外,我们将现有的概念扩展到群推理,并说明该方法也适用于广义线性模型。