{"title":"混合整数非线性规划方法在LOGISTIC回归变量选择中的应用","authors":"K. Kimura","doi":"10.15807/JORSJ.62.15","DOIUrl":null,"url":null,"abstract":"Variable selection is the process of finding variables relevant to a given dataset in model construction. One of the techniques for variable selection is exponentially evaluating many models with a goodness-of-fit (GOF) measure, for example, Akaike information criterion (AIC). The model with the lowest GOF value is considered as the best model. We proposed a mixed integer nonlinear programming approach to AIC minimization for linear regression and showed that the approach outperformed existing approaches in terms of computational time [13]. In this study, we apply the approach in [13] to AIC minimization for logistic regression and explain that a few of the techniques developed previously [13], for example, relaxation and a branching rule, can be used for the AIC minimization. The proposed approach requires solving relaxation problems, which are unconstrained convex problems. We apply an iterative method with an effective initial guess to solve these problems. We implement the proposed approach via SCIP, which is a noncommercial optimization software and a branch-and-bound framework. We compare the proposed approach with a piecewise linear approximation approach developed by Sato and others [16]. The results of computational experiments show that the proposed approach finds the model with the lowest AIC value if the number of candidates for variables is 45 or lower.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"APPLICATION OF A MIXED INTEGER NONLINEAR PROGRAMMING APPROACH TO VARIABLE SELECTION IN LOGISTIC REGRESSION\",\"authors\":\"K. Kimura\",\"doi\":\"10.15807/JORSJ.62.15\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Variable selection is the process of finding variables relevant to a given dataset in model construction. One of the techniques for variable selection is exponentially evaluating many models with a goodness-of-fit (GOF) measure, for example, Akaike information criterion (AIC). The model with the lowest GOF value is considered as the best model. We proposed a mixed integer nonlinear programming approach to AIC minimization for linear regression and showed that the approach outperformed existing approaches in terms of computational time [13]. In this study, we apply the approach in [13] to AIC minimization for logistic regression and explain that a few of the techniques developed previously [13], for example, relaxation and a branching rule, can be used for the AIC minimization. The proposed approach requires solving relaxation problems, which are unconstrained convex problems. We apply an iterative method with an effective initial guess to solve these problems. We implement the proposed approach via SCIP, which is a noncommercial optimization software and a branch-and-bound framework. We compare the proposed approach with a piecewise linear approximation approach developed by Sato and others [16]. The results of computational experiments show that the proposed approach finds the model with the lowest AIC value if the number of candidates for variables is 45 or lower.\",\"PeriodicalId\":51107,\"journal\":{\"name\":\"Journal of the Operations Research Society of Japan\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Operations Research Society of Japan\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15807/JORSJ.62.15\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Operations Research Society of Japan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15807/JORSJ.62.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Decision Sciences","Score":null,"Total":0}
APPLICATION OF A MIXED INTEGER NONLINEAR PROGRAMMING APPROACH TO VARIABLE SELECTION IN LOGISTIC REGRESSION
Variable selection is the process of finding variables relevant to a given dataset in model construction. One of the techniques for variable selection is exponentially evaluating many models with a goodness-of-fit (GOF) measure, for example, Akaike information criterion (AIC). The model with the lowest GOF value is considered as the best model. We proposed a mixed integer nonlinear programming approach to AIC minimization for linear regression and showed that the approach outperformed existing approaches in terms of computational time [13]. In this study, we apply the approach in [13] to AIC minimization for logistic regression and explain that a few of the techniques developed previously [13], for example, relaxation and a branching rule, can be used for the AIC minimization. The proposed approach requires solving relaxation problems, which are unconstrained convex problems. We apply an iterative method with an effective initial guess to solve these problems. We implement the proposed approach via SCIP, which is a noncommercial optimization software and a branch-and-bound framework. We compare the proposed approach with a piecewise linear approximation approach developed by Sato and others [16]. The results of computational experiments show that the proposed approach finds the model with the lowest AIC value if the number of candidates for variables is 45 or lower.
期刊介绍:
The journal publishes original work and quality reviews in the field of operations research and management science to OR practitioners and researchers in two substantive categories: operations research methods; applications and practices of operations research in industry, public sector, and all areas of science and engineering.