A. Rodríguez-Sánchez, Román Salmerón-Gómez, Catalina García-García
{"title":"The coefficient of determination in the ridge regression","authors":"A. Rodríguez-Sánchez, Román Salmerón-Gómez, Catalina García-García","doi":"10.1080/03610918.2019.1649421","DOIUrl":null,"url":null,"abstract":"Abstract In a linear regression, the coefficient of determination, R 2, is a relevant measure that represents the percentage of variation in the dependent variable that is explained by a set of independent variables. Thus, it measures the predictive ability of the estimated model. For an ordinary least squares (OLS) estimator, this coefficient is calculated from the decomposition of the sum of squares. However, when the model presents collinearity problems (a strong linear relation between the independent variables), the OLS estimation is unstable, and other estimation methodologies are proposed, with the ridge estimation being the most widely applied. This paper shows that the decomposition of the sum of squares is not verified in the ridge regression and proposes how the coefficient of determination should be calculated in this case.","PeriodicalId":119237,"journal":{"name":"Commun. Stat. Simul. Comput.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Commun. Stat. Simul. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/03610918.2019.1649421","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
Abstract In a linear regression, the coefficient of determination, R 2, is a relevant measure that represents the percentage of variation in the dependent variable that is explained by a set of independent variables. Thus, it measures the predictive ability of the estimated model. For an ordinary least squares (OLS) estimator, this coefficient is calculated from the decomposition of the sum of squares. However, when the model presents collinearity problems (a strong linear relation between the independent variables), the OLS estimation is unstable, and other estimation methodologies are proposed, with the ridge estimation being the most widely applied. This paper shows that the decomposition of the sum of squares is not verified in the ridge regression and proposes how the coefficient of determination should be calculated in this case.