{"title":"Coefficient of Determination","authors":"F. Pyrczak, Deborah M. Oh","doi":"10.4324/9781315179803-39","DOIUrl":null,"url":null,"abstract":"As with simple regression, in the multiple linear regression model, we can interpret ! SYY \" RSS SYY as the fraction of the variability in Y explained by including the terms u 1 , u 2 , … , u k-1 in the mean function (as compared to the constant mean function). In the multiple regression context, ! SYY \" RSS SYY is denoted as R 2 (with capital R). R 2 is called the coefficient of (multiple) determination or (misleadingly) the squared multiple correlation. • R alone (unsquared) has no meaning in multiple regression. • By convention, we use small r for the sample correlation in simple regression. • In multiple regression, we can talk about correlation between two variables (i.e,, just two at once). • In particular, in multiple regression, r ij is often used to denote the sample correlation coefficient between terms u i and u j. • R 2 is sometimes used for comparing models. But caution is needed: o It only makes sense to use for comparing models that are in the same units (e.g., submodels of the same full model). o A submodel of a model will always have a smaller R 2 than the larger model. o As discussed above and below, many other considerations should be taken to account in selecting a model.","PeriodicalId":196141,"journal":{"name":"Making Sense of Statistics","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"161","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Making Sense of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4324/9781315179803-39","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 161
Abstract
As with simple regression, in the multiple linear regression model, we can interpret ! SYY " RSS SYY as the fraction of the variability in Y explained by including the terms u 1 , u 2 , … , u k-1 in the mean function (as compared to the constant mean function). In the multiple regression context, ! SYY " RSS SYY is denoted as R 2 (with capital R). R 2 is called the coefficient of (multiple) determination or (misleadingly) the squared multiple correlation. • R alone (unsquared) has no meaning in multiple regression. • By convention, we use small r for the sample correlation in simple regression. • In multiple regression, we can talk about correlation between two variables (i.e,, just two at once). • In particular, in multiple regression, r ij is often used to denote the sample correlation coefficient between terms u i and u j. • R 2 is sometimes used for comparing models. But caution is needed: o It only makes sense to use for comparing models that are in the same units (e.g., submodels of the same full model). o A submodel of a model will always have a smaller R 2 than the larger model. o As discussed above and below, many other considerations should be taken to account in selecting a model.