Dungang Liu, Xiaorui Zhu, Brandon Greenwell, Zewei Lin
{"title":"probit模型的一个新的拟合优度度量:代理R2","authors":"Dungang Liu, Xiaorui Zhu, Brandon Greenwell, Zewei Lin","doi":"10.1111/bmsp.12289","DOIUrl":null,"url":null,"abstract":"<p>Probit models are used extensively for inferential purposes in the social sciences as discrete data are prevalent in a vast body of social studies. Among many accompanying model inference problems, a critical question remains unsettled: how to develop a goodness-of-fit measure that resembles the ordinary least square (OLS) <i>R</i><sup>2</sup> used for linear models. Such a measure has long been sought to achieve ‘comparability’ of different empirical models across multiple samples addressing similar social questions. To this end, we propose a novel <i>R</i><sup>2</sup> measure for probit models using the notion of surrogacy – simulating a continuous variable <math>\n <semantics>\n <mrow>\n <mi>S</mi>\n </mrow>\n </semantics></math> as a <i>surrogate</i> of the original discrete response (Liu & Zhang, Journal of the American Statistical Association, 113, 845 and 2018). The proposed <i>R</i><sup>2</sup> is the proportion of the variance of the surrogate response explained by explanatory variables through a <i>linear model</i>, and we call it a surrogate <i>R</i><sup>2</sup>. This paper shows both theoretically and numerically that the surrogate <i>R</i><sup>2</sup> approximates the OLS <i>R</i><sup>2</sup> based on the latent continuous variable, preserves the interpretation of explained variation, and maintains monotonicity between nested models. As no other pseudo <i>R</i><sup>2</sup>, McKelvey and Zavoina's and McFadden's included, can meet all the three criteria simultaneously, our measure fills this crucial void in probit model inference.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"76 1","pages":"192-210"},"PeriodicalIF":1.5000,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12289","citationCount":"4","resultStr":"{\"title\":\"A new goodness-of-fit measure for probit models: Surrogate R2\",\"authors\":\"Dungang Liu, Xiaorui Zhu, Brandon Greenwell, Zewei Lin\",\"doi\":\"10.1111/bmsp.12289\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Probit models are used extensively for inferential purposes in the social sciences as discrete data are prevalent in a vast body of social studies. Among many accompanying model inference problems, a critical question remains unsettled: how to develop a goodness-of-fit measure that resembles the ordinary least square (OLS) <i>R</i><sup>2</sup> used for linear models. Such a measure has long been sought to achieve ‘comparability’ of different empirical models across multiple samples addressing similar social questions. To this end, we propose a novel <i>R</i><sup>2</sup> measure for probit models using the notion of surrogacy – simulating a continuous variable <math>\\n <semantics>\\n <mrow>\\n <mi>S</mi>\\n </mrow>\\n </semantics></math> as a <i>surrogate</i> of the original discrete response (Liu & Zhang, Journal of the American Statistical Association, 113, 845 and 2018). The proposed <i>R</i><sup>2</sup> is the proportion of the variance of the surrogate response explained by explanatory variables through a <i>linear model</i>, and we call it a surrogate <i>R</i><sup>2</sup>. This paper shows both theoretically and numerically that the surrogate <i>R</i><sup>2</sup> approximates the OLS <i>R</i><sup>2</sup> based on the latent continuous variable, preserves the interpretation of explained variation, and maintains monotonicity between nested models. As no other pseudo <i>R</i><sup>2</sup>, McKelvey and Zavoina's and McFadden's included, can meet all the three criteria simultaneously, our measure fills this crucial void in probit model inference.</p>\",\"PeriodicalId\":55322,\"journal\":{\"name\":\"British Journal of Mathematical & Statistical Psychology\",\"volume\":\"76 1\",\"pages\":\"192-210\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2022-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12289\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"British Journal of Mathematical & Statistical Psychology\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/bmsp.12289\",\"RegionNum\":3,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Mathematical & Statistical Psychology","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/bmsp.12289","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 4
摘要
Probit模型在社会科学中被广泛用于推理目的,因为离散数据在大量社会研究中普遍存在。在许多伴随的模型推理问题中,一个关键问题仍未解决:如何开发一个类似于用于线性模型的普通最小二乘(OLS)R2的拟合优度度量。长期以来,人们一直在寻求这样一种衡量标准,以实现解决类似社会问题的多个样本中不同实证模型的“可比性”。为此,我们使用代孕的概念为probit模型提出了一种新的R2度量——模拟连续变量S作为原始离散响应的代孕(Liu&;Zhang,Journal of the American Statistical Association,113845 and 2018)。所提出的R2是通过线性模型由解释变量解释的替代响应的方差的比例,我们称之为替代R2。本文从理论和数值上表明,代理R2基于潜在连续变量近似OLS R2,保留了对解释变化的解释,并保持了嵌套模型之间的单调性。由于没有其他伪R2,包括McKelvey和Zavoina的以及McFadden的,能够同时满足所有三个标准,我们的度量填补了probit模型推理中的这一关键空白。
A new goodness-of-fit measure for probit models: Surrogate R2
Probit models are used extensively for inferential purposes in the social sciences as discrete data are prevalent in a vast body of social studies. Among many accompanying model inference problems, a critical question remains unsettled: how to develop a goodness-of-fit measure that resembles the ordinary least square (OLS) R2 used for linear models. Such a measure has long been sought to achieve ‘comparability’ of different empirical models across multiple samples addressing similar social questions. To this end, we propose a novel R2 measure for probit models using the notion of surrogacy – simulating a continuous variable as a surrogate of the original discrete response (Liu & Zhang, Journal of the American Statistical Association, 113, 845 and 2018). The proposed R2 is the proportion of the variance of the surrogate response explained by explanatory variables through a linear model, and we call it a surrogate R2. This paper shows both theoretically and numerically that the surrogate R2 approximates the OLS R2 based on the latent continuous variable, preserves the interpretation of explained variation, and maintains monotonicity between nested models. As no other pseudo R2, McKelvey and Zavoina's and McFadden's included, can meet all the three criteria simultaneously, our measure fills this crucial void in probit model inference.
期刊介绍:
The British Journal of Mathematical and Statistical Psychology publishes articles relating to areas of psychology which have a greater mathematical or statistical aspect of their argument than is usually acceptable to other journals including:
• mathematical psychology
• statistics
• psychometrics
• decision making
• psychophysics
• classification
• relevant areas of mathematics, computing and computer software
These include articles that address substantitive psychological issues or that develop and extend techniques useful to psychologists. New models for psychological processes, new approaches to existing data, critiques of existing models and improved algorithms for estimating the parameters of a model are examples of articles which may be favoured.