Recalibration of Predicted Probabilities Using the “Logit Shift”: Why Does It Work, and When Can It Be Expected to Work Well?

IF 4.7 2区社会学 Q1 POLITICAL SCIENCE Political Analysis Pub Date : 2023-01-09 DOI:10.1017/pan.2022.31

Evan T. R. Rosenman, Cory McCartan, Santiago Olivella

{"title":"Recalibration of Predicted Probabilities Using the “Logit Shift”: Why Does It Work, and When Can It Be Expected to Work Well?","authors":"Evan T. R. Rosenman, Cory McCartan, Santiago Olivella","doi":"10.1017/pan.2022.31","DOIUrl":null,"url":null,"abstract":"Abstract The output of predictive models is routinely recalibrated by reconciling low-level predictions with known quantities defined at higher levels of aggregation. For example, models predicting vote probabilities at the individual level in U.S. elections can be adjusted so that their aggregation matches the observed vote totals in each county, thus producing better-calibrated predictions. In this research note, we provide theoretical grounding for one of the most commonly used recalibration strategies, known colloquially as the “logit shift.” Typically cast as a heuristic adjustment strategy (whereby a constant correction on the logit scale is found, such that aggregated predictions match target totals), we show that the logit shift offers a fast and accurate approximation to a principled, but computationally impractical adjustment strategy: computing the posterior prediction probabilities, conditional on the observed totals. After deriving analytical bounds on the quality of the approximation, we illustrate its accuracy using Monte Carlo simulations. We also discuss scenarios in which the logit shift is less effective at recalibrating predictions: when the target totals are defined only for highly heterogeneous populations, and when the original predictions correctly capture the mean of true individual probabilities, but fail to capture the shape of their distribution.","PeriodicalId":48270,"journal":{"name":"Political Analysis","volume":"31 1","pages":"651 - 661"},"PeriodicalIF":4.7000,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Political Analysis","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1017/pan.2022.31","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}

引用次数: 3

Abstract

Abstract The output of predictive models is routinely recalibrated by reconciling low-level predictions with known quantities defined at higher levels of aggregation. For example, models predicting vote probabilities at the individual level in U.S. elections can be adjusted so that their aggregation matches the observed vote totals in each county, thus producing better-calibrated predictions. In this research note, we provide theoretical grounding for one of the most commonly used recalibration strategies, known colloquially as the “logit shift.” Typically cast as a heuristic adjustment strategy (whereby a constant correction on the logit scale is found, such that aggregated predictions match target totals), we show that the logit shift offers a fast and accurate approximation to a principled, but computationally impractical adjustment strategy: computing the posterior prediction probabilities, conditional on the observed totals. After deriving analytical bounds on the quality of the approximation, we illustrate its accuracy using Monte Carlo simulations. We also discuss scenarios in which the logit shift is less effective at recalibrating predictions: when the target totals are defined only for highly heterogeneous populations, and when the original predictions correctly capture the mean of true individual probabilities, but fail to capture the shape of their distribution.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用“Logit Shift”重新校准预测概率：为什么它有效，何时可以预期它有效？

摘要预测模型的输出通常通过将低水平的预测与在较高聚合水平下定义的已知量进行协调来重新校准。例如，预测美国选举中个人投票概率的模型可以进行调整，使其总和与每个县观察到的投票总数相匹配，从而产生更好的校准预测。在这篇研究报告中，我们为最常用的重新校准策略之一提供了理论基础，通俗地说就是“logit转移”。通常被视为启发式调整策略（即在logit量表上找到一个恒定的修正，使汇总预测与目标总数相匹配），我们证明了logit移位提供了一种快速而准确的近似于一种有原则但在计算上不切实际的调整策略：以观察到的总数为条件计算后验预测概率。在推导出近似质量的分析边界后，我们使用蒙特卡罗模拟来说明其准确性。我们还讨论了logit偏移在重新校准预测方面效果较差的情况：当目标总数仅针对高度异质的人群定义时，以及当原始预测正确地捕捉到真实个体概率的平均值，但未能捕捉到其分布的形状时。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Political Analysis POLITICAL SCIENCE-

CiteScore

8.80

自引率

3.70%

发文量

期刊介绍： Political Analysis chronicles these exciting developments by publishing the most sophisticated scholarship in the field. It is the place to learn new methods, to find some of the best empirical scholarship, and to publish your best research.