Scoring rules and performance, new analysis of expert judgment data

FUTURES & FORESIGHT SCIENCE Pub Date : 2024-07-01 DOI:10.1002/ffo2.189

Gabriela F. Nane, Roger M. Cooke

引用次数: 0

Abstract

A review of scoring rules highlights the distinction between rewarding honesty and rewarding quality. This motivates the introduction of a scale-invariant version of the Continuous Ranked Probability Score (CRPS) which enables statistical accuracy (SA) testing based on an exact rather than an asymptotic distribution of the density of convolutions. A recent data set of 6761 expert probabilistic forecasts for questions for which the actual values are known is used to compare performance. New insights include that (a) variance due to assessed variables dominates variance due to experts, (b) performance on mean absolute percentage error (MAPE) is weakly related to SA (c) scale-invariant CRPS combinations compete with the Classical Model (CM) on SA and MAPE, and (d) CRPS is more forgiving with regard to SA than the CM as CRPS is insensitive to location bias.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评分规则和绩效，对专家判断数据的新分析

对评分规则的回顾强调了诚实奖励和质量奖励之间的区别。这促使我们引入了连续排名概率得分（CRPS）的尺度不变版本，该版本可根据卷积密度的精确分布而非渐近分布进行统计准确性（SA）测试。最近的数据集包含 6761 个专家对已知实际值的问题进行的概率预测，用于比较性能。新发现包括：(a) 评估变量引起的方差主导专家引起的方差；(b) 平均绝对百分比误差 (MAPE) 的性能与 SA 关系不大；(c) 在 SA 和 MAPE 方面，规模不变的 CRPS 组合与经典模型 (CM) 竞争；(d) CRPS 在 SA 方面比 CM 更宽容，因为 CRPS 对位置偏差不敏感。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

FUTURES & FORESIGHT SCIENCE

CiteScore

7.00

自引率

0.00%

发文量