{"title":"多基准现实检查","authors":"Ignacio Arbués , Mariano Matilla-García","doi":"10.1016/j.econmod.2024.106848","DOIUrl":null,"url":null,"abstract":"<div><p>Empirical economic modelers often have to choose between two classes of models, with each class containing multiple models. In many cases, this decision is based on the predictive ability of the considered models. This entails that multiple testing and/or p-hacking pose known risks. This study presents a new statistical approach for comparing all model in a single test, serving as a multi-benchmark reality check test. The behavior of the test is studied asymptotically and in small finite samples. We show how the new approach works by analyzing whether one family of linear bivariate models outperforms a univariate family in predicting commodity prices. This paper raises new questions for future research. From an empirical viewpoint, we present several open questions in economic modeling that can be tested with multi-benchmark tests. Meanwhile, from a theoretical viewpoint, further studies can investigate whether a more general method for approximating or simulating the test distribution can be developed.</p></div>","PeriodicalId":48419,"journal":{"name":"Economic Modelling","volume":"140 ","pages":"Article 106848"},"PeriodicalIF":4.2000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0264999324002050/pdfft?md5=8fe46243b7d8dd56e25a17cfa5c3b4f7&pid=1-s2.0-S0264999324002050-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Multibenchmark reality checks\",\"authors\":\"Ignacio Arbués , Mariano Matilla-García\",\"doi\":\"10.1016/j.econmod.2024.106848\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Empirical economic modelers often have to choose between two classes of models, with each class containing multiple models. In many cases, this decision is based on the predictive ability of the considered models. This entails that multiple testing and/or p-hacking pose known risks. This study presents a new statistical approach for comparing all model in a single test, serving as a multi-benchmark reality check test. The behavior of the test is studied asymptotically and in small finite samples. We show how the new approach works by analyzing whether one family of linear bivariate models outperforms a univariate family in predicting commodity prices. This paper raises new questions for future research. From an empirical viewpoint, we present several open questions in economic modeling that can be tested with multi-benchmark tests. Meanwhile, from a theoretical viewpoint, further studies can investigate whether a more general method for approximating or simulating the test distribution can be developed.</p></div>\",\"PeriodicalId\":48419,\"journal\":{\"name\":\"Economic Modelling\",\"volume\":\"140 \",\"pages\":\"Article 106848\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0264999324002050/pdfft?md5=8fe46243b7d8dd56e25a17cfa5c3b4f7&pid=1-s2.0-S0264999324002050-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Economic Modelling\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0264999324002050\",\"RegionNum\":2,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Economic Modelling","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0264999324002050","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
摘要
实证经济建模人员经常需要在两类模型中做出选择,每类模型都包含多个模型。在许多情况下,这一决定基于所考虑模型的预测能力。这就意味着多重测试和/或 p 黑客会带来已知的风险。本研究提出了一种新的统计方法,用于在单个测试中对所有模型进行比较,作为多基准现实检验测试。我们对该测试的行为进行了渐进研究,并研究了少量有限样本。我们通过分析线性双变量模型族在预测商品价格方面是否优于单变量模型族,展示了新方法的工作原理。本文提出了未来研究的新问题。从实证的角度来看,我们提出了经济建模中的几个未决问题,这些问题可以通过多基准测试进行检验。同时,从理论角度来看,进一步的研究可以探讨是否可以开发出一种更通用的方法来近似或模拟测试分布。
Empirical economic modelers often have to choose between two classes of models, with each class containing multiple models. In many cases, this decision is based on the predictive ability of the considered models. This entails that multiple testing and/or p-hacking pose known risks. This study presents a new statistical approach for comparing all model in a single test, serving as a multi-benchmark reality check test. The behavior of the test is studied asymptotically and in small finite samples. We show how the new approach works by analyzing whether one family of linear bivariate models outperforms a univariate family in predicting commodity prices. This paper raises new questions for future research. From an empirical viewpoint, we present several open questions in economic modeling that can be tested with multi-benchmark tests. Meanwhile, from a theoretical viewpoint, further studies can investigate whether a more general method for approximating or simulating the test distribution can be developed.
期刊介绍:
Economic Modelling fills a major gap in the economics literature, providing a single source of both theoretical and applied papers on economic modelling. The journal prime objective is to provide an international review of the state-of-the-art in economic modelling. Economic Modelling publishes the complete versions of many large-scale models of industrially advanced economies which have been developed for policy analysis. Examples are the Bank of England Model and the US Federal Reserve Board Model which had hitherto been unpublished. As individual models are revised and updated, the journal publishes subsequent papers dealing with these revisions, so keeping its readers as up to date as possible.