{"title":"Predicting the deterrence effect of tax audits. A machine learning approach","authors":"Michele Rabasco, Pietro Battiston","doi":"10.1111/meca.12420","DOIUrl":null,"url":null,"abstract":"<p>We apply machine learning methods to the prediction of deterrence effects of tax audits. Based on tax declarations data, we predict the increase in future income declarations after being targeted by an audit. We find that flexible models, such as classification trees and ensemble methods based on them, outperform penalized linear models such as Lasso and ridge regression in predicting taxpayers more likely to increase their declarations after an audit. We show that despite the non-randomness of audits, their specific time structure and the distribution of changes in declared amounts suggest a causal interpretation of our results; that is, our approach detects a heterogeneity in the reaction to a tax audit, rather than just forecasting an unconditional future increase. We find that taxpayers identified by our model will on average increase their declared income by €14,461—the average among all audited taxpayers being €−205. Our approach allows the tax agency to yield significantly larger revenues by appropriately targeting tax audit.</p>","PeriodicalId":46885,"journal":{"name":"Metroeconomica","volume":"74 3","pages":"531-556"},"PeriodicalIF":1.0000,"publicationDate":"2023-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Metroeconomica","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/meca.12420","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
We apply machine learning methods to the prediction of deterrence effects of tax audits. Based on tax declarations data, we predict the increase in future income declarations after being targeted by an audit. We find that flexible models, such as classification trees and ensemble methods based on them, outperform penalized linear models such as Lasso and ridge regression in predicting taxpayers more likely to increase their declarations after an audit. We show that despite the non-randomness of audits, their specific time structure and the distribution of changes in declared amounts suggest a causal interpretation of our results; that is, our approach detects a heterogeneity in the reaction to a tax audit, rather than just forecasting an unconditional future increase. We find that taxpayers identified by our model will on average increase their declared income by €14,461—the average among all audited taxpayers being €−205. Our approach allows the tax agency to yield significantly larger revenues by appropriately targeting tax audit.