{"title":"机器学习中有效不确定性量化的保形预测方法比较研究","authors":"Nicolas Dewolf","doi":"arxiv-2405.02082","DOIUrl":null,"url":null,"abstract":"In the past decades, most work in the area of data analysis and machine\nlearning was focused on optimizing predictive models and getting better results\nthan what was possible with existing models. To what extent the metrics with\nwhich such improvements were measured were accurately capturing the intended\ngoal, whether the numerical differences in the resulting values were\nsignificant, or whether uncertainty played a role in this study and if it\nshould have been taken into account, was of secondary importance. Whereas\nprobability theory, be it frequentist or Bayesian, used to be the gold standard\nin science before the advent of the supercomputer, it was quickly replaced in\nfavor of black box models and sheer computing power because of their ability to\nhandle large data sets. This evolution sadly happened at the expense of\ninterpretability and trustworthiness. However, while people are still trying to\nimprove the predictive power of their models, the community is starting to\nrealize that for many applications it is not so much the exact prediction that\nis of importance, but rather the variability or uncertainty. The work in this dissertation tries to further the quest for a world where\neveryone is aware of uncertainty, of how important it is and how to embrace it\ninstead of fearing it. A specific, though general, framework that allows anyone\nto obtain accurate uncertainty estimates is singled out and analysed. Certain\naspects and applications of the framework -- dubbed `conformal prediction' --\nare studied in detail. Whereas many approaches to uncertainty quantification\nmake strong assumptions about the data, conformal prediction is, at the time of\nwriting, the only framework that deserves the title `distribution-free'. No\nparametric assumptions have to be made and the nonparametric results also hold\nwithout having to resort to the law of large numbers in the asymptotic regime.","PeriodicalId":501330,"journal":{"name":"arXiv - MATH - Statistics Theory","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparative study of conformal prediction methods for valid uncertainty quantification in machine learning\",\"authors\":\"Nicolas Dewolf\",\"doi\":\"arxiv-2405.02082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the past decades, most work in the area of data analysis and machine\\nlearning was focused on optimizing predictive models and getting better results\\nthan what was possible with existing models. To what extent the metrics with\\nwhich such improvements were measured were accurately capturing the intended\\ngoal, whether the numerical differences in the resulting values were\\nsignificant, or whether uncertainty played a role in this study and if it\\nshould have been taken into account, was of secondary importance. Whereas\\nprobability theory, be it frequentist or Bayesian, used to be the gold standard\\nin science before the advent of the supercomputer, it was quickly replaced in\\nfavor of black box models and sheer computing power because of their ability to\\nhandle large data sets. This evolution sadly happened at the expense of\\ninterpretability and trustworthiness. However, while people are still trying to\\nimprove the predictive power of their models, the community is starting to\\nrealize that for many applications it is not so much the exact prediction that\\nis of importance, but rather the variability or uncertainty. The work in this dissertation tries to further the quest for a world where\\neveryone is aware of uncertainty, of how important it is and how to embrace it\\ninstead of fearing it. A specific, though general, framework that allows anyone\\nto obtain accurate uncertainty estimates is singled out and analysed. Certain\\naspects and applications of the framework -- dubbed `conformal prediction' --\\nare studied in detail. Whereas many approaches to uncertainty quantification\\nmake strong assumptions about the data, conformal prediction is, at the time of\\nwriting, the only framework that deserves the title `distribution-free'. No\\nparametric assumptions have to be made and the nonparametric results also hold\\nwithout having to resort to the law of large numbers in the asymptotic regime.\",\"PeriodicalId\":501330,\"journal\":{\"name\":\"arXiv - MATH - Statistics Theory\",\"volume\":\"16 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - MATH - Statistics Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.02082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Statistics Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.02082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A comparative study of conformal prediction methods for valid uncertainty quantification in machine learning
In the past decades, most work in the area of data analysis and machine
learning was focused on optimizing predictive models and getting better results
than what was possible with existing models. To what extent the metrics with
which such improvements were measured were accurately capturing the intended
goal, whether the numerical differences in the resulting values were
significant, or whether uncertainty played a role in this study and if it
should have been taken into account, was of secondary importance. Whereas
probability theory, be it frequentist or Bayesian, used to be the gold standard
in science before the advent of the supercomputer, it was quickly replaced in
favor of black box models and sheer computing power because of their ability to
handle large data sets. This evolution sadly happened at the expense of
interpretability and trustworthiness. However, while people are still trying to
improve the predictive power of their models, the community is starting to
realize that for many applications it is not so much the exact prediction that
is of importance, but rather the variability or uncertainty. The work in this dissertation tries to further the quest for a world where
everyone is aware of uncertainty, of how important it is and how to embrace it
instead of fearing it. A specific, though general, framework that allows anyone
to obtain accurate uncertainty estimates is singled out and analysed. Certain
aspects and applications of the framework -- dubbed `conformal prediction' --
are studied in detail. Whereas many approaches to uncertainty quantification
make strong assumptions about the data, conformal prediction is, at the time of
writing, the only framework that deserves the title `distribution-free'. No
parametric assumptions have to be made and the nonparametric results also hold
without having to resort to the law of large numbers in the asymptotic regime.