Learning Whenever Learning is Possible: Universal Learning under General Stochastic Processes

2020 Information Theory and Applications Workshop (ITA) Pub Date : 2017-06-05 DOI:10.1109/ita50056.2020.9244938

Steve Hanneke

{"title":"Learning Whenever Learning is Possible: Universal Learning under General Stochastic Processes","authors":"Steve Hanneke","doi":"10.1109/ita50056.2020.9244938","DOIUrl":null,"url":null,"abstract":"This work initiates a general study of learning and generalization without the i.i.d. assumption, starting from first principles. While the standard approach to statistical learning theory is based on assumptions chosen largely for their convenience (e.g., i.i.d. or stationary ergodic), in this work we are interested in developing a theory of learning based only on the most fundamental and natural assumptions implicit in the requirements of the learning problem itself. We specifically study universally consistent function learning, where the objective is to obtain low long-run average loss for any target function, when the data follow a given stochastic process. We are then interested in the question of whether there exist learning rules guaranteed to be universally consistent given only the assumption that universally consistent learning is possible for the given data process. The reasoning that motivates this criterion emanates from a kind of optimist’s decision theory, and so we refer to such learning rules as being optimistically universal. We study this question in three natural learning settings: inductive, self-adaptive, and online. Remarkably, as our strongest positive result, we find that optimistically universal learning rules do indeed exist in the self-adaptive learning setting. Establishing this fact requires us to develop new approaches to the design of learning algorithms. Along the way, we also identify concise characterizations of the family of processes under which universally consistent learning is possible in the inductive and self-adaptive settings. We additionally pose a number of enticing open problems, particularly for the online learning setting.","PeriodicalId":137257,"journal":{"name":"2020 Information Theory and Applications Workshop (ITA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Information Theory and Applications Workshop (ITA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ita50056.2020.9244938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 24

Abstract

This work initiates a general study of learning and generalization without the i.i.d. assumption, starting from first principles. While the standard approach to statistical learning theory is based on assumptions chosen largely for their convenience (e.g., i.i.d. or stationary ergodic), in this work we are interested in developing a theory of learning based only on the most fundamental and natural assumptions implicit in the requirements of the learning problem itself. We specifically study universally consistent function learning, where the objective is to obtain low long-run average loss for any target function, when the data follow a given stochastic process. We are then interested in the question of whether there exist learning rules guaranteed to be universally consistent given only the assumption that universally consistent learning is possible for the given data process. The reasoning that motivates this criterion emanates from a kind of optimist’s decision theory, and so we refer to such learning rules as being optimistically universal. We study this question in three natural learning settings: inductive, self-adaptive, and online. Remarkably, as our strongest positive result, we find that optimistically universal learning rules do indeed exist in the self-adaptive learning setting. Establishing this fact requires us to develop new approaches to the design of learning algorithms. Along the way, we also identify concise characterizations of the family of processes under which universally consistent learning is possible in the inductive and self-adaptive settings. We additionally pose a number of enticing open problems, particularly for the online learning setting.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习只要有可能:一般随机过程下的通用学习

这项工作开创了一个一般性的学习和泛化的研究，没有i.i.d假设，从第一原理开始。虽然统计学习理论的标准方法主要基于方便选择的假设(例如，i.i.d或平稳遍历)，但在这项工作中，我们感兴趣的是开发一种仅基于学习问题本身要求中隐含的最基本和自然假设的学习理论。我们专门研究了普遍一致的函数学习，其目标是在数据遵循给定的随机过程时获得任何目标函数的低长期平均损失。然后，我们感兴趣的问题是，是否存在保证普遍一致的学习规则，只要假设对于给定的数据过程，普遍一致的学习是可能的。激励这一标准的推理源自一种乐观主义的决策理论，因此我们将这样的学习规则称为乐观普适性。我们在三种自然学习环境中研究这个问题:归纳、自适应和在线。值得注意的是，作为我们最强的积极结果，我们发现乐观通用学习规则在自适应学习环境中确实存在。确立这一事实要求我们开发新的方法来设计学习算法。在此过程中，我们还确定了过程族的简明特征，在归纳和自适应设置中，普遍一致的学习是可能的。此外，我们还提出了一些诱人的开放式问题，特别是针对在线学习设置。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 Information Theory and Applications Workshop (ITA)

自引率

0.00%

发文量

期刊最新文献

Massive MIMO is Very Useful for Pilot-Free Uplink Communications Simplified Ray Tracing for the Millimeter Wave Channel: A Performance Evaluation On Marton's Achievable Region: Local Tensorization for Product Channels with a Binary Component Improve Robustness of Deep Neural Networks by Coding On Nonnegative CP Tensor Decomposition Robustness to Noise