{"title":"Improve robustness of machine learning via efficient optimization and conformal prediction","authors":"Yan Yan","doi":"10.1002/aaai.12173","DOIUrl":null,"url":null,"abstract":"<p>The advance of machine learning (ML) systems in real-world scenarios usually expects safe deployment in high-stake applications (e.g., medical diagnosis) for critical decision-making process. To this end, provable robustness of ML is usually required to measure and understand how reliable the deployed ML system is and how trustworthy their predictions can be. Many studies have been done to enhance the robustness in recent years from different angles, such as variance-regularized robust objective functions and conformal prediction (CP) for uncertainty quantification on testing data. Although these tools provably improve the robustness of ML model, there is still an inevitable gap to integrate them into an <i>end-to-end</i> deployment. For example, robust objectives usually require carefully designed optimization algorithms, while CP treats ML models as black boxes. This paper is a brief introduction to our recent research focusing on filling this gap. Specifically, for learning robust objectives, we designed sample-efficient stochastic optimization algorithms that achieves the optimal (or faster compared to existing algorithms) convergence rates. Moreover, for CP-based uncertainty quantification, we established a framework to analyze the expected prediction set size (smaller size means more efficiency) of CP methods in both standard and adversarial settings. This paper elaborates the key challenges and our exploration towards efficient algorithms with details of background methods, notions for robustness measure, concepts of algorithmic efficiency, our proposed algorithms and results. All of them further motivate our future research on risk-aware ML that can be critical for AI–human collaborative systems. The future work mainly targets designing conformal robust objectives and their efficient optimization algorithms.</p>","PeriodicalId":7854,"journal":{"name":"Ai Magazine","volume":"45 2","pages":"270-279"},"PeriodicalIF":2.5000,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aaai.12173","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ai Magazine","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aaai.12173","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The advance of machine learning (ML) systems in real-world scenarios usually expects safe deployment in high-stake applications (e.g., medical diagnosis) for critical decision-making process. To this end, provable robustness of ML is usually required to measure and understand how reliable the deployed ML system is and how trustworthy their predictions can be. Many studies have been done to enhance the robustness in recent years from different angles, such as variance-regularized robust objective functions and conformal prediction (CP) for uncertainty quantification on testing data. Although these tools provably improve the robustness of ML model, there is still an inevitable gap to integrate them into an end-to-end deployment. For example, robust objectives usually require carefully designed optimization algorithms, while CP treats ML models as black boxes. This paper is a brief introduction to our recent research focusing on filling this gap. Specifically, for learning robust objectives, we designed sample-efficient stochastic optimization algorithms that achieves the optimal (or faster compared to existing algorithms) convergence rates. Moreover, for CP-based uncertainty quantification, we established a framework to analyze the expected prediction set size (smaller size means more efficiency) of CP methods in both standard and adversarial settings. This paper elaborates the key challenges and our exploration towards efficient algorithms with details of background methods, notions for robustness measure, concepts of algorithmic efficiency, our proposed algorithms and results. All of them further motivate our future research on risk-aware ML that can be critical for AI–human collaborative systems. The future work mainly targets designing conformal robust objectives and their efficient optimization algorithms.
机器学习(ML)系统在现实世界场景中的发展,通常期望在关键决策过程中的高风险应用(如医疗诊断)中安全部署。为此,通常需要证明 ML 的鲁棒性,以衡量和了解部署的 ML 系统的可靠性及其预测的可信度。近年来,人们从不同角度对增强鲁棒性进行了许多研究,例如方差规则化鲁棒目标函数和用于测试数据不确定性量化的保形预测(CP)。虽然这些工具都能有效提高 ML 模型的鲁棒性,但要将它们集成到端到端的部署中,仍存在不可避免的差距。例如,稳健目标通常需要精心设计的优化算法,而 CP 则将 ML 模型视为黑盒。本文简要介绍了我们最近为填补这一空白而开展的研究。具体来说,针对鲁棒目标的学习,我们设计了样本效率高的随机优化算法,以达到最佳收敛率(或与现有算法相比更快的收敛率)。此外,对于基于 CP 的不确定性量化,我们建立了一个框架,用于分析标准和对抗环境下 CP 方法的预期预测集规模(规模越小效率越高)。本文通过详细介绍背景方法、鲁棒性度量概念、算法效率概念、我们提出的算法和结果,阐述了关键挑战和我们对高效算法的探索。所有这些都进一步激发了我们对风险感知人工智能的未来研究,这对人工智能与人类协作系统至关重要。未来工作的主要目标是设计保形鲁棒目标及其高效优化算法。
期刊介绍:
AI Magazine publishes original articles that are reasonably self-contained and aimed at a broad spectrum of the AI community. Technical content should be kept to a minimum. In general, the magazine does not publish articles that have been published elsewhere in whole or in part. The magazine welcomes the contribution of articles on the theory and practice of AI as well as general survey articles, tutorial articles on timely topics, conference or symposia or workshop reports, and timely columns on topics of interest to AI scientists.