W-DOE: Wasserstein Distribution-Agnostic Outlier Exposure

IF 18.6 IEEE transactions on pattern analysis and machine intelligence Pub Date : 2025-01-17 DOI:10.1109/TPAMI.2025.3531000

Qizhou Wang;Bo Han;Yang Liu;Chen Gong;Tongliang Liu;Jiming Liu

{"title":"W-DOE: Wasserstein Distribution-Agnostic Outlier Exposure","authors":"Qizhou Wang;Bo Han;Yang Liu;Chen Gong;Tongliang Liu;Jiming Liu","doi":"10.1109/TPAMI.2025.3531000","DOIUrl":null,"url":null,"abstract":"In open-world environments, classification models should be adept at identifying out-of-distribution (OOD) data whose semantics differ from in-distribution (ID) data, leading to the emerging research in OOD detection. As a promising learning scheme, <italic>outlier exposure</i> (OE) enables the models to learn from <italic>auxiliary OOD data</i>, enhancing model representations in discerning between ID and OOD patterns. However, these auxiliary OOD data often do not fully represent real OOD scenarios, potentially biasing our models in practical OOD detection. Hence, we propose a novel OE-based learning method termed <italic>Wasserstein Distribution-agnostic Outlier Exposure</i> (W-DOE), which is both theoretically sound and experimentally superior to previous works. The intuition is that by expanding the coverage of training-time OOD data, the models will encounter fewer unseen OOD cases upon deployment. In W-DOE, we achieve additional OOD data to enlarge the OOD coverage, based on a new data synthesis approach called <italic>implicit data synthesis</i> (IDS). It is driven by our new insight that perturbing model parameters can lead to implicit data transformation, which is simple to implement yet effective to realize. Furthermore, we suggest a general learning framework to search for the synthesized OOD data that can benefit the models most, ensuring the OOD performance for the enlarged OOD coverage measured by the Wasserstein metric. Our approach comes with provable guarantees for open-world settings, demonstrating that broader OOD coverage ensures reduced estimation errors and thereby improved generalization for real OOD cases. We conduct extensive experiments across a series of representative OOD detection setups, further validating the superiority of W-DOE against state-of-the-art counterparts in the field.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 5","pages":"3530-3545"},"PeriodicalIF":18.6000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10844561","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10844561/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In open-world environments, classification models should be adept at identifying out-of-distribution (OOD) data whose semantics differ from in-distribution (ID) data, leading to the emerging research in OOD detection. As a promising learning scheme, outlier exposure (OE) enables the models to learn from auxiliary OOD data, enhancing model representations in discerning between ID and OOD patterns. However, these auxiliary OOD data often do not fully represent real OOD scenarios, potentially biasing our models in practical OOD detection. Hence, we propose a novel OE-based learning method termed Wasserstein Distribution-agnostic Outlier Exposure (W-DOE), which is both theoretically sound and experimentally superior to previous works. The intuition is that by expanding the coverage of training-time OOD data, the models will encounter fewer unseen OOD cases upon deployment. In W-DOE, we achieve additional OOD data to enlarge the OOD coverage, based on a new data synthesis approach called implicit data synthesis (IDS). It is driven by our new insight that perturbing model parameters can lead to implicit data transformation, which is simple to implement yet effective to realize. Furthermore, we suggest a general learning framework to search for the synthesized OOD data that can benefit the models most, ensuring the OOD performance for the enlarged OOD coverage measured by the Wasserstein metric. Our approach comes with provable guarantees for open-world settings, demonstrating that broader OOD coverage ensures reduced estimation errors and thereby improved generalization for real OOD cases. We conduct extensive experiments across a series of representative OOD detection setups, further validating the superiority of W-DOE against state-of-the-art counterparts in the field.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

W-DOE: Wasserstein分布不可知论异常值暴露

在开放世界环境下，分类模型需要善于识别语义不同于分布内（ID）数据的分布外（out- distribution， OOD）数据，从而导致了分布外（out- distribution， OOD）数据检测研究的兴起。作为一种很有前途的学习方案，离群暴露（OE）使模型能够从辅助的OOD数据中学习，增强模型在识别ID和OOD模式方面的表征。然而，这些辅助的OOD数据通常不能完全代表真实的OOD场景，这可能会使我们的模型在实际的OOD检测中产生偏差。因此，我们提出了一种新的基于oe的学习方法，称为Wasserstein分布不可知论异常值暴露（W-DOE），该方法在理论上和实验上都优于以往的研究成果。直觉是，通过扩大训练时间OOD数据的覆盖范围，模型在部署时将遇到更少的未见过的OOD案例。在W-DOE中，我们基于一种新的数据合成方法，即隐式数据合成（IDS），获得额外的OOD数据以扩大OOD覆盖范围。这是由我们的新见解驱动的，即扰动模型参数可以导致隐式数据转换，该转换实现简单而有效。此外，我们提出了一个通用的学习框架来搜索最能使模型受益的合成OOD数据，以确保由Wasserstein度量的扩大的OOD覆盖范围的OOD性能。我们的方法具有开放世界设置的可证明保证，表明更广泛的OOD覆盖确保减少估计误差，从而提高对真实OOD案例的泛化。我们在一系列具有代表性的OOD检测设置中进行了广泛的实验，进一步验证了W-DOE与该领域最先进的同类产品相比的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量