{"title":"Regional Ensemble for Improving Unsupervised Outlier Detectors","authors":"Jiawei Yang;Sylwan Rahardja;Susanto Rahardja","doi":"10.1109/TAI.2024.3381102","DOIUrl":null,"url":null,"abstract":"Outlier ensemble is an important methodology for improving outlier detection, but faces severe challenges in unsupervised settings. Unlike traditional outlier ensembles which revised scores by considering only the values of the scores from multiple detectors, we present a novel regional ensemble (RE). RE combines the scores from multiple objects and multiple detectors and simultaneously takes into consideration both the values and the distribution of these scores. RE specifically enhances the score of a given object by using the scores of neighboring objects of the given object, under the assumption that the scores of the majority of neighboring objects are reliable. RE provides many potential applications, particularly in data mining and machine learning. Compared to existing outlier ensembles with 30 real-world datasets tested, RE attained the best performance with 14 datasets, while the current standard achieves superior performance with only eight datasets. RE can significantly improve the best existing from 0.83 to 0.86 AUC on average.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10479168/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Outlier ensemble is an important methodology for improving outlier detection, but faces severe challenges in unsupervised settings. Unlike traditional outlier ensembles which revised scores by considering only the values of the scores from multiple detectors, we present a novel regional ensemble (RE). RE combines the scores from multiple objects and multiple detectors and simultaneously takes into consideration both the values and the distribution of these scores. RE specifically enhances the score of a given object by using the scores of neighboring objects of the given object, under the assumption that the scores of the majority of neighboring objects are reliable. RE provides many potential applications, particularly in data mining and machine learning. Compared to existing outlier ensembles with 30 real-world datasets tested, RE attained the best performance with 14 datasets, while the current standard achieves superior performance with only eight datasets. RE can significantly improve the best existing from 0.83 to 0.86 AUC on average.