{"title":"Energy-Based Learning for Polluted Outlier Detection in Backdoor","authors":"Xiangyu Gao, M. Qiu","doi":"10.1109/SmartCloud55982.2022.00014","DOIUrl":null,"url":null,"abstract":"Big data analysis has become an essential tool in a lot of fields. An increasing number of entities rely on different kinds of data analysis tools to formulate their strategy. However, the popularity of big data brings several problems as well because attackers might pollute the data set by adding negligible data points to make a negative effect on the final analysis results. Therefore, in this paper, we propose to leverage the energy-based learning method to detect outliers within a data set. Specifically, we iteratively rule out bad data points from the data set based on specific selection rules. The experiment result is promising, which shows that our algorithm can improve the accuracy in the linear regression by more than 20% on average.","PeriodicalId":104366,"journal":{"name":"2022 IEEE 7th International Conference on Smart Cloud (SmartCloud)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 7th International Conference on Smart Cloud (SmartCloud)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SmartCloud55982.2022.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Big data analysis has become an essential tool in a lot of fields. An increasing number of entities rely on different kinds of data analysis tools to formulate their strategy. However, the popularity of big data brings several problems as well because attackers might pollute the data set by adding negligible data points to make a negative effect on the final analysis results. Therefore, in this paper, we propose to leverage the energy-based learning method to detect outliers within a data set. Specifically, we iteratively rule out bad data points from the data set based on specific selection rules. The experiment result is promising, which shows that our algorithm can improve the accuracy in the linear regression by more than 20% on average.