{"title":"A learning-based anomaly detection framework for secure recommendation","authors":"Haolong Xiang , Wenhao Fei , Ruiyang Ni , Xuyun Zhang","doi":"10.1016/j.ins.2025.122071","DOIUrl":null,"url":null,"abstract":"<div><div>Recommendation systems are essential tools for suggesting items or information to users based on their preferences and behaviors, which have been widely applied in various online platforms and services to personalize user experiences, increase user engagement, and drive business growth. However, the security and efficacy of recommendation systems can be compromised if the input data is tainted by malicious users. One of the primary threats to recommendation systems is shilling attacks, which pose great challenges in handling various types of huge-volume data with anomaly detection techniques. In this paper, we propose a novel anomaly detection framework named LTHiForest with the use of the learning to hash based isolation forest. Then, we instantiate the generic framework with one concrete hashing mechanism, extended order preserving hashing, to illustrate the stages of our framework and verify its effectiveness in detecting various anomalies. The core idea of this instantiation is to learn from data to construct a better isolation forest structure than the state-of-the-art methods like iForest and LSHiForest, which can achieve robust detection of various anomaly types. Extensive experiments on both synthetic and real-world data sets demonstrate the robustness and effectiveness of our framework for recommendation systems.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"708 ","pages":"Article 122071"},"PeriodicalIF":8.1000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525002038","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Recommendation systems are essential tools for suggesting items or information to users based on their preferences and behaviors, which have been widely applied in various online platforms and services to personalize user experiences, increase user engagement, and drive business growth. However, the security and efficacy of recommendation systems can be compromised if the input data is tainted by malicious users. One of the primary threats to recommendation systems is shilling attacks, which pose great challenges in handling various types of huge-volume data with anomaly detection techniques. In this paper, we propose a novel anomaly detection framework named LTHiForest with the use of the learning to hash based isolation forest. Then, we instantiate the generic framework with one concrete hashing mechanism, extended order preserving hashing, to illustrate the stages of our framework and verify its effectiveness in detecting various anomalies. The core idea of this instantiation is to learn from data to construct a better isolation forest structure than the state-of-the-art methods like iForest and LSHiForest, which can achieve robust detection of various anomaly types. Extensive experiments on both synthetic and real-world data sets demonstrate the robustness and effectiveness of our framework for recommendation systems.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.