{"title":"K-Means Algorithm: Fraud Detection Based on Signaling Data","authors":"Xing Min, Rongheng Lin","doi":"10.1109/SERVICES.2018.00024","DOIUrl":null,"url":null,"abstract":"At present, the crime of telecom fraud, with advanced communications and Internet technologies, is growing rapidly and causing huge losses every year. The traditional fraud detection methods are less flexible. In this paper, we used the signaling data to train a clustering model, which can discover the hidden user characteristics of fraud phones. The paper puts forward the extraction method of behavior characteristics, reduce the dimension of features with principal component analysis and select the appropriate clustering parameters through grid search, then present the K-Means-based behavior identification system, which can help to distinguish the frauds and identify the fraud phone numbers. Finally, the feasibility of this model is verified by the actual sample dataset.","PeriodicalId":130225,"journal":{"name":"2018 IEEE World Congress on Services (SERVICES)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE World Congress on Services (SERVICES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERVICES.2018.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
At present, the crime of telecom fraud, with advanced communications and Internet technologies, is growing rapidly and causing huge losses every year. The traditional fraud detection methods are less flexible. In this paper, we used the signaling data to train a clustering model, which can discover the hidden user characteristics of fraud phones. The paper puts forward the extraction method of behavior characteristics, reduce the dimension of features with principal component analysis and select the appropriate clustering parameters through grid search, then present the K-Means-based behavior identification system, which can help to distinguish the frauds and identify the fraud phone numbers. Finally, the feasibility of this model is verified by the actual sample dataset.