Machine learning-based approach for designing and implementing a collaborative fraud detection model through CDR and traffic analysis

Eric Michel DEUSSOM DJOMADJI, Bequerelle MATEMTSAP MBOU, Aurelle Tchagna Kouanou, M. Ekonde Sone, Parfait Bayonbog
{"title":"Machine learning-based approach for designing and implementing a collaborative fraud detection model through CDR and traffic analysis","authors":"Eric Michel DEUSSOM DJOMADJI, Bequerelle MATEMTSAP MBOU, Aurelle Tchagna Kouanou, M. Ekonde Sone, Parfait Bayonbog","doi":"10.14738/tmlai.104.12854","DOIUrl":null,"url":null,"abstract":"Fraud in telecommunications networks is a constantly growing phenomenon that causes enormous financial losses for both the individual user and the telecommunications operators. We can denote many researchers who have proposed various approaches to provide a solution to this problem, but still need to be improve to ensure the efficiency. Detecting fraud is difficult and, it's no surprise that many frauds schemes have serious limitations. Different types of fraud may require different systems, each with different procedures, parameter adjustments, database interfaces, and case management tools and capabilities. This article uses the K-Means algorithm to handle fraud detection based on Call Detail Record (CDR) and traffic analysis in a telecommunication industry. Our algorithm consists to compare traffic and CDR generated in the network and check if there is abnormal behavior and if yes, our model is used to confirm if users suspecting of fraud are really fraudster or not. To build our model we used real word CDR data collected in November 2021. Our model associates the Differential Privacy model to encrypt users' personal information, and the k-means algorithm to group users into different clusters. Those clusters represent non fraud users having similar characteristics based on criteria used to build the model. Users having abnormal behavior that can be assimilated to fraudsters are those who are far from the different clusters center. Thanks to a representation in a plan, we better visualize user’s behavior. We validated our model by evaluating our segmentation method. The interpretation of the results shows sufficiently that our approach allows to obtain better results. Our approach can be used by all telecommunications operator to reduce the impact of fraud on internet services.","PeriodicalId":119801,"journal":{"name":"Transactions on Machine Learning and Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions on Machine Learning and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14738/tmlai.104.12854","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Fraud in telecommunications networks is a constantly growing phenomenon that causes enormous financial losses for both the individual user and the telecommunications operators. We can denote many researchers who have proposed various approaches to provide a solution to this problem, but still need to be improve to ensure the efficiency. Detecting fraud is difficult and, it's no surprise that many frauds schemes have serious limitations. Different types of fraud may require different systems, each with different procedures, parameter adjustments, database interfaces, and case management tools and capabilities. This article uses the K-Means algorithm to handle fraud detection based on Call Detail Record (CDR) and traffic analysis in a telecommunication industry. Our algorithm consists to compare traffic and CDR generated in the network and check if there is abnormal behavior and if yes, our model is used to confirm if users suspecting of fraud are really fraudster or not. To build our model we used real word CDR data collected in November 2021. Our model associates the Differential Privacy model to encrypt users' personal information, and the k-means algorithm to group users into different clusters. Those clusters represent non fraud users having similar characteristics based on criteria used to build the model. Users having abnormal behavior that can be assimilated to fraudsters are those who are far from the different clusters center. Thanks to a representation in a plan, we better visualize user’s behavior. We validated our model by evaluating our segmentation method. The interpretation of the results shows sufficiently that our approach allows to obtain better results. Our approach can be used by all telecommunications operator to reduce the impact of fraud on internet services.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习的基于CDR和流量分析的协同欺诈检测模型的设计和实现方法
电信网络诈骗是一个日益严重的现象,给个人用户和电信运营商都造成了巨大的经济损失。我们可以指出,许多研究者已经提出了各种方法来解决这个问题,但仍需要改进以确保效率。检测欺诈是困难的,而且许多欺诈计划有严重的局限性也就不足为奇了。不同类型的欺诈可能需要不同的系统,每个系统都有不同的程序、参数调整、数据库接口以及案例管理工具和功能。本文利用K-Means算法处理电信行业基于话单和流量分析的欺诈检测。我们的算法包括比较网络中产生的流量和CDR,检查是否有异常行为,如果有,使用我们的模型来确认怀疑欺诈的用户是否真的是欺诈者。为了建立我们的模型,我们使用了2021年11月收集的真实单词CDR数据。我们的模型结合差分隐私模型对用户的个人信息进行加密,并结合k-means算法将用户分组到不同的集群中。这些聚类表示基于用于构建模型的标准具有相似特征的非欺诈用户。那些远离不同集群中心的用户具有可以被同化为欺诈者的异常行为。由于计划中的表示,我们可以更好地可视化用户的行为。我们通过评估我们的分割方法来验证我们的模型。对结果的解释充分表明,我们的方法可以获得更好的结果。我们的方法可以被所有电信运营商使用,以减少欺诈对互联网服务的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Addressing Challenges Encountered by English Language Teachers in Imparting Communication Skills among Higher Secondary Students: A Critical Overview Singing Voice Melody Detection Inquiring About The Memetic Relationships People Have with Societal Collapse Natural Ventilation in a Semi-Confined Enclosure Heated by a Linear Heat Source NMC: A Fast and Secure ARX Cipher
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1