Detecting anomalies in blockchain transactions using machine learning classifiers and explainability analysis

IF 6.9 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Blockchain-Research and Applications Pub Date : 2024-09-01 DOI:10.1016/j.bcra.2024.100207

Mohammad Hasan , Mohammad Shahriar Rahman , Helge Janicke , Iqbal H. Sarker

{"title":"Detecting anomalies in blockchain transactions using machine learning classifiers and explainability analysis","authors":"Mohammad Hasan , Mohammad Shahriar Rahman , Helge Janicke , Iqbal H. Sarker","doi":"10.1016/j.bcra.2024.100207","DOIUrl":null,"url":null,"abstract":"<div><div>As the use of blockchain for digital payments continues to rise, it becomes susceptible to various malicious attacks. Successfully detecting anomalies within blockchain transactions is essential for bolstering trust in digital payments. However, the task of anomaly detection in blockchain transaction data is challenging due to the infrequent occurrence of illicit transactions. Although several studies have been conducted in the field, a limitation persists: the lack of explanations for the model's predictions. This study seeks to overcome this limitation by integrating explainable artificial intelligence (XAI) techniques and anomaly rules into tree-based ensemble classifiers for detecting anomalous Bitcoin transactions. The shapley additive explanation (SHAP) method is employed to measure the contribution of each feature, and it is compatible with ensemble models. Moreover, we present rules for interpreting whether a Bitcoin transaction is anomalous or not. Additionally, we introduce an under-sampling algorithm named XGBCLUS, designed to balance anomalous and non-anomalous transaction data. This algorithm is compared against other commonly used under-sampling and over-sampling techniques. Finally, the outcomes of various tree-based single classifiers are compared with those of stacking and voting ensemble classifiers. Our experimental results demonstrate that: (i) XGBCLUS enhances true positive rate (TPR) and receiver operating characteristic-area under curve (ROC-AUC) scores compared to state-of-the-art under-sampling and over-sampling techniques, and (ii) our proposed ensemble classifiers outperform traditional single tree-based machine learning classifiers in terms of accuracy, TPR, and false positive rate (FPR) scores.</div></div>","PeriodicalId":53141,"journal":{"name":"Blockchain-Research and Applications","volume":"5 3","pages":"Article 100207"},"PeriodicalIF":6.9000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Blockchain-Research and Applications","FirstCategoryId":"1093","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2096720924000204","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

As the use of blockchain for digital payments continues to rise, it becomes susceptible to various malicious attacks. Successfully detecting anomalies within blockchain transactions is essential for bolstering trust in digital payments. However, the task of anomaly detection in blockchain transaction data is challenging due to the infrequent occurrence of illicit transactions. Although several studies have been conducted in the field, a limitation persists: the lack of explanations for the model's predictions. This study seeks to overcome this limitation by integrating explainable artificial intelligence (XAI) techniques and anomaly rules into tree-based ensemble classifiers for detecting anomalous Bitcoin transactions. The shapley additive explanation (SHAP) method is employed to measure the contribution of each feature, and it is compatible with ensemble models. Moreover, we present rules for interpreting whether a Bitcoin transaction is anomalous or not. Additionally, we introduce an under-sampling algorithm named XGBCLUS, designed to balance anomalous and non-anomalous transaction data. This algorithm is compared against other commonly used under-sampling and over-sampling techniques. Finally, the outcomes of various tree-based single classifiers are compared with those of stacking and voting ensemble classifiers. Our experimental results demonstrate that: (i) XGBCLUS enhances true positive rate (TPR) and receiver operating characteristic-area under curve (ROC-AUC) scores compared to state-of-the-art under-sampling and over-sampling techniques, and (ii) our proposed ensemble classifiers outperform traditional single tree-based machine learning classifiers in terms of accuracy, TPR, and false positive rate (FPR) scores.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用机器学习分类器和可解释性分析检测区块链交易中的异常情况

随着区块链在数字支付领域的应用不断增加，它也容易受到各种恶意攻击。成功检测区块链交易中的异常情况对于增强数字支付的信任度至关重要。然而，由于非法交易很少发生，在区块链交易数据中进行异常检测是一项具有挑战性的任务。虽然该领域已开展了多项研究，但仍存在一个局限性：缺乏对模型预测的解释。本研究试图通过将可解释人工智能（XAI）技术和异常规则整合到基于树的集合分类器中来克服这一局限，以检测异常比特币交易。我们采用夏普利加法解释（SHAP）方法来衡量每个特征的贡献，该方法与集合模型兼容。此外，我们还提出了解释比特币交易是否异常的规则。此外，我们还引入了一种名为 XGBCLUS 的低采样算法，旨在平衡异常和非异常交易数据。我们将该算法与其他常用的低采样和高采样技术进行了比较。最后，将各种基于树的单一分类器的结果与堆叠和投票集合分类器的结果进行了比较。实验结果表明(i) 与最先进的欠采样和过采样技术相比，XGBCLUS 提高了真阳性率（TPR）和接收者操作特征曲线下面积（ROC-AUC）分数；(ii) 我们提出的集合分类器在准确率、TPR 和假阳性率（FPR）分数方面优于传统的基于树的单一机器学习分类器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Blockchain-Research and Applications

CiteScore

11.30

自引率

3.60%

发文量

期刊介绍： Blockchain: Research and Applications is an international, peer reviewed journal for researchers, engineers, and practitioners to present the latest advances and innovations in blockchain research. The journal publishes theoretical and applied papers in established and emerging areas of blockchain research to shape the future of blockchain technology.