XAI-PhD: Fortifying Trust of Phishing URL Detection Empowered by Shapley Additive Explanations

International Journal of Online and Biomedical Engineering (iJOE) Pub Date : 2024-08-08 DOI:10.3991/ijoe.v20i11.49533

Mustafa Al-Fayoumi, Bushra Alhijawi, Q. Abu Al-haija, Rakan Armoush

{"title":"XAI-PhD: Fortifying Trust of Phishing URL Detection Empowered by Shapley Additive Explanations","authors":"Mustafa Al-Fayoumi, Bushra Alhijawi, Q. Abu Al-haija, Rakan Armoush","doi":"10.3991/ijoe.v20i11.49533","DOIUrl":null,"url":null,"abstract":"The rapid growth of the Internet has led to an increased demand for online services. However, this surge in online activity has also brought about a new threat: phishing attacks. Phishing is a type of cyberattack that utilizes social engineering techniques and technological manipulations to steal crucial information from unsuspecting individuals. Consequently, there is a rising necessity to create dependable phishing URL detection models that can effectively identify phishing URLs with enhanced accuracy and reduced prediction overhead. This study introduces XAI-PhD, an innovative phishing detection method that utilizes machine learning (ML) and Shapley additive explanation (SHAP) capabilities. Specifically, XAI-PhD utilizes SHAP to thoroughly analyze the significance of each feature in influencing the decision-making process of the classifier. By selectively incorporating input characteristics based on their SHAP values, only the most crucial attributes are assessed, enabling the development of a highly adaptable and generalized model. XAI-PhD utilizes a lightweight gradient boosting machine as its classifier, and a series of rigorous tests are conducted to assess its performance compared to established baseline methods. The empirical findings unequivocally demonstrate the exceptional effectiveness of XAI-PhD, as evidenced by its remarkable accuracy and F1-score of 99.8% and 99%, respectively. Moreover, XAI-PhD exhibits high computational efficiency, requiring only 1.47 milliseconds and 18.5 microseconds per record to generate accurate predictions.","PeriodicalId":507997,"journal":{"name":"International Journal of Online and Biomedical Engineering (iJOE)","volume":"30 8","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Online and Biomedical Engineering (iJOE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3991/ijoe.v20i11.49533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The rapid growth of the Internet has led to an increased demand for online services. However, this surge in online activity has also brought about a new threat: phishing attacks. Phishing is a type of cyberattack that utilizes social engineering techniques and technological manipulations to steal crucial information from unsuspecting individuals. Consequently, there is a rising necessity to create dependable phishing URL detection models that can effectively identify phishing URLs with enhanced accuracy and reduced prediction overhead. This study introduces XAI-PhD, an innovative phishing detection method that utilizes machine learning (ML) and Shapley additive explanation (SHAP) capabilities. Specifically, XAI-PhD utilizes SHAP to thoroughly analyze the significance of each feature in influencing the decision-making process of the classifier. By selectively incorporating input characteristics based on their SHAP values, only the most crucial attributes are assessed, enabling the development of a highly adaptable and generalized model. XAI-PhD utilizes a lightweight gradient boosting machine as its classifier, and a series of rigorous tests are conducted to assess its performance compared to established baseline methods. The empirical findings unequivocally demonstrate the exceptional effectiveness of XAI-PhD, as evidenced by its remarkable accuracy and F1-score of 99.8% and 99%, respectively. Moreover, XAI-PhD exhibits high computational efficiency, requiring only 1.47 milliseconds and 18.5 microseconds per record to generate accurate predictions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

XAI-PhD：利用沙普利加法解释增强网络钓鱼 URL 检测的可信度

互联网的快速发展导致对在线服务的需求增加。然而，在线活动的激增也带来了新的威胁：网络钓鱼攻击。网络钓鱼是一种网络攻击，它利用社交工程技术和技术手段从毫无戒备的个人那里窃取关键信息。因此，越来越有必要创建可靠的网络钓鱼 URL 检测模型，以提高准确性并减少预测开销，从而有效识别网络钓鱼 URL。本研究介绍了 XAI-PhD，这是一种利用机器学习（ML）和夏普利加法解释（SHAP）功能的创新型网络钓鱼检测方法。具体来说，XAI-PhD 利用 SHAP 彻底分析每个特征在影响分类器决策过程中的重要性。通过根据 SHAP 值有选择地纳入输入特征，只对最关键的属性进行评估，从而开发出适应性强的通用模型。XAI-PhD 采用轻量级梯度提升机作为分类器，并进行了一系列严格的测试，以评估其与现有基准方法相比的性能。实证结果明确证明了 XAI-PhD 的卓越功效，其准确率和 F1 分数分别高达 99.8% 和 99%。此外，XAI-PhD 还具有很高的计算效率，每条记录仅需 1.47 毫秒和 18.5 微秒即可生成准确预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Online and Biomedical Engineering (iJOE)

自引率

0.00%

发文量