Harnessing AI and analytics to enhance cybersecurity and privacy for collective intelligence systems.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE PeerJ Computer Science Pub Date : 2024-09-20 eCollection Date: 2024-01-01 DOI:10.7717/peerj-cs.2264

Muhammad Rehan Naeem, Rashid Amin, Muhammad Farhan, Faiz Abdullah Alotaibi, Mrim M Alnfiai, Gabriel Avelino Sampedro, Vincent Karovič

{"title":"Harnessing AI and analytics to enhance cybersecurity and privacy for collective intelligence systems.","authors":"Muhammad Rehan Naeem, Rashid Amin, Muhammad Farhan, Faiz Abdullah Alotaibi, Mrim M Alnfiai, Gabriel Avelino Sampedro, Vincent Karovič","doi":"10.7717/peerj-cs.2264","DOIUrl":null,"url":null,"abstract":"<p><p>Collective intelligence systems like Chat Generative Pre-Trained Transformer (ChatGPT) have emerged. They have brought both promise and peril to cybersecurity and privacy protection. This study introduces novel approaches to harness the power of artificial intelligence (AI) and big data analytics to enhance security and privacy in this new era. Contributions could explore topics such as: leveraging natural language processing (NLP) in ChatGPT-like systems to strengthen information security; evaluating privacy-enhancing technologies to maximize data utility while minimizing personal data exposure; modeling human behavior and agency to build secure and ethical human-centric systems; applying machine learning to detect threats and vulnerabilities in a data-driven manner; using analytics to preserve privacy in large datasets while enabling value creation; crafting AI techniques that operate in a trustworthy and explainable manner. This article advances the state-of-the-art at the intersection of cybersecurity, privacy, human factors, ethics, and cutting-edge AI, providing impactful solutions to emerging challenges. Our research presents a revolutionary approach to malware detection that leverages deep learning (DL) based methodologies to automatically learn features from raw data. Our approach involves constructing a grayscale image from a malware file and extracting features to minimize its size. This process affords us the ability to discern patterns that might remain hidden from other techniques, enabling us to utilize convolutional neural networks (CNNs) to learn from these grayscale images and a stacking ensemble to classify malware. The goal is to model a highly complex nonlinear function with parameters that can be optimized to achieve superior performance. To test our approach, we ran it on over 6,414 malware variants and 2,050 benign files from the MalImg collection, resulting in an impressive 99.86 percent validation accuracy for malware detection. Furthermore, we conducted a classification experiment on 15 malware families and 13 tests with varying parameters to compare our model to other comparable research. Our model outperformed most of the similar research with detection accuracy ranging from 47.07% to 99.81% and a significant increase in detection performance. Our results demonstrate the efficacy of our approach, which unlocks the hidden patterns that underlie complex systems, advancing the frontiers of computational security.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2264"},"PeriodicalIF":3.5000,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11419604/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.7717/peerj-cs.2264","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Collective intelligence systems like Chat Generative Pre-Trained Transformer (ChatGPT) have emerged. They have brought both promise and peril to cybersecurity and privacy protection. This study introduces novel approaches to harness the power of artificial intelligence (AI) and big data analytics to enhance security and privacy in this new era. Contributions could explore topics such as: leveraging natural language processing (NLP) in ChatGPT-like systems to strengthen information security; evaluating privacy-enhancing technologies to maximize data utility while minimizing personal data exposure; modeling human behavior and agency to build secure and ethical human-centric systems; applying machine learning to detect threats and vulnerabilities in a data-driven manner; using analytics to preserve privacy in large datasets while enabling value creation; crafting AI techniques that operate in a trustworthy and explainable manner. This article advances the state-of-the-art at the intersection of cybersecurity, privacy, human factors, ethics, and cutting-edge AI, providing impactful solutions to emerging challenges. Our research presents a revolutionary approach to malware detection that leverages deep learning (DL) based methodologies to automatically learn features from raw data. Our approach involves constructing a grayscale image from a malware file and extracting features to minimize its size. This process affords us the ability to discern patterns that might remain hidden from other techniques, enabling us to utilize convolutional neural networks (CNNs) to learn from these grayscale images and a stacking ensemble to classify malware. The goal is to model a highly complex nonlinear function with parameters that can be optimized to achieve superior performance. To test our approach, we ran it on over 6,414 malware variants and 2,050 benign files from the MalImg collection, resulting in an impressive 99.86 percent validation accuracy for malware detection. Furthermore, we conducted a classification experiment on 15 malware families and 13 tests with varying parameters to compare our model to other comparable research. Our model outperformed most of the similar research with detection accuracy ranging from 47.07% to 99.81% and a significant increase in detection performance. Our results demonstrate the efficacy of our approach, which unlocks the hidden patterns that underlie complex systems, advancing the frontiers of computational security.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用人工智能和分析技术加强集体智能系统的网络安全和隐私保护。

像聊天生成预训练转换器（ChatGPT）这样的集体智能系统已经出现。它们既给网络安全和隐私保护带来了希望，也带来了危险。本研究介绍了在这个新时代利用人工智能（AI）和大数据分析的力量来加强安全和隐私保护的新方法。投稿可探讨的主题包括：在类似 ChatGPT 的系统中利用自然语言处理（NLP）来加强信息安全；评估隐私增强技术，以最大限度地提高数据效用，同时最大限度地减少个人数据的暴露；模拟人类行为和代理，以建立安全、道德的以人为本的系统；应用机器学习，以数据驱动的方式检测威胁和漏洞；利用分析技术保护大型数据集中的隐私，同时实现价值创造；精心设计以可信和可解释的方式运行的人工智能技术。这篇文章推进了网络安全、隐私、人为因素、伦理和尖端人工智能交叉领域的最新进展，为新出现的挑战提供了有影响力的解决方案。我们的研究提出了一种革命性的恶意软件检测方法，它利用基于深度学习（DL）的方法自动学习原始数据中的特征。我们的方法包括从恶意软件文件中构建灰度图像，并提取特征以最小化其大小。这一过程使我们有能力识别其他技术可能无法识别的模式，从而利用卷积神经网络（CNN）从这些灰度图像中学习，并利用堆叠集合对恶意软件进行分类。我们的目标是建立一个高度复杂的非线性函数模型，并对参数进行优化，以实现卓越的性能。为了测试我们的方法，我们在 MalImg 收集的 6414 个恶意软件变体和 2050 个良性文件上运行了该方法，结果恶意软件检测的验证准确率达到了令人印象深刻的 99.86%。此外，我们还对 15 个恶意软件系列和 13 个不同参数的测试进行了分类实验，将我们的模型与其他同类研究进行了比较。我们的模型优于大多数同类研究，检测准确率从 47.07% 到 99.81%，检测性能显著提高。我们的结果证明了我们方法的有效性，它揭开了复杂系统背后隐藏的模式，推动了计算安全领域的前沿发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

PeerJ Computer Science Computer Science-General Computer Science

CiteScore

6.10

自引率

5.30%

发文量

332

审稿时长

10 weeks

期刊介绍： PeerJ Computer Science is the new open access journal covering all subject areas in computer science, with the backing of a prestigious advisory board and more than 300 academic editors.