Fractional-Order Optimal Control and FIOV-MASAC Reinforcement Learning for Combating Malware Spread in Internet of Vehicles

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2025-01-01 DOI:10.1109/TASE.2024.3521614

Guiyun Liu;Hao Li;Lihao Xiong;Zhihao Tan;Zhongwei Liang;Xiaojing Zhong

{"title":"Fractional-Order Optimal Control and FIOV-MASAC Reinforcement Learning for Combating Malware Spread in Internet of Vehicles","authors":"Guiyun Liu;Hao Li;Lihao Xiong;Zhihao Tan;Zhongwei Liang;Xiaojing Zhong","doi":"10.1109/TASE.2024.3521614","DOIUrl":null,"url":null,"abstract":"Internet of Vehicles (IoV) is gradually becoming popular, but it also brings more opportunities for malware intrusion. The intrusion of malware into IoV will cause a series of security issues and increase the incidence of road accidents. Therefore, the suppressing measures to combat the spread of malware in IoV will be fundamental and urgent. To address this critical issue, this paper proposes a fractional-order IoV (FIOV) to investigate malware propagation patterns in Road Side Unit (RSU) and Vehicles. To accurately reflect the actual spread of malware, the traffic density, the channel fading and the actual connectivity are considered in mathematical model. Then, the model-based optimal treatment and quarantine control strategy is derived by optimal control theory. Additionally, a novel model-free FIOV multi-agent soft actor-critic (FIOV-MASAC) approach is first proposed to suppress the malware propagation in IoV. Simulation experiments demonstrate that the proposed FIOV-MASAC approach exhibits better learning ability compared to other reinforcement learning (RL) algorithms.Note to Practitioners—Frequent attacks by malware on IoV are recognized as being challenging to prevent, with these attacks posing threats to data security and potentially resulting in traffic accidents and vehicle malfunctions. In response, a novel mathematical model has been introduced within this study to better predict the propagation trends of malware in IoV, effectively managing its spread within the vehicular network systems. While RL methods have been extensively utilized in the domain of control systems, it is noted that current RL methods depend on rich experience pools, rendering them inapplicable to more complex systems without adaptation. To address this, an effective and pragmatic RL algorithm has been devised in this study. This algorithm, devoid of the requirement for complex model establishment, is capable of intelligently learning and adjusting to the sophisticated environment of IoV, thereby effectively countering the propagation of malware. It should be highlighted that the RL method proposed herein is applicable to the majority of epidemic systems, enabling the achievement of stable control while substantially minimizing control expenditures. The integration of this method is anticipated to augment the security and robustness of IoV in the face of malware attacks.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"10313-10332"},"PeriodicalIF":6.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10819631/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Internet of Vehicles (IoV) is gradually becoming popular, but it also brings more opportunities for malware intrusion. The intrusion of malware into IoV will cause a series of security issues and increase the incidence of road accidents. Therefore, the suppressing measures to combat the spread of malware in IoV will be fundamental and urgent. To address this critical issue, this paper proposes a fractional-order IoV (FIOV) to investigate malware propagation patterns in Road Side Unit (RSU) and Vehicles. To accurately reflect the actual spread of malware, the traffic density, the channel fading and the actual connectivity are considered in mathematical model. Then, the model-based optimal treatment and quarantine control strategy is derived by optimal control theory. Additionally, a novel model-free FIOV multi-agent soft actor-critic (FIOV-MASAC) approach is first proposed to suppress the malware propagation in IoV. Simulation experiments demonstrate that the proposed FIOV-MASAC approach exhibits better learning ability compared to other reinforcement learning (RL) algorithms.Note to Practitioners—Frequent attacks by malware on IoV are recognized as being challenging to prevent, with these attacks posing threats to data security and potentially resulting in traffic accidents and vehicle malfunctions. In response, a novel mathematical model has been introduced within this study to better predict the propagation trends of malware in IoV, effectively managing its spread within the vehicular network systems. While RL methods have been extensively utilized in the domain of control systems, it is noted that current RL methods depend on rich experience pools, rendering them inapplicable to more complex systems without adaptation. To address this, an effective and pragmatic RL algorithm has been devised in this study. This algorithm, devoid of the requirement for complex model establishment, is capable of intelligently learning and adjusting to the sophisticated environment of IoV, thereby effectively countering the propagation of malware. It should be highlighted that the RL method proposed herein is applicable to the majority of epidemic systems, enabling the achievement of stable control while substantially minimizing control expenditures. The integration of this method is anticipated to augment the security and robustness of IoV in the face of malware attacks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

分数阶最优控制与FIOV-MASAC强化学习对抗车联网恶意软件传播

车联网（IoV）正在逐渐普及，但它也带来了更多恶意软件入侵的机会。恶意软件入侵车联网会引发一系列安全问题，增加道路交通事故的发生率。因此，打击恶意软件在车联网中的传播的抑制措施将是根本性的和紧迫的。为了解决这一关键问题，本文提出了一个分数阶IoV （FIOV）来研究恶意软件在路边单元（RSU）和车辆中的传播模式。为了准确反映恶意软件的实际传播情况，在数学模型中考虑了流量密度、信道衰落和实际连通性。然后，利用最优控制理论推导了基于模型的最优处理隔离控制策略。此外，首次提出了一种新的无模型FIOV多智能体软行为者批评（FIOV- masac）方法来抑制恶意软件在IoV中的传播。仿真实验表明，与其他强化学习（RL）算法相比，所提出的FIOV-MASAC方法具有更好的学习能力。从业人员注意：恶意软件对车联网的频繁攻击被认为是具有挑战性的，因为这些攻击会对数据安全构成威胁，并可能导致交通事故和车辆故障。为此，本研究引入了一种新的数学模型，以更好地预测恶意软件在车联网中的传播趋势，有效地管理其在车联网系统中的传播。虽然强化学习方法在控制系统领域得到了广泛的应用，但需要注意的是，当前的强化学习方法依赖于丰富的经验池，使得它们在没有自适应的情况下无法适用于更复杂的系统。为了解决这个问题，本研究设计了一种有效且实用的强化学习算法。该算法不需要建立复杂的模型，能够智能地学习和适应车联网的复杂环境，从而有效地对抗恶意软件的传播。需要强调的是，本文提出的RL方法适用于大多数流行病系统，能够实现稳定控制，同时大大减少控制支出。这种方法的集成有望在面对恶意软件攻击时增强车联网的安全性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.