Fractional-Order Optimal Control and FIOV-MASAC Reinforcement Learning for Combating Malware Spread in Internet of Vehicles

IF 6.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS IEEE Transactions on Automation Science and Engineering Pub Date : 2025-01-01 DOI:10.1109/TASE.2024.3521614
Guiyun Liu;Hao Li;Lihao Xiong;Zhihao Tan;Zhongwei Liang;Xiaojing Zhong
{"title":"Fractional-Order Optimal Control and FIOV-MASAC Reinforcement Learning for Combating Malware Spread in Internet of Vehicles","authors":"Guiyun Liu;Hao Li;Lihao Xiong;Zhihao Tan;Zhongwei Liang;Xiaojing Zhong","doi":"10.1109/TASE.2024.3521614","DOIUrl":null,"url":null,"abstract":"Internet of Vehicles (IoV) is gradually becoming popular, but it also brings more opportunities for malware intrusion. The intrusion of malware into IoV will cause a series of security issues and increase the incidence of road accidents. Therefore, the suppressing measures to combat the spread of malware in IoV will be fundamental and urgent. To address this critical issue, this paper proposes a fractional-order IoV (FIOV) to investigate malware propagation patterns in Road Side Unit (RSU) and Vehicles. To accurately reflect the actual spread of malware, the traffic density, the channel fading and the actual connectivity are considered in mathematical model. Then, the model-based optimal treatment and quarantine control strategy is derived by optimal control theory. Additionally, a novel model-free FIOV multi-agent soft actor-critic (FIOV-MASAC) approach is first proposed to suppress the malware propagation in IoV. Simulation experiments demonstrate that the proposed FIOV-MASAC approach exhibits better learning ability compared to other reinforcement learning (RL) algorithms.Note to Practitioners—Frequent attacks by malware on IoV are recognized as being challenging to prevent, with these attacks posing threats to data security and potentially resulting in traffic accidents and vehicle malfunctions. In response, a novel mathematical model has been introduced within this study to better predict the propagation trends of malware in IoV, effectively managing its spread within the vehicular network systems. While RL methods have been extensively utilized in the domain of control systems, it is noted that current RL methods depend on rich experience pools, rendering them inapplicable to more complex systems without adaptation. To address this, an effective and pragmatic RL algorithm has been devised in this study. This algorithm, devoid of the requirement for complex model establishment, is capable of intelligently learning and adjusting to the sophisticated environment of IoV, thereby effectively countering the propagation of malware. It should be highlighted that the RL method proposed herein is applicable to the majority of epidemic systems, enabling the achievement of stable control while substantially minimizing control expenditures. The integration of this method is anticipated to augment the security and robustness of IoV in the face of malware attacks.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"10313-10332"},"PeriodicalIF":6.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10819631/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Internet of Vehicles (IoV) is gradually becoming popular, but it also brings more opportunities for malware intrusion. The intrusion of malware into IoV will cause a series of security issues and increase the incidence of road accidents. Therefore, the suppressing measures to combat the spread of malware in IoV will be fundamental and urgent. To address this critical issue, this paper proposes a fractional-order IoV (FIOV) to investigate malware propagation patterns in Road Side Unit (RSU) and Vehicles. To accurately reflect the actual spread of malware, the traffic density, the channel fading and the actual connectivity are considered in mathematical model. Then, the model-based optimal treatment and quarantine control strategy is derived by optimal control theory. Additionally, a novel model-free FIOV multi-agent soft actor-critic (FIOV-MASAC) approach is first proposed to suppress the malware propagation in IoV. Simulation experiments demonstrate that the proposed FIOV-MASAC approach exhibits better learning ability compared to other reinforcement learning (RL) algorithms.Note to Practitioners—Frequent attacks by malware on IoV are recognized as being challenging to prevent, with these attacks posing threats to data security and potentially resulting in traffic accidents and vehicle malfunctions. In response, a novel mathematical model has been introduced within this study to better predict the propagation trends of malware in IoV, effectively managing its spread within the vehicular network systems. While RL methods have been extensively utilized in the domain of control systems, it is noted that current RL methods depend on rich experience pools, rendering them inapplicable to more complex systems without adaptation. To address this, an effective and pragmatic RL algorithm has been devised in this study. This algorithm, devoid of the requirement for complex model establishment, is capable of intelligently learning and adjusting to the sophisticated environment of IoV, thereby effectively countering the propagation of malware. It should be highlighted that the RL method proposed herein is applicable to the majority of epidemic systems, enabling the achievement of stable control while substantially minimizing control expenditures. The integration of this method is anticipated to augment the security and robustness of IoV in the face of malware attacks.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
分数阶最优控制与FIOV-MASAC强化学习对抗车联网恶意软件传播
车联网(IoV)正在逐渐普及,但它也带来了更多恶意软件入侵的机会。恶意软件入侵车联网会引发一系列安全问题,增加道路交通事故的发生率。因此,打击恶意软件在车联网中的传播的抑制措施将是根本性的和紧迫的。为了解决这一关键问题,本文提出了一个分数阶IoV (FIOV)来研究恶意软件在路边单元(RSU)和车辆中的传播模式。为了准确反映恶意软件的实际传播情况,在数学模型中考虑了流量密度、信道衰落和实际连通性。然后,利用最优控制理论推导了基于模型的最优处理隔离控制策略。此外,首次提出了一种新的无模型FIOV多智能体软行为者批评(FIOV- masac)方法来抑制恶意软件在IoV中的传播。仿真实验表明,与其他强化学习(RL)算法相比,所提出的FIOV-MASAC方法具有更好的学习能力。从业人员注意:恶意软件对车联网的频繁攻击被认为是具有挑战性的,因为这些攻击会对数据安全构成威胁,并可能导致交通事故和车辆故障。为此,本研究引入了一种新的数学模型,以更好地预测恶意软件在车联网中的传播趋势,有效地管理其在车联网系统中的传播。虽然强化学习方法在控制系统领域得到了广泛的应用,但需要注意的是,当前的强化学习方法依赖于丰富的经验池,使得它们在没有自适应的情况下无法适用于更复杂的系统。为了解决这个问题,本研究设计了一种有效且实用的强化学习算法。该算法不需要建立复杂的模型,能够智能地学习和适应车联网的复杂环境,从而有效地对抗恶意软件的传播。需要强调的是,本文提出的RL方法适用于大多数流行病系统,能够实现稳定控制,同时大大减少控制支出。这种方法的集成有望在面对恶意软件攻击时增强车联网的安全性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Automation Science and Engineering
IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统
CiteScore
12.50
自引率
14.30%
发文量
404
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.
期刊最新文献
CoVA-IL: Zero-Shot Imitation Learning via Contrastive Viewpoint Alignment on Object-Centric Representation Self-Organized Team Formation via Multi-Task Hedonic Games for Capability-Heterogeneous Human-Machine Agents Perceptive Locomotion and Navigation for Quadruped Robots via Depth-Based Representation Data-Driven Inverse Reinforcement Learning for Markov Multiplayer Tidal Turbine Systems A Hierarchical Path Planning Framework for Large-Scale Sparse Environments: Layered Grid Refinement and Bidirectional Shortcuts with Application to Offshore Wind Farm Inspection
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1