Highly transferable adversarial attack against deep-reinforcement-learning-based frequency control

Zhongwei Li, Yang Liu, Peng Qiu, Hongyan Yin, Xu Wan, Mingyang Sun
{"title":"Highly transferable adversarial attack against deep-reinforcement-learning-based frequency control","authors":"Zhongwei Li,&nbsp;Yang Liu,&nbsp;Peng Qiu,&nbsp;Hongyan Yin,&nbsp;Xu Wan,&nbsp;Mingyang Sun","doi":"10.1049/enc2.12086","DOIUrl":null,"url":null,"abstract":"<p>With the increase in inverter-based renewable energy resources, the complexity and uncertainty of low-carbon power systems have increased significantly. Deep reinforcement learning (DRL)–based approaches have been extensively studied for frequency control to overcome the limitations of traditional model-based approaches. The goal of DRL-based methods for primary frequency control is to minimise load shedding while satisfying frequency safety requirements, thereby reducing control costs. However, the vulnerabilities of DRL models pose new security threats to power systems. These threats have not been identified and addressed in the existing literature. Therefore, in this paper, a series of vulnerability assessment methods are proposed for DRL-based frequency control with a focus on the under-frequency load shedding (UFLS) problem. Three adversarial sample production methods are designed with different optimisation directions: Q-value-based FGSM (Q-FGSM), action-based JSMA (A-JSMA), and state-action-based CW (SA-CW). Furthermore, combining the advantages of the above three attack methods, a hybrid adversarial attack algorithm is designed, Q-value-state-action-based mix (QSA-MIX), to significantly affect the decision process of the DRL model. In case studies of the IEEE39 bus system, the proposed attack methods had a severe impact on system operation and control. In particular, the high attack transferability of the proposed attack algorithms in a black-box setting provides further evidence that the vulnerability of current DRL-based control schemes is prevalent.</p>","PeriodicalId":100467,"journal":{"name":"Energy Conversion and Economics","volume":"4 3","pages":"202-212"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/enc2.12086","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Conversion and Economics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/enc2.12086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the increase in inverter-based renewable energy resources, the complexity and uncertainty of low-carbon power systems have increased significantly. Deep reinforcement learning (DRL)–based approaches have been extensively studied for frequency control to overcome the limitations of traditional model-based approaches. The goal of DRL-based methods for primary frequency control is to minimise load shedding while satisfying frequency safety requirements, thereby reducing control costs. However, the vulnerabilities of DRL models pose new security threats to power systems. These threats have not been identified and addressed in the existing literature. Therefore, in this paper, a series of vulnerability assessment methods are proposed for DRL-based frequency control with a focus on the under-frequency load shedding (UFLS) problem. Three adversarial sample production methods are designed with different optimisation directions: Q-value-based FGSM (Q-FGSM), action-based JSMA (A-JSMA), and state-action-based CW (SA-CW). Furthermore, combining the advantages of the above three attack methods, a hybrid adversarial attack algorithm is designed, Q-value-state-action-based mix (QSA-MIX), to significantly affect the decision process of the DRL model. In case studies of the IEEE39 bus system, the proposed attack methods had a severe impact on system operation and control. In particular, the high attack transferability of the proposed attack algorithms in a black-box setting provides further evidence that the vulnerability of current DRL-based control schemes is prevalent.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
针对基于深度强化学习的频率控制的高度可转移对抗性攻击
随着基于逆变器的可再生能源的增加,低碳电力系统的复杂性和不确定性显著增加。基于深度强化学习(DRL)的方法已被广泛研究用于频率控制,以克服传统基于模型的方法的局限性。基于DRL的一次频率控制方法的目标是在满足频率安全要求的同时最大限度地减少甩负荷,从而降低控制成本。然而,DRL模型的漏洞对电力系统构成了新的安全威胁。现有文献中尚未发现和解决这些威胁。因此,本文针对低频减载(UFLS)问题,提出了一系列基于DRL的频率控制脆弱性评估方法。设计了三种具有不同优化方向的对抗性样本生成方法:基于Q值的FGSM(Q-FGSM)、基于动作的JSMA(A-JSMA)和基于状态-动作的CW(SA-CW)。此外,结合上述三种攻击方法的优点,设计了一种混合对抗性攻击算法,即基于Q值状态的混合(QSA-mix),以显著影响DRL模型的决策过程。在IEEE39总线系统的案例研究中,所提出的攻击方法对系统的运行和控制产生了严重影响。特别是,所提出的攻击算法在黑匣子设置中的高攻击可转移性进一步证明了当前基于DRL的控制方案的漏洞普遍存在。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A novel online reinforcement learning-based linear quadratic regulator for three-level neutral-point clamped DC/AC inverter Artificial intelligence-driven insights: Precision tracking of power plant carbon emissions using satellite data Forecasting masked-load with invisible distributed energy resources based on transfer learning and Bayesian tuning Collaborative deployment of multiple reinforcement methods for network-loss reduction in distribution system with seasonal loads State-of-health estimation of lithium-ion batteries: A comprehensive literature review from cell to pack levels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1