Strategic bidding with price-quantity pairs based on deep reinforcement learning considering competitors' behaviors

IF 11 1区 工程技术 Q1 ENERGY & FUELS Applied Energy Pub Date : 2025-08-01 Epub Date: 2025-04-14 DOI:10.1016/j.apenergy.2025.125874
Fei Hu , Yong Zhao , Yaowen Yu , Changshun Zhang , Yicheng Lian , Cheng Huang , Yuanzheng Li
{"title":"Strategic bidding with price-quantity pairs based on deep reinforcement learning considering competitors' behaviors","authors":"Fei Hu ,&nbsp;Yong Zhao ,&nbsp;Yaowen Yu ,&nbsp;Changshun Zhang ,&nbsp;Yicheng Lian ,&nbsp;Cheng Huang ,&nbsp;Yuanzheng Li","doi":"10.1016/j.apenergy.2025.125874","DOIUrl":null,"url":null,"abstract":"<div><div>In a smart electricity market, self-interested market participants may leverage a large amount of market data to bid strategically to maximize their profits. However, the existing studies in strategic bidding often ignore competitors' bidding behaviors and only consider strategic actions on prices without quantities. To bridge the gap, this paper develops a novel deep reinforcement learning-based framework to model and solve the strategic bidding problem of a producer. To capture competitors' historical bidding behaviors in the market environment, their demand-bid mappings are established based on a data-driven method combining K-medoids clustering and a deep neural network. To make full use of the bidding action space and increase the profit of the strategic producer, a bilevel optimization model considering bids in price-quantity pairs is formulated. To efficiently solve the problem with competitors' bidding behaviors, a twin delayed deep deterministic policy gradient-based algorithm is developed. Case studies on the IEEE 57-bus system show that the proposed framework obtains a 27.37 % higher expected value and a 47.60 % lower standard deviation of the profit compared to the existing approach, demonstrating its profitability and robustness under market dynamics. Another case on the IEEE 118-bus test system achieves a 33.34 % increase in the expected profit, further validating the advantages in profitability. These cases together demonstrate the effectiveness and scalability of our approach in systems of different sizes, as well as its potential application to strategic bidding in smart electricity markets.</div></div>","PeriodicalId":246,"journal":{"name":"Applied Energy","volume":"391 ","pages":""},"PeriodicalIF":11.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S030626192500604X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/14 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

In a smart electricity market, self-interested market participants may leverage a large amount of market data to bid strategically to maximize their profits. However, the existing studies in strategic bidding often ignore competitors' bidding behaviors and only consider strategic actions on prices without quantities. To bridge the gap, this paper develops a novel deep reinforcement learning-based framework to model and solve the strategic bidding problem of a producer. To capture competitors' historical bidding behaviors in the market environment, their demand-bid mappings are established based on a data-driven method combining K-medoids clustering and a deep neural network. To make full use of the bidding action space and increase the profit of the strategic producer, a bilevel optimization model considering bids in price-quantity pairs is formulated. To efficiently solve the problem with competitors' bidding behaviors, a twin delayed deep deterministic policy gradient-based algorithm is developed. Case studies on the IEEE 57-bus system show that the proposed framework obtains a 27.37 % higher expected value and a 47.60 % lower standard deviation of the profit compared to the existing approach, demonstrating its profitability and robustness under market dynamics. Another case on the IEEE 118-bus test system achieves a 33.34 % increase in the expected profit, further validating the advantages in profitability. These cases together demonstrate the effectiveness and scalability of our approach in systems of different sizes, as well as its potential application to strategic bidding in smart electricity markets.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
考虑竞争对手行为的基于深度强化学习的价量对策略投标
在智能电力市场中,自私自利的市场参与者可能会利用大量的市场数据进行策略性竞标,以实现利润最大化。然而,现有的战略投标研究往往忽略竞争对手的投标行为,只考虑价格上的战略行为,而不考虑数量上的战略行为。为了弥补这一差距,本文开发了一种新的基于深度强化学习的框架来建模和解决生产商的战略投标问题。为了捕捉市场环境中竞争对手的历史投标行为,基于k - medioids聚类和深度神经网络相结合的数据驱动方法,建立了竞争对手的需求-投标映射关系。为了充分利用投标行动空间,提高战略生产者的利润,建立了考虑价格-数量对投标的双层优化模型。为了有效地解决竞争对手的竞价行为问题,提出了一种基于双延迟深度确定性策略梯度的算法。对IEEE 57总线系统的实例研究表明,与现有方法相比,该框架的期望值提高了27.37%,利润标准差降低了47.60%,证明了其在市场动态下的盈利能力和鲁棒性。在IEEE 118总线测试系统的另一个案例中,预期利润提高了33.34%,进一步验证了盈利能力的优势。这些案例共同展示了我们的方法在不同规模的系统中的有效性和可扩展性,以及它在智能电力市场战略投标中的潜在应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Energy
Applied Energy 工程技术-工程:化工
CiteScore
21.20
自引率
10.70%
发文量
1830
审稿时长
41 days
期刊介绍: Applied Energy serves as a platform for sharing innovations, research, development, and demonstrations in energy conversion, conservation, and sustainable energy systems. The journal covers topics such as optimal energy resource use, environmental pollutant mitigation, and energy process analysis. It welcomes original papers, review articles, technical notes, and letters to the editor. Authors are encouraged to submit manuscripts that bridge the gap between research, development, and implementation. The journal addresses a wide spectrum of topics, including fossil and renewable energy technologies, energy economics, and environmental impacts. Applied Energy also explores modeling and forecasting, conservation strategies, and the social and economic implications of energy policies, including climate change mitigation. It is complemented by the open-access journal Advances in Applied Energy.
期刊最新文献
Privacy-preserving transfer learning framework for building energy forecasting with fully anonymized data Robust Koopman EMPC for optimal frequency regulation of VSC-MTDC systems Operational decision of wind–photovoltaic–energy storage integrated system in day-ahead and ancillary service joint market considering weather variability Collaborative governance of carbon mitigation, energy transition, and material management: A factorial non-deterministic carbon-energy-metal nexus optimization model Systematic under-representation of ERA5 10 m wind speeds in the ERA5-land product undermines wind energy studies
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1