Optimizing deep reinforcement learning in data-scarce domains: a cross-domain evaluation of double DQN and dueling DQN

IF 1.6 Q2 ENGINEERING, MULTIDISCIPLINARY International Journal of System Assurance Engineering and Management Pub Date : 2024-05-02 DOI:10.1007/s13198-024-02344-5
Nusrat Mohi Ud Din, Assif Assad, Saqib Ul Sabha, Muzafar Rasool
{"title":"Optimizing deep reinforcement learning in data-scarce domains: a cross-domain evaluation of double DQN and dueling DQN","authors":"Nusrat Mohi Ud Din, Assif Assad, Saqib Ul Sabha, Muzafar Rasool","doi":"10.1007/s13198-024-02344-5","DOIUrl":null,"url":null,"abstract":"<p>The challenge of limited labeled data is a persistent concern across diverse domains, including healthcare, niche agricultural practices, astronomy and space exploration, anomaly detection, and many more. Limited data can lead to biased training, overfitting, and poor generalization in Artificial Intelligence (AI) models. In response to this ubiquitous problem, this research explores the potential of deep reinforcement learning (DRL) algorithms, specifically Double Deep Q-Network (Double DQN) and Dueling Deep Q-Network (Dueling DQN). The algorithms were trained on small training subsets generated by subsampling from the original training datasets. In this subsampling process, 10, 20, 30, and 40 instances were selected from each class to form the smaller training subsets. Subsequently, the performance of these algorithms was comprehensively assessed by evaluating them on the entire test set. We employed datasets from two different domains where this problem mainly exists to assess their performance in data-constrained scenarios. A comparative analysis was conducted against a transfer learning approach widely employed to tackle similar challenges. The comprehensive evaluation reveals compelling results. In the medical domain, Dueling DQN consistently outperformed Double DQN and transfer learning, while in the agriculture domain, Double DQN demonstrates superior performance compared to Dueling DQN and transfer learning. These findings underscore the remarkable effectiveness of DRL algorithms in addressing data scarcity across a spectrum of domains, positioning DRL as a potent tool for enhancing diverse applications with limited labeled data.</p>","PeriodicalId":14463,"journal":{"name":"International Journal of System Assurance Engineering and Management","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of System Assurance Engineering and Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13198-024-02344-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The challenge of limited labeled data is a persistent concern across diverse domains, including healthcare, niche agricultural practices, astronomy and space exploration, anomaly detection, and many more. Limited data can lead to biased training, overfitting, and poor generalization in Artificial Intelligence (AI) models. In response to this ubiquitous problem, this research explores the potential of deep reinforcement learning (DRL) algorithms, specifically Double Deep Q-Network (Double DQN) and Dueling Deep Q-Network (Dueling DQN). The algorithms were trained on small training subsets generated by subsampling from the original training datasets. In this subsampling process, 10, 20, 30, and 40 instances were selected from each class to form the smaller training subsets. Subsequently, the performance of these algorithms was comprehensively assessed by evaluating them on the entire test set. We employed datasets from two different domains where this problem mainly exists to assess their performance in data-constrained scenarios. A comparative analysis was conducted against a transfer learning approach widely employed to tackle similar challenges. The comprehensive evaluation reveals compelling results. In the medical domain, Dueling DQN consistently outperformed Double DQN and transfer learning, while in the agriculture domain, Double DQN demonstrates superior performance compared to Dueling DQN and transfer learning. These findings underscore the remarkable effectiveness of DRL algorithms in addressing data scarcity across a spectrum of domains, positioning DRL as a potent tool for enhancing diverse applications with limited labeled data.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
优化数据稀缺领域的深度强化学习:双DQN和决斗DQN的跨领域评估
标注数据有限的挑战是各个领域长期存在的问题,包括医疗保健、利基农业实践、天文学和太空探索、异常检测等。有限的数据会导致人工智能(AI)模型的训练偏差、过度拟合和泛化效果不佳。针对这一普遍问题,本研究探索了深度强化学习(DRL)算法的潜力,特别是双深度 Q 网络(Double DQN)和决斗深度 Q 网络(Dueling DQN)。这些算法是在原始训练数据集的子采样生成的小型训练子集上进行训练的。在子采样过程中,从每一类中分别选取 10、20、30 和 40 个实例,形成较小的训练子集。随后,我们在整个测试集上对这些算法的性能进行了全面评估。我们采用了主要存在这一问题的两个不同领域的数据集,以评估它们在数据受限情况下的性能。我们还与广泛用于应对类似挑战的迁移学习方法进行了比较分析。综合评估结果令人信服。在医疗领域,Dueling DQN 的性能始终优于 Double DQN 和迁移学习,而在农业领域,Double DQN 的性能则优于 Dueling DQN 和迁移学习。这些发现凸显了 DRL 算法在解决各领域数据匮乏问题方面的显著效果,从而使 DRL 成为一种强有力的工具,可用于增强标注数据有限的各种应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.30
自引率
10.00%
发文量
252
期刊介绍: This Journal is established with a view to cater to increased awareness for high quality research in the seamless integration of heterogeneous technologies to formulate bankable solutions to the emergent complex engineering problems. Assurance engineering could be thought of as relating to the provision of higher confidence in the reliable and secure implementation of a system’s critical characteristic features through the espousal of a holistic approach by using a wide variety of cross disciplinary tools and techniques. Successful realization of sustainable and dependable products, systems and services involves an extensive adoption of Reliability, Quality, Safety and Risk related procedures for achieving high assurancelevels of performance; also pivotal are the management issues related to risk and uncertainty that govern the practical constraints encountered in their deployment. It is our intention to provide a platform for the modeling and analysis of large engineering systems, among the other aforementioned allied goals of systems assurance engineering, leading to the enforcement of performance enhancement measures. Achieving a fine balance between theory and practice is the primary focus. The Journal only publishes high quality papers that have passed the rigorous peer review procedure of an archival scientific Journal. The aim is an increasing number of submissions, wide circulation and a high impact factor.
期刊最新文献
Vision-based gait analysis to detect Parkinson’s disease using hybrid Harris hawks and Arithmetic optimization algorithm with Random Forest classifier Zero crossing point detection in a distorted sinusoidal signal using random forest classifier FL-XGBTC: federated learning inspired with XG-boost tuned classifier for YouTube spam content detection A generalized product adoption model under random marketing conditions Assessing e-learning platforms in higher education with reference to student satisfaction: a PLS-SEM approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1