Nusrat Mohi Ud Din, Assif Assad, Saqib Ul Sabha, Muzafar Rasool
{"title":"Optimizing deep reinforcement learning in data-scarce domains: a cross-domain evaluation of double DQN and dueling DQN","authors":"Nusrat Mohi Ud Din, Assif Assad, Saqib Ul Sabha, Muzafar Rasool","doi":"10.1007/s13198-024-02344-5","DOIUrl":null,"url":null,"abstract":"<p>The challenge of limited labeled data is a persistent concern across diverse domains, including healthcare, niche agricultural practices, astronomy and space exploration, anomaly detection, and many more. Limited data can lead to biased training, overfitting, and poor generalization in Artificial Intelligence (AI) models. In response to this ubiquitous problem, this research explores the potential of deep reinforcement learning (DRL) algorithms, specifically Double Deep Q-Network (Double DQN) and Dueling Deep Q-Network (Dueling DQN). The algorithms were trained on small training subsets generated by subsampling from the original training datasets. In this subsampling process, 10, 20, 30, and 40 instances were selected from each class to form the smaller training subsets. Subsequently, the performance of these algorithms was comprehensively assessed by evaluating them on the entire test set. We employed datasets from two different domains where this problem mainly exists to assess their performance in data-constrained scenarios. A comparative analysis was conducted against a transfer learning approach widely employed to tackle similar challenges. The comprehensive evaluation reveals compelling results. In the medical domain, Dueling DQN consistently outperformed Double DQN and transfer learning, while in the agriculture domain, Double DQN demonstrates superior performance compared to Dueling DQN and transfer learning. These findings underscore the remarkable effectiveness of DRL algorithms in addressing data scarcity across a spectrum of domains, positioning DRL as a potent tool for enhancing diverse applications with limited labeled data.</p>","PeriodicalId":14463,"journal":{"name":"International Journal of System Assurance Engineering and Management","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of System Assurance Engineering and Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13198-024-02344-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The challenge of limited labeled data is a persistent concern across diverse domains, including healthcare, niche agricultural practices, astronomy and space exploration, anomaly detection, and many more. Limited data can lead to biased training, overfitting, and poor generalization in Artificial Intelligence (AI) models. In response to this ubiquitous problem, this research explores the potential of deep reinforcement learning (DRL) algorithms, specifically Double Deep Q-Network (Double DQN) and Dueling Deep Q-Network (Dueling DQN). The algorithms were trained on small training subsets generated by subsampling from the original training datasets. In this subsampling process, 10, 20, 30, and 40 instances were selected from each class to form the smaller training subsets. Subsequently, the performance of these algorithms was comprehensively assessed by evaluating them on the entire test set. We employed datasets from two different domains where this problem mainly exists to assess their performance in data-constrained scenarios. A comparative analysis was conducted against a transfer learning approach widely employed to tackle similar challenges. The comprehensive evaluation reveals compelling results. In the medical domain, Dueling DQN consistently outperformed Double DQN and transfer learning, while in the agriculture domain, Double DQN demonstrates superior performance compared to Dueling DQN and transfer learning. These findings underscore the remarkable effectiveness of DRL algorithms in addressing data scarcity across a spectrum of domains, positioning DRL as a potent tool for enhancing diverse applications with limited labeled data.
期刊介绍:
This Journal is established with a view to cater to increased awareness for high quality research in the seamless integration of heterogeneous technologies to formulate bankable solutions to the emergent complex engineering problems.
Assurance engineering could be thought of as relating to the provision of higher confidence in the reliable and secure implementation of a system’s critical characteristic features through the espousal of a holistic approach by using a wide variety of cross disciplinary tools and techniques. Successful realization of sustainable and dependable products, systems and services involves an extensive adoption of Reliability, Quality, Safety and Risk related procedures for achieving high assurancelevels of performance; also pivotal are the management issues related to risk and uncertainty that govern the practical constraints encountered in their deployment. It is our intention to provide a platform for the modeling and analysis of large engineering systems, among the other aforementioned allied goals of systems assurance engineering, leading to the enforcement of performance enhancement measures. Achieving a fine balance between theory and practice is the primary focus. The Journal only publishes high quality papers that have passed the rigorous peer review procedure of an archival scientific Journal. The aim is an increasing number of submissions, wide circulation and a high impact factor.