Generalized reinforcement learning control algorithm for fully automated insulin delivery system

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems with Applications Pub Date : 2025-05-15 Epub Date: 2025-02-19 DOI:10.1016/j.eswa.2025.126909

Vega Pradana Rachim , Junyoung Yoo , Jaeyeon Lee , Yein Lee , Sung-Min Park

{"title":"Generalized reinforcement learning control algorithm for fully automated insulin delivery system","authors":"Vega Pradana Rachim , Junyoung Yoo , Jaeyeon Lee , Yein Lee , Sung-Min Park","doi":"10.1016/j.eswa.2025.126909","DOIUrl":null,"url":null,"abstract":"<div><div>A fully automated insulin delivery (Fully-AID) system is expected to provide the ultimate safety, comfort, and a sense of freedom for people living with diabetes (PwD). Previous studies have shown the potential of a deep reinforcement learning (DRL) model for fully-AID control algorithm in simulation environment. However, the practical implementation is still challenging due to the domain gaps between simulation and real world scenario. In this manuscript, we proposed a novel generalized control algorithm, called xgDRL, to realize a DRL-driven fully-AID system. The generalization of the proposed algorithm is achieved by our two main contributions that are introducing a novel concept of fully-AID context called total daily insulin (TDI) into the input of DRL model, and a novel training environment named type 1 diabetes (T1D) simulation-to-reality (T1Dsim2real). Here, we conduct a stepwise validation experiment to validate the performance of the proposed control algorithm, which comprises <em>in silico</em>, retrospective-counterfactual studies, and preclinical studies using a T1D pig model. Results from the preclinical validation demonstrate the effectiveness of the proposed algorithm, with average time in target range of 70–180 mg/dL of 72.8 %, 73.8 %, 74.5 %, and 86.9 % across breakfast, lunch, dinner, and overnight fasting time, respectively. Thus, this study represents the first preclinical validation of a DRL-driven fully-AID algorithm in PwD, confirming the efficacy of the xgDRL model in preclinical settings.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"274 ","pages":"Article 126909"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425005317","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

A fully automated insulin delivery (Fully-AID) system is expected to provide the ultimate safety, comfort, and a sense of freedom for people living with diabetes (PwD). Previous studies have shown the potential of a deep reinforcement learning (DRL) model for fully-AID control algorithm in simulation environment. However, the practical implementation is still challenging due to the domain gaps between simulation and real world scenario. In this manuscript, we proposed a novel generalized control algorithm, called xgDRL, to realize a DRL-driven fully-AID system. The generalization of the proposed algorithm is achieved by our two main contributions that are introducing a novel concept of fully-AID context called total daily insulin (TDI) into the input of DRL model, and a novel training environment named type 1 diabetes (T1D) simulation-to-reality (T1Dsim2real). Here, we conduct a stepwise validation experiment to validate the performance of the proposed control algorithm, which comprises in silico, retrospective-counterfactual studies, and preclinical studies using a T1D pig model. Results from the preclinical validation demonstrate the effectiveness of the proposed algorithm, with average time in target range of 70–180 mg/dL of 72.8 %, 73.8 %, 74.5 %, and 86.9 % across breakfast, lunch, dinner, and overnight fasting time, respectively. Thus, this study represents the first preclinical validation of a DRL-driven fully-AID algorithm in PwD, confirming the efficacy of the xgDRL model in preclinical settings.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

全自动胰岛素输送系统的广义强化学习控制算法

全自动胰岛素输送（full - aid）系统有望为糖尿病患者提供终极的安全、舒适和自由感。以往的研究表明，深度强化学习（DRL）模型在全aid控制算法仿真环境中的潜力。然而，由于模拟和现实世界场景之间的领域差距，实际实现仍然具有挑战性。在本文中，我们提出了一种新的广义控制算法，称为xgDRL，以实现drl驱动的全aid系统。我们提出的算法的推广是通过我们的两个主要贡献来实现的，即在DRL模型的输入中引入了一种新的全aid上下文概念，称为每日总胰岛素（TDI），以及一种新的训练环境，称为1型糖尿病（T1D）模拟到现实（T1Dsim2real）。在这里，我们进行了一项逐步验证实验，以验证所提出的控制算法的性能，该实验包括计算机、回顾性反事实研究和使用T1D猪模型的临床前研究。临床前验证的结果证明了该算法的有效性，早餐、午餐、晚餐和夜间禁食时间在70-180 mg/dL目标范围内的平均时间分别为72.8%、73.8%、74.5%和86.9%。因此，本研究代表了drl驱动的全aid算法在PwD中的首次临床前验证，证实了xgDRL模型在临床前环境中的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.