Generalized reinforcement learning control algorithm for fully automated insulin delivery system

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems with Applications Pub Date : 2025-05-15 Epub Date: 2025-02-19 DOI:10.1016/j.eswa.2025.126909
Vega Pradana Rachim , Junyoung Yoo , Jaeyeon Lee , Yein Lee , Sung-Min Park
{"title":"Generalized reinforcement learning control algorithm for fully automated insulin delivery system","authors":"Vega Pradana Rachim ,&nbsp;Junyoung Yoo ,&nbsp;Jaeyeon Lee ,&nbsp;Yein Lee ,&nbsp;Sung-Min Park","doi":"10.1016/j.eswa.2025.126909","DOIUrl":null,"url":null,"abstract":"<div><div>A fully automated insulin delivery (Fully-AID) system is expected to provide the ultimate safety, comfort, and a sense of freedom for people living with diabetes (PwD). Previous studies have shown the potential of a deep reinforcement learning (DRL) model for fully-AID control algorithm in simulation environment. However, the practical implementation is still challenging due to the domain gaps between simulation and real world scenario. In this manuscript, we proposed a novel generalized control algorithm, called xgDRL, to realize a DRL-driven fully-AID system. The generalization of the proposed algorithm is achieved by our two main contributions that are introducing a novel concept of fully-AID context called total daily insulin (TDI) into the input of DRL model, and a novel training environment named type 1 diabetes (T1D) simulation-to-reality (T1Dsim2real). Here, we conduct a stepwise validation experiment to validate the performance of the proposed control algorithm, which comprises <em>in silico</em>, retrospective-counterfactual studies, and preclinical studies using a T1D pig model. Results from the preclinical validation demonstrate the effectiveness of the proposed algorithm, with average time in target range of 70–180 mg/dL of 72.8 %, 73.8 %, 74.5 %, and 86.9 % across breakfast, lunch, dinner, and overnight fasting time, respectively. Thus, this study represents the first preclinical validation of a DRL-driven fully-AID algorithm in PwD, confirming the efficacy of the xgDRL model in preclinical settings.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"274 ","pages":"Article 126909"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425005317","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

A fully automated insulin delivery (Fully-AID) system is expected to provide the ultimate safety, comfort, and a sense of freedom for people living with diabetes (PwD). Previous studies have shown the potential of a deep reinforcement learning (DRL) model for fully-AID control algorithm in simulation environment. However, the practical implementation is still challenging due to the domain gaps between simulation and real world scenario. In this manuscript, we proposed a novel generalized control algorithm, called xgDRL, to realize a DRL-driven fully-AID system. The generalization of the proposed algorithm is achieved by our two main contributions that are introducing a novel concept of fully-AID context called total daily insulin (TDI) into the input of DRL model, and a novel training environment named type 1 diabetes (T1D) simulation-to-reality (T1Dsim2real). Here, we conduct a stepwise validation experiment to validate the performance of the proposed control algorithm, which comprises in silico, retrospective-counterfactual studies, and preclinical studies using a T1D pig model. Results from the preclinical validation demonstrate the effectiveness of the proposed algorithm, with average time in target range of 70–180 mg/dL of 72.8 %, 73.8 %, 74.5 %, and 86.9 % across breakfast, lunch, dinner, and overnight fasting time, respectively. Thus, this study represents the first preclinical validation of a DRL-driven fully-AID algorithm in PwD, confirming the efficacy of the xgDRL model in preclinical settings.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
全自动胰岛素输送系统的广义强化学习控制算法
全自动胰岛素输送(full - aid)系统有望为糖尿病患者提供终极的安全、舒适和自由感。以往的研究表明,深度强化学习(DRL)模型在全aid控制算法仿真环境中的潜力。然而,由于模拟和现实世界场景之间的领域差距,实际实现仍然具有挑战性。在本文中,我们提出了一种新的广义控制算法,称为xgDRL,以实现drl驱动的全aid系统。我们提出的算法的推广是通过我们的两个主要贡献来实现的,即在DRL模型的输入中引入了一种新的全aid上下文概念,称为每日总胰岛素(TDI),以及一种新的训练环境,称为1型糖尿病(T1D)模拟到现实(T1Dsim2real)。在这里,我们进行了一项逐步验证实验,以验证所提出的控制算法的性能,该实验包括计算机、回顾性反事实研究和使用T1D猪模型的临床前研究。临床前验证的结果证明了该算法的有效性,早餐、午餐、晚餐和夜间禁食时间在70-180 mg/dL目标范围内的平均时间分别为72.8%、73.8%、74.5%和86.9%。因此,本研究代表了drl驱动的全aid算法在PwD中的首次临床前验证,证实了xgDRL模型在临床前环境中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
期刊最新文献
FairDiff: Masked condition diffusion for fairness-aware recommendation CTGAN-MNLIME: A CTGAN-boosted multidimensional nonlinear LIME method for corporate environmental indicators prediction An explainable machine learning-based scoring function using interpretable features and model explanation approaches for binding affinity prediction Hybrid fuzzy multi-criteria decision-making model for assessing sustainable waste management strategies MPGCF: Multi-objective and popularity-smoothing graph collaborative filtering for long-tail web API recommendation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1