Zheyan Tu, Sean Jeffries, Eric Pelletier, Oliver Cafferty, Joshua Morse, Avinash Sinha, Thomas Hemmerling
{"title":"Deep reinforcement learning for multi-targets propofol dosing.","authors":"Zheyan Tu, Sean Jeffries, Eric Pelletier, Oliver Cafferty, Joshua Morse, Avinash Sinha, Thomas Hemmerling","doi":"10.1007/s10877-025-01269-z","DOIUrl":null,"url":null,"abstract":"<p><p>The administration of propofol for sedation or general anesthesia presents challenges due to the complex relationship between patient factors and real-time physiological responses. This study explores the application of deep reinforcement learning (DRL) to automate propofol dosing, aiming to maintain multiple physiological parameters including bispectral index (BIS), heart rate (HR), respiratory rate (RR), and mean arterial pressure (MAP) within safe and desired ranges. A multi-variable pharmacokinetic-pharmacodynamic (PK/PD) simulation environment was developed to model the effects of propofol on the physiological parameters. An adjustable reward system was designed for multi-target anesthetic infusion. The DRL agent was trained using Twin Delayed Deep Deterministic Policy Gradient (TD3), interacting with the simulation environment and receiving rewards for maintaining physiological parameters close to their targets and above safety thresholds. The performance of the TD3 agent was compared to other DRL algorithms and traditional control methods. The TD3 algorithm demonstrated superior performance in achieving precise and safe control of multiple physiological parameters during propofol administration, outperforming other DRL algorithms and traditional control methods. The application of DRL, particularly TD3, offers a promising approach for automating propofol dosing, ensuring better management of physiological parameters and enhancing the safety and effectiveness of sedation and general anesthesia.</p>","PeriodicalId":15513,"journal":{"name":"Journal of Clinical Monitoring and Computing","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Monitoring and Computing","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10877-025-01269-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ANESTHESIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The administration of propofol for sedation or general anesthesia presents challenges due to the complex relationship between patient factors and real-time physiological responses. This study explores the application of deep reinforcement learning (DRL) to automate propofol dosing, aiming to maintain multiple physiological parameters including bispectral index (BIS), heart rate (HR), respiratory rate (RR), and mean arterial pressure (MAP) within safe and desired ranges. A multi-variable pharmacokinetic-pharmacodynamic (PK/PD) simulation environment was developed to model the effects of propofol on the physiological parameters. An adjustable reward system was designed for multi-target anesthetic infusion. The DRL agent was trained using Twin Delayed Deep Deterministic Policy Gradient (TD3), interacting with the simulation environment and receiving rewards for maintaining physiological parameters close to their targets and above safety thresholds. The performance of the TD3 agent was compared to other DRL algorithms and traditional control methods. The TD3 algorithm demonstrated superior performance in achieving precise and safe control of multiple physiological parameters during propofol administration, outperforming other DRL algorithms and traditional control methods. The application of DRL, particularly TD3, offers a promising approach for automating propofol dosing, ensuring better management of physiological parameters and enhancing the safety and effectiveness of sedation and general anesthesia.
期刊介绍:
The Journal of Clinical Monitoring and Computing is a clinical journal publishing papers related to technology in the fields of anaesthesia, intensive care medicine, emergency medicine, and peri-operative medicine.
The journal has links with numerous specialist societies, including editorial board representatives from the European Society for Computing and Technology in Anaesthesia and Intensive Care (ESCTAIC), the Society for Technology in Anesthesia (STA), the Society for Complex Acute Illness (SCAI) and the NAVAt (NAVigating towards your Anaestheisa Targets) group.
The journal publishes original papers, narrative and systematic reviews, technological notes, letters to the editor, editorial or commentary papers, and policy statements or guidelines from national or international societies. The journal encourages debate on published papers and technology, including letters commenting on previous publications or technological concerns. The journal occasionally publishes special issues with technological or clinical themes, or reports and abstracts from scientificmeetings. Special issues proposals should be sent to the Editor-in-Chief. Specific details of types of papers, and the clinical and technological content of papers considered within scope can be found in instructions for authors.