Delayed reinforcement learning converges to intermittent control for human quiet stance

IF 1.7 4区 医学 Q3 ENGINEERING, BIOMEDICAL Medical Engineering & Physics Pub Date : 2024-08-01 DOI:10.1016/j.medengphy.2024.104197
Yongkun Zhao , Balint K. Hodossy , Shibo Jing , Masahiro Todoh , Dario Farina
{"title":"Delayed reinforcement learning converges to intermittent control for human quiet stance","authors":"Yongkun Zhao ,&nbsp;Balint K. Hodossy ,&nbsp;Shibo Jing ,&nbsp;Masahiro Todoh ,&nbsp;Dario Farina","doi":"10.1016/j.medengphy.2024.104197","DOIUrl":null,"url":null,"abstract":"<div><p>The neural control of human quiet stance remains controversial, with classic views suggesting a limited role of the brain and recent findings conversely indicating direct cortical control of muscles during upright posture. Conceptual neural feedback control models have been proposed and tested against experimental evidence. The most renowned model is the continuous impedance control model. However, when time delays are included in this model to simulate neural transmission, the continuous controller becomes unstable. Another model, the intermittent control model, assumes that the central nervous system (CNS) activates muscles intermittently, and not continuously, to counteract gravitational torque. In this study, a delayed reinforcement learning algorithm was developed to seek optimal control policy to balance a one-segment inverted pendulum model representing the human body. According to this approach, there was no a-priori strategy imposed on the controller but rather the optimal strategy emerged from the reward-based learning. The simulation results indicated that the optimal neural controller exhibits intermittent, and not continuous, characteristics, in agreement with the possibility that the CNS intermittently provides neural feedback torque to maintain an upright posture.</p></div>","PeriodicalId":49836,"journal":{"name":"Medical Engineering & Physics","volume":"130 ","pages":"Article 104197"},"PeriodicalIF":1.7000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1350453324000985/pdfft?md5=f8324b603d1db550e3ab02402a64146c&pid=1-s2.0-S1350453324000985-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical Engineering & Physics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350453324000985","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The neural control of human quiet stance remains controversial, with classic views suggesting a limited role of the brain and recent findings conversely indicating direct cortical control of muscles during upright posture. Conceptual neural feedback control models have been proposed and tested against experimental evidence. The most renowned model is the continuous impedance control model. However, when time delays are included in this model to simulate neural transmission, the continuous controller becomes unstable. Another model, the intermittent control model, assumes that the central nervous system (CNS) activates muscles intermittently, and not continuously, to counteract gravitational torque. In this study, a delayed reinforcement learning algorithm was developed to seek optimal control policy to balance a one-segment inverted pendulum model representing the human body. According to this approach, there was no a-priori strategy imposed on the controller but rather the optimal strategy emerged from the reward-based learning. The simulation results indicated that the optimal neural controller exhibits intermittent, and not continuous, characteristics, in agreement with the possibility that the CNS intermittently provides neural feedback torque to maintain an upright posture.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
延迟强化学习收敛于人类安静站姿的间歇控制
人类安静姿态的神经控制仍存在争议,传统观点认为大脑的作用有限,而最近的研究结果则相反,表明直立姿势时大脑皮层对肌肉的直接控制。神经反馈控制概念模型已被提出,并根据实验证据进行了检验。最著名的模型是连续阻抗控制模型。然而,当在该模型中加入时间延迟来模拟神经传输时,连续控制器就会变得不稳定。另一种模型是间歇控制模型,它假定中枢神经系统(CNS)会间歇性而非持续性地激活肌肉,以抵消重力扭矩。在这项研究中,开发了一种延迟强化学习算法来寻求最佳控制策略,以平衡代表人体的单节倒立摆模型。根据这种方法,控制器没有先验策略,而是通过基于奖励的学习获得最佳策略。模拟结果表明,最佳神经控制器表现出间歇性而非连续性的特点,这与中枢神经系统间歇性地提供神经反馈扭矩以保持直立姿势的可能性一致。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Medical Engineering & Physics
Medical Engineering & Physics 工程技术-工程:生物医学
CiteScore
4.30
自引率
4.50%
发文量
172
审稿时长
3.0 months
期刊介绍: Medical Engineering & Physics provides a forum for the publication of the latest developments in biomedical engineering, and reflects the essential multidisciplinary nature of the subject. The journal publishes in-depth critical reviews, scientific papers and technical notes. Our focus encompasses the application of the basic principles of physics and engineering to the development of medical devices and technology, with the ultimate aim of producing improvements in the quality of health care.Topics covered include biomechanics, biomaterials, mechanobiology, rehabilitation engineering, biomedical signal processing and medical device development. Medical Engineering & Physics aims to keep both engineers and clinicians abreast of the latest applications of technology to health care.
期刊最新文献
A patient-matched prosthesis for thumb amputations: Design, mechanical and functional evaluation Influence of surface type on outdoor gait parameters measured using an In-Shoe Motion Sensor System Assessment of pre- and post-operative gait dynamics in total knee arthroplasty by a wearable capture system A novel 3D lightweight model for COVID-19 lung CT Lesion Segmentation ResGloTBNet: An interpretable deep residual network with global long-range dependency for tuberculosis screening of sputum smear microscopy images
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1