探索马尔可夫决策过程:优化应用与技术综合概览

IgMin Research Pub Date : 2024-07-04 DOI:10.61927/igmin210
Khan Qazi Waqas
{"title":"探索马尔可夫决策过程:优化应用与技术综合概览","authors":"Khan Qazi Waqas","doi":"10.61927/igmin210","DOIUrl":null,"url":null,"abstract":"Markov decision process is a dynamic programming algorithm that can be used to solve an optimization problem. It was used in applications like robotics, radar tracking, medical treatments, and decision-making. In the existing literature, the researcher only targets a few applications area of MDP. However, this work surveyed the Markov decision process’s application in various regions for solving optimization problems. In a survey, we compared optimization techniques based on MDP. We performed a comparative analysis of past work of other researchers in the last few years based on a few parameters. These parameters are focused on the proposed problem, the proposed methodology for solving an optimization problem, and the results and outcomes of the optimization technique in solving a specific problem. Reinforcement learning is an emerging machine learning domain based on the Markov decision process. In this work, we conclude that the MDP-based approach is most widely used when deciding on the current state in some environments to move to the next state.","PeriodicalId":509147,"journal":{"name":"IgMin Research","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring Markov Decision Processes: A Comprehensive Survey of Optimization Applications and Techniques\",\"authors\":\"Khan Qazi Waqas\",\"doi\":\"10.61927/igmin210\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Markov decision process is a dynamic programming algorithm that can be used to solve an optimization problem. It was used in applications like robotics, radar tracking, medical treatments, and decision-making. In the existing literature, the researcher only targets a few applications area of MDP. However, this work surveyed the Markov decision process’s application in various regions for solving optimization problems. In a survey, we compared optimization techniques based on MDP. We performed a comparative analysis of past work of other researchers in the last few years based on a few parameters. These parameters are focused on the proposed problem, the proposed methodology for solving an optimization problem, and the results and outcomes of the optimization technique in solving a specific problem. Reinforcement learning is an emerging machine learning domain based on the Markov decision process. In this work, we conclude that the MDP-based approach is most widely used when deciding on the current state in some environments to move to the next state.\",\"PeriodicalId\":509147,\"journal\":{\"name\":\"IgMin Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IgMin Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.61927/igmin210\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IgMin Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.61927/igmin210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

马尔可夫决策过程是一种动态编程算法,可用于解决优化问题。它被应用于机器人、雷达跟踪、医疗和决策等领域。在现有文献中,研究人员只针对 MDP 的几个应用领域进行了研究。然而,这项工作调查了马尔可夫决策过程在解决优化问题的各个领域中的应用。在调查中,我们比较了基于 MDP 的优化技术。我们根据几个参数对其他研究人员过去几年的工作进行了比较分析。这些参数主要集中在提出的问题、解决优化问题的方法以及优化技术解决特定问题的结果和成果。强化学习是基于马尔可夫决策过程的新兴机器学习领域。在这项工作中,我们得出结论,在某些环境中决定当前状态以进入下一状态时,基于马尔可夫决策过程的方法得到了最广泛的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Exploring Markov Decision Processes: A Comprehensive Survey of Optimization Applications and Techniques
Markov decision process is a dynamic programming algorithm that can be used to solve an optimization problem. It was used in applications like robotics, radar tracking, medical treatments, and decision-making. In the existing literature, the researcher only targets a few applications area of MDP. However, this work surveyed the Markov decision process’s application in various regions for solving optimization problems. In a survey, we compared optimization techniques based on MDP. We performed a comparative analysis of past work of other researchers in the last few years based on a few parameters. These parameters are focused on the proposed problem, the proposed methodology for solving an optimization problem, and the results and outcomes of the optimization technique in solving a specific problem. Reinforcement learning is an emerging machine learning domain based on the Markov decision process. In this work, we conclude that the MDP-based approach is most widely used when deciding on the current state in some environments to move to the next state.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Levetiracetam-induced Rhabdomyolysis - A Rare Complication Biomimetic Synthesis of Calcium Carbonate in Bile in the presence of Amino Acids The Influence of Low Pesticide Doses on Fusarium Molds Relationship between Sustainable Development, Economy and Poverty The Imperative for Modern Public Cloud Providers to Upgrade Their Data Centers
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1