Deterministic Discounted Markov Decision Processes with Fuzzy Rewards/Costs

IF 1.3 Q2 MATHEMATICS, APPLIED Fuzzy Information and Engineering Pub Date : 2023-09-01 DOI:10.26599/fie.2023.9270020
Hugo Cruz-Suárez, Raúl Montes-de-Oca, R. Israel Ortega-Gutiérrez
{"title":"Deterministic Discounted Markov Decision Processes with Fuzzy Rewards/Costs","authors":"Hugo Cruz-Suárez, Raúl Montes-de-Oca, R. Israel Ortega-Gutiérrez","doi":"10.26599/fie.2023.9270020","DOIUrl":null,"url":null,"abstract":"The article concerns a study of infinite-horizon deterministic Markov decision processes (MDPs) for which the fuzzy environment will be presented through considering these MDPs with both fuzzy rewards and fuzzy costs. Specifically, these rewards and costs will be assumed of a suitable trapezoidal type. For both classes of MDPs, i.e., MDPs with fuzzy rewards and MDPs with fuzzy costs, the fuzzy total discounted function will be taken into account as the objective function, and the corresponding optimal decision problems will be considered with respect to the max order of the fuzzy numbers. For each optimal decision problem, the optimal policy and the optimal value function are related and obtained as a solution of a convenient standard MDP (i.e., a standard MDP is an MDP with a non-fuzzy reward function or a non-fuzzy cost function). Moreover, an economic growth model (EGM), a deterministic version of the linear-quadratic model (LQM), and an optimal consumption model (OCM) in order to clarify the theory presented are given, and it is remarked that these models have uncountable state spaces, and the corresponding non-fuzzy version of both the EGM and the OCM has an unbounded reward function, and the corresponding non-fuzzy version of the LQM has an unbounded cost function.","PeriodicalId":37623,"journal":{"name":"Fuzzy Information and Engineering","volume":"11 1","pages":"0"},"PeriodicalIF":1.3000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuzzy Information and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26599/fie.2023.9270020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

The article concerns a study of infinite-horizon deterministic Markov decision processes (MDPs) for which the fuzzy environment will be presented through considering these MDPs with both fuzzy rewards and fuzzy costs. Specifically, these rewards and costs will be assumed of a suitable trapezoidal type. For both classes of MDPs, i.e., MDPs with fuzzy rewards and MDPs with fuzzy costs, the fuzzy total discounted function will be taken into account as the objective function, and the corresponding optimal decision problems will be considered with respect to the max order of the fuzzy numbers. For each optimal decision problem, the optimal policy and the optimal value function are related and obtained as a solution of a convenient standard MDP (i.e., a standard MDP is an MDP with a non-fuzzy reward function or a non-fuzzy cost function). Moreover, an economic growth model (EGM), a deterministic version of the linear-quadratic model (LQM), and an optimal consumption model (OCM) in order to clarify the theory presented are given, and it is remarked that these models have uncountable state spaces, and the corresponding non-fuzzy version of both the EGM and the OCM has an unbounded reward function, and the corresponding non-fuzzy version of the LQM has an unbounded cost function.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
具有模糊奖励/成本的确定性折现马尔可夫决策过程
本文研究了具有模糊回报和模糊代价的无限视界确定性马尔可夫决策过程的模糊环境。具体来说,这些回报和成本将被假设为一个合适的梯形。对于两类MDPs,即具有模糊奖励的MDPs和具有模糊代价的MDPs,将模糊总折现函数作为目标函数,并考虑相对于模糊数的最大阶的最优决策问题。对于每一个最优决策问题,最优策略和最优价值函数相互关联,并作为一个方便的标准MDP(即标准MDP是一个具有非模糊奖励函数或非模糊成本函数的MDP)的解得到。此外,为了阐明所提出的理论,给出了经济增长模型(EGM)、线性二次模型(LQM)的确定性版本和最优消费模型(OCM),并指出这些模型具有不可数的状态空间,EGM和OCM的非模糊版本都具有无界的奖励函数,LQM的非模糊版本具有无界的成本函数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.30
自引率
0.00%
发文量
13
审稿时长
40 weeks
期刊介绍: Fuzzy Information and Engineering—An International Journal wants to provide a unified communication platform for researchers in a wide area of topics from pure and applied mathematics, computer science, engineering, and other related fields. While also accepting fundamental work, the journal focuses on applications. Research papers, short communications, and reviews are welcome. Technical topics within the scope include: (1) Fuzzy Information a. Fuzzy information theory and information systems b. Fuzzy clustering and classification c. Fuzzy information processing d. Hardware and software co-design e. Fuzzy computer f. Fuzzy database and data mining g. Fuzzy image processing and pattern recognition h. Fuzzy information granulation i. Knowledge acquisition and representation in fuzzy information (2) Fuzzy Sets and Systems a. Fuzzy sets b. Fuzzy analysis c. Fuzzy topology and fuzzy mapping d. Fuzzy equation e. Fuzzy programming and optimal f. Fuzzy probability and statistic g. Fuzzy logic and algebra h. General systems i. Fuzzy socioeconomic system j. Fuzzy decision support system k. Fuzzy expert system (3) Soft Computing a. Soft computing theory and foundation b. Nerve cell algorithms c. Genetic algorithms d. Fuzzy approximation algorithms e. Computing with words and Quantum computation (4) Fuzzy Engineering a. Fuzzy control b. Fuzzy system engineering c. Fuzzy knowledge engineering d. Fuzzy management engineering e. Fuzzy design f. Fuzzy industrial engineering g. Fuzzy system modeling (5) Fuzzy Operations Research [...] (6) Artificial Intelligence [...] (7) Others [...]
期刊最新文献
A Multi-Level Multi-Objective Integer Quadratic Programming Problem under Pentagonal Neutrosophic Environment Certain Concepts in Directed Rough Fuzzy Graphs and Application to Mergers of Companies A Study on Hutchinson-Barnsley Theory in Product Intuitionistic Fuzzy Fractal Space Existence and Uniqueness of Solutions for Fuzzy Boundary Value Problems Under Granular Differentiability Resolution of Fuzzy Relation Equations with Constraints
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1