Collaborative Search Model for Lost-Link Borrowers Information Based on Multi-Agent Q-Learning

IF 1.9 3区 数学 Q1 MATHEMATICS, APPLIED Axioms Pub Date : 2023-11-03 DOI:10.3390/axioms12111033
Ge You, Hao Guo, Abd Alwahed Dagestani, Ibrahim Alnafrah
{"title":"Collaborative Search Model for Lost-Link Borrowers Information Based on Multi-Agent Q-Learning","authors":"Ge You, Hao Guo, Abd Alwahed Dagestani, Ibrahim Alnafrah","doi":"10.3390/axioms12111033","DOIUrl":null,"url":null,"abstract":"To reduce the economic losses caused by debt evasion amongst lost-link borrowers (LBs) and improve the efficiency of finding information on LBs, this paper focuses on the cross-platform information collaborative search optimization problem for LBs. Given the limitations of platform/system heterogeneity, data type diversity, and the complexity of collaborative control in cross-platform information search for LBs, a collaborative search model for LBs’ information based on multi-agent technology is proposed. Additionally, a multi-agent Q-learning algorithm for the collaborative scheduling of multi-search subtasks is designed. We use the Q-learning algorithm based on function approximation to update the description model of the LBs. The multi-agent collaborative search problem is transformed into a reinforcement learning problem by defining search states, search actions, and reward functions. The results indicate that: (i) this model greatly improves the comprehensiveness and accuracy of the search for key information of LBs compared with traditional search engines; (ii) during searching for the information of LBs, the agent is more inclined to search on platforms and data types with larger environmental rewards, and the multi-agent Q-learning algorithm has a stronger ability to acquire information value than the transition probability matrix algorithm and the probability statistical algorithm for the same number of searches; (iii) the optimal search times of the multi-agent Q-learning algorithm are between 14 and 100. Users can flexibly set the number of searches within this range. It is significant for improving the efficiency of finding key information related to LBs.","PeriodicalId":53148,"journal":{"name":"Axioms","volume":"1 14","pages":"0"},"PeriodicalIF":1.9000,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Axioms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/axioms12111033","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

To reduce the economic losses caused by debt evasion amongst lost-link borrowers (LBs) and improve the efficiency of finding information on LBs, this paper focuses on the cross-platform information collaborative search optimization problem for LBs. Given the limitations of platform/system heterogeneity, data type diversity, and the complexity of collaborative control in cross-platform information search for LBs, a collaborative search model for LBs’ information based on multi-agent technology is proposed. Additionally, a multi-agent Q-learning algorithm for the collaborative scheduling of multi-search subtasks is designed. We use the Q-learning algorithm based on function approximation to update the description model of the LBs. The multi-agent collaborative search problem is transformed into a reinforcement learning problem by defining search states, search actions, and reward functions. The results indicate that: (i) this model greatly improves the comprehensiveness and accuracy of the search for key information of LBs compared with traditional search engines; (ii) during searching for the information of LBs, the agent is more inclined to search on platforms and data types with larger environmental rewards, and the multi-agent Q-learning algorithm has a stronger ability to acquire information value than the transition probability matrix algorithm and the probability statistical algorithm for the same number of searches; (iii) the optimal search times of the multi-agent Q-learning algorithm are between 14 and 100. Users can flexibly set the number of searches within this range. It is significant for improving the efficiency of finding key information related to LBs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多智能体q -学习的失联借款人信息协同搜索模型
为了减少失联借款人逃债造成的经济损失,提高失联借款人的信息查找效率,本文重点研究了失联借款人跨平台信息协同搜索优化问题。针对LBs跨平台信息搜索存在平台/系统异构性、数据类型多样性和协同控制复杂性等局限性,提出了一种基于多智能体技术的LBs信息协同搜索模型。此外,设计了一种多智能体q -学习算法,用于多搜索子任务的协同调度。我们使用基于函数逼近的q -学习算法来更新lb的描述模型。通过定义搜索状态、搜索动作和奖励函数,将多智能体协同搜索问题转化为强化学习问题。结果表明:(1)与传统搜索引擎相比,该模型大大提高了LBs关键信息搜索的全面性和准确性;(ii)智能体在搜索LBs信息时,更倾向于在环境奖励较大的平台和数据类型上搜索,在相同搜索次数下,多智能体q -学习算法比转移概率矩阵算法和概率统计算法具有更强的获取信息价值的能力;(iii)多智能体Q-learning算法的最优搜索次数在14 ~ 100之间。用户可以在此范围内灵活设置搜索次数。这对于提高查找LBs相关关键信息的效率具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Axioms
Axioms Mathematics-Algebra and Number Theory
自引率
10.00%
发文量
604
审稿时长
11 weeks
期刊介绍: Axiomatic theories in physics and in mathematics (for example, axiomatic theory of thermodynamics, and also either the axiomatic classical set theory or the axiomatic fuzzy set theory) Axiomatization, axiomatic methods, theorems, mathematical proofs Algebraic structures, field theory, group theory, topology, vector spaces Mathematical analysis Mathematical physics Mathematical logic, and non-classical logics, such as fuzzy logic, modal logic, non-monotonic logic. etc. Classical and fuzzy set theories Number theory Systems theory Classical measures, fuzzy measures, representation theory, and probability theory Graph theory Information theory Entropy Symmetry Differential equations and dynamical systems Relativity and quantum theories Mathematical chemistry Automata theory Mathematical problems of artificial intelligence Complex networks from a mathematical viewpoint Reasoning under uncertainty Interdisciplinary applications of mathematical theory.
期刊最新文献
Modified Two-Parameter Liu Estimator for Addressing Multicollinearity in the Poisson Regression Model Results of Third-Order Strong Differential Subordinations A Probabilistic Physico-Chemical Diffusion Model of the Key Drifting Parameter of Measuring Equipment Finite-Time Passivity and Synchronization for a Class of Fuzzy Inertial Complex-Valued Neural Networks with Time-Varying Delays Integer-Valued Split-BREAK Process with a General Family of Innovations and Application to Accident Count Data Modeling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1