交易主体在双边议价中的应用

S. Jamali, K. Faez
{"title":"交易主体在双边议价中的应用","authors":"S. Jamali, K. Faez","doi":"10.1109/UKSim.2012.39","DOIUrl":null,"url":null,"abstract":"In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.","PeriodicalId":405479,"journal":{"name":"2012 UKSim 14th International Conference on Computer Modelling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Applying SAQ-Learning Algorithm for Trading Agents in Bilateral Bargaining\",\"authors\":\"S. Jamali, K. Faez\",\"doi\":\"10.1109/UKSim.2012.39\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.\",\"PeriodicalId\":405479,\"journal\":{\"name\":\"2012 UKSim 14th International Conference on Computer Modelling and Simulation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 UKSim 14th International Conference on Computer Modelling and Simulation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UKSim.2012.39\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 UKSim 14th International Conference on Computer Modelling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UKSim.2012.39","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在本研究中,我们使用了一种称为SAQ-Learning的学习方法,用于代理在单一问题的讨价还价过程中。SAQ-Learning算法是Q-Learning算法的改进版本,它利用了模拟退火算法的Metropolis准则来克服在探索和利用之间寻找平衡的挑战。Q-Learning是强化学习(RL)最重要的类型之一,因为它不需要环境的过渡模型。人工智能(AI)方法已引起人们对解决议价问题的兴趣。这是因为博弈论(GT)需要一些不切实际的假设来解决议价问题。完全理性主体的存在就是这些假设的一个例子。因此,通过设计SAQ-Learning agent进行价格讨价还价,我们在结算率、平均收益和agent找到最优策略所需时间的情况下获得了更高的性能。这种学习方法可以作为电子商务中自动在线议价代理的一种合适的学习算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Applying SAQ-Learning Algorithm for Trading Agents in Bilateral Bargaining
In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Optimal Method for Migration of Tasks with Duplication A Quantitative Evaluation Method of Landmark Effectiveness for Pedestrian Navigation Simulation of DPCM and ADM Systems A Genetic Algorithm Approach for Solving Group Technology Problem with Process Plan Flexibility Complexity Measure as a Feature to Classify Schizophrenic and Healthy Participants
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1