交易主体在双边议价中的应用

2012 UKSim 14th International Conference on Computer Modelling and Simulation Pub Date : 2012-03-28 DOI:10.1109/UKSim.2012.39

S. Jamali, K. Faez

{"title":"交易主体在双边议价中的应用","authors":"S. Jamali, K. Faez","doi":"10.1109/UKSim.2012.39","DOIUrl":null,"url":null,"abstract":"In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.","PeriodicalId":405479,"journal":{"name":"2012 UKSim 14th International Conference on Computer Modelling and Simulation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Applying SAQ-Learning Algorithm for Trading Agents in Bilateral Bargaining\",\"authors\":\"S. Jamali, K. Faez\",\"doi\":\"10.1109/UKSim.2012.39\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.\",\"PeriodicalId\":405479,\"journal\":{\"name\":\"2012 UKSim 14th International Conference on Computer Modelling and Simulation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 UKSim 14th International Conference on Computer Modelling and Simulation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UKSim.2012.39\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 UKSim 14th International Conference on Computer Modelling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UKSim.2012.39","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在本研究中，我们使用了一种称为SAQ-Learning的学习方法，用于代理在单一问题的讨价还价过程中。SAQ-Learning算法是Q-Learning算法的改进版本，它利用了模拟退火算法的Metropolis准则来克服在探索和利用之间寻找平衡的挑战。Q-Learning是强化学习(RL)最重要的类型之一，因为它不需要环境的过渡模型。人工智能(AI)方法已引起人们对解决议价问题的兴趣。这是因为博弈论(GT)需要一些不切实际的假设来解决议价问题。完全理性主体的存在就是这些假设的一个例子。因此，通过设计SAQ-Learning agent进行价格讨价还价，我们在结算率、平均收益和agent找到最优策略所需时间的情况下获得了更高的性能。这种学习方法可以作为电子商务中自动在线议价代理的一种合适的学习算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Applying SAQ-Learning Algorithm for Trading Agents in Bilateral Bargaining

In this research we use a learning method called SAQ-Learning to use for agents in a single-issue bargaining process. SAQ-Learning algorithm is an improved version of Q-Learning algorithm that benefits from the Metropolis criterion of Simulated Annealing (SA) algorithm to overcome the challenge of finding a balance between exploration and exploitation. Q-Learning is one the most important types of Reinforcement Learning (RL) because of the fact that it does not need the transition model of the environment. Artificial Intelligence (AI) approaches have attracted interest in solving bargaining problem. This is because Game Theory (GT) needs some unrealistic assumptions to solve bargaining problem. Presence of perfectly rational agents is an example of these assumptions. Therefore by designing SAQ-Learning agents to bargain with each other over price, we gained higher performance in case of settlement rate, average payoff, and the time an agent needs to find his optimal policy. This learning method can be a suitable learning algorithm for automated online bargaining agents in e-commerce.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 UKSim 14th International Conference on Computer Modelling and Simulation

自引率

0.00%

发文量

期刊最新文献

Optimal Method for Migration of Tasks with Duplication A Quantitative Evaluation Method of Landmark Effectiveness for Pedestrian Navigation Simulation of DPCM and ADM Systems A Genetic Algorithm Approach for Solving Group Technology Problem with Process Plan Flexibility Complexity Measure as a Feature to Classify Schizophrenic and Healthy Participants