首页 > 最新文献

Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001最新文献

英文 中文
An improved representation of functions for partition based functional decomposition 基于分区的函数分解的一种改进的函数表示
M. Venkatesan, H. Selvaraj, R. Bignall
Summary form only given. Functional decomposition is a process of representing a complex function as a function of functions with fewer variables. Earlier partition based functional decomposition tools represent the functions using r-partition. The r-partition representation is an abstract representation of the function and their memory requirements are super-exponential. An improved functional representation called ir-partition is proposed. The ir-partition representation is a complete representation of the function and requires less memory to store the functions. The main idea behind the ir-partition representation is to incorporate the values of the minterms corresponding to the variables (cubes). Hence, repeated access of the truth table is not necessary to read the value of the minterms. The computational time to calculate the ir-partition operations are three times greater than the computational time and memory requirement to calculate r-partition. However, the memory requirements for representing the function using ir-partition is half the memory requirement using the r-partition representation (abstract representation). Their partition representation also allows us to perform certain Partition Calculus operations implicitly. The representation has been implemented and tested with the MCNC benchmarks.
只提供摘要形式。函数分解是将一个复杂函数表示为具有较少变量的函数的函数的过程。早期基于分区的功能分解工具使用r-分区表示函数。r-分区表示是函数的抽象表示,它们的内存需求是超指数的。提出了一种改进的函数表示,称为ir-partition。分区表示是函数的完整表示,需要更少的内存来存储函数。分区表示背后的主要思想是合并与变量(多维数据集)相对应的最小项的值。因此,不需要重复访问真值表来读取最小项的值。计算r-partition操作所需的计算时间是计算r-partition所需的计算时间和内存的三倍。然而,使用r-partition表示函数的内存需求是使用r-partition表示(抽象表示)的内存需求的一半。它们的分区表示也允许我们隐式地执行某些分区微积分操作。该表示已在MCNC基准测试中实现和测试。
{"title":"An improved representation of functions for partition based functional decomposition","authors":"M. Venkatesan, H. Selvaraj, R. Bignall","doi":"10.1109/ICCIMA.2001.970497","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970497","url":null,"abstract":"Summary form only given. Functional decomposition is a process of representing a complex function as a function of functions with fewer variables. Earlier partition based functional decomposition tools represent the functions using r-partition. The r-partition representation is an abstract representation of the function and their memory requirements are super-exponential. An improved functional representation called ir-partition is proposed. The ir-partition representation is a complete representation of the function and requires less memory to store the functions. The main idea behind the ir-partition representation is to incorporate the values of the minterms corresponding to the variables (cubes). Hence, repeated access of the truth table is not necessary to read the value of the minterms. The computational time to calculate the ir-partition operations are three times greater than the computational time and memory requirement to calculate r-partition. However, the memory requirements for representing the function using ir-partition is half the memory requirement using the r-partition representation (abstract representation). Their partition representation also allows us to perform certain Partition Calculus operations implicitly. The representation has been implemented and tested with the MCNC benchmarks.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129331036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Macroscopic quantitative observation of multi-robot behavior 多机器人行为的宏观定量观察
M. Kinoshita, H. Yokoi, Y. Kakazu, Michiko Watanabe, T. Kawakami
It is very difficult to estimate behaviors of multiple autonomous robots or mutual interactions of them in real time. Therefore, we propose a quantitative observation approach of multiple robots behaviors. This approach introduces thermodynamic macroscopic state values to the multi-robot systems. The advantage of this approach is that it enables to observe the behaviors of autonomous robots in real world and can be mapped to characteristic values in another conceptual state space. Thermodynamic macroscopic state values, such as temperature, pressure and entropy, are defined in mobile robots systems. In our definition, each mobile robot is supposed to have a particle in thermodynamic systems. The experiment shows that the states of robots system can be classified by thermodynamic macroscopic state value. This verifies that the macroscopic quantitative observation is efficient and applicable to control multi-robot systems.
实时估计多个自主机器人的行为或它们之间的相互作用是非常困难的。因此,我们提出了一种多机器人行为的定量观察方法。该方法将热力学宏观状态值引入多机器人系统。这种方法的优点是可以观察到现实世界中自主机器人的行为,并可以映射到另一个概念状态空间中的特征值。在移动机器人系统中,定义了温度、压力和熵等热力学宏观状态值。在我们的定义中,每个移动机器人在热力学系统中应该有一个粒子。实验表明,机器人系统的状态可以通过热力学宏观状态值进行分类。验证了宏观定量观测方法在多机器人系统控制中的有效性和适用性。
{"title":"Macroscopic quantitative observation of multi-robot behavior","authors":"M. Kinoshita, H. Yokoi, Y. Kakazu, Michiko Watanabe, T. Kawakami","doi":"10.1109/ICCIMA.2001.970466","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970466","url":null,"abstract":"It is very difficult to estimate behaviors of multiple autonomous robots or mutual interactions of them in real time. Therefore, we propose a quantitative observation approach of multiple robots behaviors. This approach introduces thermodynamic macroscopic state values to the multi-robot systems. The advantage of this approach is that it enables to observe the behaviors of autonomous robots in real world and can be mapped to characteristic values in another conceptual state space. Thermodynamic macroscopic state values, such as temperature, pressure and entropy, are defined in mobile robots systems. In our definition, each mobile robot is supposed to have a particle in thermodynamic systems. The experiment shows that the states of robots system can be classified by thermodynamic macroscopic state value. This verifies that the macroscopic quantitative observation is efficient and applicable to control multi-robot systems.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"116 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128082292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Evolving neural networks for chlorophyll-a prediction 用于叶绿素-a预测的进化神经网络
X. Yao, Yong Liu
The paper studies the application of evolutionary artificial neural networks to chlorophyll-a prediction in Lake Kasumigaura (in Japan). Unlike previous applications of artificial neural networks in this field, the architecture of the artificial neural network is evolved automatically rather than designed manually. The evolutionary system is able to find a near optimal architecture of the artificial neural network for the prediction task. Our experimental results have shown that evolved artificial neural networks are very compact and generalise well. The evolutionary system is able to explore a large space of possible artificial neural networks and discover novel artificial neural networks for solving a problem.
本文研究了进化人工神经网络在日本霞aura湖叶绿素a预测中的应用。与以往人工神经网络在该领域的应用不同,人工神经网络的架构是自动进化的,而不是人工设计的。进化系统能够为预测任务找到接近最优的人工神经网络结构。实验结果表明,进化后的人工神经网络结构紧凑,具有良好的泛化能力。进化系统能够探索大量可能的人工神经网络,并发现新的人工神经网络来解决问题。
{"title":"Evolving neural networks for chlorophyll-a prediction","authors":"X. Yao, Yong Liu","doi":"10.1109/ICCIMA.2001.970465","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970465","url":null,"abstract":"The paper studies the application of evolutionary artificial neural networks to chlorophyll-a prediction in Lake Kasumigaura (in Japan). Unlike previous applications of artificial neural networks in this field, the architecture of the artificial neural network is evolved automatically rather than designed manually. The evolutionary system is able to find a near optimal architecture of the artificial neural network for the prediction task. Our experimental results have shown that evolved artificial neural networks are very compact and generalise well. The evolutionary system is able to explore a large space of possible artificial neural networks and discover novel artificial neural networks for solving a problem.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129185342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
PANDORA: a multi-agent system using paraconsistent logic 潘多拉:一个使用超一致逻辑的多代理系统
E. Angelotti, E. Scalabrin, B. C. Ávila
This work is part of the Multicheck Project that defines architecture of cognitive and independents agents for the automatic treatment of handwritten Brazilian bank checks. The concept of autonomous agents allows us to organize the application knowledge and brings several own benefits to the approach. The choice of this approach is supported in a triple hypothesis. First, the nature of the problem in question allows decomposition in well-defined tasks, and each of them can be encapsulated in an independent agent. Second, the natural capability of interaction of the agents makes the check treatment process more robust, solving situations apparently difficult. Third, the natural parallelism between the agents can contribute to implement an application with high performance.
这项工作是Multicheck项目的一部分,该项目定义了用于自动处理手写巴西银行支票的认知和独立代理的架构。自主代理的概念允许我们组织应用程序知识,并为该方法带来了几个好处。这种方法的选择在三重假设中得到支持。首先,所讨论的问题的性质允许分解为定义良好的任务,并且每个任务都可以封装在一个独立的代理中。其次,代理之间相互作用的自然能力使检查处理过程更加稳健,解决了明显困难的情况。第三,代理之间的自然并行性有助于实现具有高性能的应用程序。
{"title":"PANDORA: a multi-agent system using paraconsistent logic","authors":"E. Angelotti, E. Scalabrin, B. C. Ávila","doi":"10.1109/ICCIMA.2001.970493","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970493","url":null,"abstract":"This work is part of the Multicheck Project that defines architecture of cognitive and independents agents for the automatic treatment of handwritten Brazilian bank checks. The concept of autonomous agents allows us to organize the application knowledge and brings several own benefits to the approach. The choice of this approach is supported in a triple hypothesis. First, the nature of the problem in question allows decomposition in well-defined tasks, and each of them can be encapsulated in an independent agent. Second, the natural capability of interaction of the agents makes the check treatment process more robust, solving situations apparently difficult. Third, the natural parallelism between the agents can contribute to implement an application with high performance.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117212265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Working on student models (really) based on mental states 研究基于心理状态的学生模型
L. Giraffa, M. Mora, A. Zamberlam
This paper shows some aspects of our work using a Multi-agent System applied to build interactive Intelligent Tutoring Systems (ITS) with student model based on mental states (BDI agents). We present some aspects about BDI agents' implementation, and the tool (E-BDI editor) under construction. We do believe that building BDI agents is not simple. This declarative paradigm and its implementation needs to be guided by a visual tool. This editor was designed to aid research to organise the basic set of mental states needed to model the cognitive agents.
本文介绍了我们利用多智能体系统(Multi-agent System)构建基于心理状态(BDI agent)学生模型的交互式智能辅导系统(ITS)的工作。我们介绍了BDI代理实现的一些方面,以及正在构建的工具(E-BDI编辑器)。我们相信建立BDI代理并不简单。这种声明性范例及其实现需要一个可视化工具来指导。这个编辑器的目的是帮助研究组织认知代理模型所需的基本心理状态集。
{"title":"Working on student models (really) based on mental states","authors":"L. Giraffa, M. Mora, A. Zamberlam","doi":"10.1109/ICCIMA.2001.970499","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970499","url":null,"abstract":"This paper shows some aspects of our work using a Multi-agent System applied to build interactive Intelligent Tutoring Systems (ITS) with student model based on mental states (BDI agents). We present some aspects about BDI agents' implementation, and the tool (E-BDI editor) under construction. We do believe that building BDI agents is not simple. This declarative paradigm and its implementation needs to be guided by a visual tool. This editor was designed to aid research to organise the basic set of mental states needed to model the cognitive agents.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129746764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Effects of proportion of metanorm players on establishment of norm 规范参与者比例对规范建立的影响
T. Yamashita, H. Kawamura, M. Yamamoto, A. Ohuchi
The authors introduce a mutual choice mechanism into the norms game instead of direct penal regulation and reformulate the norms game with mutual choice. In order to examine how the proportion of metanorm players to all players influences the establishment of norm in the metanorms game with mutual choice, we perform simulation with several different proportions of metanorm players. We exclude the game theoretic assumption of uniformity as the metanorm player and observe the frequency of the establishment of the norm. As a result, we confirm the robustness of mutual choice against insufficiency of metanorm players.
作者在规范博弈中引入了一种相互选择机制来代替直接的刑罚规制,用相互选择机制来重新构建规范博弈。为了考察在相互选择的异态博弈中,异态玩家占所有玩家的比例如何影响规范的建立,我们对不同比例的异态玩家进行了模拟。我们排除了博弈论的均匀性假设,并观察了规范建立的频率。结果证明了互选择对变异参与人不足的鲁棒性。
{"title":"Effects of proportion of metanorm players on establishment of norm","authors":"T. Yamashita, H. Kawamura, M. Yamamoto, A. Ohuchi","doi":"10.1109/ICCIMA.2001.970467","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970467","url":null,"abstract":"The authors introduce a mutual choice mechanism into the norms game instead of direct penal regulation and reformulate the norms game with mutual choice. In order to examine how the proportion of metanorm players to all players influences the establishment of norm in the metanorms game with mutual choice, we perform simulation with several different proportions of metanorm players. We exclude the game theoretic assumption of uniformity as the metanorm player and observe the frequency of the establishment of the norm. As a result, we confirm the robustness of mutual choice against insufficiency of metanorm players.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129265038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Boxed Economy Simulation Platform and foundation model 箱体经济仿真平台及基础模型
T. Iba, Y. Takabe, Y. Chubachi, Y. Takefuji
The authors propose a "Boxed Economy Simulation Platform", which is a sharable basis for agent-based economic simulations. By providing the basic design of the social model, which we call "Boxed Economy Foundation Model", it enables collaborative research more efficiently. Sharing and cumulating the model components can be promoted by domain-specific design at the level of social model rather than the level of abstract general purpose model. It will be able to contribute to remove factors that have been making it difficult for social scientists to participate in and conduct agent-based research.
作者提出了一个“盒式经济仿真平台”,这是基于主体的经济仿真的共享基础。通过提供社会模型的基本设计,我们称之为“盒装经济基础模型”,它使合作研究更有效。在社会模型的层次上,而不是抽象的通用模型的层次上,可以通过特定领域的设计来促进模型组件的共享和累积。它将有助于消除使社会科学家难以参与和开展基于主体的研究的因素。
{"title":"Boxed Economy Simulation Platform and foundation model","authors":"T. Iba, Y. Takabe, Y. Chubachi, Y. Takefuji","doi":"10.1109/ICCIMA.2001.970442","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970442","url":null,"abstract":"The authors propose a \"Boxed Economy Simulation Platform\", which is a sharable basis for agent-based economic simulations. By providing the basic design of the social model, which we call \"Boxed Economy Foundation Model\", it enables collaborative research more efficiently. Sharing and cumulating the model components can be promoted by domain-specific design at the level of social model rather than the level of abstract general purpose model. It will be able to contribute to remove factors that have been making it difficult for social scientists to participate in and conduct agent-based research.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130147014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Interactive multiagent reinforcement learning with motivation rules 基于动机规则的交互式多智能体强化学习
T. Yamaguchi, Ryo Marukawa
Presents a new framework of multi-agent reinforcement learning to acquire cooperative behaviors by generating and coordinating each learning goal interactively among agents. One of the main goals of artificial intelligence is to realize an intelligent agent that behaves autonomously by its sense of values. Reinforcement learning (RL) is the major learning mechanism for the agent to adapt itself to various situations of an unknown environment flexibly. However, in a multi-agent system environment that has mutual dependency among agents, it is difficult for a human to set up suitable learning goals for each agent, and, in addition, the existing framework of RL that aims for egoistic optimality of each agent is inadequate. Therefore, an active and interactive learning mechanism is required to generate and coordinate each learning goal among the agents. To realize this, first we propose to treat each learning goal as a reinforcement signal (RS) that can be communicated among the agents. Second, we introduce motivation rules to integrate the RSs communicated among the agents into a reward value for RL of an agent. Then we define cooperative rewards as learning goals with mutual dependency. Learning experiments for two agents with various motivation rules are performed. The experimental results show that several combinations of motivation rules converge to cooperative behaviors.
提出了一种新的多智能体强化学习框架,通过智能体之间的交互生成和协调每个学习目标来获得合作行为。人工智能的主要目标之一是实现一个能够根据其价值观自主行为的智能体。强化学习(Reinforcement learning, RL)是智能体灵活适应未知环境各种情况的主要学习机制。然而,在智能体之间相互依赖的多智能体系统环境中,人类很难为每个智能体设定合适的学习目标,而且现有的以每个智能体的利己最优为目标的强化学习框架也存在不足。因此,需要一种主动和互动的学习机制来生成和协调智能体之间的每个学习目标。为了实现这一点,首先我们建议将每个学习目标视为一个可以在智能体之间传递的强化信号(RS)。其次,我们引入激励规则,将智能体之间沟通的RSs整合为智能体RL的奖励值。然后我们将合作奖励定义为相互依赖的学习目标。对两个具有不同动机规则的智能体进行了学习实验。实验结果表明,多种激励规则组合收敛为合作行为。
{"title":"Interactive multiagent reinforcement learning with motivation rules","authors":"T. Yamaguchi, Ryo Marukawa","doi":"10.1109/ICCIMA.2001.970456","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970456","url":null,"abstract":"Presents a new framework of multi-agent reinforcement learning to acquire cooperative behaviors by generating and coordinating each learning goal interactively among agents. One of the main goals of artificial intelligence is to realize an intelligent agent that behaves autonomously by its sense of values. Reinforcement learning (RL) is the major learning mechanism for the agent to adapt itself to various situations of an unknown environment flexibly. However, in a multi-agent system environment that has mutual dependency among agents, it is difficult for a human to set up suitable learning goals for each agent, and, in addition, the existing framework of RL that aims for egoistic optimality of each agent is inadequate. Therefore, an active and interactive learning mechanism is required to generate and coordinate each learning goal among the agents. To realize this, first we propose to treat each learning goal as a reinforcement signal (RS) that can be communicated among the agents. Second, we introduce motivation rules to integrate the RSs communicated among the agents into a reward value for RL of an agent. Then we define cooperative rewards as learning goals with mutual dependency. Learning experiments for two agents with various motivation rules are performed. The experimental results show that several combinations of motivation rules converge to cooperative behaviors.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122348762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
On the rationality of profit sharing in multi-agent reinforcement learning 论多智能体强化学习中利润分配的合理性
K. Miyazaki, S. Kobayashi
Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from a theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However, it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. The authors use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. In particular, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through a crane control problem, we confirm the effectiveness of PS in multi-agent environments.
强化学习是机器学习的一种。它旨在根据奖励使智能体适应未知环境。传统上,从理论的角度来看,许多强化学习系统假设环境具有马尔可夫性质。然而,在多智能体强化学习系统中处理非马尔可夫环境是很重要的。将利润分享作为一种强化学习系统,讨论了多智能体环境下利润分享的合理性。特别是,我们对非马尔可夫环境进行分类,并讨论如何在强化学习代理之间共享奖励。通过一个起重机控制问题,验证了多智能体环境下PS算法的有效性。
{"title":"On the rationality of profit sharing in multi-agent reinforcement learning","authors":"K. Miyazaki, S. Kobayashi","doi":"10.1109/ICCIMA.2001.970455","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970455","url":null,"abstract":"Reinforcement learning is a kind of machine learning. It aims to adapt an agent to an unknown environment according to rewards. Traditionally, from a theoretical point of view, many reinforcement learning systems assume that the environment has Markovian properties. However, it is important to treat non-Markovian environments in multi-agent reinforcement learning systems. The authors use Profit Sharing (PS) as a reinforcement learning system and discuss the rationality of PS in multi-agent environments. In particular, we classify non-Markovian environments and discuss how to share a reward among reinforcement learning agents. Through a crane control problem, we confirm the effectiveness of PS in multi-agent environments.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124240286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Data visualization tools for 3SAT instances 3SAT实例的数据可视化工具
J.M. Garcia-Ortegon, J. Torres-Jiménez
We present a set of data visualization tools that could lead to a better understanding of the conditions that make of a SAT instance a hard one. The visualization techniques included in this work are used to make evident the relationships between the SAT-variables in function of their distribution, signs and combinations of signs in the clauses. Using this information, the user may identify patterns associated with the hardness of a SAT instance which gives a more flexible measurement of the instance's hardness than just the relationship between the number of clauses (M) and the number of variables (N). It could be even possible to use the developed data visualization tools to determine if a specific solution method is best suitable for a specific SAT instance.
我们提供了一套数据可视化工具,可以更好地理解使SAT实例变得困难的条件。本工作中使用的可视化技术用于明确sat变量的分布、符号和子句中符号组合的函数之间的关系。使用这些信息,用户可以识别与SAT实例硬度相关的模式,这提供了一个更灵活的实例硬度测量,而不仅仅是子句数量(M)和变量数量(N)之间的关系。甚至可以使用开发的数据可视化工具来确定特定的解决方法是否最适合特定的SAT实例。
{"title":"Data visualization tools for 3SAT instances","authors":"J.M. Garcia-Ortegon, J. Torres-Jiménez","doi":"10.1109/ICCIMA.2001.970486","DOIUrl":"https://doi.org/10.1109/ICCIMA.2001.970486","url":null,"abstract":"We present a set of data visualization tools that could lead to a better understanding of the conditions that make of a SAT instance a hard one. The visualization techniques included in this work are used to make evident the relationships between the SAT-variables in function of their distribution, signs and combinations of signs in the clauses. Using this information, the user may identify patterns associated with the hardness of a SAT instance which gives a more flexible measurement of the instance's hardness than just the relationship between the number of clauses (M) and the number of variables (N). It could be even possible to use the developed data visualization tools to determine if a specific solution method is best suitable for a specific SAT instance.","PeriodicalId":232504,"journal":{"name":"Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001","volume":"234 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114257825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings Fourth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2001
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1