首页 > 最新文献

2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)最新文献

英文 中文
Pseudo- Randomization in Automating Robot Behaviour during Human-Robot Interaction 人机交互过程中自动化机器人行为的伪随机化
S. Paplu, Chinmaya Mishra, K. Berns
Automating robot behavior in a specific situation is an active area of research. There are several approaches available in the literature of robotics to cater for the automatic behavior of a robot. However, when it comes to humanoids or human-robot interaction in general, the area has been less explored. In this paper, a pseudo-randomization approach has been introduced to automatize the gestures and facial expressions of an interactive humanoid robot called ROBIN based on its mental state. A significant number of gestures and facial expressions have been implemented to allow the robot more options to perform a relevant action or reaction based on visual stimuli. There is a display of noticeable differences in the behaviour of the robot for the same stimuli perceived from an interaction partner. This slight autonomous behavioural change in the robot clearly shows a notion of automation in behaviour. The results from experimental scenarios and human-centered evaluation of the system help validate the approach.
自动化机器人在特定情况下的行为是一个活跃的研究领域。在机器人的文献中有几种方法可以满足机器人的自动行为。然而,当涉及到类人或人机交互时,这一领域的探索却很少。本文介绍了一种基于心理状态的拟随机化方法来实现交互式人形机器人ROBIN的手势和面部表情的自动化。大量的手势和面部表情已经被实现,让机器人有更多的选择来执行基于视觉刺激的相关动作或反应。对于来自互动伙伴的相同刺激,机器人的行为表现出明显的差异。机器人的这种轻微的自主行为变化清楚地表明了行为自动化的概念。实验场景和以人为中心的系统评估结果有助于验证该方法。
{"title":"Pseudo- Randomization in Automating Robot Behaviour during Human-Robot Interaction","authors":"S. Paplu, Chinmaya Mishra, K. Berns","doi":"10.1109/ICDL-EpiRob48136.2020.9278115","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278115","url":null,"abstract":"Automating robot behavior in a specific situation is an active area of research. There are several approaches available in the literature of robotics to cater for the automatic behavior of a robot. However, when it comes to humanoids or human-robot interaction in general, the area has been less explored. In this paper, a pseudo-randomization approach has been introduced to automatize the gestures and facial expressions of an interactive humanoid robot called ROBIN based on its mental state. A significant number of gestures and facial expressions have been implemented to allow the robot more options to perform a relevant action or reaction based on visual stimuli. There is a display of noticeable differences in the behaviour of the robot for the same stimuli perceived from an interaction partner. This slight autonomous behavioural change in the robot clearly shows a notion of automation in behaviour. The results from experimental scenarios and human-centered evaluation of the system help validate the approach.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122709413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Tactile-based curiosity maximizes tactile-rich object-oriented actions even without any extrinsic rewards 基于触觉的好奇心最大化了触觉丰富的面向对象行为,即使没有任何外部奖励
Hiroki Mori, Masayuki Masuda, T. Ogata
This study proposed a hypothesis regarding the emergence of object-oriented action via tactile-based curiosity. The hypothesis is such that a curious exploration driven by tactile sensation leads tactile-rich object-oriented actions, while there are no explicit rewards or other designated intentional purposes. Experiments were with the curiosity model named the disagreement model from the reinforcement learning research field and with a simple physics robotic simulation with visual and tactile sensory information. The experimental results indicated that the tactile sensation induces object-oriented actions such as hitting and pecking by the body parts that have tactile sensors. We deduced that the hypothesis could be extended to discussions regarding the acquisition of dexterous skillful object manipulation in human development.
本研究提出了一个关于通过触觉好奇心产生面向对象行为的假设。假设是,由触觉驱动的好奇探索会导致触觉丰富的面向对象行为,而没有明确的奖励或其他指定的故意目的。实验采用强化学习研究领域的好奇心模型(即分歧模型)和具有视觉和触觉感官信息的简单物理机器人仿真。实验结果表明,触觉可以通过具有触觉传感器的身体部位诱导出击打、啄啄等面向对象的动作。我们推断,这一假设可以扩展到关于在人类发展中获得灵巧灵巧的物体操纵的讨论。
{"title":"Tactile-based curiosity maximizes tactile-rich object-oriented actions even without any extrinsic rewards","authors":"Hiroki Mori, Masayuki Masuda, T. Ogata","doi":"10.1109/ICDL-EpiRob48136.2020.9278082","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278082","url":null,"abstract":"This study proposed a hypothesis regarding the emergence of object-oriented action via tactile-based curiosity. The hypothesis is such that a curious exploration driven by tactile sensation leads tactile-rich object-oriented actions, while there are no explicit rewards or other designated intentional purposes. Experiments were with the curiosity model named the disagreement model from the reinforcement learning research field and with a simple physics robotic simulation with visual and tactile sensory information. The experimental results indicated that the tactile sensation induces object-oriented actions such as hitting and pecking by the body parts that have tactile sensors. We deduced that the hypothesis could be extended to discussions regarding the acquisition of dexterous skillful object manipulation in human development.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134455373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Self-Calibrating Active Binocular Vision via Active Efficient Coding with Deep Autoencoders 基于深度自编码器的主动高效编码自校准主动双目视觉
Charles Wilmot, Bertram E. Shi, J. Triesch
We present a model of the self-calibration of active binocular vision comprising the simultaneous learning of visual representations, vergence, and pursuit eye movements. The model follows the principle of Active Efficient Coding (AEC), a recent extension of the classic Efficient Coding Hypothesis to active perception. In contrast to previous AEC models, the present model uses deep autoencoders to learn sensory representations. We also propose a new formulation of the intrinsic motivation signal that guides the learning of behavior. We demonstrate the performance of the model in simulations.
我们提出了一个主动双眼视觉的自校准模型,包括视觉表征,收敛和追求眼球运动的同时学习。该模型遵循主动有效编码(AEC)原则,这是经典有效编码假说对主动感知的最新扩展。与以前的AEC模型相比,本模型使用深度自编码器来学习感官表征。我们还提出了指导行为学习的内在动机信号的新公式。通过仿真验证了该模型的性能。
{"title":"Self-Calibrating Active Binocular Vision via Active Efficient Coding with Deep Autoencoders","authors":"Charles Wilmot, Bertram E. Shi, J. Triesch","doi":"10.1109/ICDL-EpiRob48136.2020.9278037","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278037","url":null,"abstract":"We present a model of the self-calibration of active binocular vision comprising the simultaneous learning of visual representations, vergence, and pursuit eye movements. The model follows the principle of Active Efficient Coding (AEC), a recent extension of the classic Efficient Coding Hypothesis to active perception. In contrast to previous AEC models, the present model uses deep autoencoders to learn sensory representations. We also propose a new formulation of the intrinsic motivation signal that guides the learning of behavior. We demonstrate the performance of the model in simulations.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126934203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modeling robot co-representation: state-of-the-art, open issues, and predictive learning as a possible framework 机器人共同表示建模:最先进的、开放的问题,以及作为可能框架的预测学习
M. Kirtay, Olga A. Wudarczyk, D. Pischedda, A. Kuhlen, R. A. Rahman, J. Haynes, V. Hafner
Robots are getting increasingly more present in many spheres of human life, making the need for robots that can successfully engage in natural social interactions with humans paramount. Successful human-robot interaction could be achieved more effectively if robots could act predictably and could predict the humans' actions. If robots could represent human partners and generate behaviors that are in line with the partners' expectations based on human's mental models of interdependent action, human agents would be able to apply predictive and adaptive mechanisms acquired in human interactions to interact with robots effectively. How could robots be predictable and be capable of predicting human behavior? We propose that this could be achieved by having an internal representation of both oneself and the other agent, that is by equipping the robot with the ability to co-represent. Here, co-representation refers to the representation of the partner's actions alongside one's own actions. Although co-representation constitutes an essential process for successful human social interaction, as it supports understanding of others' actions, to date co-representation processes have only scarcely been integrated into robotic platforms. We highlight the state-of-the-art findings on co-representation in social robotics, discuss current research limitations and open issues for creating computational models of co-representation in robots, and put forward the idea that predictive learning might constitute a particularly promising framework to build models of co-representing robots. Overall, in this article, we offer an integrated view of the state-of-the-art findings in robotics literature on co-representation and outline directions for future research, with the aim to boost success in building robots equipped with co-representation models fit for smooth social interactions.
机器人越来越多地出现在人类生活的许多领域,因此对能够成功地与人类进行自然社会互动的机器人的需求至关重要。如果机器人可以预测人类的行为,并且能够预测人类的行为,那么成功的人机交互就可以更有效地实现。如果机器人能够代表人类伴侣,并基于人类相互依存的心理模型产生符合伴侣期望的行为,那么人类代理将能够应用在人类互动中获得的预测和自适应机制来有效地与机器人互动。机器人如何能够被预测,并能够预测人类的行为?我们建议,这可以通过拥有自己和其他代理的内部表示来实现,也就是说,通过为机器人配备共同表示的能力。在这里,共同表征指的是将合作伙伴的行为与自己的行为一起表征。虽然共同代表是成功的人类社会互动的必要过程,因为它支持对他人行为的理解,但迄今为止,共同代表过程几乎没有集成到机器人平台中。我们强调了社交机器人中共同表征的最新研究成果,讨论了当前研究的局限性和在机器人中创建共同表征计算模型的开放问题,并提出了预测学习可能构成一个特别有前途的框架来构建共同表征机器人模型的想法。总的来说,在这篇文章中,我们提供了一个关于共同表示的机器人文献中最先进的发现的综合观点,并概述了未来研究的方向,目的是促进成功构建具有适合顺利社会互动的共同表示模型的机器人。
{"title":"Modeling robot co-representation: state-of-the-art, open issues, and predictive learning as a possible framework","authors":"M. Kirtay, Olga A. Wudarczyk, D. Pischedda, A. Kuhlen, R. A. Rahman, J. Haynes, V. Hafner","doi":"10.1109/ICDL-EpiRob48136.2020.9278031","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278031","url":null,"abstract":"Robots are getting increasingly more present in many spheres of human life, making the need for robots that can successfully engage in natural social interactions with humans paramount. Successful human-robot interaction could be achieved more effectively if robots could act predictably and could predict the humans' actions. If robots could represent human partners and generate behaviors that are in line with the partners' expectations based on human's mental models of interdependent action, human agents would be able to apply predictive and adaptive mechanisms acquired in human interactions to interact with robots effectively. How could robots be predictable and be capable of predicting human behavior? We propose that this could be achieved by having an internal representation of both oneself and the other agent, that is by equipping the robot with the ability to co-represent. Here, co-representation refers to the representation of the partner's actions alongside one's own actions. Although co-representation constitutes an essential process for successful human social interaction, as it supports understanding of others' actions, to date co-representation processes have only scarcely been integrated into robotic platforms. We highlight the state-of-the-art findings on co-representation in social robotics, discuss current research limitations and open issues for creating computational models of co-representation in robots, and put forward the idea that predictive learning might constitute a particularly promising framework to build models of co-representing robots. Overall, in this article, we offer an integrated view of the state-of-the-art findings in robotics literature on co-representation and outline directions for future research, with the aim to boost success in building robots equipped with co-representation models fit for smooth social interactions.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117056967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Involuntary movement suppression filter for electric wheelchair with athetosis-type cerebral palsy 电动轮椅伴手足动型脑瘫的不自主运动抑制过滤器
Motoyu Katsumura, K. Yano, T. Nakao, Atsushi Hamada, Katsuhiko Torii
Individuals with cerebral palsy use electric wheelchairs due to their abnormal gait caused by paralysis and other symptoms. However, it is difficult for them to operate the wheelchair joystick because of their suddenly occurring uncontrollable, involuntary movements and the difficulty they have maintaining their posture. In this study, we developed a control system, which suppresses the effects of involuntary movement. This system is capable of controlling electric wheelchairs as intended by individuals with tension-athetosis-type cerebral palsy. We demonstrated the experiments to compare the stability of operation by normal system and the proposed system. Finally, we showed the effectiveness of the proposed system in the straight running experiment.
脑瘫患者因瘫痪等症状导致步态异常,使用电动轮椅。然而,他们很难操作轮椅操纵杆,因为他们突然发生不可控的、不自主的运动,他们很难保持姿势。在这项研究中,我们开发了一个控制系统,可以抑制不自主运动的影响。该系统能够控制电动轮椅为个人与紧张-手足动型脑瘫。通过实验比较了正常系统和所提系统的运行稳定性。最后,在直跑实验中验证了该系统的有效性。
{"title":"Involuntary movement suppression filter for electric wheelchair with athetosis-type cerebral palsy","authors":"Motoyu Katsumura, K. Yano, T. Nakao, Atsushi Hamada, Katsuhiko Torii","doi":"10.1109/ICDL-EpiRob48136.2020.9278065","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278065","url":null,"abstract":"Individuals with cerebral palsy use electric wheelchairs due to their abnormal gait caused by paralysis and other symptoms. However, it is difficult for them to operate the wheelchair joystick because of their suddenly occurring uncontrollable, involuntary movements and the difficulty they have maintaining their posture. In this study, we developed a control system, which suppresses the effects of involuntary movement. This system is capable of controlling electric wheelchairs as intended by individuals with tension-athetosis-type cerebral palsy. We demonstrated the experiments to compare the stability of operation by normal system and the proposed system. Finally, we showed the effectiveness of the proposed system in the straight running experiment.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133876982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Need for MORE: Need Systems as Non-Linear Multi-Objective Reinforcement Learning 需要更多:需要系统作为非线性多目标强化学习
Matthias Rolf
Both biological and artificial agents need to coordinate their behavior to suit various needs at the same time. Reconciling conflicts of different needs and contradictory interests such as self-preservation and curiosity is the central difficulty arising in the design and modelling of need and value systems. Current models of multi-objective reinforcement learning do either not provide satisfactory power to describe such conflicts, or lack the power to actually resolve them. This paper aims to promote a clear understanding of these limitations, and to overcome them with a theory-driven approach rather than ad hoc solutions. The first contribution of this paper is the development of an example that demonstrates previous approaches' limitations concisely. The second contribution is a new, non-linear objective function design, MORE, that addresses these and leads to a practical algorithm. Experiments show that standard RL methods fail to grasp the nature of the problem and ad-hoc solutions struggle to describe consistent preferences. MORE consistently learns a highly satisfactory solution that balances contradictory needs based on a consistent notion of optimality.
生物制剂和人工制剂都需要同时协调它们的行为以适应不同的需要。在需求和价值系统的设计和建模过程中,调和不同需求和相互矛盾的利益(如自我保护和好奇心)之间的冲突是最主要的困难。当前的多目标强化学习模型要么不能提供令人满意的能力来描述这些冲突,要么缺乏实际解决这些冲突的能力。本文旨在促进对这些限制的清晰理解,并通过理论驱动的方法而不是临时解决方案来克服它们。本文的第一个贡献是开发了一个示例,简明地展示了以前方法的局限性。第二个贡献是一个新的非线性目标函数设计,MORE,它解决了这些问题,并导致了一个实用的算法。实验表明,标准的强化学习方法无法把握问题的本质,临时解决方案难以描述一致的偏好。更一致地学习一个高度满意的解决方案,平衡矛盾的需求,基于一致的最优概念。
{"title":"The Need for MORE: Need Systems as Non-Linear Multi-Objective Reinforcement Learning","authors":"Matthias Rolf","doi":"10.1109/ICDL-EpiRob48136.2020.9278062","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278062","url":null,"abstract":"Both biological and artificial agents need to coordinate their behavior to suit various needs at the same time. Reconciling conflicts of different needs and contradictory interests such as self-preservation and curiosity is the central difficulty arising in the design and modelling of need and value systems. Current models of multi-objective reinforcement learning do either not provide satisfactory power to describe such conflicts, or lack the power to actually resolve them. This paper aims to promote a clear understanding of these limitations, and to overcome them with a theory-driven approach rather than ad hoc solutions. The first contribution of this paper is the development of an example that demonstrates previous approaches' limitations concisely. The second contribution is a new, non-linear objective function design, MORE, that addresses these and leads to a practical algorithm. Experiments show that standard RL methods fail to grasp the nature of the problem and ad-hoc solutions struggle to describe consistent preferences. MORE consistently learns a highly satisfactory solution that balances contradictory needs based on a consistent notion of optimality.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128188179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
From Reward to Histone: Combining Temporal-Difference Learning and Epigenetic Inheritance for Swarm's Coevolving Decision Making 从奖励到组蛋白:结合时间差异学习和表观遗传的群体协同进化决策
F. Mukhlish, J. Page, Michael Bain
Applying intelligence to a group of simple robots known as swarm robots has become an exciting technology in assisting or replacing humans to fulfil complex, dangerous and harsh missions. However, building a strategy for a swarm to thrive in a dynamic environment is challenging because of control decentralisation and interactions between agents. The decision-making process in a robotic task commonly takes place in sequential stages. By understanding the subsequent action-reaction process, a strategy to make optimal decisions in a respective environment can be learnt. Hence, using the concept of epigenetic inheritance, novel evolutionary-learning mechanisms for a swarm will be discussed in this paper. Reinforcement evolutionary learning using epigenetic inheritance (RELEpi) is proposed in this article. This method utilizes reward, temporal difference and epigenetic inheritance to approximate optimal action and behaviour policies. The proposed method opens possibilities to combine reward-based learning and evolutionary methods as a stacked process where histone value is used rather than fitness function. The formulation consists of methylation and epigenetic mechanisms, inspired by the epigenome studies. The methylation process helps the accumulation of the reward to histone value of the gene. Epigenetic mechanisms give the ability to mate genetic information along with their histone value.
将智能应用于一组简单的机器人,即群机器人,已经成为一项令人兴奋的技术,可以帮助或取代人类完成复杂、危险和严酷的任务。然而,由于控制分散和代理之间的相互作用,为群体在动态环境中茁壮成长制定策略是具有挑战性的。机器人任务的决策过程通常是在连续的阶段进行的。通过了解随后的行动-反应过程,可以学习在各自环境中做出最佳决策的策略。因此,本文将利用表观遗传的概念,讨论新的群体进化学习机制。本文提出了一种基于表观遗传的强化进化学习方法。该方法利用奖励、时间差异和表观遗传来近似最优行动和行为策略。提出的方法打开了将基于奖励的学习和进化方法结合起来的可能性,作为一个堆叠过程,使用组蛋白值而不是适应度函数。该配方由甲基化和表观遗传机制组成,受到表观基因组研究的启发。甲基化过程有助于基因对组蛋白价值的奖励积累。表观遗传机制赋予了将遗传信息与其组蛋白价值结合在一起的能力。
{"title":"From Reward to Histone: Combining Temporal-Difference Learning and Epigenetic Inheritance for Swarm's Coevolving Decision Making","authors":"F. Mukhlish, J. Page, Michael Bain","doi":"10.1109/ICDL-EpiRob48136.2020.9278049","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278049","url":null,"abstract":"Applying intelligence to a group of simple robots known as swarm robots has become an exciting technology in assisting or replacing humans to fulfil complex, dangerous and harsh missions. However, building a strategy for a swarm to thrive in a dynamic environment is challenging because of control decentralisation and interactions between agents. The decision-making process in a robotic task commonly takes place in sequential stages. By understanding the subsequent action-reaction process, a strategy to make optimal decisions in a respective environment can be learnt. Hence, using the concept of epigenetic inheritance, novel evolutionary-learning mechanisms for a swarm will be discussed in this paper. Reinforcement evolutionary learning using epigenetic inheritance (RELEpi) is proposed in this article. This method utilizes reward, temporal difference and epigenetic inheritance to approximate optimal action and behaviour policies. The proposed method opens possibilities to combine reward-based learning and evolutionary methods as a stacked process where histone value is used rather than fitness function. The formulation consists of methylation and epigenetic mechanisms, inspired by the epigenome studies. The methylation process helps the accumulation of the reward to histone value of the gene. Epigenetic mechanisms give the ability to mate genetic information along with their histone value.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130156473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-level representations through unconstrained sensorimotor learning 无约束感觉运动学习的高级表征
Ozgur Baran Ozturkcu, Emre Ugur, Erhan Öztop
How the sensorimotor experience of an agent can be organized into abstract symbol-like structures to enable effective planning and control is an open question. In the literature, there are many studies that start by assuming the existence of some symbols and ‘ground’ those onto continuous sensorimotor signals. There are also works that aim to facilitate the emergence of symbol-like representations by using specially designed machine learning architectures. In this paper, we investigate whether a deep reinforcement learning system that learns a dynamic task would facilitate the formation of high-level neural representations that might be considered as precursors of symbolic representation, which could be exploited by higher level neural circuits for better control and planning. The results indicate that without even explicit design to promote such representations, neural responses emerge that may serve as the basis of abstract symbol-like representations.
如何将智能体的感觉运动经验组织成抽象的类似符号的结构以实现有效的规划和控制是一个悬而未决的问题。在文献中,有许多研究首先假设一些符号的存在,并将这些符号与连续的感觉运动信号“联系”起来。还有一些作品旨在通过使用专门设计的机器学习架构来促进类似符号的表示的出现。在本文中,我们研究了学习动态任务的深度强化学习系统是否会促进高级神经表征的形成,这些表征可能被认为是符号表征的先驱,可以被更高级别的神经回路利用,以更好地控制和规划。结果表明,即使没有明确的设计来促进这种表征,神经反应也可能作为抽象符号表征的基础。
{"title":"High-level representations through unconstrained sensorimotor learning","authors":"Ozgur Baran Ozturkcu, Emre Ugur, Erhan Öztop","doi":"10.1109/ICDL-EpiRob48136.2020.9278100","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278100","url":null,"abstract":"How the sensorimotor experience of an agent can be organized into abstract symbol-like structures to enable effective planning and control is an open question. In the literature, there are many studies that start by assuming the existence of some symbols and ‘ground’ those onto continuous sensorimotor signals. There are also works that aim to facilitate the emergence of symbol-like representations by using specially designed machine learning architectures. In this paper, we investigate whether a deep reinforcement learning system that learns a dynamic task would facilitate the formation of high-level neural representations that might be considered as precursors of symbolic representation, which could be exploited by higher level neural circuits for better control and planning. The results indicate that without even explicit design to promote such representations, neural responses emerge that may serve as the basis of abstract symbol-like representations.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114466023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Integration of Vergence, Cyclovergence, and Saccades through Active Efficient Coding 通过主动高效编码集成收敛、循环收敛和扫视
Qingpeng Zhu, J. Triesch, Bertram E. Shi
This paper describes a unified computational model for the joint development of early visual representations and the control of three types of eye movements, i.e., vergence, cyclovergence, and saccades. The model is based on the Active Efficient Coding (AEC) framework, an extension of Barlow's efficient coding hypothesis to active perception. AEC describes the joint learning of sensory encoding and behavioral control. The present work relaxes the assumptions made in our previous work by learning vergence, cyclovergence, and saccades all from random initialization. Our results also demonstrate the importance of the interaction between the learning of these eye movements in terms of learning speed and accuracy. Overall, we find that AEC provides a parsimonious framework to account for the simultaneous learning of active vision skills.
本文描述了一个统一的计算模型,用于早期视觉表征的联合发展和三种类型的眼球运动的控制,即收敛,回旋收敛和扫视。该模型基于主动有效编码(AEC)框架,将巴洛的有效编码假说扩展到主动感知。AEC描述了感觉编码和行为控制的联合学习。本工作通过学习随机初始化的收敛、循环收敛和跳变,放宽了我们以前工作中所做的假设。我们的研究结果也证明了学习这些眼动之间的相互作用在学习速度和准确性方面的重要性。总的来说,我们发现AEC提供了一个简约的框架来解释主动视觉技能的同时学习。
{"title":"Integration of Vergence, Cyclovergence, and Saccades through Active Efficient Coding","authors":"Qingpeng Zhu, J. Triesch, Bertram E. Shi","doi":"10.1109/ICDL-EpiRob48136.2020.9278126","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278126","url":null,"abstract":"This paper describes a unified computational model for the joint development of early visual representations and the control of three types of eye movements, i.e., vergence, cyclovergence, and saccades. The model is based on the Active Efficient Coding (AEC) framework, an extension of Barlow's efficient coding hypothesis to active perception. AEC describes the joint learning of sensory encoding and behavioral control. The present work relaxes the assumptions made in our previous work by learning vergence, cyclovergence, and saccades all from random initialization. Our results also demonstrate the importance of the interaction between the learning of these eye movements in terms of learning speed and accuracy. Overall, we find that AEC provides a parsimonious framework to account for the simultaneous learning of active vision skills.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128191558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Fast Developmental Stereo-Disparity Detectors 快速发展立体视差探测器
J. A. Knoll, Van-Nam Hoang, Jacob Honer, Samuel Church, Thanh-Hai Tran, J. Weng
Traditional methods for stereo-disparity detection use explicit search between the left and right images. Although such methods are simple and intuitive for understanding, they suffer from degeneracies when the search window contains weak texture. Developmental Networks (DNs) are task-nonspecific and modality-nonspecific learning engines. Because they are general-purpose learners, they have a potential to deal with many types of degeneracies in intelligent systems. This work presents two novel mechanisms to deal with degeneracies: volume dimension and subwindow voting. While developmental stereo-disparity detection has been tested on simulated stereo images in our prior publications, it has never been tested on the real world. This paper reports our system, $3mathrm{DEye}$, which is the first to have filled this void. The algorithm, software, graphical user interface, training, performance, and update rates on CPU and GPU, respectively, on a Sony G8142 mobile phone are reported. Many deep learning methods that use error back-propagation suffer from the controversy of “post-selection” using the test set [1], to select one from many networks to report. In contrast, all randomly initialized DNs are performance-equivalent, no “post-selection” using test set. Possible future improvements for practical real-world and real-time applications are discussed.
传统的立体视差检测方法是在左右图像之间进行显式搜索。虽然这些方法简单直观,但当搜索窗口包含弱纹理时,这些方法存在退化问题。发展性网络(dn)是任务非特异性和模式非特异性的学习引擎。因为它们是通用的学习器,它们有潜力处理智能系统中的许多类型的退化。本文提出了两种处理简并性的新机制:体积维数和子窗口投票。虽然在我们之前的出版物中已经在模拟立体图像上测试了发育立体视差检测,但从未在现实世界中进行过测试。本文报告了我们的系统$3 mathm {DEye}$,这是第一个填补这个空白的系统。本文报道了索尼G8142手机的算法、软件、图形用户界面、训练、CPU和GPU的性能和更新率。许多使用误差反向传播的深度学习方法都存在“后选择”的争议,即使用测试集[1]从许多网络中选择一个进行报告。相比之下,所有随机初始化的DNs都是性能相等的,没有使用测试集的“后选择”。讨论了在现实世界和实时应用中可能的未来改进。
{"title":"Fast Developmental Stereo-Disparity Detectors","authors":"J. A. Knoll, Van-Nam Hoang, Jacob Honer, Samuel Church, Thanh-Hai Tran, J. Weng","doi":"10.1109/ICDL-EpiRob48136.2020.9278056","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278056","url":null,"abstract":"Traditional methods for stereo-disparity detection use explicit search between the left and right images. Although such methods are simple and intuitive for understanding, they suffer from degeneracies when the search window contains weak texture. Developmental Networks (DNs) are task-nonspecific and modality-nonspecific learning engines. Because they are general-purpose learners, they have a potential to deal with many types of degeneracies in intelligent systems. This work presents two novel mechanisms to deal with degeneracies: volume dimension and subwindow voting. While developmental stereo-disparity detection has been tested on simulated stereo images in our prior publications, it has never been tested on the real world. This paper reports our system, $3mathrm{DEye}$, which is the first to have filled this void. The algorithm, software, graphical user interface, training, performance, and update rates on CPU and GPU, respectively, on a Sony G8142 mobile phone are reported. Many deep learning methods that use error back-propagation suffer from the controversy of “post-selection” using the test set [1], to select one from many networks to report. In contrast, all randomly initialized DNs are performance-equivalent, no “post-selection” using test set. Possible future improvements for practical real-world and real-time applications are discussed.","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116871399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1