首页 > 最新文献

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)最新文献

英文 中文
Development of compositional and contextual communication of robots by using the multiple timescales dynamic neural network 基于多时间尺度动态神经网络的机器人组成和上下文通信研究
Gibeom Park, J. Tani
The current paper introduces neurorobotics experiment on acquisition of complex communicative skills with human via learning. A dynamic neural network model which is characterized by its multiple timescale dynamics characteristics was utilized as a neuronal model for controlling a humanoid robot. In the experimental task, the humanoid robot was trained to generate specific sequential movement patterns as responding to various sequences of imperative gesture patterns demonstrated by the human subjects by following predefined compositional semantic rules. The experimental results showed that (1) the MTRNN can learn to extract compositional semantic rules with generalization in the higher cognitive level, (2) the MTRNN can develop further higher-order cognition capability for controlling the internal contextual processes as situated to on-going task sequences without being provided with cues for explicitly indicating task segmentation points. The analysis on the dynamic characteristics developed in the MTRNN through learning indicated that the aforementioned cognitive mechanisms were achieved by developing adequate functional hierarchy by utilizing the constraint of the multiple timescale property and the topological connectivity imposed on the network configuration.
本文介绍了通过学习习得与人类复杂交流技能的神经机器人实验。利用具有多时间尺度动力学特性的动态神经网络模型作为控制仿人机器人的神经元模型。在实验任务中,人形机器人被训练生成特定的顺序运动模式,以响应人类受试者所展示的各种命令式手势模式序列,并遵循预定义的组合语义规则。实验结果表明:(1)MTRNN可以在更高的认知层次上学习提取具有泛化特征的组合语义规则;(2)在不提供明确指示任务分割点的线索的情况下,MTRNN可以进一步发展高阶认知能力,以控制位于正在进行的任务序列的内部上下文过程。通过学习对MTRNN动态特性的分析表明,上述认知机制是通过利用多时间尺度特性的约束和网络结构的拓扑连通性来建立适当的功能层次来实现的。
{"title":"Development of compositional and contextual communication of robots by using the multiple timescales dynamic neural network","authors":"Gibeom Park, J. Tani","doi":"10.1109/DEVLRN.2015.7346137","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346137","url":null,"abstract":"The current paper introduces neurorobotics experiment on acquisition of complex communicative skills with human via learning. A dynamic neural network model which is characterized by its multiple timescale dynamics characteristics was utilized as a neuronal model for controlling a humanoid robot. In the experimental task, the humanoid robot was trained to generate specific sequential movement patterns as responding to various sequences of imperative gesture patterns demonstrated by the human subjects by following predefined compositional semantic rules. The experimental results showed that (1) the MTRNN can learn to extract compositional semantic rules with generalization in the higher cognitive level, (2) the MTRNN can develop further higher-order cognition capability for controlling the internal contextual processes as situated to on-going task sequences without being provided with cues for explicitly indicating task segmentation points. The analysis on the dynamic characteristics developed in the MTRNN through learning indicated that the aforementioned cognitive mechanisms were achieved by developing adequate functional hierarchy by utilizing the constraint of the multiple timescale property and the topological connectivity imposed on the network configuration.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128485470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Active learning strategies and active control of complexity growth in naming games 命名游戏中主动学习策略与复杂性增长的主动控制
William Schueller, Pierre-Yves Oudeyer
Naming Games are models of the dynamic formation of lexical conventions in populations of agents. In this work we introduce new Naming Game strategies, using developmental and active learning mechanisms to control the growth of complexity. An information theoretical measure to compare those strategies is introduced, and used to study their impact on the dynamics of the Naming Game.
命名博弈是智能体群体中词汇约定动态形成的模型。在这项工作中,我们引入了新的命名游戏策略,使用发展性和主动学习机制来控制复杂性的增长。引入了一种信息理论度量来比较这些策略,并用于研究它们对命名博弈动态的影响。
{"title":"Active learning strategies and active control of complexity growth in naming games","authors":"William Schueller, Pierre-Yves Oudeyer","doi":"10.1109/DEVLRN.2015.7346144","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346144","url":null,"abstract":"Naming Games are models of the dynamic formation of lexical conventions in populations of agents. In this work we introduce new Naming Game strategies, using developmental and active learning mechanisms to control the growth of complexity. An information theoretical measure to compare those strategies is introduced, and used to study their impact on the dynamics of the Naming Game.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126233838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Biologically inspired incremental learning for high-dimensional spaces 生物学启发的高维空间增量学习
A. Gepperth, Thomas Hecht, Mathieu Lefort, Ursula Körner
We propose an incremental, highly parallelizable, and constant-time complexity neural learning architecture for multi-class classification (and regression) problems that remains resource-efficient even when the number of input dimensions is very high (≥ 1000). This so-called projection-prediction (PRO-PRE) architecture is strongly inspired by biological information processing in that it uses a prototype-based, topologically organized hidden layer that updates hidden layer weights whenever an error occurs. The employed self-organizing map (SOM) learning adapts only the weights of localized neural sub-populations that are similar to the input, which explicitly avoids the catastrophic forgetting effect of MLPs in case new input statistics are presented. The readout layer applies linear regression to hidden layer activities subjected to a transfer function, making the whole system capable of representing strongly non-linear decision boundaries. The resource-efficiency of the algorithm stems from approximating similarity in the input space by proximity in the SOM layer due to the topological SOM projection property. This avoids the storage of inter-cluster distances (quadratic in number of hidden layer elements) or input space covariance matrices (quadratic in input dimensions) as other incremental algorithms typically do. Tests on the popular MNIST handwritten digit benchmark show that the algorithm compares favorably to state-of-the-art results, and parallelizability is demonstrated by analyzing the efficiency of a parallel GPU implementation of the architecture.
我们提出了一种增量的、高度并行的、恒定时间复杂度的神经学习架构,用于多类分类(和回归)问题,即使在输入维数非常高(≥1000)的情况下,也能保持资源效率。这种所谓的投影预测(PRO-PRE)架构受到生物信息处理的强烈启发,因为它使用基于原型的拓扑组织的隐藏层,每当发生错误时更新隐藏层的权重。所采用的自组织映射(SOM)学习只适应与输入相似的局部神经亚群的权重,这明显避免了mlp在出现新输入统计时的灾难性遗忘效应。读出层将线性回归应用于受传递函数影响的隐藏层活动,使整个系统能够表示强非线性决策边界。由于拓扑SOM的投影特性,该算法的资源效率源于通过SOM层的接近性来近似输入空间中的相似性。这避免了存储簇间距离(隐藏层元素数量为二次元)或输入空间协方差矩阵(输入维数为二次元),而其他增量算法通常会这样做。在流行的MNIST手写数字基准测试上的测试表明,该算法优于最先进的结果,并且通过分析该架构的并行GPU实现的效率来证明并行性。
{"title":"Biologically inspired incremental learning for high-dimensional spaces","authors":"A. Gepperth, Thomas Hecht, Mathieu Lefort, Ursula Körner","doi":"10.1109/DEVLRN.2015.7346155","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346155","url":null,"abstract":"We propose an incremental, highly parallelizable, and constant-time complexity neural learning architecture for multi-class classification (and regression) problems that remains resource-efficient even when the number of input dimensions is very high (≥ 1000). This so-called projection-prediction (PRO-PRE) architecture is strongly inspired by biological information processing in that it uses a prototype-based, topologically organized hidden layer that updates hidden layer weights whenever an error occurs. The employed self-organizing map (SOM) learning adapts only the weights of localized neural sub-populations that are similar to the input, which explicitly avoids the catastrophic forgetting effect of MLPs in case new input statistics are presented. The readout layer applies linear regression to hidden layer activities subjected to a transfer function, making the whole system capable of representing strongly non-linear decision boundaries. The resource-efficiency of the algorithm stems from approximating similarity in the input space by proximity in the SOM layer due to the topological SOM projection property. This avoids the storage of inter-cluster distances (quadratic in number of hidden layer elements) or input space covariance matrices (quadratic in input dimensions) as other incremental algorithms typically do. Tests on the popular MNIST handwritten digit benchmark show that the algorithm compares favorably to state-of-the-art results, and parallelizability is demonstrated by analyzing the efficiency of a parallel GPU implementation of the architecture.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122816628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Diversity-driven selection of exploration strategies in multi-armed bandits 多武装盗匪的多样性驱动勘探策略选择
Fabien C. Y. Benureau, Pierre-Yves Oudeyer
We consider a scenario where an agent has multiple available strategies to explore an unknown environment. For each new interaction with the environment, the agent must select which exploration strategy to use. We provide a new strategy-agnostic method that treat the situation as a Multi-Armed Bandits problem where the reward signal is the diversity of effects that each strategy produces. We test the method empirically on a simulated planar robotic arm, and establish that the method is both able discriminate between strategies of dissimilar quality, even when the differences are tenuous, and that the resulting performance is competitive with the best fixed mixture of strategies.
我们考虑这样一个场景:智能体有多种可用策略来探索未知环境。对于与环境的每次新交互,智能体必须选择使用哪种探索策略。我们提供了一种新的策略不可知论方法,该方法将情况视为多武装强盗问题,其中奖励信号是每种策略产生的效果的多样性。我们在一个模拟的平面机械臂上对该方法进行了实证测试,并证明该方法既能够区分不同质量的策略,即使差异很小,而且最终的性能与最佳固定策略混合具有竞争力。
{"title":"Diversity-driven selection of exploration strategies in multi-armed bandits","authors":"Fabien C. Y. Benureau, Pierre-Yves Oudeyer","doi":"10.1109/devlrn.2015.7346130","DOIUrl":"https://doi.org/10.1109/devlrn.2015.7346130","url":null,"abstract":"We consider a scenario where an agent has multiple available strategies to explore an unknown environment. For each new interaction with the environment, the agent must select which exploration strategy to use. We provide a new strategy-agnostic method that treat the situation as a Multi-Armed Bandits problem where the reward signal is the diversity of effects that each strategy produces. We test the method empirically on a simulated planar robotic arm, and establish that the method is both able discriminate between strategies of dissimilar quality, even when the differences are tenuous, and that the resulting performance is competitive with the best fixed mixture of strategies.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117323250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Incremental grounded language learning in robot-robot interactions — Examples from spatial language 机器人-机器人互动中的增量基础语言学习-来自空间语言的例子
Michael Spranger
This paper reports on models of the grounded co-acquisition of syntax and semantics of locative spatial language in developmental robots. We instantiate theories from Cognitive Linguistics and Developmental Psychology and show how a learner robot can learn to produce and interpret spatial utterances in guided-learning interactions with a tutor robot. Particular emphasis is put on the role of the tutor. Our experiments show that the learner rapidly becomes successful in communication given the right tutoring strategy and learning operators.
本文报道了发育型机器人定位空间语言句法和语义的基础协同习得模型。我们实例化了认知语言学和发展心理学的理论,并展示了学习机器人如何在与辅导机器人的引导学习互动中学习产生和解释空间话语。特别强调导师的作用。我们的实验表明,在正确的辅导策略和学习操作的指导下,学习者在沟通方面迅速取得成功。
{"title":"Incremental grounded language learning in robot-robot interactions — Examples from spatial language","authors":"Michael Spranger","doi":"10.1109/DEVLRN.2015.7346140","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346140","url":null,"abstract":"This paper reports on models of the grounded co-acquisition of syntax and semantics of locative spatial language in developmental robots. We instantiate theories from Cognitive Linguistics and Developmental Psychology and show how a learner robot can learn to produce and interpret spatial utterances in guided-learning interactions with a tutor robot. Particular emphasis is put on the role of the tutor. Our experiments show that the learner rapidly becomes successful in communication given the right tutoring strategy and learning operators.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115005764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
To hear and to hold: Maternal naming and infant object exploration 听和抱:母亲的命名和婴儿的对象探索
Lucas Chang, K. D. Barbaro, G. Deák
To acquire language, infants must associate the language they hear with concurrent nonlinguistic experiences - the word-world mapping problem. Caregivers help structure the infant's environment by monitoring infants' attention and producing speech at informative times. In particular, children's learning of object names depends on their sensory experiences at times when objects are named. At 18 months, children's learning of novel words is predicted by the size of the object in their visual field when it is named [1]. However, there is not a direct relationship between infant's attention to objects in the world and speech produced by caregivers. Infant's multimodal experiences unfold in interactions with caregivers where both partners' behavior, including vocalizations, gaze, and manual activity, dynamically structure the visual and auditory scene [2,3].
为了习得语言,婴儿必须将他们听到的语言与同时发生的非语言经验联系起来——这就是单词-世界映射问题。照顾者通过监控婴儿的注意力和在信息时间说话来帮助构建婴儿的环境。特别是,儿童对物体名称的学习取决于他们在物体命名时的感官体验。在18个月时,儿童对新单词的学习可以通过物体命名时在其视野中的大小来预测[1]。然而,婴儿对世界上物体的注意力与照顾者的言语之间并没有直接的关系。婴儿的多模态体验在与照顾者的互动中展开,其中双方的行为,包括发声、凝视和手动活动,动态地构建了视觉和听觉场景[2,3]。
{"title":"To hear and to hold: Maternal naming and infant object exploration","authors":"Lucas Chang, K. D. Barbaro, G. Deák","doi":"10.1109/DEVLRN.2015.7346125","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346125","url":null,"abstract":"To acquire language, infants must associate the language they hear with concurrent nonlinguistic experiences - the word-world mapping problem. Caregivers help structure the infant's environment by monitoring infants' attention and producing speech at informative times. In particular, children's learning of object names depends on their sensory experiences at times when objects are named. At 18 months, children's learning of novel words is predicted by the size of the object in their visual field when it is named [1]. However, there is not a direct relationship between infant's attention to objects in the world and speech produced by caregivers. Infant's multimodal experiences unfold in interactions with caregivers where both partners' behavior, including vocalizations, gaze, and manual activity, dynamically structure the visual and auditory scene [2,3].","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"199 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121629854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Babybot challenge: Motor skills 婴儿挑战:运动技能
Patricia Shaw, Daniel Lewkowicz, Alexandros Giagkos, J. Law, Suresh Kumar, Mark H. Lee, Q. Shen
In 1984, von Hofsten performed a longitudinal study of early reaching in infants between the ages of 1 week and 19 weeks. This paper proposes a possible model using excitation of various subsystems to reproduce the longitudinal study. The model is then implemented and tested on an iCub humanoid robot, and the results compared to the original study. The resulting model shares interesting similarities to the data presented by von Hofsten, in particular a slight dip in the quantity of reaching. However, the dip is shifted along by a few weeks, and the analysis of hand behaviour is inconclusive based on the data recorded.
1984年,von Hofsten对1周至19周的婴儿进行了一项纵向研究。本文提出了一种利用各子系统的激励来再现纵向研究的可能模型。然后在iCub人形机器人上对该模型进行了实现和测试,并将结果与原始研究进行了比较。由此得出的模型与冯·霍夫斯滕提供的数据有一些有趣的相似之处,尤其是接触量略有下降。然而,下降幅度会随着几个星期的变化而变化,而且根据记录的数据,对手的行为的分析是不确定的。
{"title":"Babybot challenge: Motor skills","authors":"Patricia Shaw, Daniel Lewkowicz, Alexandros Giagkos, J. Law, Suresh Kumar, Mark H. Lee, Q. Shen","doi":"10.1109/DEVLRN.2015.7346114","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346114","url":null,"abstract":"In 1984, von Hofsten performed a longitudinal study of early reaching in infants between the ages of 1 week and 19 weeks. This paper proposes a possible model using excitation of various subsystems to reproduce the longitudinal study. The model is then implemented and tested on an iCub humanoid robot, and the results compared to the original study. The resulting model shares interesting similarities to the data presented by von Hofsten, in particular a slight dip in the quantity of reaching. However, the dip is shifted along by a few weeks, and the analysis of hand behaviour is inconclusive based on the data recorded.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125965149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Using spatial representations in gesture to facilitate early word learning: A priming process model 使用手势中的空间表征促进早期单词学习:启动过程模型
J. Trafton, Anthony M. Harrison, W. Lawson
As children learn to speak, they also gesture; previous empirical work has suggested that there is a direct link between the two. In this paper, we propose a priming process model that uses gesture to facilitate language. Our model uses the ACT-R/E cognitive architecture and uses a combination of repetition naming and priming from gesture spatial representations to increase the probability that a word will be remembered. Our model simulates 11 months of learning and runs on an embodied platform.
当孩子们学会说话时,他们也会做手势;先前的实证研究表明,两者之间存在直接联系。在本文中,我们提出了一个使用手势促进语言的启动过程模型。我们的模型使用ACT-R/E认知架构,并使用重复命名和从手势空间表征中启动的组合来增加单词被记住的概率。我们的模型模拟了11个月的学习,并在一个具体化的平台上运行。
{"title":"Using spatial representations in gesture to facilitate early word learning: A priming process model","authors":"J. Trafton, Anthony M. Harrison, W. Lawson","doi":"10.1109/DEVLRN.2015.7346117","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346117","url":null,"abstract":"As children learn to speak, they also gesture; previous empirical work has suggested that there is a direct link between the two. In this paper, we propose a priming process model that uses gesture to facilitate language. Our model uses the ACT-R/E cognitive architecture and uses a combination of repetition naming and priming from gesture spatial representations to increase the probability that a word will be remembered. Our model simulates 11 months of learning and runs on an embodied platform.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125993309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards hierarchical curiosity-driven exploration of sensorimotor models 对层次好奇驱动的感觉运动模型探索
Sébastien Forestier, Pierre-Yves Oudeyer
Curiosity-driven exploration mechanisms have been proposed to allow robots to actively explore high dimensional sensorimotor spaces in an open-ended manner [1], [2]. In such setups, competence-based intrinsic motivations show better results than knowledge-based exploration mechanisms which only monitor the learner's prediction performance [2], [3]. With competence-based intrinsic motivations, the learner explores its sensor space with a bias toward regions which are predicted to yield a high competence progress. Also, throughout its life, a developmental robot has to incrementally explore skills that add up to the hierarchy of previously learned skills, with a constraint being the cost of experimentation. Thus, a hierarchical exploration architecture could allow to reuse the sensorimotor models previously explored and to combine them to explore more efficiently new complex sensorimotor models. Here, we rely more specifically on the R-IAC and SAGG-RIAC series of architectures [3]. These architectures allow the learning of a single mapping between a motor and a sensor space with a competence-based intrinsic motivation. We describe some ways to extend these architectures with different tasks spaces that can be explored in a hierarchical manner, and mechanisms to handle this hierarchy of sensorimotor models that all need to be explored with an adequate amount of trials. We also describe an interactive task to evaluate the hierarchical learning mechanisms, where a robot has to explore its motor space in order to push an object to different locations. The robot can first explore how to make movements with its hand and then reuse this skill to explore the task of pushing an object.
已经提出了好奇心驱动的探索机制,允许机器人以开放式的方式主动探索高维感觉运动空间[1],[2]。在这样的设置中,基于能力的内在动机比基于知识的探索机制表现出更好的效果,后者只监控学习者的预测表现[2],[3]。在基于能力的内在动机下,学习者在探索其感知空间时,会偏向于预期能产生高能力进步的区域。此外,在其整个生命周期中,一个正在发育的机器人必须逐步探索技能,这些技能加起来会形成先前学习技能的层次结构,这是实验成本的限制。因此,分层探索架构可以重用以前探索过的感觉运动模型,并将它们结合起来更有效地探索新的复杂感觉运动模型。在这里,我们更具体地依赖于R-IAC和SAGG-RIAC系列架构[3]。这些架构允许学习基于能力的内在动机的电机和传感器空间之间的单个映射。我们描述了一些用不同的任务空间扩展这些架构的方法,这些空间可以以分层方式进行探索,以及处理这种感觉运动模型层次的机制,这些都需要通过足够数量的试验进行探索。我们还描述了一个交互式任务来评估分层学习机制,其中机器人必须探索其运动空间,以便将物体推到不同的位置。机器人可以先探索如何用手做运动,然后再利用这一技能来探索推动物体的任务。
{"title":"Towards hierarchical curiosity-driven exploration of sensorimotor models","authors":"Sébastien Forestier, Pierre-Yves Oudeyer","doi":"10.1109/DEVLRN.2015.7346146","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346146","url":null,"abstract":"Curiosity-driven exploration mechanisms have been proposed to allow robots to actively explore high dimensional sensorimotor spaces in an open-ended manner [1], [2]. In such setups, competence-based intrinsic motivations show better results than knowledge-based exploration mechanisms which only monitor the learner's prediction performance [2], [3]. With competence-based intrinsic motivations, the learner explores its sensor space with a bias toward regions which are predicted to yield a high competence progress. Also, throughout its life, a developmental robot has to incrementally explore skills that add up to the hierarchy of previously learned skills, with a constraint being the cost of experimentation. Thus, a hierarchical exploration architecture could allow to reuse the sensorimotor models previously explored and to combine them to explore more efficiently new complex sensorimotor models. Here, we rely more specifically on the R-IAC and SAGG-RIAC series of architectures [3]. These architectures allow the learning of a single mapping between a motor and a sensor space with a competence-based intrinsic motivation. We describe some ways to extend these architectures with different tasks spaces that can be explored in a hierarchical manner, and mechanisms to handle this hierarchy of sensorimotor models that all need to be explored with an adequate amount of trials. We also describe an interactive task to evaluate the hierarchical learning mechanisms, where a robot has to explore its motor space in order to push an object to different locations. The robot can first explore how to make movements with its hand and then reuse this skill to explore the task of pushing an object.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131892016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Seeing [u] aids vocal learning: Babbling and imitation of vowels using a 3D vocal tract model, reinforcement learning, and reservoir computing Seeing [u]帮助语音学习:使用3D声道模型、强化学习和存储库计算来咿呀学语和模仿元音
M. Murakami, B. Kröger, P. Birkholz, J. Triesch
We present a model of imitative vocal learning consisting of two stages. First, the infant is exposed to the ambient language and forms auditory knowledge of the speech items to be acquired. Second, the infant attempts to imitate these speech items and thereby learns to control the articulators for speech production. We model these processes using a recurrent neural network and a realistic vocal tract model. We show that vowel production can be successfully learnt by imitation. Moreover, we find that acquisition of [u] is impaired if visual information is discarded during imitation. This might give sighted infants an advantage over blind infants during vocal learning, which is in agreement with experimental evidence.
我们提出了一个由两个阶段组成的模仿声乐学习模型。首先,婴儿暴露在环境语言中,形成待习得言语项目的听觉知识。其次,婴儿试图模仿这些言语项目,从而学会控制发音器来产生言语。我们使用一个循环神经网络和一个真实的声道模型来模拟这些过程。我们证明元音的产生可以通过模仿成功地学习。此外,我们发现,如果在模仿过程中丢弃视觉信息,则对[u]的习得会受到损害。这可能使视力正常的婴儿在声乐学习方面比失明的婴儿有优势,这与实验证据是一致的。
{"title":"Seeing [u] aids vocal learning: Babbling and imitation of vowels using a 3D vocal tract model, reinforcement learning, and reservoir computing","authors":"M. Murakami, B. Kröger, P. Birkholz, J. Triesch","doi":"10.1109/DEVLRN.2015.7346142","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346142","url":null,"abstract":"We present a model of imitative vocal learning consisting of two stages. First, the infant is exposed to the ambient language and forms auditory knowledge of the speech items to be acquired. Second, the infant attempts to imitate these speech items and thereby learns to control the articulators for speech production. We model these processes using a recurrent neural network and a realistic vocal tract model. We show that vowel production can be successfully learnt by imitation. Moreover, we find that acquisition of [u] is impaired if visual information is discarded during imitation. This might give sighted infants an advantage over blind infants during vocal learning, which is in agreement with experimental evidence.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121425487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
期刊
2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1