首页 > 最新文献

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)最新文献

英文 中文
The sequential organization of movement is critical to the development of reaching: A neural dynamics account 运动的顺序组织对伸展的发展是至关重要的:一个神经动力学的解释
Stephan K. U. Zibner, Jan Tekülve, G. Schöner
We present a neuro-dynamic model of looking, reaching, and grasping movements in infants in three pre-reaching phases. We attribute the evolution from pre-reaches to their suppression and subsequent re-emergence reported in a longitudinal study of von Hofsten [1] to the development of the sequential organization of movements, through which a set of elementary movements (visual fixation, reaching, opening the hand) are coordinated in time. The spatial precision hypothesis, which has emerged from work on spatial, visual, and action working memory, characterizes developmental changes as a change from strongly input-driven to more strongly interaction-dominated neural dynamics. Applying this hypothesis to reaching, we propose that the intention to reach is increasingly able to suppress competing movement behaviors, enabling object-oriented reaches. We evaluate three versions of the model that capture the three phases reported by von Hofsten and illustrate the properties of the movement model in simulations and in demonstration on a NAO robot.
我们提出了一个神经动力学模型的看,伸手,和抓取运动在三个前伸手阶段的婴儿。在von Hofsten[1]的纵向研究中,我们将从预伸手到其抑制和随后的重新出现的演变归因于连续运动组织的发展,通过该发展,一系列基本运动(视觉注视、伸手、张开手)在时间上协调一致。空间精度假说来源于对空间、视觉和动作工作记忆的研究,它将发育变化描述为从强烈的输入驱动到更强烈的相互作用主导的神经动力学的变化。将这一假设应用于到达,我们提出到达的意图越来越能够抑制竞争运动行为,从而实现面向对象的到达。我们评估了捕获von Hofsten报告的三个阶段的模型的三个版本,并在NAO机器人的仿真和演示中说明了运动模型的特性。
{"title":"The sequential organization of movement is critical to the development of reaching: A neural dynamics account","authors":"Stephan K. U. Zibner, Jan Tekülve, G. Schöner","doi":"10.1109/DEVLRN.2015.7346113","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346113","url":null,"abstract":"We present a neuro-dynamic model of looking, reaching, and grasping movements in infants in three pre-reaching phases. We attribute the evolution from pre-reaches to their suppression and subsequent re-emergence reported in a longitudinal study of von Hofsten [1] to the development of the sequential organization of movements, through which a set of elementary movements (visual fixation, reaching, opening the hand) are coordinated in time. The spatial precision hypothesis, which has emerged from work on spatial, visual, and action working memory, characterizes developmental changes as a change from strongly input-driven to more strongly interaction-dominated neural dynamics. Applying this hypothesis to reaching, we propose that the intention to reach is increasingly able to suppress competing movement behaviors, enabling object-oriented reaches. We evaluate three versions of the model that capture the three phases reported by von Hofsten and illustrate the properties of the movement model in simulations and in demonstration on a NAO robot.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114135066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Active maintenance of binocular correspondence leads to orientation alignment of visual receptive fields 双眼对应的主动维持导致视觉感受野的定向对齐
T. Chandrapala, Bertram E. Shi, J. Triesch
Neural development in the visual cortex depends on the visual experience during the so-called critical period. Recent experiments have shown that under normal conditions rodents develop binocular receptive fields which have similar orientation preferences for the left and right eyes. In contrast, under conditions of monocular deprivation during the critical period, this orientation alignment does not happen. Here we propose a computational model to explain the process of orientation alignment, its underlying mechanisms, and its failure in case of monocular deprivation or uncorrelated binocular inputs. Our model is based on the recently proposed Active Efficient Coding framework that jointly develops eye movement control and sensory representations. Our model suggests that the active maintenance of a binocular visual field, which leads to correlated visual inputs from the two eyes, is essential for the process of orientation alignment. This behavior is analogous to vergence control in primates. However, due to the fact that rodents have large receptive fields with low spatial frequency tuning, the coordination of the eyes need not be very precise. The model also suggests that it is not necessary that coordinated binocular vision be maintained continuously in order for orientation alignment to develop.
在所谓的关键时期,视觉皮层的神经发育取决于视觉经验。最近的实验表明,在正常情况下,啮齿类动物的双眼接受区具有相似的左眼和右眼取向偏好。相反,在关键时期的单眼剥夺条件下,这种定向对齐不会发生。在这里,我们提出了一个计算模型来解释定向对准的过程,其潜在的机制,以及在单眼剥夺或不相关双目输入的情况下定向对准的失败。我们的模型是基于最近提出的主动高效编码框架,共同开发眼动控制和感觉表征。我们的模型表明,双眼视野的主动维持,导致来自两只眼睛的相关视觉输入,对于定向对齐过程至关重要。这种行为类似于灵长类动物的收敛控制。然而,由于啮齿类动物具有大的接受野和低空间频率调谐,因此眼睛的协调不需要非常精确。该模型还表明,定向对准的发展并不需要持续保持双目协调视觉。
{"title":"Active maintenance of binocular correspondence leads to orientation alignment of visual receptive fields","authors":"T. Chandrapala, Bertram E. Shi, J. Triesch","doi":"10.1109/DEVLRN.2015.7346122","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346122","url":null,"abstract":"Neural development in the visual cortex depends on the visual experience during the so-called critical period. Recent experiments have shown that under normal conditions rodents develop binocular receptive fields which have similar orientation preferences for the left and right eyes. In contrast, under conditions of monocular deprivation during the critical period, this orientation alignment does not happen. Here we propose a computational model to explain the process of orientation alignment, its underlying mechanisms, and its failure in case of monocular deprivation or uncorrelated binocular inputs. Our model is based on the recently proposed Active Efficient Coding framework that jointly develops eye movement control and sensory representations. Our model suggests that the active maintenance of a binocular visual field, which leads to correlated visual inputs from the two eyes, is essential for the process of orientation alignment. This behavior is analogous to vergence control in primates. However, due to the fact that rodents have large receptive fields with low spatial frequency tuning, the coordination of the eyes need not be very precise. The model also suggests that it is not necessary that coordinated binocular vision be maintained continuously in order for orientation alignment to develop.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123054292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Do infants' gaze sequences predict their looking time? Testing the sequential-learnability model 婴儿的凝视序列能预测他们的注视时间吗?测试顺序学习性模型
M. Schlesinger, Scott P. Johnson, Dima Amso
We recently demonstrated that the gaze sequences produced by infants during an habituation event predict their looking times to both the habituation and (one of two) posthabituation test events [1]. Specifically, we trained a simple recurrent network (SRN) to predict infants' habituation gaze sequences. Sequences that were easier for the SRN to learn were associated with shorter looking times at the end of habituation, as well as longer looking times to one of two posthabituation test events. In the current study, we extended these findings by applying the sequential-learnability model to a new set of looking-time data, in which an important visual cue was removed from the habituation and test events. Following our previous work, we predicted that “learnability” of infants' habituation gaze sequences would predict their habituation looking time. However, unlike the previous study, we also predicted that habituation gaze sequences would not predict looking time to either of the posthabituation test events. The results were consistent with both of these predictions.
我们最近证明,婴儿在习惯化事件中产生的凝视序列预测了他们对习惯化和(两个中的一个)习惯化后测试事件的注视时间[1]。具体来说,我们训练了一个简单的循环网络(SRN)来预测婴儿的习惯注视序列。对于SRN来说,更容易学习的序列与习惯化结束时更短的观察时间有关,同时与两个适应后测试事件之一的更长时间有关。在当前的研究中,我们将顺序可学习性模型应用于一组新的观看时间数据,其中从习惯化和测试事件中删除了重要的视觉线索,从而扩展了这些发现。根据我们之前的研究,我们预测婴儿习惯凝视序列的“可学习性”将预测他们的习惯观看时间。然而,与之前的研究不同的是,我们还预测,习惯化凝视序列不能预测对任何一种情境后测试事件的注视时间。结果与这两种预测一致。
{"title":"Do infants' gaze sequences predict their looking time? Testing the sequential-learnability model","authors":"M. Schlesinger, Scott P. Johnson, Dima Amso","doi":"10.1109/DEVLRN.2015.7346135","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346135","url":null,"abstract":"We recently demonstrated that the gaze sequences produced by infants during an habituation event predict their looking times to both the habituation and (one of two) posthabituation test events [1]. Specifically, we trained a simple recurrent network (SRN) to predict infants' habituation gaze sequences. Sequences that were easier for the SRN to learn were associated with shorter looking times at the end of habituation, as well as longer looking times to one of two posthabituation test events. In the current study, we extended these findings by applying the sequential-learnability model to a new set of looking-time data, in which an important visual cue was removed from the habituation and test events. Following our previous work, we predicted that “learnability” of infants' habituation gaze sequences would predict their habituation looking time. However, unlike the previous study, we also predicted that habituation gaze sequences would not predict looking time to either of the posthabituation test events. The results were consistent with both of these predictions.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123348146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Which criteria for autonomously shifting between goal-directed and habitual behaviors in robots? 机器人在目标导向行为和习惯行为之间自主转换的标准是什么?
Erwan Renaudo, Benoît Girard, R. Chatila, M. Khamassi
Research in the fields of Psychology and Neuroscience have provided strong evidence that mammals can adaptively switch between goal-directed behaviors - i.e. deliberative decisions based on costly but flexible planned long-term consequences of actions - and habitual behaviors - i.e. reactive behaviors that are efficient when the environment is stable but inflexible in the case of environmental changes. However, the computational principles underlying this switching ability are not yet understood, and several alternative criteria have been proposed, each tested on specific subsets of experimental datasets. Here we present a neurorobotic implementation and comparison of such type of criteria, plus some new ones imported from the field of ensemble reinforcement learning, with a twofold objective: on the one hand exploring the possible efficiency of such bio-inspired principles to enable robots to have more behavioral flexibility during autonomous development and learning; on the other hand, analyzing whether an asynchronous continuous robotic simulation and comparison of these criteria in a common task can feed current debates in the Psychological and Neuroscience fields. We evaluate these methods in an apparently simple repetitive cube-pushing task on a simulated conveyor belt, but which imposes to the robot constant trade-offs between speed and accuracy and between stability and abrupt changes. Our results show that if overall performance is not improved by using multiple behavioral systems in a stable environment, these methods allow for a better adaptation to environmental changes. The Voting methods and Boltzmann addition, from ensemble reinforcement learning, give the best performance, providing an interesting alternative to Expert selection.
心理学和神经科学领域的研究提供了强有力的证据,表明哺乳动物可以在目标导向的行为(即基于代价高昂但灵活的行动长期后果的审慎决策)和习惯行为(即在环境稳定时有效的反应性行为,但在环境变化的情况下缺乏灵活性)之间进行适应性转换。然而,这种切换能力背后的计算原理尚未被理解,并且已经提出了几个替代标准,每个标准都在实验数据集的特定子集上进行了测试。在这里,我们提出了这类标准的神经机器人实现和比较,以及从集成强化学习领域引入的一些新标准,具有双重目标:一方面,探索这种仿生原理的可能效率,使机器人在自主发展和学习过程中具有更多的行为灵活性;另一方面,分析在一个共同的任务中异步连续机器人模拟和这些标准的比较是否可以为当前心理学和神经科学领域的争论提供信息。我们在模拟传送带上的一个看似简单的重复立方体推动任务中评估了这些方法,但这对机器人施加了速度和精度之间以及稳定性和突变之间的不断权衡。我们的研究结果表明,如果在稳定的环境中使用多种行为系统不能提高整体性能,那么这些方法可以更好地适应环境变化。来自集成强化学习的投票方法和玻尔兹曼加法给出了最好的性能,为专家选择提供了一个有趣的选择。
{"title":"Which criteria for autonomously shifting between goal-directed and habitual behaviors in robots?","authors":"Erwan Renaudo, Benoît Girard, R. Chatila, M. Khamassi","doi":"10.1109/DEVLRN.2015.7346152","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346152","url":null,"abstract":"Research in the fields of Psychology and Neuroscience have provided strong evidence that mammals can adaptively switch between goal-directed behaviors - i.e. deliberative decisions based on costly but flexible planned long-term consequences of actions - and habitual behaviors - i.e. reactive behaviors that are efficient when the environment is stable but inflexible in the case of environmental changes. However, the computational principles underlying this switching ability are not yet understood, and several alternative criteria have been proposed, each tested on specific subsets of experimental datasets. Here we present a neurorobotic implementation and comparison of such type of criteria, plus some new ones imported from the field of ensemble reinforcement learning, with a twofold objective: on the one hand exploring the possible efficiency of such bio-inspired principles to enable robots to have more behavioral flexibility during autonomous development and learning; on the other hand, analyzing whether an asynchronous continuous robotic simulation and comparison of these criteria in a common task can feed current debates in the Psychological and Neuroscience fields. We evaluate these methods in an apparently simple repetitive cube-pushing task on a simulated conveyor belt, but which imposes to the robot constant trade-offs between speed and accuracy and between stability and abrupt changes. Our results show that if overall performance is not improved by using multiple behavioral systems in a stable environment, these methods allow for a better adaptation to environmental changes. The Voting methods and Boltzmann addition, from ensemble reinforcement learning, give the best performance, providing an interesting alternative to Expert selection.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"15 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131133233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Goal-directed learning of hand-eye coordination in a humanoid robot 仿人机器人手眼协调的目标导向学习
Mathias Schmerling, G. Schillaci, V. Hafner
Visuo-motor coordination is known to be highly important for the development of a broader range of cognitive and motor skills in human infants and can thus be considered one of the key skills for robots to master. In this paper, we investigate how a recent concept in developmental robotics, referred to as goal babbling, relates to a visuo-motor coordination task in the humanoid robot Aldebaran Nao that requires coordinated control of two subsystems of motors, namely head and arm motors. The idea of goal babbling builds on findings in developmental psychology showing that human infants attempt goal-directed movements early on in their development enabling them to rapidly and efficiently bootstrap their motor system. Goal babbling has been shown to be superior to the classical idea of random motor babbling for the learning of body kinematics in robotic systems, in particular for systems with many degrees of freedom. Our results not only support the utility of goal babbling for the acquisition of visuo-motor coordination skills but also suggest that goal babbling is particularly effective in the case where two separate motor sub-systems, head and arm, need to be coordinated.
众所周知,视觉运动协调对于人类婴儿更广泛的认知和运动技能的发展非常重要,因此可以被认为是机器人需要掌握的关键技能之一。在本文中,我们研究了发展机器人中的一个新概念,即目标咿呀学语,如何与人形机器人Aldebaran Nao的视觉-运动协调任务相关,该任务需要协调控制两个电机子系统,即头部和手臂电机。目标咿呀学语的想法建立在发展心理学的发现之上,这些发现表明,人类婴儿在发育的早期就尝试目标导向的运动,使他们能够快速有效地引导他们的运动系统。在机器人系统中,特别是在多自由度系统中,目标牙牙学已被证明优于经典的随机运动牙牙学思想。我们的研究结果不仅支持了目标咿呀学语对视觉运动协调技能习得的作用,而且表明目标咿呀学语在需要协调头部和手臂这两个独立的运动子系统的情况下特别有效。
{"title":"Goal-directed learning of hand-eye coordination in a humanoid robot","authors":"Mathias Schmerling, G. Schillaci, V. Hafner","doi":"10.1109/DEVLRN.2015.7346136","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346136","url":null,"abstract":"Visuo-motor coordination is known to be highly important for the development of a broader range of cognitive and motor skills in human infants and can thus be considered one of the key skills for robots to master. In this paper, we investigate how a recent concept in developmental robotics, referred to as goal babbling, relates to a visuo-motor coordination task in the humanoid robot Aldebaran Nao that requires coordinated control of two subsystems of motors, namely head and arm motors. The idea of goal babbling builds on findings in developmental psychology showing that human infants attempt goal-directed movements early on in their development enabling them to rapidly and efficiently bootstrap their motor system. Goal babbling has been shown to be superior to the classical idea of random motor babbling for the learning of body kinematics in robotic systems, in particular for systems with many degrees of freedom. Our results not only support the utility of goal babbling for the acquisition of visuo-motor coordination skills but also suggest that goal babbling is particularly effective in the case where two separate motor sub-systems, head and arm, need to be coordinated.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133174419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A learning model for essentialist concepts 本质主义概念的学习模型
Iris Oved, Shaun Nichols, D. Barner
Many cognitive scientists take it for granted that concepts like CAT (mental terms that are expressed with single nouns) can be learned by observing a co-occurrence in superficial properties, such as having fur, being 4-legged, and tending to purr, and then building a complex category representation from representations for those superficial properties. A less popular account, known as Psychological Essentialism, claims that concepts like CAT pick out deep, hidden properties (essences) that are causal explanations for observable co-occurrences in superficial properties. The trouble is, Psychological Essentialism lacks an account of how such essentialist concepts could be learned, and often adopt the unpalatable conclusion that such concepts are innate. Developmental roboticists have recently started implementing systems that employ learned hidden/latent variables. The present paper spells out a learning theory for essentialist concepts, and presents two psychology experiments that help support the account over the associationist alternative.
许多认知科学家理所当然地认为,像CAT(用单个名词表达的心理术语)这样的概念可以通过观察表面特征(如有毛、四条腿、倾向于咕噜叫)的共发生来学习,然后从这些表面特征的表征中构建一个复杂的类别表征。一个不太受欢迎的说法,被称为心理本质主义,声称像CAT这样的概念挑选出深层的、隐藏的属性(本质),这些属性是对表面属性中可观察到的共同现象的因果解释。问题是,心理本质主义缺乏对这些本质主义概念是如何习得的解释,并经常得出令人不快的结论,即这些概念是天生的。发展型机器人专家最近开始实施使用学习到的隐藏/潜在变量的系统。本文阐述了本质主义概念的学习理论,并提出了两个心理学实验,以帮助支持该理论而不是联想理论。
{"title":"A learning model for essentialist concepts","authors":"Iris Oved, Shaun Nichols, D. Barner","doi":"10.1109/DEVLRN.2015.7346121","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346121","url":null,"abstract":"Many cognitive scientists take it for granted that concepts like CAT (mental terms that are expressed with single nouns) can be learned by observing a co-occurrence in superficial properties, such as having fur, being 4-legged, and tending to purr, and then building a complex category representation from representations for those superficial properties. A less popular account, known as Psychological Essentialism, claims that concepts like CAT pick out deep, hidden properties (essences) that are causal explanations for observable co-occurrences in superficial properties. The trouble is, Psychological Essentialism lacks an account of how such essentialist concepts could be learned, and often adopt the unpalatable conclusion that such concepts are innate. Developmental roboticists have recently started implementing systems that employ learned hidden/latent variables. The present paper spells out a learning theory for essentialist concepts, and presents two psychology experiments that help support the account over the associationist alternative.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115212977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task 预测觅食任务中人类决策建模的强化学习和基于实例的学习方法
Suhas E. Chelian, Jaehyon Paik, P. Pirolli, C. Lebiere, Rajan Bhattacharyya
Procedural memory and episodic memory are known to be distinct and both underlie the performance of many tasks. Reinforcement learning (RL) and instance-based learning (IBL) represent common approaches to modeling procedural and episodic memory in that order. In this work, we present a neural model utilizing RL dynamics and an ACT-R model utilizing IBL productions to the task of modeling human decision making in a prognostic foraging task. The task performed was derived from a geospatial intelligence domain wherein agents must choose among information sources to more accurately predict the actions of an adversary. Results from both models are compared to human data and suggest that information gain is an important component in modeling decision-making behavior using either memory system; with respect to the episodic memory approach, the procedural memory approach has a small but significant advantage in fitting human data. Finally, we discuss the interactions of multi-memory systems in complex decision-making tasks.
程序记忆和情景记忆是截然不同的,它们都是许多任务表现的基础。强化学习(RL)和基于实例的学习(IBL)是程序记忆和情景记忆建模的常用方法。在这项工作中,我们提出了一个利用RL动力学的神经模型和一个利用IBL产品的ACT-R模型来模拟人类在预测觅食任务中的决策。执行的任务来自地理空间情报领域,其中代理必须在信息源中进行选择,以更准确地预测对手的行动。将这两种模型的结果与人类数据进行了比较,并表明信息增益是使用任何一种记忆系统建模决策行为的重要组成部分;相对于情景记忆方法,程序记忆方法在拟合人类数据方面有一个小而显著的优势。最后,我们讨论了多记忆系统在复杂决策任务中的相互作用。
{"title":"Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task","authors":"Suhas E. Chelian, Jaehyon Paik, P. Pirolli, C. Lebiere, Rajan Bhattacharyya","doi":"10.1109/DEVLRN.2015.7346127","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346127","url":null,"abstract":"Procedural memory and episodic memory are known to be distinct and both underlie the performance of many tasks. Reinforcement learning (RL) and instance-based learning (IBL) represent common approaches to modeling procedural and episodic memory in that order. In this work, we present a neural model utilizing RL dynamics and an ACT-R model utilizing IBL productions to the task of modeling human decision making in a prognostic foraging task. The task performed was derived from a geospatial intelligence domain wherein agents must choose among information sources to more accurately predict the actions of an adversary. Results from both models are compared to human data and suggest that information gain is an important component in modeling decision-making behavior using either memory system; with respect to the episodic memory approach, the procedural memory approach has a small but significant advantage in fitting human data. Finally, we discuss the interactions of multi-memory systems in complex decision-making tasks.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123920047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Autonomous development of turn-taking behaviors in agent populations: A computational study 智能体群体中轮流行为的自主发展:一个计算研究
Clément Moulin-Frier, Martí Sánchez-Fibla, P. Verschure
We provide a computational model showing how turn-taking behaviors can self-organize out of sensorimotor interactions between vocalizing agents. Recent hypotheses propose that turn-taking behaviors in certain primate species emerge from a need to maintain vocal contact in a group (e.g. in dense environments preventing visual contact). In this context, vocalizations can convey information about the presence of each group member and taking turns allow to minimize the vocal signal interferences. We consider agents equipped with a cognitive architecture based on two coupled control loops: a reactive one implementing a basic regulatory behavior to maintain vocal listening and an adaptive one learning an action policy to maximize vocal contact among group members. We show that the reactive process bootstraps the adaptive learning to converge toward a collective turn-taking strategy. This model provides a computational support to the hypothesis that turn-taking can emerge from functional constraints related to group cohesion and inter-individual vocal signal interferences. We suggest future directions of research to understand how social behaviors can result from sensorimotor interactions.
我们提供了一个计算模型,展示了发声代理之间的感觉运动相互作用如何自我组织轮流行为。最近的假设提出,某些灵长类物种的轮流行为源于在群体中保持声音接触的需要(例如,在密集的环境中防止视觉接触)。在这种情况下,发声可以传达关于每个群体成员存在的信息,并且轮流允许将声音信号干扰降到最低。我们认为智能体具有基于两个耦合控制回路的认知架构:一个反应性控制回路实现基本的调节行为以维持声音倾听,一个适应性控制回路学习行动策略以最大限度地提高群体成员之间的声音接触。我们表明,反应过程引导自适应学习向集体轮流策略收敛。该模型为轮替可以从与群体凝聚力和个体间声音信号干扰相关的功能约束中出现的假设提供了计算支持。我们建议未来的研究方向,以了解社会行为是如何从感觉运动相互作用中产生的。
{"title":"Autonomous development of turn-taking behaviors in agent populations: A computational study","authors":"Clément Moulin-Frier, Martí Sánchez-Fibla, P. Verschure","doi":"10.1109/DEVLRN.2015.7346139","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346139","url":null,"abstract":"We provide a computational model showing how turn-taking behaviors can self-organize out of sensorimotor interactions between vocalizing agents. Recent hypotheses propose that turn-taking behaviors in certain primate species emerge from a need to maintain vocal contact in a group (e.g. in dense environments preventing visual contact). In this context, vocalizations can convey information about the presence of each group member and taking turns allow to minimize the vocal signal interferences. We consider agents equipped with a cognitive architecture based on two coupled control loops: a reactive one implementing a basic regulatory behavior to maintain vocal listening and an adaptive one learning an action policy to maximize vocal contact among group members. We show that the reactive process bootstraps the adaptive learning to converge toward a collective turn-taking strategy. This model provides a computational support to the hypothesis that turn-taking can emerge from functional constraints related to group cohesion and inter-individual vocal signal interferences. We suggest future directions of research to understand how social behaviors can result from sensorimotor interactions.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116433595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Infants gaze pattern analyzing using contrast entropy minimization 利用对比熵最小化分析婴儿注视模式
Ali Mahdi, M. Schlesinger, Dima Amso, Jun Qin
Statistical relationships between human vision and natural scene statistics have been addressed. Contrast is fundamental image property. We proposed that infant visual exploration and learning development could be analyzed using contrast entropy minimization. Ten infants viewed 16 naturalistic images. Fixations of the infants were recorded. Infant gaze patterns are compared between individual infants. Infants are compared to the fixations predicted by the contrast entropy minimization. The results show that infant tends to have different gaze patterns, and infant entropy drops gradually as fixations are made.
研究了人类视觉与自然场景统计之间的统计关系。对比是图像的基本属性。我们提出可以使用对比熵最小化来分析婴儿的视觉探索和学习发展。10名婴儿观看了16张自然主义图片。记录婴儿的注视。婴儿的凝视模式在个体婴儿之间进行比较。将婴儿与对比熵最小化预测的注视进行比较。结果表明,婴儿倾向于具有不同的注视模式,并且随着注视的进行,婴儿熵逐渐下降。
{"title":"Infants gaze pattern analyzing using contrast entropy minimization","authors":"Ali Mahdi, M. Schlesinger, Dima Amso, Jun Qin","doi":"10.1109/DEVLRN.2015.7346124","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346124","url":null,"abstract":"Statistical relationships between human vision and natural scene statistics have been addressed. Contrast is fundamental image property. We proposed that infant visual exploration and learning development could be analyzed using contrast entropy minimization. Ten infants viewed 16 naturalistic images. Fixations of the infants were recorded. Infant gaze patterns are compared between individual infants. Infants are compared to the fixations predicted by the contrast entropy minimization. The results show that infant tends to have different gaze patterns, and infant entropy drops gradually as fixations are made.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124097014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
U-shaped motor development emerges from Goal Babbling with intrinsic motor noise u型运动发育是由目标咿呀学语产生的,具有内在的运动噪声
Kenichi Narioka, Jochen J. Steil
The development of skills in human infants sometimes proceeds so-called U-shaped form, an sequence of disappearance and reappearance of a skill, which has been reported in [1] for pre-reaching to balls presented to infants of early age. We use Goal Babbling, a computational model for exploratory learning of motor skills, to model this U-shaped learning dynamics and furthermore also the qualitative differences caused by presentation modes of the goals. To this aim, we introduce developmentally plausible motor noise and adaptive learning rates in Goal Babbling and show through extensive simulation that U-shaped motor development emerges, thereby reproducing the findings of [1] in our computational model. The results suggest that the disappearance of the skill is caused by a combination of improving the skill and reducing the motor noise in the course of the maturing process, while the reappearance of the skill and its precision is directly reflecting the progress in learning the motor coordination.
人类婴儿的技能发展有时是所谓的u型,一种技能的消失和再现的序列,在[1]中有报道称,早期婴儿可以预先接触到球。我们使用运动技能探索性学习的计算模型“目标咿呀学语”(Goal Babbling)来模拟这种u型学习动态,以及目标呈现模式造成的质的差异。为此,我们在Goal Babbling中引入了发展上合理的运动噪声和自适应学习率,并通过广泛的模拟显示了u形运动发展的出现,从而在我们的计算模型中再现了[1]的发现。结果表明,该技能的消失是在成熟过程中提高技能和减少运动噪声相结合的结果,而该技能的再现及其精度直接反映了运动协调学习的进展。
{"title":"U-shaped motor development emerges from Goal Babbling with intrinsic motor noise","authors":"Kenichi Narioka, Jochen J. Steil","doi":"10.1109/DEVLRN.2015.7346115","DOIUrl":"https://doi.org/10.1109/DEVLRN.2015.7346115","url":null,"abstract":"The development of skills in human infants sometimes proceeds so-called U-shaped form, an sequence of disappearance and reappearance of a skill, which has been reported in [1] for pre-reaching to balls presented to infants of early age. We use Goal Babbling, a computational model for exploratory learning of motor skills, to model this U-shaped learning dynamics and furthermore also the qualitative differences caused by presentation modes of the goals. To this aim, we introduce developmentally plausible motor noise and adaptive learning rates in Goal Babbling and show through extensive simulation that U-shaped motor development emerges, thereby reproducing the findings of [1] in our computational model. The results suggest that the disappearance of the skill is caused by a combination of improving the skill and reducing the motor noise in the course of the maturing process, while the reappearance of the skill and its precision is directly reflecting the progress in learning the motor coordination.","PeriodicalId":164756,"journal":{"name":"2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"105 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126705100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1