Simple and complex behavior learning using behavior hidden Markov model and CobART

IF 6.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neurocomputing Pub Date : 2013-03-01 DOI:10.1016/j.neucom.2012.09.013
Seyit Sabri Seyhan, Ferda Nur Alpaslan, Mustafa Yavaş
{"title":"Simple and complex behavior learning using behavior hidden Markov model and CobART","authors":"Seyit Sabri Seyhan,&nbsp;Ferda Nur Alpaslan,&nbsp;Mustafa Yavaş","doi":"10.1016/j.neucom.2012.09.013","DOIUrl":null,"url":null,"abstract":"<div><p><span>This paper proposes behavior learning and generation models for simple and complex behaviors of robots using </span>unsupervised learning<span> methods. While the simple behaviors are modeled by simple-behavior learning model (SBLM), complex behaviors are modeled by complex-behavior learning model (CBLM) which uses previously learned simple or complex behaviors. Both models include behavior categorization, behavior modeling, and behavior generation phases. In the behavior categorization phase, sensory data are categorized using correlation based adaptive resonance theory (CobART) network that generates motion primitives corresponding to robot's base abilities. In the behavior modeling phase, a modified version of hidden Markov model<span> (HMM), is called Behavior-HMM, is used to model the relationships among the motion primitives in a finite state stochastic network. At the same time, a motion generator which is an artificial neural network (ANN) is trained for each motion primitive to learn essential robot motor commands. In the behavior generation phase, a motion primitive sequence that can perform the desired task is generated according to the previously learned Behavior-HMMs at the higher level. Then, in the lower level, these motion primitives are executed by the motion generator which is specifically trained for the corresponding motion primitive. The transitions between the motion primitives are done according to observed sensory data and probabilistic weights assigned to each transition during the learning phase. The proposed models are not constructed for one specific behavior, but are intended to be bases for all behaviors. The behavior learning capabilities of the model is extended by integrating previously learned behaviors hierarchically which is referred as CBLM. Hence, new behaviors can take advantage of already discovered behaviors. Performed experiments on a robot simulator show that simple and complex-behavior learning models can generate requested behaviors effectively.</span></span></p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"103 ","pages":"Pages 121-131"},"PeriodicalIF":6.5000,"publicationDate":"2013-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.neucom.2012.09.013","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231212007631","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 16

Abstract

This paper proposes behavior learning and generation models for simple and complex behaviors of robots using unsupervised learning methods. While the simple behaviors are modeled by simple-behavior learning model (SBLM), complex behaviors are modeled by complex-behavior learning model (CBLM) which uses previously learned simple or complex behaviors. Both models include behavior categorization, behavior modeling, and behavior generation phases. In the behavior categorization phase, sensory data are categorized using correlation based adaptive resonance theory (CobART) network that generates motion primitives corresponding to robot's base abilities. In the behavior modeling phase, a modified version of hidden Markov model (HMM), is called Behavior-HMM, is used to model the relationships among the motion primitives in a finite state stochastic network. At the same time, a motion generator which is an artificial neural network (ANN) is trained for each motion primitive to learn essential robot motor commands. In the behavior generation phase, a motion primitive sequence that can perform the desired task is generated according to the previously learned Behavior-HMMs at the higher level. Then, in the lower level, these motion primitives are executed by the motion generator which is specifically trained for the corresponding motion primitive. The transitions between the motion primitives are done according to observed sensory data and probabilistic weights assigned to each transition during the learning phase. The proposed models are not constructed for one specific behavior, but are intended to be bases for all behaviors. The behavior learning capabilities of the model is extended by integrating previously learned behaviors hierarchically which is referred as CBLM. Hence, new behaviors can take advantage of already discovered behaviors. Performed experiments on a robot simulator show that simple and complex-behavior learning models can generate requested behaviors effectively.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用行为隐马尔可夫模型和coart进行简单和复杂的行为学习
利用无监督学习方法,提出了机器人简单和复杂行为的行为学习和生成模型。简单行为的建模采用简单行为学习模型(simple-behavior learning model, SBLM),复杂行为的建模采用复杂行为学习模型(complex-behavior learning model, CBLM)。这两个模型都包括行为分类、行为建模和行为生成阶段。在行为分类阶段,使用基于关联的自适应共振理论(CobART)网络对感知数据进行分类,该网络生成与机器人基本能力相对应的运动原语。在行为建模阶段,使用隐马尔可夫模型(HMM)的改进版本behavior -HMM对有限状态随机网络中运动基元之间的关系进行建模。同时,对每个运动原语训练一个运动生成器,即人工神经网络来学习机器人的基本运动指令。在行为生成阶段,根据先前学习到的更高层次的behavior - hmm,生成能够执行期望任务的运动原语序列。然后,在较低的层次上,这些运动原语由运动生成器执行,该生成器专门针对相应的运动原语进行训练。运动基元之间的转换是根据观察到的感官数据和在学习阶段分配给每个转换的概率权重来完成的。建议的模型不是为一个特定的行为构建的,而是打算作为所有行为的基础。该模型的行为学习能力是通过层次化地整合先前学习的行为来扩展的,称为CBLM。因此,新的行为可以利用已经发现的行为。在机器人模拟器上进行的实验表明,简单和复杂的行为学习模型都能有效地生成所要求的行为。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
期刊最新文献
Dynamic hypergraph structure learning for spatio-temporal time series forecasting ASHSR: Enhancing query-based occupancy prediction via anti-occlusion sampling and hard sample reweighting CDNE: Community deception from node and edge perspectives 3MU-Net: A multi-layer, multi-view and multi-modal segmentation model for PET/CT images of lung tumors Editorial Board
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1