2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)最新文献

英文中文

Bayesian Optimization for Developmental Robotics with Meta-Learning by Parameters Bounds Reduction 基于参数界约简的发展机器人元学习贝叶斯优化

2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Pub Date : 2020-07-30 DOI: 10.1109/ICDL-EpiRob48136.2020.9278071

Maxime Petit, E. Dellandréa, Liming Chen

In robotics, methods and softwares usually require optimizations of hyperparameters in order to be efficient for specific tasks, for instance industrial bin-picking from homogeneous heaps of different objects. We present a developmental framework based on long-term memory and reasoning modules (Bayesian Optimisation, visual similarity and parameters bounds reduction) allowing a robot to use meta-learning mechanism increasing the efficiency of such continuous and constrained parameters optimizations. The new optimization, viewed as a learning for the robot, can take advantage of past experiences (stored in the episodic and procedural memories) to shrink the search space by using reduced parameters bounds computed from the best optimizations realized by the robot with similar tasks of the new one (e.g. bin-picking from an homogenous heap of a similar object, based on visual similarity of objects stored in the semantic memory). As example, we have confronted the system to the constrained optimizations of 9 continuous hyperparameters for a professional software (Kamido) in industrial robotic arm bin-picking tasks, a step that is needed each time to handle correctly new object. We used a simulator to create bin-picking tasks for 8 different objects (7 in simulation and one with real setup, without and with meta-learning with experiences coming from other similar objects) achieving goods results despite a very small optimization budget, with a better performance reached when meta-learning is used (84.3 % vs 78.9 % of success overall, with a small budget of 30 iterations for each optimization) for every object tested (p-value=0.036).

在机器人技术中，方法和软件通常需要对超参数进行优化，以便有效地完成特定任务，例如，从不同对象的同质堆中进行工业拾取。我们提出了一个基于长期记忆和推理模块(贝叶斯优化、视觉相似性和参数边界缩减)的开发框架，允许机器人使用元学习机制来提高这种连续和约束参数优化的效率。新的优化，被视为机器人的学习，可以利用过去的经验(存储在情景和程序记忆中)，通过使用由机器人实现的最佳优化计算的简化参数边界来缩小搜索空间(例如，基于存储在语义记忆中的对象的视觉相似性，从相似对象的同质堆中拾取)。以工业机械臂捡筒任务为例，我们针对专业软件(Kamido)对系统进行了9个连续超参数的约束优化，这一步骤每次都需要正确处理新对象。我们使用模拟器为8个不同的对象(7个在模拟中，一个在真实设置中，没有元学习，有来自其他类似对象的经验)创建bin-picking任务，尽管优化预算非常小，但仍然获得了良好的结果，使用元学习时达到了更好的性能(84.3% vs 78.9%的总体成功率，每次优化的30次迭代的小预算)对于每个测试对象(p值=0.036)。

{"title":"Bayesian Optimization for Developmental Robotics with Meta-Learning by Parameters Bounds Reduction","authors":"Maxime Petit, E. Dellandréa, Liming Chen","doi":"10.1109/ICDL-EpiRob48136.2020.9278071","DOIUrl":"https://doi.org/10.1109/ICDL-EpiRob48136.2020.9278071","url":null,"abstract":"In robotics, methods and softwares usually require optimizations of hyperparameters in order to be efficient for specific tasks, for instance industrial bin-picking from homogeneous heaps of different objects. We present a developmental framework based on long-term memory and reasoning modules (Bayesian Optimisation, visual similarity and parameters bounds reduction) allowing a robot to use meta-learning mechanism increasing the efficiency of such continuous and constrained parameters optimizations. The new optimization, viewed as a learning for the robot, can take advantage of past experiences (stored in the episodic and procedural memories) to shrink the search space by using reduced parameters bounds computed from the best optimizations realized by the robot with similar tasks of the new one (e.g. bin-picking from an homogenous heap of a similar object, based on visual similarity of objects stored in the semantic memory). As example, we have confronted the system to the constrained optimizations of 9 continuous hyperparameters for a professional software (Kamido) in industrial robotic arm bin-picking tasks, a step that is needed each time to handle correctly new object. We used a simulator to create bin-picking tasks for 8 different objects (7 in simulation and one with real setup, without and with meta-learning with experiences coming from other similar objects) achieving goods results despite a very small optimization budget, with a better performance reached when meta-learning is used (84.3 % vs 78.9 % of success overall, with a small budget of 30 iterations for each optimization) for every object tested (p-value=0.036).","PeriodicalId":114948,"journal":{"name":"2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"123 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122486399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Tracking Emotions: Intrinsic Motivation Grounded on Multi - Level Prediction Error Dynamics 跟踪情绪:基于多层次预测误差动态的内在动机

2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Pub Date : 2020-07-29 DOI: 10.1109/ICDL-EpiRob48136.2020.9278106

G. Schillaci, Alejandra Ciria, B. Lara

We present an intrinsic motivation architecture that generates behaviors towards self-generated and dynamic goals and that regulates goal selection and the balance between exploitation and exploration through multi-level monitoring of prediction error dynamics. This architecture modulates exploration noise and leverages computational resources according to the dynamics of the overall performance of the learning system. Results show that this architecture outperforms intrinsic motivation approaches where exploratory noise and goals are fixed. We suggest that the tracking of prediction error dynamics allows an artificial agent to be intrinsically motivated to seek new experiences but constrained to those that generate reducible prediction error. We argue about the potential relationship between emotional valence and rates of progress toward a goal.

我们提出了一种内在动机体系结构，该体系结构通过对预测误差动态的多层次监测来产生对自我生成和动态目标的行为，并调节目标选择和开发与探索之间的平衡。这种架构根据学习系统整体性能的动态来调节探索噪声和利用计算资源。结果表明，该架构优于固有动机方法，其中探索噪声和目标是固定的。我们建议，预测误差动态跟踪允许人工智能体具有内在动机去寻求新的经验，但限制在那些产生可减少的预测误差。我们争论情绪效价和朝着目标前进的速度之间的潜在关系。

引用次数: 9

End-to-End Pixel-Based Deep Active Inference for Body Perception and Action 基于端到端像素的身体感知和动作深度主动推理

2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Pub Date : 2019-12-28 DOI: 10.1109/ICDL-EpiRob48136.2020.9278105

Cansu Sancaktar, Pablo Lanillos

We present a pixel-based deep active inference algorithm (PixelAI) inspired by human body perception and action. Our algorithm combines the free energy principle from neuroscience, rooted in variational inference, with deep convolutional decoders to scale the algorithm to directly deal with raw visual input and provide online adaptive inference. Our approach is validated by studying body perception and action in a simulated and a real Nao robot. Results show that our approach allows the robot to perform 1) dynamical body estimation of its arm using only monocular camera images and 2) autonomous reaching to “imagined” arm poses in visual space. This suggests that robot and human body perception and action can be efficiently solved by viewing both as an active inference problem guided by ongoing sensory input.

基于人体感知和动作的启发，提出了一种基于像素的深度主动推理算法(PixelAI)。我们的算法结合了神经科学的自由能原理，根植于变分推理，深度卷积解码器扩展算法，直接处理原始视觉输入并提供在线自适应推理。通过研究模拟和真实Nao机器人的身体感知和动作，验证了我们的方法。结果表明，我们的方法允许机器人仅使用单目相机图像对其手臂进行动态身体估计;2)自动到达视觉空间中的“想象”手臂姿势。这表明，机器人和人体的感知和行动可以通过将两者视为由持续的感官输入引导的主动推理问题来有效地解决。

引用次数: 48

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀