International Journal of Humanoid Robotics最新文献

英文中文

Task-Oriented Self-Imitation Learning for Robotic Autonomous Skill Acquisition 面向任务的机器人自主技能学习

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2024-03-13 DOI: 10.1142/s0219843624500014

Chenyang Ran, Jianbo Su

The inferior sample efficiency of reinforcement learning (RL) and the requirement for high-quality demonstrations in imitation learning (IL) will hinder their application in real-world robots. To address this challenge, a novel self-evolution framework, named task-oriented self-imitation learning (TOSIL), is proposed. To circumvent external demonstrations, the top-K self-generated trajectories are chosen as expert data from both per-episode exploration and long-term return perspectives. Each transition is assigned a guide reward, which is formulated by these trajectories. The guide rewards update as the agent evolves, encouraging good exploration behaviors. This methodology guarantees that the agent explores in the direction relevant to the task, improving sample efficiency and asymptotic performance. The experimental results on locomotion and manipulation tasks indicate that the proposed framework outperforms other state-of-the-art RL methods. Furthermore, the integration of suboptimal trajectories has the potential to improve the sample efficiency while maintaining performance. This is a significant advancement in autonomous skill acquisition for robots.

强化学习（RL）的样本效率较低，而模仿学习（IL）需要高质量的示范，这将阻碍它们在现实世界机器人中的应用。为了应对这一挑战，我们提出了一个新颖的自我进化框架，名为面向任务的自我模仿学习（TOSIL）。为避免外部示范，从每集探索和长期回报的角度出发，选择前 K 个自生成轨迹作为专家数据。每个过渡都有一个指导奖励，由这些轨迹制定。指导奖励随着代理的发展而更新，以鼓励良好的探索行为。这种方法保证了机器人在与任务相关的方向上进行探索，提高了样本效率和渐进性能。运动和操纵任务的实验结果表明，所提出的框架优于其他最先进的 RL 方法。此外，整合次优轨迹有可能在保持性能的同时提高采样效率。这是机器人自主技能获取领域的一大进步。

{"title":"Task-Oriented Self-Imitation Learning for Robotic Autonomous Skill Acquisition","authors":"Chenyang Ran, Jianbo Su","doi":"10.1142/s0219843624500014","DOIUrl":"https://doi.org/10.1142/s0219843624500014","url":null,"abstract":"<p>The inferior sample efficiency of reinforcement learning (RL) and the requirement for high-quality demonstrations in imitation learning (IL) will hinder their application in real-world robots. To address this challenge, a novel self-evolution framework, named task-oriented self-imitation learning (TOSIL), is proposed. To circumvent external demonstrations, the top-K self-generated trajectories are chosen as expert data from both per-episode exploration and long-term return perspectives. Each transition is assigned a guide reward, which is formulated by these trajectories. The guide rewards update as the agent evolves, encouraging good exploration behaviors. This methodology guarantees that the agent explores in the direction relevant to the task, improving sample efficiency and asymptotic performance. The experimental results on locomotion and manipulation tasks indicate that the proposed framework outperforms other state-of-the-art RL methods. Furthermore, the integration of suboptimal trajectories has the potential to improve the sample efficiency while maintaining performance. This is a significant advancement in autonomous skill acquisition for robots.</p>","PeriodicalId":50319,"journal":{"name":"International Journal of Humanoid Robotics","volume":"21 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140153545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Celebrating 20th Anniversary of IJHR 庆祝 IJHR 20 周年

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2024-02-29 DOI: 10.1142/s021984362402002x

Ming Xie, John Weng, Giulio Sandini

In this editorial dedicated to the special issue which celebrates the 20th anniversary of the International Journal of Humanoid Robotics (IJHR), we outline the history of IJHR.

本期社论旨在庆祝《国际仿人机器人学杂志》（IJHR）创刊 20 周年，我们将概述《国际仿人机器人学杂志》的历史。

引用次数: 0

Anthology: Cognitive Developmental Humanoids Robotics 选集：认知发展型人机机器人

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2024-02-24 DOI: 10.1142/s0219843624500026

Minoru Asada

This paper explores the confluence of physical embodiment and social interaction in the context of Cognitive Developmental Humanoid Robotics (CDHR). By classifying varied interactions through developmental stages of the “self” and their interaction spheres, the discussion unearths profound insights into the composite nature of developmental processes. It presents a multi-dimensional exploration through different interaction scenarios, ranging from fetus-mother interactions to advanced large language models like ChatGPT, revealing the intrinsic connection between the physical and social realms of existence. In conclusion, this paper broadens the horizon of our understanding of the intricate interplays between physical embodiment and social interaction, setting the stage for more nuanced, ethically sound approaches and explorations in the realm of humanoid robotics and artificial intelligence.

本文以认知发展型仿人机器人（CDHR）为背景，探讨了物理体现与社会互动的融合。通过对 "自我 "发展阶段的各种互动及其互动领域进行分类，讨论揭示了发展过程的复合本质的深刻见解。本文通过从胎儿与母亲的互动到先进的大型语言模型（如 ChatGPT）等不同互动场景的多维探索，揭示了存在的物理和社会领域之间的内在联系。总之，本文拓宽了我们对物理体现与社会互动之间错综复杂的相互作用的理解，为仿人机器人和人工智能领域更细致入微、更符合伦理道德的方法和探索奠定了基础。

引用次数: 0

Learning an Image-Based Visual Servoing Controller for Object Grasping 学习基于图像的物体抓取视觉伺服控制器

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2024-01-26 DOI: 10.1142/s0219843623500330

Shuaijun Wang, Lining Sun, Mantian Li, Pengfei Wang, Fusheng Zha, Wei Guo, Qiang Li

Adaptive and cooperative control of arms and fingers for natural object reaching and grasping, without explicit 3D geometric pose information, is observed in humans. In this study, an image-based visual servoing controller, inspired by human grasping behavior, is proposed for an arm-gripper system. A large-scale dataset is constructed using Pybullet simulation, comprising paired images and arm-gripper control signals mimicking expert grasping behavior. Leveraging this dataset, a network is directly trained to derive a control policy that maps images to cooperative grasp control. Subsequently, the learned synergy grasping policy from the network is directly applied to a real robot with the same configuration. Experimental results demonstrate the effectiveness of the algorithm. Videos can be found at https://www.bilibili.com/video/BV1tg4y1b7Qe/.

在没有明确的三维几何姿势信息的情况下，可以观察到人类对手臂和手指的自适应协同控制，以实现自然物体的伸展和抓取。在本研究中，受人类抓握行为的启发，针对手臂抓取系统提出了一种基于图像的视觉伺服控制器。利用 Pybullet 仿真构建了一个大规模数据集，其中包括成对图像和模仿专家抓取行为的手臂抓取器控制信号。利用这个数据集，可以直接训练一个网络，推导出将图像映射到协同抓取控制的控制策略。随后，从网络中学到的协同抓取策略被直接应用到具有相同配置的真实机器人上。实验结果证明了该算法的有效性。视频请访问 https://www.bilibili.com/video/BV1tg4y1b7Qe/。

引用次数: 0

Conscious Learning without Post-Selection Misconduct 自觉学习，杜绝选举后的不当行为

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2023-12-15 DOI: 10.1142/s0219843623500317

Juyang Weng

引用次数: 0

C-SQG: Cosine-Law-Based Spatially Quantized Gait Generation for Knee-Stretched Biped Walking C-SQG：基于余弦法的空间量化步态生成，用于膝关节伸展的双足行走

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2023-12-15 DOI: 10.1142/s0219843623500329

Chaobin Zou, Rui Huang, Guangkui Song, Kecheng Shi, Zhinan Peng, Hong Cheng

引用次数: 0

ZMP - where are we after fifty-five years? ZMP - 五十五年后，我们在哪里？

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2023-12-08 DOI: 10.1142/s0219843623500305

B. Borovac, M. Nikolic, M. Raković, S. Savic

引用次数: 0

From embodiment to super-embodiment: an approach to open-ended and human aligned intelligence/mind 从化身到超级化身：一种实现开放式和与人类一致的智能/心灵的方法

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2023-12-01 DOI: 10.1142/s0219843623500299

Yasuo Kuniyoshi

引用次数: 0

Avatar 阿凡达

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2023-11-23 DOI: 10.1142/s0219843623500287

Hiroshi Ishiguro

引用次数: 0

Stories of qrio and pino, and beyond: lessons learned from small humanoid projects from R&D to business qrio 和 pino 的故事及其他：小型仿人项目从研发到商业的经验教训

IF 1.5 4区计算机科学 Q4 ROBOTICS

International Journal of Humanoid Robotics

Pub Date : 2023-11-17 DOI: 10.1142/s0219843623500275

Masahiro Fujita, Yasunori Kawanami, Kiyokazu Miyazawa, Masaya Kinoshita, Kunihito Sawai, F. Yamasaki, Tatsuya Matsui, Ken Endo, Shu Ishiguro, Hiroaki Kitano

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Journal of Humanoid Robotics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀