Multimodal zero-shot learning for tactile texture recognition

IF 5.2 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Robotics and Autonomous Systems Pub Date : 2024-03-24 DOI:10.1016/j.robot.2024.104688

Guanqun Cao , Jiaqi Jiang , Danushka Bollegala , Min Li , Shan Luo

{"title":"Multimodal zero-shot learning for tactile texture recognition","authors":"Guanqun Cao , Jiaqi Jiang , Danushka Bollegala , Min Li , Shan Luo","doi":"10.1016/j.robot.2024.104688","DOIUrl":null,"url":null,"abstract":"<div><p>Tactile sensing plays an irreplaceable role in robotic material recognition. It enables robots to distinguish material properties such as their local geometry and textures, especially for materials like textiles. However, most tactile recognition methods can only classify known materials that have been touched and trained with tactile data, yet cannot classify unknown materials that are not trained with tactile data. To solve this problem, we propose a tactile Zero-Shot Learning framework to recognise materials when they are touched for the first time, using their visual and semantic information, without requiring tactile training samples. The biggest challenge in tactile Zero-Shot Learning is to recognise disjoint classes between training and test materials, i.e., the test materials that are not among the training ones. To bridge this gap, the visual modality, providing tactile cues from sight, and semantic attributes, giving high-level characteristics, are combined together and act as a link to expose the model to these disjoint classes. Specifically, a generative model is learnt to synthesise tactile features according to corresponding visual images and semantic embeddings, and then a classifier can be trained using the synthesised tactile features for zero-shot recognition. Extensive experiments demonstrate that our proposed multimodal generative model can achieve a high recognition accuracy of 83.06% in classifying materials that were not touched before. The robotic experiment demo and the FabricVST dataset are available at <span>https://sites.google.com/view/multimodalzsl</span><svg><path></path></svg>.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"176 ","pages":"Article 104688"},"PeriodicalIF":5.2000,"publicationDate":"2024-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S092188902400071X/pdfft?md5=644cdaa618ea404cb0bb3be3f39edd0c&pid=1-s2.0-S092188902400071X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092188902400071X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Tactile sensing plays an irreplaceable role in robotic material recognition. It enables robots to distinguish material properties such as their local geometry and textures, especially for materials like textiles. However, most tactile recognition methods can only classify known materials that have been touched and trained with tactile data, yet cannot classify unknown materials that are not trained with tactile data. To solve this problem, we propose a tactile Zero-Shot Learning framework to recognise materials when they are touched for the first time, using their visual and semantic information, without requiring tactile training samples. The biggest challenge in tactile Zero-Shot Learning is to recognise disjoint classes between training and test materials, i.e., the test materials that are not among the training ones. To bridge this gap, the visual modality, providing tactile cues from sight, and semantic attributes, giving high-level characteristics, are combined together and act as a link to expose the model to these disjoint classes. Specifically, a generative model is learnt to synthesise tactile features according to corresponding visual images and semantic embeddings, and then a classifier can be trained using the synthesised tactile features for zero-shot recognition. Extensive experiments demonstrate that our proposed multimodal generative model can achieve a high recognition accuracy of 83.06% in classifying materials that were not touched before. The robotic experiment demo and the FabricVST dataset are available at https://sites.google.com/view/multimodalzsl.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

触觉纹理识别的多模态零点学习

触觉传感在机器人材料识别中发挥着不可替代的作用。它能让机器人分辨材料的特性，如局部几何形状和纹理，尤其是纺织品等材料。然而，大多数触觉识别方法只能对经过触摸和触觉数据训练的已知材料进行分类，却无法对未经过触觉数据训练的未知材料进行分类。为了解决这个问题，我们提出了一种触觉零点学习框架，利用材料的视觉和语义信息，在不需要触觉训练样本的情况下，对首次触摸的材料进行识别。触觉零点学习的最大挑战在于识别训练和测试材料之间的不相关类别，即不属于训练材料的测试材料。为了弥合这一差距，视觉模式（提供来自视觉的触觉线索）和语义属性（提供高层次特征）被结合在一起，并充当将模型与这些不相关类别联系起来的纽带。具体来说，通过学习生成模型，可根据相应的视觉图像和语义嵌入合成触觉特征，然后利用合成的触觉特征训练分类器，实现零误差识别。大量实验证明，我们提出的多模态生成模型在对未触摸过的材料进行分类时，识别准确率高达 83.06%。机器人实验演示和 FabricVST 数据集可在 https://sites.google.com/view/multimodalzsl 上获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Robotics and Autonomous Systems 工程技术-机器人学

CiteScore

9.00

自引率

7.00%

发文量

164

审稿时长

4.5 months

期刊介绍： Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems. Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.