应对工业取放挑战:基于深度学习的 6 自由度姿态估计解决方案

IF 8.2 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers in Industry Pub Date : 2024-07-25 DOI:10.1016/j.compind.2024.104130
{"title":"应对工业取放挑战:基于深度学习的 6 自由度姿态估计解决方案","authors":"","doi":"10.1016/j.compind.2024.104130","DOIUrl":null,"url":null,"abstract":"<div><p>Object picking is a fundamental, long-lasting, and yet unsolved problem in industrial applications. To complete it, 6 Degrees-of-Freedom pose estimation can be crucial. This task, easy for humans, is a challenge for machines as it involves multiple intelligent processes (for example object detection, recognition, pose prediction). Pose estimation has recently made huge steps forward, due to the advent of Deep Learning. However, in real-world applications it is not trivial to compute it: each use-case needs an annotated dataset and a model robust enough to face its specific challenges. In this paper, we present a comprehensive investigation focused on a specific use-case: the picking of four industrial objects by a collaborative robot’s arm, addressing challenges related to reflective textures and pose ambiguities of heterogeneous shapes. Thus, Artificial Intelligence is crucial in this process, utilizing Convolutional Neural Networks to discern an object’s pose by extracting hierarchical features from a single image. In detail, we propose a new synthetic dataset of industrial objects and a fine-tuning method to close the sim-to-real domain gap. In addition, we improved an existing pipeline for pose estimation and introduced a new version of an existing method, based on Convolutional Neural Networks. Finally, extensive experiments were conducted with a Universal Robot UR5e. Results show our strategy achieves good performances with an average successful picking rate of 75% on these new objects. Considering the lack of available datasets for pose estimation, coupled with the significant time and labor required for annotating new images, we contribute to the scientific community by providing a comprehensive dataset, and the associated generation and estimation pipelines.<span><span><sup>1</sup></span></span></p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":null,"pages":null},"PeriodicalIF":8.2000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0166361524000587/pdfft?md5=5784c120c97e9ce7f729edd31cc45d22&pid=1-s2.0-S0166361524000587-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Addressing challenges in industrial pick and place: A deep learning-based 6 Degrees-of-Freedom pose estimation solution\",\"authors\":\"\",\"doi\":\"10.1016/j.compind.2024.104130\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Object picking is a fundamental, long-lasting, and yet unsolved problem in industrial applications. To complete it, 6 Degrees-of-Freedom pose estimation can be crucial. This task, easy for humans, is a challenge for machines as it involves multiple intelligent processes (for example object detection, recognition, pose prediction). Pose estimation has recently made huge steps forward, due to the advent of Deep Learning. However, in real-world applications it is not trivial to compute it: each use-case needs an annotated dataset and a model robust enough to face its specific challenges. In this paper, we present a comprehensive investigation focused on a specific use-case: the picking of four industrial objects by a collaborative robot’s arm, addressing challenges related to reflective textures and pose ambiguities of heterogeneous shapes. Thus, Artificial Intelligence is crucial in this process, utilizing Convolutional Neural Networks to discern an object’s pose by extracting hierarchical features from a single image. In detail, we propose a new synthetic dataset of industrial objects and a fine-tuning method to close the sim-to-real domain gap. In addition, we improved an existing pipeline for pose estimation and introduced a new version of an existing method, based on Convolutional Neural Networks. Finally, extensive experiments were conducted with a Universal Robot UR5e. Results show our strategy achieves good performances with an average successful picking rate of 75% on these new objects. Considering the lack of available datasets for pose estimation, coupled with the significant time and labor required for annotating new images, we contribute to the scientific community by providing a comprehensive dataset, and the associated generation and estimation pipelines.<span><span><sup>1</sup></span></span></p></div>\",\"PeriodicalId\":55219,\"journal\":{\"name\":\"Computers in Industry\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2024-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0166361524000587/pdfft?md5=5784c120c97e9ce7f729edd31cc45d22&pid=1-s2.0-S0166361524000587-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in Industry\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0166361524000587\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524000587","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

物体拾取是工业应用中一个基本的、长期存在但尚未解决的问题。要完成这一任务,6 自由度姿态估计至关重要。这项任务对人类来说很容易,但对机器来说却是一项挑战,因为它涉及多个智能过程(如物体检测、识别、姿态预测)。由于深度学习技术的出现,姿态估计最近取得了巨大进步。然而,在现实世界的应用中,计算姿势估计并非易事:每个用例都需要有注释的数据集和足够强大的模型来应对其特定挑战。在本文中,我们介绍了一项侧重于特定用例的综合调查:协作机器人手臂拾取四个工业物体,解决与反光纹理和异质形状的姿势模糊性有关的挑战。因此,人工智能在这一过程中至关重要,它利用卷积神经网络,通过从单张图像中提取分层特征来辨别物体的姿态。具体而言,我们提出了一种新的工业物体合成数据集和微调方法,以缩小模拟与真实领域的差距。此外,我们还改进了现有的姿态估计管道,并引入了基于卷积神经网络的现有方法的新版本。最后,我们使用通用机器人 UR5e 进行了大量实验。结果表明,我们的策略取得了良好的效果,在这些新物体上的平均拾取成功率达到 75%。考虑到姿势估计缺乏可用的数据集,加上标注新图像需要大量的时间和人力,我们提供了一个全面的数据集以及相关的生成和估计管道,为科学界做出了贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Addressing challenges in industrial pick and place: A deep learning-based 6 Degrees-of-Freedom pose estimation solution

Object picking is a fundamental, long-lasting, and yet unsolved problem in industrial applications. To complete it, 6 Degrees-of-Freedom pose estimation can be crucial. This task, easy for humans, is a challenge for machines as it involves multiple intelligent processes (for example object detection, recognition, pose prediction). Pose estimation has recently made huge steps forward, due to the advent of Deep Learning. However, in real-world applications it is not trivial to compute it: each use-case needs an annotated dataset and a model robust enough to face its specific challenges. In this paper, we present a comprehensive investigation focused on a specific use-case: the picking of four industrial objects by a collaborative robot’s arm, addressing challenges related to reflective textures and pose ambiguities of heterogeneous shapes. Thus, Artificial Intelligence is crucial in this process, utilizing Convolutional Neural Networks to discern an object’s pose by extracting hierarchical features from a single image. In detail, we propose a new synthetic dataset of industrial objects and a fine-tuning method to close the sim-to-real domain gap. In addition, we improved an existing pipeline for pose estimation and introduced a new version of an existing method, based on Convolutional Neural Networks. Finally, extensive experiments were conducted with a Universal Robot UR5e. Results show our strategy achieves good performances with an average successful picking rate of 75% on these new objects. Considering the lack of available datasets for pose estimation, coupled with the significant time and labor required for annotating new images, we contribute to the scientific community by providing a comprehensive dataset, and the associated generation and estimation pipelines.1

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers in Industry
Computers in Industry 工程技术-计算机:跨学科应用
CiteScore
18.90
自引率
8.00%
发文量
152
审稿时长
22 days
期刊介绍: The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that: • Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry; • Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry; • Foster connections or integrations across diverse application areas of ICT in industry.
期刊最新文献
Rapid quality control for recycled coarse aggregates (RCA) streams: Multi-sensor integration for advanced contaminant detection Apple varieties and growth prediction with time series classification based on deep learning to impact the harvesting decisions Maximum subspace transferability discriminant analysis: A new cross-domain similarity measure for wind-turbine fault transfer diagnosis Dual channel visible graph convolutional neural network for microleakage monitoring of pipeline weld homalographic cracks Video-based automatic people counting for public transport: On-bus versus off-bus deployment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1