首页 > 最新文献

IEEE Robotics and Automation Letters最新文献

英文 中文
Virtual Obstacles Regulation for Multi-Agent Path Finding 多代理路径搜索的虚拟障碍调节
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-08 DOI: 10.1109/LRA.2024.3494653
Sike Zeng;Xi Chen;Li Chai
Multi-agent path finding (MAPF) involves finding collision-free paths for multiple agents while minimizing the total path costs. Explicit estimation conflict-based search (EECBS) represents a state-of-the-art variant of the widely used conflict-based search (CBS) method, offering bounded-suboptimal solutions. However, both CBS and its variants rely on pairwise conflict resolution methods. A conflict boom means many conflicts occur at one location, which frequently exists in scenarios that a large number of agents operate in small space, and usually leads to heavy computational burden. The location that conflict boom occurs is regarded as conflict boom vertex. This letter proposes a novel method, the Virtual Obstacles Regulation, to expedite algorithmic solving processes (such as EECBS) for MAPF. The proposed method identifies conflicts boom vertices and strategically regulates them as global or local virtual obstacles to circumvent concentrated conflicts. Then, the pairwise conflict resolution processes on conflicts boom vertices are significantly simplified, hence accelerating overall algorithm runtime–often dominated by conflict resolution. Numerical studies validate the efficacy of this approach.
多代理路径搜索(MAPF)涉及为多个代理寻找无碰撞路径,同时使路径总成本最小化。基于冲突的显式估计搜索(EECBS)是广泛使用的基于冲突的搜索(CBS)方法的最新变体,可提供有界次优解。然而,CBS 及其变体都依赖于成对冲突解决方法。冲突繁荣指的是在一个地点发生许多冲突,这种情况经常出现在大量代理在狭小空间内运行的场景中,通常会导致沉重的计算负担。冲突繁荣发生的位置被视为冲突繁荣顶点。本文提出了一种新方法--虚拟障碍规程,以加快 MAPF 的算法求解过程(如 EECBS)。所提方法能识别冲突繁荣顶点,并将其作为全局或局部虚拟障碍进行策略性调节,以规避集中冲突。这样,冲突繁荣顶点上的成对冲突解决过程就大大简化了,从而加快了整个算法的运行时间--通常冲突解决时间是算法运行时间的主要部分。数值研究验证了这种方法的有效性。
{"title":"Virtual Obstacles Regulation for Multi-Agent Path Finding","authors":"Sike Zeng;Xi Chen;Li Chai","doi":"10.1109/LRA.2024.3494653","DOIUrl":"https://doi.org/10.1109/LRA.2024.3494653","url":null,"abstract":"Multi-agent path finding (MAPF) involves finding collision-free paths for multiple agents while minimizing the total path costs. Explicit estimation conflict-based search (EECBS) represents a state-of-the-art variant of the widely used conflict-based search (CBS) method, offering bounded-suboptimal solutions. However, both CBS and its variants rely on pairwise conflict resolution methods. A conflict boom means many conflicts occur at one location, which frequently exists in scenarios that a large number of agents operate in small space, and usually leads to heavy computational burden. The location that conflict boom occurs is regarded as conflict boom vertex. This letter proposes a novel method, the Virtual Obstacles Regulation, to expedite algorithmic solving processes (such as EECBS) for MAPF. The proposed method identifies conflicts boom vertices and strategically regulates them as global or local virtual obstacles to circumvent concentrated conflicts. Then, the pairwise conflict resolution processes on conflicts boom vertices are significantly simplified, hence accelerating overall algorithm runtime–often dominated by conflict resolution. Numerical studies validate the efficacy of this approach.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11417-11424"},"PeriodicalIF":4.6,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unbounded-GS: Extending 3D Gaussian Splatting With Hybrid Representation for Unbounded Large-Scale Scene Reconstruction 无界-GS:利用混合表示法扩展三维高斯拼接,实现无界大规模场景重建
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-08 DOI: 10.1109/LRA.2024.3494652
Wanzhang Li;Fukun Yin;Wen Liu;Yiying Yang;Xin Chen;Biao Jiang;Gang Yu;Jiayuan Fan
Modeling large-scale scenes from multi-view images is challenging due to the trade-off dilemma between visual quality and computational cost. Existing NeRF-based methods have made advancements in neural implicit representation through volumetric ray-marching, but still struggle to deal with cubically growing sampling space in large-scale scenes. Fortunately, the rendering approach based on 3D Gaussian splatting (3DGS) has shown promising results, inspiring further exploration in the splatting setting. However, 3DGS has the limitation of inadequate Gaussian points for modeling distant backgrounds, leading to “splotchy” artifacts. To address this problem, we introduce a novel hybrid neural representation called Unbounded 3D Gaussian. For foreground area, we employs an explicit 3D Gaussian representation to efficiently model the geometry and appearance through splatting weighted Gaussians. For far-away background, we additionally introduce an implicit module comprising Multi-layer Perceptions (MLPs) to directly predict far-away background colors from positional encodings of view positions and ray directions. Furthermore, we design a seamless blending mechanism between the color predictions of the explicit splatting and implicit branches to reconstruct holistic scenes. Extensive experiments demonstrate that our proposed Unbounded-GS inherits the advantages of both faster convergence and high-fidelity rendering quality.
由于视觉质量和计算成本之间的权衡难题,通过多视角图像对大规模场景进行建模具有挑战性。现有的基于 NeRF 的方法通过体积光线行进在神经隐式表示方面取得了进步,但在处理大规模场景中立方体增长的采样空间时仍有困难。幸运的是,基于三维高斯拼接(3DGS)的渲染方法取得了可喜的成果,激发了对拼接设置的进一步探索。然而,3DGS 有其局限性,即高斯点不足以对远处的背景进行建模,从而导致 "斑点 "现象。为了解决这个问题,我们引入了一种名为无界三维高斯的新型混合神经表示法。对于前景区域,我们采用明确的三维高斯表示法,通过拼接加权高斯有效地建立几何和外观模型。对于远处的背景,我们还引入了由多层感知(MLP)组成的隐式模块,通过视图位置和光线方向的位置编码直接预测远处背景的颜色。此外,我们还在显式拼接和隐式分支的颜色预测之间设计了一种无缝混合机制,以重建整体场景。广泛的实验证明,我们提出的无边界-GS 继承了更快收敛和高保真渲染质量的优点。
{"title":"Unbounded-GS: Extending 3D Gaussian Splatting With Hybrid Representation for Unbounded Large-Scale Scene Reconstruction","authors":"Wanzhang Li;Fukun Yin;Wen Liu;Yiying Yang;Xin Chen;Biao Jiang;Gang Yu;Jiayuan Fan","doi":"10.1109/LRA.2024.3494652","DOIUrl":"https://doi.org/10.1109/LRA.2024.3494652","url":null,"abstract":"Modeling large-scale scenes from multi-view images is challenging due to the trade-off dilemma between visual quality and computational cost. Existing NeRF-based methods have made advancements in neural implicit representation through volumetric ray-marching, but still struggle to deal with cubically growing sampling space in large-scale scenes. Fortunately, the rendering approach based on 3D Gaussian splatting (3DGS) has shown promising results, inspiring further exploration in the splatting setting. However, 3DGS has the limitation of inadequate Gaussian points for modeling distant backgrounds, leading to “splotchy” artifacts. To address this problem, we introduce a novel hybrid neural representation called Unbounded 3D Gaussian. For foreground area, we employs an explicit 3D Gaussian representation to efficiently model the geometry and appearance through splatting weighted Gaussians. For far-away background, we additionally introduce an implicit module comprising Multi-layer Perceptions (MLPs) to directly predict far-away background colors from positional encodings of view positions and ray directions. Furthermore, we design a seamless blending mechanism between the color predictions of the explicit splatting and implicit branches to reconstruct holistic scenes. Extensive experiments demonstrate that our proposed Unbounded-GS inherits the advantages of both faster convergence and high-fidelity rendering quality.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11529-11536"},"PeriodicalIF":4.6,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142672009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Voltage Minimization Control Method for Magnetic Navigation Systems to Enhance the Rotating Magnetic Field 增强旋转磁场的磁导航系统电压最小化控制方法
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-08 DOI: 10.1109/LRA.2024.3495374
Junhyoung Kwon;Junchi Sa;Serim Lee;Gunhee Jang
Magnetic helical robots (MHRs) can be actuated by a rotating magnetic field (RMF) and can tunnel through a clogged blood vessel using their rotational motion. We propose a method minimizing the voltage required for each coil to generate the RMF for a magnetic navigation system (MNS). The proposed method maximizes the RMF under the rated voltage of the MNS. In addition, to suppress the increase in impedance during high-speed rotational motion of the MHR, the proposed method utilizes the resonance to which the minimax optimization method is applied. The voltage needed to generate the RMF was analytically derived, with the goal of a fast optimization process in mind, so that the MHR could be controlled in real-time. The proposed method was experimentally verified by measuring the magnetic flux density. In addition, we demonstrated the enhanced navigating and tunneling performance of the MHR from in vitro experiments. Finally, we validated the navigating and tunneling motion of the MHR in an in vivo experiment within a superficial femoral artery of a pig.
磁螺旋机器人(MHR)可由旋转磁场(RMF)驱动,并可利用其旋转运动穿过堵塞的血管。我们提出了一种方法,可最大限度地降低磁导航系统(MNS)每个线圈产生 RMF 所需的电压。在磁导航系统的额定电压下,所提出的方法能使 RMF 最大化。此外,为了抑制 MHR 高速旋转运动时阻抗的增加,所提出的方法利用了共振,在共振中应用了最小优化法。产生 RMF 所需的电压是通过分析得出的,目的是实现快速优化过程,以便实时控制 MHR。通过测量磁通密度,对所提出的方法进行了实验验证。此外,我们还通过体外实验证明了 MHR 增强的导航和隧道性能。最后,我们在猪股浅动脉的体内实验中验证了 MHR 的导航和隧道运动。
{"title":"A Voltage Minimization Control Method for Magnetic Navigation Systems to Enhance the Rotating Magnetic Field","authors":"Junhyoung Kwon;Junchi Sa;Serim Lee;Gunhee Jang","doi":"10.1109/LRA.2024.3495374","DOIUrl":"https://doi.org/10.1109/LRA.2024.3495374","url":null,"abstract":"Magnetic helical robots (MHRs) can be actuated by a rotating magnetic field (RMF) and can tunnel through a clogged blood vessel using their rotational motion. We propose a method minimizing the voltage required for each coil to generate the RMF for a magnetic navigation system (MNS). The proposed method maximizes the RMF under the rated voltage of the MNS. In addition, to suppress the increase in impedance during high-speed rotational motion of the MHR, the proposed method utilizes the resonance to which the minimax optimization method is applied. The voltage needed to generate the RMF was analytically derived, with the goal of a fast optimization process in mind, so that the MHR could be controlled in real-time. The proposed method was experimentally verified by measuring the magnetic flux density. In addition, we demonstrated the enhanced navigating and tunneling performance of the MHR from \u0000<italic>in vitro</i>\u0000 experiments. Finally, we validated the navigating and tunneling motion of the MHR in an \u0000<italic>in vivo</i>\u0000 experiment within a superficial femoral artery of a pig.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11561-11568"},"PeriodicalIF":4.6,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142672096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shape Visual Servoing of a Cable Suspended Between Two Drones 对悬挂在两架无人机之间的缆线进行形状可视化伺服控制
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-07 DOI: 10.1109/LRA.2024.3494655
Lev Smolentsev;Alexandre Krupa;François Chaumette
In this letter, we propose a shape visual servoing approach for manipulating a suspended cable attached between two quadrotor drones. A leader-follower control strategy is presented, where a human operator controls the rigid motion of the cable by teleoperating one drone (the leader), while the second drone (the follower) performs a shape visual servoing task to autonomously apply a desired deformation to the cable. The proposed shape visual servoing approach uses an RGB-D camera embedded on the follower drone and has the advantage to rely on a simple geometrical model of the cable that only requires the knowledge of its length. In the same time, our control strategy maintains the best visibility of the cable in the camera field of view. A robust image processing pipeline allows detecting and tracking in real-time the cable shape from the data provided by the onboard RGB-D camera. Experimental results demonstrate the effectiveness of the proposed visual control approach to shape a flexible cable into a desired shape. In addition, we demonstrate experimentally that such system can be used to perform an aerial transport task by grasping with the cable an object fitted with a hook, then moving and releasing it at another location.
在这封信中,我们提出了一种形状视觉伺服方法,用于操纵连接在两架四旋翼无人机之间的悬挂缆绳。我们提出了一种 "领导者-追随者 "控制策略,即人类操作员通过遥控一架无人机(领导者)来控制缆绳的刚性运动,而第二架无人机(追随者)则执行形状视觉伺服任务,自主地对缆绳施加所需的变形。所提出的形状视觉伺服方法使用了嵌入在跟随者无人机上的 RGB-D 摄像机,其优势在于可以依赖简单的缆线几何模型,只需了解缆线的长度即可。同时,我们的控制策略还能在相机视场中保持电缆的最佳可见度。通过强大的图像处理管道,可以从机载 RGB-D 摄像机提供的数据中实时检测和跟踪电缆的形状。实验结果表明,所提出的视觉控制方法能有效地将柔性电缆塑造成所需的形状。此外,我们还通过实验证明,这种系统可用于执行空中运输任务,即用缆绳抓住一个装有钩子的物体,然后将其移动并释放到另一个位置。
{"title":"Shape Visual Servoing of a Cable Suspended Between Two Drones","authors":"Lev Smolentsev;Alexandre Krupa;François Chaumette","doi":"10.1109/LRA.2024.3494655","DOIUrl":"https://doi.org/10.1109/LRA.2024.3494655","url":null,"abstract":"In this letter, we propose a shape visual servoing approach for manipulating a suspended cable attached between two quadrotor drones. A leader-follower control strategy is presented, where a human operator controls the rigid motion of the cable by teleoperating one drone (the leader), while the second drone (the follower) performs a shape visual servoing task to autonomously apply a desired deformation to the cable. The proposed shape visual servoing approach uses an RGB-D camera embedded on the follower drone and has the advantage to rely on a simple geometrical model of the cable that only requires the knowledge of its length. In the same time, our control strategy maintains the best visibility of the cable in the camera field of view. A robust image processing pipeline allows detecting and tracking in real-time the cable shape from the data provided by the onboard RGB-D camera. Experimental results demonstrate the effectiveness of the proposed visual control approach to shape a flexible cable into a desired shape. In addition, we demonstrate experimentally that such system can be used to perform an aerial transport task by grasping with the cable an object fitted with a hook, then moving and releasing it at another location.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11473-11480"},"PeriodicalIF":4.6,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Bifurcation and Environmental Adaptability of a Novel Line-Symmetric Double-Centered Metamorphic Mechanism
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-07 DOI: 10.1109/LRA.2024.3494662
Song Lin;Jiansheng Dai;Yifeng Song;Hongguang Wang;Bingbing Yuan;Yingbin Feng
To improve the environmental adaptability of the mechanism, this letter proposes a metamorphic mechanism configuration synthesis method based on environmental constraint characteristics. In particular, a novel 6R line-symmetric double-centered metamorphic mechanism is synthesized and the multi-bifurcation characteristics and environmental adaptability are analyzed and verified. Firstly, a configuration synthesis method is proposed based on the bifocal geometric constraints of the elliptical oblique section of the pipe. The variation of the instantaneous mobility with the number of links is derived and m=n=3 is the minimum number of links that the mechanism is movable. Then, for the 6R mechanism, the joint velocity solution spaces of all the motion branches are revealed through the closed-loop and higher-order kinematics. Further, the correlation between the joint angle, configuration, and environment layers is demonstrated. The results show that the 6R mechanism has four motion branches (MB$_{1}$, MB$_{2}$, MB$_{3}$, and MB$_{4}$) with different topological structures corresponding to different pipe environments and four 4R serial motion branches (SMB$_{1}$, SMB$_{2}$, SMB$_{3}$, and SMB$_{4}$). Eventually, a wheeled metamorphic robot is designed, and the controllable motion branch transformation of the 6R mechanism and the pipe environment adaptability of the robot are proved by prototype experiments.
{"title":"Multi-Bifurcation and Environmental Adaptability of a Novel Line-Symmetric Double-Centered Metamorphic Mechanism","authors":"Song Lin;Jiansheng Dai;Yifeng Song;Hongguang Wang;Bingbing Yuan;Yingbin Feng","doi":"10.1109/LRA.2024.3494662","DOIUrl":"https://doi.org/10.1109/LRA.2024.3494662","url":null,"abstract":"To improve the environmental adaptability of the mechanism, this letter proposes a metamorphic mechanism configuration synthesis method based on environmental constraint characteristics. In particular, a novel 6R line-symmetric double-centered metamorphic mechanism is synthesized and the multi-bifurcation characteristics and environmental adaptability are analyzed and verified. Firstly, a configuration synthesis method is proposed based on the bifocal geometric constraints of the elliptical oblique section of the pipe. The variation of the instantaneous mobility with the number of links is derived and \u0000<italic>m</i>\u0000=\u0000<italic>n</i>\u0000=3 is the minimum number of links that the mechanism is movable. Then, for the 6R mechanism, the joint velocity solution spaces of all the motion branches are revealed through the closed-loop and higher-order kinematics. Further, the correlation between the joint angle, configuration, and environment layers is demonstrated. The results show that the 6R mechanism has four motion branches (\u0000<italic>MB</i>\u0000<inline-formula><tex-math>$_{1}$</tex-math></inline-formula>\u0000, \u0000<italic>MB</i>\u0000<inline-formula><tex-math>$_{2}$</tex-math></inline-formula>\u0000, \u0000<italic>MB</i>\u0000<inline-formula><tex-math>$_{3}$</tex-math></inline-formula>\u0000, and \u0000<italic>MB</i>\u0000<inline-formula><tex-math>$_{4}$</tex-math></inline-formula>\u0000) with different topological structures corresponding to different pipe environments and four 4R serial motion branches (\u0000<italic>SMB</i>\u0000<inline-formula><tex-math>$_{1}$</tex-math></inline-formula>\u0000, \u0000<italic>SMB</i>\u0000<inline-formula><tex-math>$_{2}$</tex-math></inline-formula>\u0000, \u0000<italic>SMB</i>\u0000<inline-formula><tex-math>$_{3}$</tex-math></inline-formula>\u0000, and \u0000<italic>SMB</i>\u0000<inline-formula><tex-math>$_{4}$</tex-math></inline-formula>\u0000). Eventually, a wheeled metamorphic robot is designed, and the controllable motion branch transformation of the 6R mechanism and the pipe environment adaptability of the robot are proved by prototype experiments.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11746-11753"},"PeriodicalIF":4.6,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142691699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Dynamic Calibration Framework for the Event-Frame Stereo Camera System 事件帧立体相机系统的动态校准框架
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-04 DOI: 10.1109/LRA.2024.3491426
Rui Hu;Jürgen Kogler;Margrit Gelautz;Min Lin;Yuanqing Xia
The fusion of event cameras and conventional frame cameras is a novel research field, and a stereo structure consisting of an event camera and a frame camera can incorporate the advantages of both. This letter develops a dynamic calibration framework for the event-frame stereo camera system. In this framework, the first step is to complete the initial detection on a circle-grid calibration pattern, and a sliding-window time matching method is proposed to match the event-frame pairs. Then, a refining method is devised for two cameras to get the accurate information of the pattern. Particularly, for the event camera, a patch-size motion compensation method with high computational efficiency is designed to achieve time synchronization for two cameras and fit circles in an image of warped events. Finally, the pose between two cameras is globally optimized by constructing a pose-landmark graph with two types of edges. The proposed calibration framework has the advantages of high real-time performance and easy deployment, and its effectiveness is verified by experiments based on self-recorded datasets.
事件相机与传统框架相机的融合是一个新的研究领域,而由事件相机和框架相机组成的立体结构可以兼具两者的优点。本文为事件帧立体相机系统开发了一个动态校准框架。在该框架中,第一步是在圆网格校准模式上完成初始检测,并提出一种滑动窗口时间匹配方法来匹配事件帧对。然后,针对两台摄像机设计了一种细化方法,以获得图案的准确信息。特别是针对事件摄像机,设计了一种具有高计算效率的补丁大小运动补偿方法,以实现两台摄像机的时间同步,并在翘曲事件图像中拟合圆形。最后,通过构建具有两类边的姿态地标图,对两台摄像机之间的姿态进行全局优化。所提出的校准框架具有实时性高、易于部署等优点,其有效性已通过基于自记录数据集的实验得到验证。
{"title":"A Dynamic Calibration Framework for the Event-Frame Stereo Camera System","authors":"Rui Hu;Jürgen Kogler;Margrit Gelautz;Min Lin;Yuanqing Xia","doi":"10.1109/LRA.2024.3491426","DOIUrl":"https://doi.org/10.1109/LRA.2024.3491426","url":null,"abstract":"The fusion of event cameras and conventional frame cameras is a novel research field, and a stereo structure consisting of an event camera and a frame camera can incorporate the advantages of both. This letter develops a dynamic calibration framework for the event-frame stereo camera system. In this framework, the first step is to complete the initial detection on a circle-grid calibration pattern, and a sliding-window time matching method is proposed to match the event-frame pairs. Then, a refining method is devised for two cameras to get the accurate information of the pattern. Particularly, for the event camera, a patch-size motion compensation method with high computational efficiency is designed to achieve time synchronization for two cameras and fit circles in an image of warped events. Finally, the pose between two cameras is globally optimized by constructing a pose-landmark graph with two types of edges. The proposed calibration framework has the advantages of high real-time performance and easy deployment, and its effectiveness is verified by experiments based on self-recorded datasets.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11465-11472"},"PeriodicalIF":4.6,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Minimal Impact Pokes to Place Objects on Planar Surfaces 将物体放置在平面上的最小冲击戳
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-04 DOI: 10.1109/LRA.2024.3491412
Ahmed Zermane;Léo Moussafir;Youcan Yan;Abderrahmane Kheddar
We present a planning and control method that computes a minimal sequence of pokes to slide a given object from an initial pose to a desired final one (or as close to it as possible) on a planar surface. Both planning and control are based on impact models to generate pokes. Our framework takes into account the object's dynamics with a rich contact model and parameters to plan the poking sequence. The planning is conducted in the joint-space and generates trajectories tracked using an impact-aware QP control, which corrects for post-pokes errors using discrete visual feedback. We implemented our method on a Panda robot arm and assessed its versatility and robustness. The experimental results show that the proposed poking approach can bring the object to the desired position and orientation with minimal errors (0.05 m for translation and 0.2 rd for rotation), highlighting its potential application in diverse industrial scenarios such as logistics.
我们提出了一种规划和控制方法,它能计算出一个最小的戳序列,使给定物体在平面上从初始姿态滑动到所需的最终姿态(或尽可能接近最终姿态)。规划和控制均基于冲击模型来生成戳。我们的框架通过丰富的接触模型和参数考虑了物体的动态,以规划戳动序列。规划在接合空间中进行,生成的轨迹使用冲击感知 QP 控制跟踪,该控制使用离散视觉反馈纠正戳后误差。我们在熊猫机械臂上实施了我们的方法,并对其多功能性和鲁棒性进行了评估。实验结果表明,所提出的戳击方法能以最小的误差(平移误差为 0.05 米,旋转误差为 0.2 rd)将物体带到所需的位置和方位,突出了其在物流等各种工业场景中的应用潜力。
{"title":"Minimal Impact Pokes to Place Objects on Planar Surfaces","authors":"Ahmed Zermane;Léo Moussafir;Youcan Yan;Abderrahmane Kheddar","doi":"10.1109/LRA.2024.3491412","DOIUrl":"https://doi.org/10.1109/LRA.2024.3491412","url":null,"abstract":"We present a planning and control method that computes a minimal sequence of pokes to slide a given object from an initial pose to a desired final one (or as close to it as possible) on a planar surface. Both planning and control are based on impact models to generate pokes. Our framework takes into account the object's dynamics with a rich contact model and parameters to plan the poking sequence. The planning is conducted in the joint-space and generates trajectories tracked using an impact-aware QP control, which corrects for post-pokes errors using discrete visual feedback. We implemented our method on a Panda robot arm and assessed its versatility and robustness. The experimental results show that the proposed poking approach can bring the object to the desired position and orientation with minimal errors (0.05 m for translation and 0.2 rd for rotation), highlighting its potential application in diverse industrial scenarios such as logistics.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11393-11400"},"PeriodicalIF":4.6,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Benchmark Dataset for Collaborative SLAM in Service Environments 服务环境中协作式 SLAM 的基准数据集
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-04 DOI: 10.1109/LRA.2024.3491415
Harin Park;Inha Lee;Minje Kim;Hyungyu Park;Kyungdon Joo
We introduce a new multi-modal collaborative SLAM (C-SLAM) dataset for multiple service robots in various indoor service environments, called C-SLAM dataset in Service Environments (CSE). We use the NVIDIA Isaac Sim to generate data in various indoor service environments with the challenges that may occur in real-world service environments. By using the simulator, we can provide precisely time-synchronized sensor data, such as stereo RGB/depth, IMU, and ground truth (GT) poses. We configure three common indoor service environments (Hospital, Office, and Warehouse), each featuring dynamic objects performing motions suited to the environment. In addition, we drive the robots to mimic the actions of real service robots. Through these factors, we generate a realistic C-SLAM dataset for multiple service robots. We demonstrate our CSE dataset by evaluating diverse state-of-the-art single-robot SLAM and multi-robot SLAM methods. Additionally, we provide a detailed tutorial on generating C-SLAM data using the simulator.
我们介绍了一种新的多模式协作 SLAM(C-SLAM)数据集,用于在各种室内服务环境中使用多个服务机器人,称为服务环境中的 C-SLAM 数据集(CSE)。我们使用英伟达 Isaac Sim 在各种室内服务环境中生成数据,以应对真实世界服务环境中可能出现的挑战。通过使用模拟器,我们可以提供精确的时间同步传感器数据,如立体 RGB/深度、IMU 和地面实况(GT)姿势。我们配置了三种常见的室内服务环境(医院、办公室和仓库),每种环境中的动态物体都会做出与环境相适应的动作。此外,我们还模仿真实服务机器人的动作来驱动机器人。通过这些因素,我们为多个服务机器人生成了一个逼真的 C-SLAM 数据集。我们通过评估各种最先进的单机器人 SLAM 和多机器人 SLAM 方法来展示我们的 CSE 数据集。此外,我们还提供了使用模拟器生成 C-SLAM 数据的详细教程。
{"title":"A Benchmark Dataset for Collaborative SLAM in Service Environments","authors":"Harin Park;Inha Lee;Minje Kim;Hyungyu Park;Kyungdon Joo","doi":"10.1109/LRA.2024.3491415","DOIUrl":"https://doi.org/10.1109/LRA.2024.3491415","url":null,"abstract":"We introduce a new multi-modal collaborative SLAM (C-SLAM) dataset for multiple service robots in various indoor service environments, called \u0000<monospace>C</monospace>\u0000-SLAM dataset in \u0000<monospace>S</monospace>\u0000ervice \u0000<monospace>E</monospace>\u0000nvironments (\u0000<monospace>CSE</monospace>\u0000). We use the NVIDIA Isaac Sim to generate data in various indoor service environments with the challenges that may occur in real-world service environments. By using the simulator, we can provide precisely time-synchronized sensor data, such as stereo RGB/depth, IMU, and ground truth (GT) poses. We configure three common indoor service environments (\u0000<italic>Hospital</i>\u0000, \u0000<italic>Office</i>\u0000, and \u0000<italic>Warehouse</i>\u0000), each featuring dynamic objects performing motions suited to the environment. In addition, we drive the robots to mimic the actions of real service robots. Through these factors, we generate a realistic C-SLAM dataset for multiple service robots. We demonstrate our \u0000<monospace>CSE</monospace>\u0000 dataset by evaluating diverse state-of-the-art single-robot SLAM and multi-robot SLAM methods. Additionally, we provide a detailed tutorial on generating C-SLAM data using the simulator.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11337-11344"},"PeriodicalIF":4.6,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DLO Perceiver: Grounding Large Language Model for Deformable Linear Objects Perception DLO 感知器:为可变形线性物体感知建立大型语言模型
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-04 DOI: 10.1109/LRA.2024.3491428
Alessio Caporali;Kevin Galassi;Gianluca Palli
The perception of Deformable Linear Objects (DLOs) is a challenging task due to their complex and ambiguous appearance, lack of discernible features, typically small sizes, and deformability. Despite these challenges, achieving a robust and effective segmentation of DLOs is crucial to introduce robots into environments where they are currently underrepresented, such as domestic and complex industrial settings. In this context, the integration of language-based inputs can simplify the perception task while also enabling the possibility of introducing robots as human companions. Therefore, this letter proposes a novel architecture for the perception of DLOs, wherein the input image is augmented with a text-based prompt guiding the segmentation of the target DLO. After encoding the image and text separately, a Perceiver-inspired structure is exploited to compress the concatenated data into transformer layers and generate the output mask from a latent vector representation. The method is experimentally evaluated on real-world images of DLOs like electrical cables and ropes, validating its efficacy and efficiency in real practical scenarios.
由于可变形线性物体(DLOs)的外观复杂而模糊,缺乏可辨认的特征,通常尺寸较小,而且容易变形,因此对它们的感知是一项具有挑战性的任务。尽管存在这些挑战,但实现对 DLO 的稳健而有效的分割,对于将机器人引入目前代表不足的环境(如家庭和复杂的工业环境)至关重要。在这种情况下,整合基于语言的输入可简化感知任务,同时也为引入机器人作为人类伴侣提供了可能。因此,本文提出了一种新颖的 DLO 感知架构,即在输入图像的基础上添加文字提示,引导对目标 DLO 进行分割。在分别对图像和文本进行编码后,受感知器启发的结构被用来将串联数据压缩到转换层中,并从潜向量表示法生成输出掩码。该方法在真实世界的电缆和绳索等 DLO 图像上进行了实验评估,验证了其在实际应用场景中的功效和效率。
{"title":"DLO Perceiver: Grounding Large Language Model for Deformable Linear Objects Perception","authors":"Alessio Caporali;Kevin Galassi;Gianluca Palli","doi":"10.1109/LRA.2024.3491428","DOIUrl":"https://doi.org/10.1109/LRA.2024.3491428","url":null,"abstract":"The perception of Deformable Linear Objects (DLOs) is a challenging task due to their complex and ambiguous appearance, lack of discernible features, typically small sizes, and deformability. Despite these challenges, achieving a robust and effective segmentation of DLOs is crucial to introduce robots into environments where they are currently underrepresented, such as domestic and complex industrial settings. In this context, the integration of language-based inputs can simplify the perception task while also enabling the possibility of introducing robots as human companions. Therefore, this letter proposes a novel architecture for the perception of DLOs, wherein the input image is augmented with a text-based prompt guiding the segmentation of the target DLO. After encoding the image and text separately, a Perceiver-inspired structure is exploited to compress the concatenated data into transformer layers and generate the output mask from a latent vector representation. The method is experimentally evaluated on real-world images of DLOs like electrical cables and ropes, validating its efficacy and efficiency in real practical scenarios.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11385-11392"},"PeriodicalIF":4.6,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10742556","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142636275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proximal Control of UAVs With Federated Learning for Human-Robot Collaborative Domains 利用联合学习对无人飞行器进行近端控制,实现人机协作领域
IF 4.6 2区 计算机科学 Q2 ROBOTICS Pub Date : 2024-11-04 DOI: 10.1109/LRA.2024.3491417
Lucas Nogueira Nobrega;Ewerton de Oliveira;Martin Saska;Tiago Nascimento
The human-robot interaction (HRI) is a growing area of research. In HRI, complex command (action) classification is still an open problem that usually prevents the real applicability of such a technique. The literature presents some works that use neural networks to detect these actions. However, occlusion is still a major issue in HRI, especially when using uncrewed aerial vehicles (UAVs), since, during the robot's movement, the human operator is often out of the robot's field of view. Furthermore, in multi-robot scenarios, distributed training is also an open problem. In this sense, this work proposes an action recognition and control approach based on Long Short-Term Memory (LSTM) Deep Neural Networks with two layers in association with three densely connected layers and Federated Learning (FL) embedded in multiple drones. The FL enabled our approach to be trained in a distributed fashion, i.e., access to data without the need for cloud or other repositories, which facilitates the multi-robot system's learning. Furthermore, our multi-robot approach results also prevented occlusion situations, with experiments with real robots achieving an accuracy greater than 96%.
人机交互(HRI)是一个不断发展的研究领域。在人机交互领域,复杂指令(动作)分类仍是一个未决问题,这通常会阻碍此类技术的真正应用。文献中介绍了一些使用神经网络来检测这些动作的作品。然而,遮挡仍然是 HRI 中的一个主要问题,尤其是在使用无人驾驶飞行器(UAV)时,因为在机器人移动过程中,人类操作员往往不在机器人的视野范围内。此外,在多机器人场景中,分布式训练也是一个有待解决的问题。从这个意义上说,这项工作提出了一种基于长短期记忆(LSTM)深度神经网络的动作识别和控制方法,该网络有两层,三层密集连接,并嵌入了多无人机的联合学习(FL)。FL使我们的方法能够以分布式方式进行训练,即无需云或其他存储库即可访问数据,从而促进了多机器人系统的学习。此外,我们的多机器人方法还能防止出现闭塞情况,真实机器人实验的准确率超过 96%。
{"title":"Proximal Control of UAVs With Federated Learning for Human-Robot Collaborative Domains","authors":"Lucas Nogueira Nobrega;Ewerton de Oliveira;Martin Saska;Tiago Nascimento","doi":"10.1109/LRA.2024.3491417","DOIUrl":"https://doi.org/10.1109/LRA.2024.3491417","url":null,"abstract":"The human-robot interaction (HRI) is a growing area of research. In HRI, complex command (action) classification is still an open problem that usually prevents the real applicability of such a technique. The literature presents some works that use neural networks to detect these actions. However, occlusion is still a major issue in HRI, especially when using uncrewed aerial vehicles (UAVs), since, during the robot's movement, the human operator is often out of the robot's field of view. Furthermore, in multi-robot scenarios, distributed training is also an open problem. In this sense, this work proposes an action recognition and control approach based on Long Short-Term Memory (LSTM) Deep Neural Networks with two layers in association with three densely connected layers and Federated Learning (FL) embedded in multiple drones. The FL enabled our approach to be trained in a distributed fashion, i.e., access to data without the need for cloud or other repositories, which facilitates the multi-robot system's learning. Furthermore, our multi-robot approach results also prevented occlusion situations, with experiments with real robots achieving an accuracy greater than 96%.","PeriodicalId":13241,"journal":{"name":"IEEE Robotics and Automation Letters","volume":"9 12","pages":"11305-11312"},"PeriodicalIF":4.6,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Robotics and Automation Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1