{"title":"Adjusting Distributed Cameras for Robust Moving Object Pose Estimation","authors":"Yaoqing Hu;Shaoan Wang;Dongyue Li;Xingyu Chen;Mingzhu Zhu;Zhanhua Xin;Junzhi Yu","doi":"10.1109/TASE.2025.3527006","DOIUrl":null,"url":null,"abstract":"Robust moving object pose estimation is crucial in fine manipulation tasks, such as surgical instrument tracking. This paper presents a distributed-camera system with robotic adjustments to maintain consistent tracking of moving objects, thus avoiding tracking failures. An integrated framework for camera adjustment and pose estimation is developed for this distributed-camera system. In each detection cycle, the camera exhibiting the largest deviation with the object is adjusted by a visual servoing technique. After adjustment, the camera extrinsics are re-calibrated in the following detection cycles. For the unadjusted cameras, an online extrinsic optimization method based on multi-frame detection results is proposed to refine the camera extrinsics. Based on the refined camera extrinsics and detection results from multiple cameras, the pose of moving objects relative to the principal camera can be robustly estimated. We test the performance of this system in both simulation environments and real-world scenarios. The results indicate that our system achieves higher pose estimation accuracy and exhibits strong resistance to limited field-of-view (FoV) compared to conventional equivalent fixed multi-camera systems. Note to Practitioners—The motivation of this work is to tackle the challenge of FoV limitation during moving object pose estimation. Most optical tracking systems address this issue through the fusion of multiple sensors. However, their optical tracking devices are typically fixedly installed, which can lead to failures in pose estimation when objects move out of the device’s FoV. To expand the tracking scopes while simultaneously ensuring the pose estimation accuracy, we proposed to apply narrow-FoV cameras of which the locations can be adjusted by gimbals to track moving objects, keeping them at the center of the FoV. In this way, these cameras can always detect points with a wider distribution at the center of the FoV, which is conducive to improving detection accuracy. Our proposed system achieves a significant resistance towards FoV limitation during the pose estimation process of objects in large-scale motion.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"10650-10659"},"PeriodicalIF":6.4000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10833659/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Robust moving object pose estimation is crucial in fine manipulation tasks, such as surgical instrument tracking. This paper presents a distributed-camera system with robotic adjustments to maintain consistent tracking of moving objects, thus avoiding tracking failures. An integrated framework for camera adjustment and pose estimation is developed for this distributed-camera system. In each detection cycle, the camera exhibiting the largest deviation with the object is adjusted by a visual servoing technique. After adjustment, the camera extrinsics are re-calibrated in the following detection cycles. For the unadjusted cameras, an online extrinsic optimization method based on multi-frame detection results is proposed to refine the camera extrinsics. Based on the refined camera extrinsics and detection results from multiple cameras, the pose of moving objects relative to the principal camera can be robustly estimated. We test the performance of this system in both simulation environments and real-world scenarios. The results indicate that our system achieves higher pose estimation accuracy and exhibits strong resistance to limited field-of-view (FoV) compared to conventional equivalent fixed multi-camera systems. Note to Practitioners—The motivation of this work is to tackle the challenge of FoV limitation during moving object pose estimation. Most optical tracking systems address this issue through the fusion of multiple sensors. However, their optical tracking devices are typically fixedly installed, which can lead to failures in pose estimation when objects move out of the device’s FoV. To expand the tracking scopes while simultaneously ensuring the pose estimation accuracy, we proposed to apply narrow-FoV cameras of which the locations can be adjusted by gimbals to track moving objects, keeping them at the center of the FoV. In this way, these cameras can always detect points with a wider distribution at the center of the FoV, which is conducive to improving detection accuracy. Our proposed system achieves a significant resistance towards FoV limitation during the pose estimation process of objects in large-scale motion.
期刊介绍:
The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.