IET Cybersystems and Robotics最新文献

英文中文

Novel vision-LiDAR fusion framework for human action recognition based on dynamic lateral connection

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-12-31 DOI: 10.1049/csy2.70005

Fei Yan, Guangyao Jin, Zheng Mu, Shouxing Zhang, Yinghao Cai, Tao Lu, Yan Zhuang

In the past decades, substantial progress has been made in human action recognition. However, most existing studies and datasets for human action recognition utilise still images or videos as the primary modality. Image-based approaches can be easily impacted by adverse environmental conditions. In this paper, the authors propose combining RGB images and point clouds from LiDAR sensors for human action recognition. A dynamic lateral convolutional network (DLCN) is proposed to fuse features from multi-modalities. The RGB features and the geometric information from the point clouds closely interact with each other in the DLCN, which is complementary in action recognition. The experimental results on the JRDB-Act dataset demonstrate that the proposed DLCN outperforms the state-of-the-art approaches of human action recognition. The authors show the potential of the proposed DLCN in various complex scenarios, which is highly valuable in real-world applications.

引用次数: 0

Big2Small: Learning from masked image modelling with heterogeneous self-supervised knowledge distillation

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-12-31 DOI: 10.1049/csy2.70002

Ziming Wang, Shumin Han, Xiaodi Wang, Jing Hao, Xianbin Cao, Baochang Zhang

Small convolutional neural network (CNN)-based models usually require transferring knowledge from a large model before they are deployed in computationally resource-limited edge devices. Masked image modelling (MIM) methods achieve great success in various visual tasks but remain largely unexplored in knowledge distillation for heterogeneous deep models. The reason is mainly due to the significant discrepancy between the transformer-based large model and the CNN-based small network. In this paper, the authors develop the first heterogeneous self-supervised knowledge distillation (HSKD) based on MIM, which can efficiently transfer knowledge from large transformer models to small CNN-based models in a self-supervised fashion. Our method builds a bridge between transformer-based models and CNNs by training a UNet-style student with sparse convolution, which can effectively mimic the visual representation inferred by a teacher over masked modelling. Our method is a simple yet effective learning paradigm to learn the visual representation and distribution of data from heterogeneous teacher models, which can be pre-trained using advanced self-supervised methods. Extensive experiments show that it adapts well to various models and sizes, consistently achieving state-of-the-art performance in image classification, object detection, and semantic segmentation tasks. For example, in the Imagenet 1K dataset, HSKD improves the accuracy of Resnet-50 (sparse) from 76.98% to 80.01%.

{"title":"Big2Small: Learning from masked image modelling with heterogeneous self-supervised knowledge distillation","authors":"Ziming Wang, Shumin Han, Xiaodi Wang, Jing Hao, Xianbin Cao, Baochang Zhang","doi":"10.1049/csy2.70002","DOIUrl":"https://doi.org/10.1049/csy2.70002","url":null,"abstract":"<p>Small convolutional neural network (CNN)-based models usually require transferring knowledge from a large model before they are deployed in computationally resource-limited edge devices. Masked image modelling (MIM) methods achieve great success in various visual tasks but remain largely unexplored in knowledge distillation for heterogeneous deep models. The reason is mainly due to the significant discrepancy between the transformer-based large model and the CNN-based small network. In this paper, the authors develop the first heterogeneous self-supervised knowledge distillation (HSKD) based on MIM, which can efficiently transfer knowledge from large transformer models to small CNN-based models in a self-supervised fashion. Our method builds a bridge between transformer-based models and CNNs by training a UNet-style student with sparse convolution, which can effectively mimic the visual representation inferred by a teacher over masked modelling. Our method is a simple yet effective learning paradigm to learn the visual representation and distribution of data from heterogeneous teacher models, which can be pre-trained using advanced self-supervised methods. Extensive experiments show that it adapts well to various models and sizes, consistently achieving state-of-the-art performance in image classification, object detection, and semantic segmentation tasks. For example, in the Imagenet 1K dataset, HSKD improves the accuracy of Resnet-50 (sparse) from 76.98% to 80.01%.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"6 4","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.70002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143121450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automatic feature-based markerless calibration and navigation method for augmented reality assisted dental treatment

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-12-31 DOI: 10.1049/csy2.70003

Faizan Ahmad, Jing Xiong, Zeyang Xia

Augmented reality (AR) is gaining traction in the field of computer-assisted treatment (CAT). Head-mounted display (HMD)-based AR in CAT provides dentists with enhanced visualisation by directly overlaying a three-dimensional (3D) model on a real patient during dental treatment. However, conventional AR-based treatments rely on optical markers and trackers, which makes them tedious, expensive, and uncomfortable for dentists. Therefore, a markerless image-to-patient tracking system is necessary to overcome these challenges and enhance system efficiency. This paper proposes a novel feature-based markerless calibration and navigation method for an HMD-based AR visualisation system. The authors address three sub-challenges: firstly, synthetic RGB-D data for anatomical landmark detection is generated to train a deep convolutional neural network (DCNN); secondly, the HMD is automatically calibrated using detected anatomical landmarks, eliminating the need for user input or optical trackers; and thirdly, a multi-iterative closest point (ICP) algorithm is developed for effective 3D-3D real-time navigation. The authors conduct several experiments on a commercially available HMD (HoloLens 2). Finally, the authors compare and evaluate the approach against state-of-the-art methods that employ HoloLens. The proposed method achieves a calibration virtual-to-real re-projection distance of (1.09 ± 0.23) mm and navigation projection errors and accuracies of approximately (0.53 ± 0.19) mm and 93.87%, respectively.

{"title":"Automatic feature-based markerless calibration and navigation method for augmented reality assisted dental treatment","authors":"Faizan Ahmad, Jing Xiong, Zeyang Xia","doi":"10.1049/csy2.70003","DOIUrl":"https://doi.org/10.1049/csy2.70003","url":null,"abstract":"<p>Augmented reality (AR) is gaining traction in the field of computer-assisted treatment (CAT). Head-mounted display (HMD)-based AR in CAT provides dentists with enhanced visualisation by directly overlaying a three-dimensional (3D) model on a real patient during dental treatment. However, conventional AR-based treatments rely on optical markers and trackers, which makes them tedious, expensive, and uncomfortable for dentists. Therefore, a markerless image-to-patient tracking system is necessary to overcome these challenges and enhance system efficiency. This paper proposes a novel feature-based markerless calibration and navigation method for an HMD-based AR visualisation system. The authors address three sub-challenges: firstly, synthetic RGB-D data for anatomical landmark detection is generated to train a deep convolutional neural network (DCNN); secondly, the HMD is automatically calibrated using detected anatomical landmarks, eliminating the need for user input or optical trackers; and thirdly, a multi-iterative closest point (ICP) algorithm is developed for effective 3D-3D real-time navigation. The authors conduct several experiments on a commercially available HMD (HoloLens 2). Finally, the authors compare and evaluate the approach against state-of-the-art methods that employ HoloLens. The proposed method achieves a calibration virtual-to-real re-projection distance of (1.09 ± 0.23) mm and navigation projection errors and accuracies of approximately (0.53 ± 0.19) mm and 93.87%, respectively.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"6 4","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.70003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143121449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing stability and safety: A novel multi-constraint model predictive control approach for forklift trajectory

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-12-31 DOI: 10.1049/csy2.70004

Yizhen Sun, Junyou Yang, Donghui Zhao, Moses Chukwuka Okonkwo, Jianmin Zhang, Shuoyu Wang, Yang Liu

The advancements in intelligent manufacturing have made high-precision trajectory tracking technology crucial for improving the efficiency and safety of in-factory cargo transportation. This study addresses the limitations of current forklift navigation systems in trajectory control accuracy and stability by proposing the Enhanced Stability and Safety Model Predictive Control (ESS-MPC) method. This approach includes a multi-constraint strategy for improved stability and safety. The kinematic model for a single front steering-wheel forklift vehicle is constructed with all known state quantities, including the steering angle, resulting in a more accurate model description and trajectory prediction. To ensure vehicle safety, the spatial safety boundary obtained from the trajectory planning module is established as a hard constraint for ESS-MPC tracking. The optimisation constraints are also updated with the key kinematic and dynamic parameters of the forklift. The ESS-MPC method improved the position and pose accuracy and stability by 57.93%, 37.83%, and 57.51%, respectively, as demonstrated through experimental validation using simulation and real-world environments. This study provides significant support for the development of autonomous navigation systems for industrial forklifts.

{"title":"Enhancing stability and safety: A novel multi-constraint model predictive control approach for forklift trajectory","authors":"Yizhen Sun, Junyou Yang, Donghui Zhao, Moses Chukwuka Okonkwo, Jianmin Zhang, Shuoyu Wang, Yang Liu","doi":"10.1049/csy2.70004","DOIUrl":"https://doi.org/10.1049/csy2.70004","url":null,"abstract":"<p>The advancements in intelligent manufacturing have made high-precision trajectory tracking technology crucial for improving the efficiency and safety of in-factory cargo transportation. This study addresses the limitations of current forklift navigation systems in trajectory control accuracy and stability by proposing the Enhanced Stability and Safety Model Predictive Control (ESS-MPC) method. This approach includes a multi-constraint strategy for improved stability and safety. The kinematic model for a single front steering-wheel forklift vehicle is constructed with all known state quantities, including the steering angle, resulting in a more accurate model description and trajectory prediction. To ensure vehicle safety, the spatial safety boundary obtained from the trajectory planning module is established as a hard constraint for ESS-MPC tracking. The optimisation constraints are also updated with the key kinematic and dynamic parameters of the forklift. The ESS-MPC method improved the position and pose accuracy and stability by 57.93%, 37.83%, and 57.51%, respectively, as demonstrated through experimental validation using simulation and real-world environments. This study provides significant support for the development of autonomous navigation systems for industrial forklifts.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"6 4","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.70004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143121448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

3D-printed biomimetic and bioinspired soft actuators 三维打印仿生和生物启发软致动器

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-11-10 DOI: 10.1049/csy2.70001

Sonja S. Sparks, Alejandro G. Obando, Yizong Li, Si Chen, Shanshan Yao, Kaiyan Qiu

A major intent of scientific research is the replication of the behaviour observed in natural spaces. In robotics, these can be through biomimetic movements in devices and inspiration from diverse actions in nature, also known as bioinspired features. An interesting pathway enabling both features is the fabrication of soft actuators. Specifically, 3D-printing has been explored as a potential approach for the development of biomimetic and bioinspired soft actuators. The extent of this method is highlighted through the large array of applications and techniques used to create these devices, as applications from the movement of fern trees to contraction in organs are explored. In this review, different 3D-printing fabrication methods, materials, and types of soft actuators, and their respective applications are discussed in depth. Finally, the extent of their use for present operations and future technological advances are discussed.

科学研究的一个主要目的是复制在自然空间中观察到的行为。在机器人学中，这可以通过设备中的生物仿生运动和从自然界的各种行为中获得灵感来实现，这也被称为生物启发功能。实现这两种功能的一个有趣途径是制造软致动器。具体来说，三维打印技术已被视为开发仿生物和生物启发软致动器的一种潜在方法。从蕨类植物的运动到器官的收缩，大量的应用和技术被用来制造这些设备，从而凸显了这种方法的广泛性。本综述将深入讨论不同的 3D 打印制造方法、材料、软致动器类型及其各自的应用。最后，还讨论了它们在当前操作中的应用范围以及未来的技术进步。

{"title":"3D-printed biomimetic and bioinspired soft actuators","authors":"Sonja S. Sparks, Alejandro G. Obando, Yizong Li, Si Chen, Shanshan Yao, Kaiyan Qiu","doi":"10.1049/csy2.70001","DOIUrl":"https://doi.org/10.1049/csy2.70001","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <p>A major intent of scientific research is the replication of the behaviour observed in natural spaces. In robotics, these can be through biomimetic movements in devices and inspiration from diverse actions in nature, also known as bioinspired features. An interesting pathway enabling both features is the fabrication of soft actuators. Specifically, 3D-printing has been explored as a potential approach for the development of biomimetic and bioinspired soft actuators. The extent of this method is highlighted through the large array of applications and techniques used to create these devices, as applications from the movement of fern trees to contraction in organs are explored. In this review, different 3D-printing fabrication methods, materials, and types of soft actuators, and their respective applications are discussed in depth. Finally, the extent of their use for present operations and future technological advances are discussed.</p>\u0000 </section>\u0000 </div>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"6 4","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.70001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142664904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Correction-enabled reversible data hiding with pixel repetition for high embedding rate and quality preservation 利用像素重复校正功能进行可逆数据隐藏，实现高嵌入率和质量保证

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-09-30 DOI: 10.1049/csy2.70000

Mohammad Ali Kawser, Hussain Nyeem, Md Abdul Wahed

A novel correction-enabled Pixel Repetition (PR)-based Reversible Data Hiding (RDH) framework, featuring a new embedding scheme is presented. The proposed RDH scheme uses contextually redundant block pixels, generated via PR, in a two-phase adaptive embedding process, enhancing both image quality and data embedding rates. Specifically, each $� � � � � 2 � \times � 2 � � � � �$ block encodes 4 bits of data using new mapping conditions that facilitate seed pixel reconstruction from remaining block pixels and provide additional embedding opportunities. Additionally, an innovative post-embedding error correction technique, based on $� � � � � �^{2 � k �} � � � � �$ -bit error-correction, minimises post-embedding distortion, further improving image quality. This error correction approach augments data embedding robustness, vital for applications like medical imaging, telemedicine, and digital watermarking that requires high embedding capacity with minimum possible distortion. The proposed scheme surpasses existing state-of-the-art methods in embedding rate-distortion performance, validated through subjective and objective analyses. Furthermore, statistical analysis, including histogram and fragility testing, confirms the scheme's potential for image authentication across diverse multimedia applications. The correction-enabled RDH with PR offers enhanced embedding capacity and image quality preservation, making it particularly advantageous for applications requiring robust data hiding while maintaining visual fidelity.

本文介绍了一种基于像素重复（PR）校正的新型可逆数据隐藏（RDH）框架，它采用了一种新的嵌入方案。所提出的 RDH 方案在两阶段自适应嵌入过程中使用了通过 PR 生成的上下文冗余块像素，从而提高了图像质量和数据嵌入率。具体来说，每个 2 × 2 2 次 2$ 块使用新的映射条件编码 4 比特数据，这有利于从剩余块像素重建种子像素，并提供额外的嵌入机会。此外，基于 2 k ${2}^{k}$ -比特纠错的创新嵌入后纠错技术最大限度地减少了嵌入后失真，进一步提高了图像质量。这种纠错方法增强了数据嵌入的鲁棒性，对于医学成像、远程医疗和数字水印等要求高嵌入容量和最小失真度的应用至关重要。通过主观和客观分析验证，所提出的方案在嵌入率-失真性能方面超越了现有的最先进方法。此外，包括直方图和脆性测试在内的统计分析也证实了该方案在各种多媒体应用中进行图像认证的潜力。带有 PR 的校正 RDH 可提供更强的嵌入能力和图像质量保证，因此特别适用于需要在保持视觉保真度的同时进行稳健数据隐藏的应用。

{"title":"Correction-enabled reversible data hiding with pixel repetition for high embedding rate and quality preservation","authors":"Mohammad Ali Kawser, Hussain Nyeem, Md Abdul Wahed","doi":"10.1049/csy2.70000","DOIUrl":"https://doi.org/10.1049/csy2.70000","url":null,"abstract":"<p>A novel correction-enabled Pixel Repetition (PR)-based Reversible Data Hiding (RDH) framework, featuring a new embedding scheme is presented. The proposed RDH scheme uses contextually redundant block pixels, generated via PR, in a two-phase adaptive embedding process, enhancing both image quality and data embedding rates. Specifically, each <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mrow>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 <mo>×</mo>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </mrow>\u0000 </mrow>\u0000 <annotation> $2times 2$</annotation>\u0000 </semantics></math> block encodes 4 bits of data using new mapping conditions that facilitate seed pixel reconstruction from remaining block pixels and provide additional embedding opportunities. Additionally, an innovative post-embedding error correction technique, based on <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mrow>\u0000 <mrow>\u0000 <msup>\u0000 <mn>2</mn>\u0000 <mi>k</mi>\u0000 </msup>\u0000 </mrow>\u0000 </mrow>\u0000 </mrow>\u0000 <annotation> ${2}^{k}$</annotation>\u0000 </semantics></math>-bit error-correction, minimises post-embedding distortion, further improving image quality. This error correction approach augments data embedding robustness, vital for applications like medical imaging, telemedicine, and digital watermarking that requires high embedding capacity with minimum possible distortion. The proposed scheme surpasses existing state-of-the-art methods in embedding rate-distortion performance, validated through subjective and objective analyses. Furthermore, statistical analysis, including histogram and fragility testing, confirms the scheme's potential for image authentication across diverse multimedia applications. The correction-enabled RDH with PR offers enhanced embedding capacity and image quality preservation, making it particularly advantageous for applications requiring robust data hiding while maintaining visual fidelity.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"6 3","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.70000","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142360031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Anti-sloshing control: Flatness-based trajectory planning and tracking control with an integrated extended state observer 防滑控制：基于平整度的轨迹规划和跟踪控制与综合扩展状态观测器

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-08-25 DOI: 10.1049/csy2.12121

Khanh Nguyen Viet, Minh Do Duc, Thanh Cao Duc, Tung Lam Nguyen

The phenomenon of sloshing causes a significantly negative impact on a wide range of industries. A time-optimal flatness-based trajectory planning and Lyapunov-based model predictive control (LMPC) is proposed for trajectory tracking of a transmitting cylindrical container filled with liquid. Firstly, this research presents an equivalent discrete model based on a mass-spring-damper system. Subsequently, after the flatness of the adopted non-linear model for 2D is established, time-optimal trajectories are introduced. A control method called LMPC is shown to solve the problem of orbital tracking, which allows setting limits for state variables. In addition, to ensure system performance, a linear extended state observer (LESO) is integrated to cope with system uncertainties. Finally, the efficiency of the proposed approach for liquid sloshing suppression and tracking is illustrated by simulations.

荡气现象给各行各业带来了极大的负面影响。针对装满液体的传输圆柱形容器的轨迹跟踪，提出了一种基于时间最优平面度的轨迹规划和基于李亚普诺夫的模型预测控制（LMPC）。首先，本研究提出了一个基于质量-弹簧-阻尼系统的等效离散模型。随后，在建立了所采用的二维非线性模型的平面性之后，引入了时间最优轨迹。一种名为 LMPC 的控制方法被用于解决轨道跟踪问题，它允许为状态变量设置限制。此外，为确保系统性能，还集成了线性扩展状态观测器（LESO），以应对系统的不确定性。最后，通过仿真说明了所提方法在液体荡浮抑制和跟踪方面的效率。

引用次数: 0

Multi-feature fusion and memory-based mobile robot target tracking system 基于多特征融合和记忆的移动机器人目标跟踪系统

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-07-17 DOI: 10.1049/csy2.12119

Hanqing Sun, Shijie Zhang, Qingle Quan

In crowded settings, mobile robots face challenges like target disappearance and occlusion, impacting tracking performance. Despite existing optimisations, tracking in complex environments remains insufficient. To address this issue, the authors propose a tailored visual navigation tracking system for crowded scenes. For target disappearance, an autonomous navigation strategy based on target coordinates, utilising a path memory bank for intelligent search and re-tracking is introduced. This significantly enhances tracking success. To handle target occlusion, the system relies on appearance features extracted by a target detection network and a feature memory bank for enhanced sensitivity. Servo control technology ensures robust target tracking by fully utilising appearance information and motion characteristics, even in occluded scenarios. Comprehensive testing on the OTB100 dataset validates the system's effectiveness in addressing target tracking challenges in diverse crowded environments, affirming algorithm robustness.

在拥挤的环境中，移动机器人会面临目标消失和遮挡等挑战，从而影响跟踪性能。尽管已有优化措施，但在复杂环境中的跟踪性能仍然不足。为解决这一问题，作者提出了一种针对拥挤场景的定制视觉导航跟踪系统。针对目标消失的情况，引入了基于目标坐标的自主导航策略，利用路径记忆库进行智能搜索和重新跟踪。这大大提高了跟踪的成功率。为了处理目标遮挡问题，系统依靠目标检测网络和特征记忆库提取的外观特征来提高灵敏度。伺服控制技术充分利用了外观信息和运动特征，即使在目标遮挡的情况下也能确保目标跟踪的稳定性。在 OTB100 数据集上进行的全面测试验证了该系统在各种拥挤环境中应对目标跟踪挑战的有效性，肯定了算法的鲁棒性。

引用次数: 0

Internal and external disturbances aware motion planning and control for quadrotors 内部和外部干扰感知四旋翼飞行器的运动规划和控制

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-07-17 DOI: 10.1049/csy2.12122

Xiaobin Zhou, Miao Wang, Can Cui, Yongchao Wang, Chao Xu, Fei Gao

Resilient motion planning and control, without prior knowledge of disturbances, are crucial to ensure the safe and robust flight of quadrotors. The development of a motion planning and control architecture for quadrotors, considering both internal and external disturbances (i.e., motor damages and suspended payloads), is addressed. Firstly, the authors introduce the use of exponential functions to formulate trajectory planning. This choice is driven by its ability to predict thrust responses with minimal computational overhead. Additionally, a reachability analysis is incorporated for error dynamics resulting from multiple disturbances. This analysis sits at the interface between the planner and controller, contributing to the generation of more robust and safe spatial–temporal trajectories. Lastly, the authors employ a cascade controller, with the assistance of internal and external loop observers, to further enhance resilience and compensate the disturbances. The authors’ benchmark experiments demonstrate the effectiveness of the proposed strategy in enhancing flight safety, particularly when confronted with motor damages and payload disturbances.

在不预先知道干扰的情况下进行弹性运动规划和控制，对于确保四旋翼飞行器的安全和稳健飞行至关重要。考虑到内部和外部干扰（即电机损坏和悬挂有效载荷），本文探讨了四旋翼飞行器运动规划和控制架构的开发。首先，作者介绍了指数函数在轨迹规划中的应用。之所以选择这种方法，是因为它能够以最小的计算开销预测推力响应。此外，作者还针对多重干扰导致的误差动态进行了可达性分析。该分析位于规划器和控制器之间，有助于生成更稳健、更安全的时空轨迹。最后，作者在内部和外部环路观测器的协助下，采用了级联控制器，以进一步增强恢复能力并补偿干扰。作者的基准实验证明了所提出的策略在提高飞行安全性方面的有效性，尤其是在面对电机损坏和有效载荷干扰时。

{"title":"Internal and external disturbances aware motion planning and control for quadrotors","authors":"Xiaobin Zhou, Miao Wang, Can Cui, Yongchao Wang, Chao Xu, Fei Gao","doi":"10.1049/csy2.12122","DOIUrl":"https://doi.org/10.1049/csy2.12122","url":null,"abstract":"<p>Resilient motion planning and control, without prior knowledge of disturbances, are crucial to ensure the safe and robust flight of quadrotors. The development of a motion planning and control architecture for quadrotors, considering both internal and external disturbances (i.e., motor damages and suspended payloads), is addressed. Firstly, the authors introduce the use of exponential functions to formulate trajectory planning. This choice is driven by its ability to predict thrust responses with minimal computational overhead. Additionally, a reachability analysis is incorporated for error dynamics resulting from multiple disturbances. This analysis sits at the interface between the planner and controller, contributing to the generation of more robust and safe spatial–temporal trajectories. Lastly, the authors employ a cascade controller, with the assistance of internal and external loop observers, to further enhance resilience and compensate the disturbances. The authors’ benchmark experiments demonstrate the effectiveness of the proposed strategy in enhancing flight safety, particularly when confronted with motor damages and payload disturbances.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"6 3","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12122","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141639531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient knowledge distillation for hybrid models: A vision transformer-convolutional neural network to convolutional neural network approach for classifying remote sensing images 混合模型的高效知识提炼：用于遥感图像分类的视觉转换器-卷积神经网络-卷积神经网络方法

IF 1.5 Q3 AUTOMATION & CONTROL SYSTEMS

IET Cybersystems and Robotics

Pub Date : 2024-07-10 DOI: 10.1049/csy2.12120

Huaxiang Song, Yuxuan Yuan, Zhiwei Ouyang, Yu Yang, Hui Xiang

In various fields, knowledge distillation (KD) techniques that combine vision transformers (ViTs) and convolutional neural networks (CNNs) as a hybrid teacher have shown remarkable results in classification. However, in the realm of remote sensing images (RSIs), existing KD research studies are not only scarce but also lack competitiveness. This issue significantly impedes the deployment of the notable advantages of ViTs and CNNs. To tackle this, the authors introduce a novel hybrid-model KD approach named HMKD-Net, which comprises a CNN-ViT ensemble teacher and a CNN student. Contrary to popular opinion, the authors posit that the sparsity in RSI data distribution limits the effectiveness and efficiency of hybrid-model knowledge transfer. As a solution, a simple yet innovative method to handle variances during the KD phase is suggested, leading to substantial enhancements in the effectiveness and efficiency of hybrid knowledge transfer. The authors assessed the performance of HMKD-Net on three RSI datasets. The findings indicate that HMKD-Net significantly outperforms other cutting-edge methods while maintaining a significantly smaller size. Specifically, HMKD-Net exceeds other KD-based methods with a maximum accuracy improvement of 22.8% across various datasets. As ablation experiments indicated, HMKD-Net has cut down on time expenses by about 80% in the KD process. This research study validates that the hybrid-model KD technique can be more effective and efficient if the data distribution sparsity in RSIs is well handled.

在各个领域，结合视觉转换器（ViT）和卷积神经网络（CNN）作为混合教师的知识提炼（KD）技术在分类方面取得了显著效果。然而，在遥感图像（RSI）领域，现有的知识提炼研究不仅数量稀少，而且缺乏竞争力。这一问题严重阻碍了 ViT 和 CNN 显著优势的发挥。为解决这一问题，作者引入了一种名为 HMKD-Net 的新型混合模型 KD 方法，该方法由 CNN-ViT 组合教师和 CNN 学生组成。与流行观点相反，作者认为 RSI 数据分布的稀疏性限制了混合模型知识转移的效果和效率。作为解决方案，作者提出了一种简单而创新的方法来处理 KD 阶段的差异，从而大大提高了混合知识转移的效果和效率。作者在三个 RSI 数据集上评估了 HMKD-Net 的性能。研究结果表明，HMKD-Net 的性能明显优于其他前沿方法，同时体积明显缩小。具体来说，HMKD-Net 超越了其他基于 KD 的方法，在各种数据集上的准确率最高提高了 22.8%。消融实验表明，HMKD-Net 在 KD 过程中减少了约 80% 的时间支出。这项研究验证了，如果能很好地处理 RSI 中的数据分布稀疏性，混合模型 KD 技术将更加有效和高效。

{"title":"Efficient knowledge distillation for hybrid models: A vision transformer-convolutional neural network to convolutional neural network approach for classifying remote sensing images","authors":"Huaxiang Song, Yuxuan Yuan, Zhiwei Ouyang, Yu Yang, Hui Xiang","doi":"10.1049/csy2.12120","DOIUrl":"https://doi.org/10.1049/csy2.12120","url":null,"abstract":"<p>In various fields, knowledge distillation (KD) techniques that combine vision transformers (ViTs) and convolutional neural networks (CNNs) as a hybrid teacher have shown remarkable results in classification. However, in the realm of remote sensing images (RSIs), existing KD research studies are not only scarce but also lack competitiveness. This issue significantly impedes the deployment of the notable advantages of ViTs and CNNs. To tackle this, the authors introduce a novel hybrid-model KD approach named HMKD-Net, which comprises a CNN-ViT ensemble teacher and a CNN student. Contrary to popular opinion, the authors posit that the sparsity in RSI data distribution limits the effectiveness and efficiency of hybrid-model knowledge transfer. As a solution, a simple yet innovative method to handle variances during the KD phase is suggested, leading to substantial enhancements in the effectiveness and efficiency of hybrid knowledge transfer. The authors assessed the performance of HMKD-Net on three RSI datasets. The findings indicate that HMKD-Net significantly outperforms other cutting-edge methods while maintaining a significantly smaller size. Specifically, HMKD-Net exceeds other KD-based methods with a maximum accuracy improvement of 22.8% across various datasets. As ablation experiments indicated, HMKD-Net has cut down on time expenses by about 80% in the KD process. This research study validates that the hybrid-model KD technique can be more effective and efficient if the data distribution sparsity in RSIs is well handled.</p>","PeriodicalId":34110,"journal":{"name":"IET Cybersystems and Robotics","volume":"6 3","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/csy2.12120","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141597026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IET Cybersystems and Robotics

全部 Geobiology Appl. Clay Sci. Geochim. Cosmochim. Acta J. Hydrol. Org. Geochem. Carbon Balance Manage. Contrib. Mineral. Petrol. Int. J. Biometeorol. IZV-PHYS SOLID EART+ J. Atmos. Chem. Acta Oceanolog. Sin. Acta Geophys. ACTA GEOL POL ACTA PETROL SIN ACTA GEOL SIN-ENGL AAPG Bull. Acta Geochimica Adv. Atmos. Sci. Adv. Meteorol. Am. J. Phys. Anthropol. Am. J. Sci. Am. Mineral. Annu. Rev. Earth Planet. Sci. Appl. Geochem. Aquat. Geochem. Ann. Glaciol. Archaeol. Anthropol. Sci. ARCHAEOMETRY ARCT ANTARCT ALP RES Asia-Pac. J. Atmos. Sci. ATMOSPHERE-BASEL Atmos. Res. Aust. J. Earth Sci. Atmos. Chem. Phys. Atmos. Meas. Tech. Basin Res. Big Earth Data BIOGEOSCIENCES Geostand. Geoanal. Res. GEOLOGY Geosci. J. Geochem. J. Geochem. Trans. Geosci. Front. Geol. Ore Deposits Global Biogeochem. Cycles Gondwana Res. Geochem. Int. Geol. J. Geophys. Prospect. Geosci. Model Dev. GEOL BELG GROUNDWATER Hydrogeol. J. Hydrol. Earth Syst. Sci. Hydrol. Processes Int. J. Climatol. Int. J. Earth Sci. Int. Geol. Rev. Int. J. Disaster Risk Reduct. Int. J. Geomech. Int. J. Geog. Inf. Sci. Isl. Arc J. Afr. Earth. Sci. J. Adv. Model. Earth Syst. J APPL METEOROL CLIM J. Atmos. Oceanic Technol. J. Atmos. Sol. Terr. Phys. J. Clim. J. Earth Sci. J. Earth Syst. Sci. J. Environ. Eng. Geophys. J. Geog. Sci. Mineral. Mag. Miner. Deposita Mon. Weather Rev. Nat. Hazards Earth Syst. Sci. Nat. Clim. Change Nat. Geosci. Ocean Dyn. Ocean and Coastal Research npj Clim. Atmos. Sci. Ocean Modell. Ocean Sci. Ore Geol. Rev. OCEAN SCI J Paleontol. J. PALAEOGEOGR PALAEOCL PERIOD MINERAL PETROLOGY+ Phys. Chem. Miner. Polar Sci. Prog. Oceanogr. Quat. Sci. Rev. Q. J. Eng. Geol. Hydrogeol. RADIOCARBON Pure Appl. Geophys. Resour. Geol. Rev. Geophys. Sediment. Geol.

﹀