首页 > 最新文献

Robotic Intelligence and Automation最新文献

英文 中文
A transfer alignment algorithm based on combined double-time observation of velocity and attitude 基于速度和姿态双时间联合观测的传递对准算法
Pub Date : 2022-07-13 DOI: 10.1108/aa-03-2022-0048
Guangrun Sheng, Xixiang Liu, Zixuan Wang, Wenhao Pu, Xiaoqiang Wu, Xiaoshuang Ma

Purpose

This paper aims to present a novel transfer alignment method based on combined double-time observations with velocity and attitude for ships’ poor maneuverability to address the system errors introduced by flexural deformation and installing which are difficult to calibrate.

Design/methodology/approach

Based on velocity and attitude matching, redesigning and deducing Kalman filter model by combining double-time observation. By introducing the sampling of the previous update cycle of the strapdown inertial navigation system (SINS), current observation subtracts previous observation are used as measurements for transfer alignment filter, system error in measurement introduced by deformation and installing can be effectively removed.

Findings

The results of simulations and turntable tests show that when there is a system error, the proposed method can improve alignment accuracy, shorten the alignment process and not require any active maneuvers or additional sensor equipment.

Originality/value

Calibrating those deformations and installing errors during transfer alignment need special maneuvers along different axes, which is difficult to fulfill for ships’ poor maneuverability. Without additional sensor equipment and active maneuvers, the system errors in attitude measurement can be eliminated by the proposed algorithms, meanwhile improving the accuracy of the shipboard SINS transfer alignment.

目的针对船舶机动性能差的特点,提出一种基于航速和姿态双时间联合观测的传递对准方法,以解决船舶弯曲变形和安装带来的难以标定的系统误差。基于速度和姿态匹配,结合双时间观测,重新设计并推导卡尔曼滤波模型。通过引入捷联惯导系统前一更新周期的采样,以当前观测值减去前一观测值作为传递对准滤波器的测量值,有效地消除了由变形和安装引起的测量系统误差。仿真和转台试验结果表明,在存在系统误差的情况下,该方法可以提高对准精度,缩短对准过程,且不需要任何主动机动或额外的传感器设备。在传递对准过程中,这些变形和安装误差的校正需要沿着不同的轴线进行特殊的机动,而由于船舶的操纵性较差,这很难实现。该算法在不需要额外传感器设备和主动机动的情况下,消除了系统姿态测量误差,同时提高了舰载捷联惯导传递对准精度。
{"title":"A transfer alignment algorithm based on combined double-time observation of velocity and attitude","authors":"Guangrun Sheng, Xixiang Liu, Zixuan Wang, Wenhao Pu, Xiaoqiang Wu, Xiaoshuang Ma","doi":"10.1108/aa-03-2022-0048","DOIUrl":"https://doi.org/10.1108/aa-03-2022-0048","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>This paper aims to present a novel transfer alignment method based on combined double-time observations with velocity and attitude for ships’ poor maneuverability to address the system errors introduced by flexural deformation and installing which are difficult to calibrate.</p><!--/ Abstract__block -->\u0000<h3>Design/methodology/approach</h3>\u0000<p>Based on velocity and attitude matching, redesigning and deducing Kalman filter model by combining double-time observation. By introducing the sampling of the previous update cycle of the strapdown inertial navigation system (SINS), current observation subtracts previous observation are used as measurements for transfer alignment filter, system error in measurement introduced by deformation and installing can be effectively removed.</p><!--/ Abstract__block -->\u0000<h3>Findings</h3>\u0000<p>The results of simulations and turntable tests show that when there is a system error, the proposed method can improve alignment accuracy, shorten the alignment process and not require any active maneuvers or additional sensor equipment.</p><!--/ Abstract__block -->\u0000<h3>Originality/value</h3>\u0000<p>Calibrating those deformations and installing errors during transfer alignment need special maneuvers along different axes, which is difficult to fulfill for ships’ poor maneuverability. Without additional sensor equipment and active maneuvers, the system errors in attitude measurement can be eliminated by the proposed algorithms, meanwhile improving the accuracy of the shipboard SINS transfer alignment.</p><!--/ Abstract__block -->","PeriodicalId":501194,"journal":{"name":"Robotic Intelligence and Automation","volume":"52 18","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An enhanced sensorless control based on active disturbance rejection controller for a PMSM system: design and hardware implementation 基于自抗扰控制器的永磁同步电机系统增强无传感器控制:设计与硬件实现
Pub Date : 2022-05-26 DOI: 10.1108/aa-01-2022-0016
Hao Lu, Shengquan Li, Bo Feng, Juan Li

Purpose

This paper mainly aims to deal with the problems of uncertainties including modelling errors, unknown dynamics and disturbances caused by load mutation in control of permanent magnet synchronous motor (PMSM).

Design/methodology/approach

This paper proposes an enhanced speed sensorless vector control method based on an active disturbance rejection controller (ADRC) for a PMSM. First, a state space model of the PMSM is obtained for the field orientation control strategy. Second, a sliding mode observer (SMO) based on back electromotive force (EMF) is introduced to replace the encode to estimate the rotor flux position angle and speed. Third, an infinite impulse response (IIR) filter is introduced to eliminate high frequency noise mixed in the output of the sliding mode observer. In addition, a speed control method based on an extended state observer (ESO) is proposed to estimate and compensate for the total disturbances. Finally, an experimental set-up is built to verify the effectiveness and superiority of the proposed ADRC-based control method.

Findings

The comparative experimental results show that the proposed speed sensorless control method with the IIR filter can achieve excellent robustness and speed tracking performance for PMSM system.

Research limitations/implications

An enhanced sensorless control method based on active disturbance rejection controller is designed to realize high precision control of the PMSM; the IIR filter is used to attenuate the chattering problem of traditional SMO; this method simplifies the system and saves the total cost due to the speed sensorless technology.

Practical implications

The use of sensorless can reduce costs and be more beneficial to actual industrial application.

Originality/value

The proposed enhanced speed sensorless vector control method based on an ADRC with the IIR filter enriches the control method of PMSM. It can ameliorate system robustness and achieve excellent speed tracking performance.

目的研究永磁同步电动机控制中存在的建模误差、动力学未知和负载突变引起的扰动等不确定性问题。本文提出了一种基于自抗扰控制器(ADRC)的永磁同步电机无速度传感器矢量控制方法。首先,建立了永磁同步电机磁场定向控制策略的状态空间模型。其次,引入基于反电动势(EMF)的滑模观测器(SMO)代替编码来估计转子磁链位置、角度和速度;第三,引入无限脉冲响应滤波器消除滑模观测器输出中的高频噪声。此外,提出了一种基于扩展状态观测器(ESO)的速度控制方法来估计和补偿总扰动。最后,通过实验验证了该控制方法的有效性和优越性。对比实验结果表明,采用IIR滤波器的无速度传感器控制方法对永磁同步电机系统具有良好的鲁棒性和速度跟踪性能。为实现永磁同步电机的高精度控制,设计了一种基于自抗扰控制器的增强型无传感器控制方法;采用IIR滤波器对传统SMO的抖振问题进行了衰减;由于采用了无速度传感器技术,该方法简化了系统,节省了总成本。实际意义使用无传感器可以降低成本,更有利于实际工业应用。提出了一种基于自抗扰和IIR滤波器的增强型无速度传感器矢量控制方法,丰富了永磁同步电机的控制方法。该方法可以提高系统的鲁棒性,实现良好的速度跟踪性能。
{"title":"An enhanced sensorless control based on active disturbance rejection controller for a PMSM system: design and hardware implementation","authors":"Hao Lu, Shengquan Li, Bo Feng, Juan Li","doi":"10.1108/aa-01-2022-0016","DOIUrl":"https://doi.org/10.1108/aa-01-2022-0016","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>This paper mainly aims to deal with the problems of uncertainties including modelling errors, unknown dynamics and disturbances caused by load mutation in control of permanent magnet synchronous motor (PMSM).</p><!--/ Abstract__block -->\u0000<h3>Design/methodology/approach</h3>\u0000<p>This paper proposes an enhanced speed sensorless vector control method based on an active disturbance rejection controller (ADRC) for a PMSM. First, a state space model of the PMSM is obtained for the field orientation control strategy. Second, a sliding mode observer (SMO) based on back electromotive force (EMF) is introduced to replace the encode to estimate the rotor flux position angle and speed. Third, an infinite impulse response (IIR) filter is introduced to eliminate high frequency noise mixed in the output of the sliding mode observer. In addition, a speed control method based on an extended state observer (ESO) is proposed to estimate and compensate for the total disturbances. Finally, an experimental set-up is built to verify the effectiveness and superiority of the proposed ADRC-based control method.</p><!--/ Abstract__block -->\u0000<h3>Findings</h3>\u0000<p>The comparative experimental results show that the proposed speed sensorless control method with the IIR filter can achieve excellent robustness and speed tracking performance for PMSM system.</p><!--/ Abstract__block -->\u0000<h3>Research limitations/implications</h3>\u0000<p>An enhanced sensorless control method based on active disturbance rejection controller is designed to realize high precision control of the PMSM; the IIR filter is used to attenuate the chattering problem of traditional SMO; this method simplifies the system and saves the total cost due to the speed sensorless technology.</p><!--/ Abstract__block -->\u0000<h3>Practical implications</h3>\u0000<p>The use of sensorless can reduce costs and be more beneficial to actual industrial application.</p><!--/ Abstract__block -->\u0000<h3>Originality/value</h3>\u0000<p>The proposed enhanced speed sensorless vector control method based on an ADRC with the IIR filter enriches the control method of PMSM. It can ameliorate system robustness and achieve excellent speed tracking performance.</p><!--/ Abstract__block -->","PeriodicalId":501194,"journal":{"name":"Robotic Intelligence and Automation","volume":"53 7","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards extreme learning machine framework for lane detection on unmanned mobile robot 无人移动机器人车道检测的极限学习机框架研究
Pub Date : 2022-04-29 DOI: 10.1108/aa-10-2021-0125
Yingpeng Dai, Jiehao Li, Junzheng Wang, Jing Li, Xu Liu

Purpose

This paper aims to focus on lane detection of unmanned mobile robots. For the mobile robot, it is undesirable to spend lots of time detecting the lane. So quickly detecting the lane in a complex environment such as poor illumination and shadows becomes a challenge.

Design/methodology/approach

A new learning framework based on an integration of extreme learning machine (ELM) and an inception structure named multiscale ELM is proposed, making full use of the advantages that ELM has faster convergence and convolutional neural network could extract local features in different scales. The proposed architecture is divided into two main components: self-taught feature extraction by ELM with the convolution layer and bottom-up information classification based on the feature constraint. To overcome the disadvantages of poor performance under complex conditions such as shadows and illumination, this paper mainly solves four problems: local features learning: replaced the fully connected layer, the convolutional layer is used to extract local features; feature extraction in different scales: the integration of ELM and inception structure improves the parameters learning speed, but it also achieves spatial interactivity in different scales; and the validity of the training database: a method how to find a training data set is proposed.

Findings

Experimental results on various data sets reveal that the proposed algorithm effectively improves performance under complex conditions. In the actual environment, experimental results tested by the robot platform named BIT-NAZA show that the proposed algorithm achieves better performance and reliability.

Originality/value

This research can provide a theoretical and engineering basis for lane detection on unmanned robots.

本文主要研究无人移动机器人的车道检测问题。对于移动机器人来说,花费大量的时间进行车道检测是不可取的。因此,在光照不足和阴影等复杂环境下快速检测车道成为一项挑战。设计/方法/方法:充分利用极限学习机(ELM)收敛速度快和卷积神经网络可以提取不同尺度局部特征的优势,提出了一种基于极限学习机(ELM)和多尺度ELM初始结构相结合的学习框架。该体系结构分为两个主要部分:基于卷积层的ELM自学习特征提取和基于特征约束的自下而上信息分类。为了克服在阴影、光照等复杂条件下性能较差的缺点,本文主要解决了四个问题:局部特征学习:用卷积层代替全连接层提取局部特征;不同尺度下的特征提取:ELM与初始结构的融合在提高参数学习速度的同时,实现了不同尺度下的空间交互性;针对训练数据库的有效性问题,提出了一种寻找训练数据集的方法。在各种数据集上的实验结果表明,该算法有效地提高了复杂条件下的性能。在实际环境中,机器人平台BIT-NAZA的实验结果表明,本文提出的算法具有更好的性能和可靠性。独创性/价值本研究可为无人机器人的车道检测提供理论和工程基础。
{"title":"Towards extreme learning machine framework for lane detection on unmanned mobile robot","authors":"Yingpeng Dai, Jiehao Li, Junzheng Wang, Jing Li, Xu Liu","doi":"10.1108/aa-10-2021-0125","DOIUrl":"https://doi.org/10.1108/aa-10-2021-0125","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>This paper aims to focus on lane detection of unmanned mobile robots. For the mobile robot, it is undesirable to spend lots of time detecting the lane. So quickly detecting the lane in a complex environment such as poor illumination and shadows becomes a challenge.</p><!--/ Abstract__block -->\u0000<h3>Design/methodology/approach</h3>\u0000<p>A new learning framework based on an integration of extreme learning machine (ELM) and an inception structure named multiscale ELM is proposed, making full use of the advantages that ELM has faster convergence and convolutional neural network could extract local features in different scales. The proposed architecture is divided into two main components: self-taught feature extraction by ELM with the convolution layer and bottom-up information classification based on the feature constraint. To overcome the disadvantages of poor performance under complex conditions such as shadows and illumination, this paper mainly solves four problems: local features learning: replaced the fully connected layer, the convolutional layer is used to extract local features; feature extraction in different scales: the integration of ELM and inception structure improves the parameters learning speed, but it also achieves spatial interactivity in different scales; and the validity of the training database: a method how to find a training data set is proposed.</p><!--/ Abstract__block -->\u0000<h3>Findings</h3>\u0000<p>Experimental results on various data sets reveal that the proposed algorithm effectively improves performance under complex conditions. In the actual environment, experimental results tested by the robot platform named BIT-NAZA show that the proposed algorithm achieves better performance and reliability.</p><!--/ Abstract__block -->\u0000<h3>Originality/value</h3>\u0000<p>This research can provide a theoretical and engineering basis for lane detection on unmanned robots.</p><!--/ Abstract__block -->","PeriodicalId":501194,"journal":{"name":"Robotic Intelligence and Automation","volume":"53 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An optimized cosine jerk motion profile with higher efficiency and flexibility 一个优化的余弦抖动运动轮廓具有更高的效率和灵活性
Pub Date : 2022-04-27 DOI: 10.1108/aa-11-2021-0165
Qixin Zhu, Yusheng Jin, Yonghong Zhu

Purpose

The purpose of this paper is to propose a new acceleration/deceleration (acc/dec) algorithm for motion profiles. The motion efficiency, flexibility of the motion profiles and the residual vibration of the movement are discussed in this paper.

Design/methodology/approach

A dynamics model is developed to assess the residual vibration of these two kinds of motion profile. And a Simulink model is created to assess the motion efficiency and flexibility of the motion profiles with the proposed acc/dec algorithm.

Findings

Considering the flexibility of trigonometric motion profiles and the higher motion efficiency of S-curve motion profiles, the authors add the polynomial parts into the jerk profile of the cosine function acc/dec algorithm to hold the jerk when it reaches the maximum so that the motion efficiency can increase and decrease residual vibration at the same time. And the cyclical parameter k shows the decisive factor for the flexibility of trigonometric motion profiles.

Originality/value

Comparing with the traditional motion profiles, the proposed motion profiles have higher motion efficiency and excite less residual vibration. The acc/dec algorithm proposed in this paper is useful for the present motion control and servo system.

目的提出一种新的运动轮廓加减速(acc/dec)算法。本文讨论了运动效率、运动轮廓的柔性和运动的残余振动。设计/方法/方法建立了一个动力学模型来评估这两种运动剖面的残余振动。并建立了Simulink模型,对所提出的acc/dec算法的运动效率和灵活性进行了评估。结果考虑到三角运动轮廓的灵活性和s曲线运动轮廓较高的运动效率,作者在余弦函数acc/dec算法的激振轮廓中加入多项式部分,在激振达到最大值时保持激振,从而在提高运动效率的同时减少残余振动。周期参数k是三角运动曲线灵活性的决定性因素。与传统的运动轮廓相比,所提出的运动轮廓具有更高的运动效率和较少的残余振动。本文提出的acc/dec算法适用于当前的运动控制和伺服系统。
{"title":"An optimized cosine jerk motion profile with higher efficiency and flexibility","authors":"Qixin Zhu, Yusheng Jin, Yonghong Zhu","doi":"10.1108/aa-11-2021-0165","DOIUrl":"https://doi.org/10.1108/aa-11-2021-0165","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>The purpose of this paper is to propose a new acceleration/deceleration (acc/dec) algorithm for motion profiles. The motion efficiency, flexibility of the motion profiles and the residual vibration of the movement are discussed in this paper.</p><!--/ Abstract__block -->\u0000<h3>Design/methodology/approach</h3>\u0000<p>A dynamics model is developed to assess the residual vibration of these two kinds of motion profile. And a Simulink model is created to assess the motion efficiency and flexibility of the motion profiles with the proposed acc/dec algorithm.</p><!--/ Abstract__block -->\u0000<h3>Findings</h3>\u0000<p>Considering the flexibility of trigonometric motion profiles and the higher motion efficiency of S-curve motion profiles, the authors add the polynomial parts into the jerk profile of the cosine function acc/dec algorithm to hold the jerk when it reaches the maximum so that the motion efficiency can increase and decrease residual vibration at the same time. And the cyclical parameter k shows the decisive factor for the flexibility of trigonometric motion profiles.</p><!--/ Abstract__block -->\u0000<h3>Originality/value</h3>\u0000<p>Comparing with the traditional motion profiles, the proposed motion profiles have higher motion efficiency and excite less residual vibration. The acc/dec algorithm proposed in this paper is useful for the present motion control and servo system.</p><!--/ Abstract__block -->","PeriodicalId":501194,"journal":{"name":"Robotic Intelligence and Automation","volume":"53 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Press-fit process fault diagnosis using 1DCNN-LSTM method 基于1DCNN-LSTM方法的压合过程故障诊断
Pub Date : 2022-04-26 DOI: 10.1108/aa-06-2021-0072
Xialiang Ye, Minbo Li

Purpose

Press-fit with force and displacement monitoring is commonly adopted in automotive mechatronic system assembling. However, suitable methods for the press-fit study are still at initial investigation phase. The sequential data physical meaning, small data sets from different resources and computing efficiency should be considered. Therefore, this paper aims to better identify press-fit fault types.

Design/methodology/approach

This paper proposed one-dimensional convolutional neural network (1DCNN)–long short-term memory (LSTM) method to perform press-fit fault diagnosis into automotive assembling practice which is in accordance with current product development procedure. Specialized data augmentation method is proposed to merge different data resources and increase the sample size. Referring one-way sequential data characteristics, LSTM and batch normalization layers are integrated in 1DCNN to improve the performance.

Findings

The proposed 1DCNN-LSTM method is feasible with small data sets from different sources. Using data augmentation to make data unified and sample size increased, the accuracy could reach more than 99%. Training time has reduced from 90 s/Epoch to 4 s/Epoch compare to pure LSTM method.

Originality/value

The proposed method shows better performance with less training time compared to LSTM. Therefore, the method has practical value and is worthy of industrial application.

目的:在汽车机电系统装配中,通常采用带力与位移监测的压配合方式。然而,适合压合研究的方法仍处于初步研究阶段。要考虑序列数据的物理意义、不同资源的小数据集和计算效率。因此,本文旨在更好地识别压合断层类型。设计/方法/途径本文提出了一维卷积神经网络(1DCNN)长短期记忆(LSTM)方法,将其应用于汽车装配实践中,并与当前产品开发流程相适应。提出了专门的数据扩充方法来合并不同的数据资源,增加样本容量。参考单向序列数据的特点,在1DCNN中集成了LSTM和批处理归一化层,提高了性能。结果提出的1DCNN-LSTM方法对于不同来源的小数据集是可行的。采用数据增强,使数据统一,样本量增大,准确率可达99%以上。与纯LSTM方法相比,训练时间从90秒/Epoch减少到4秒/Epoch。与LSTM相比,该方法具有更好的性能和更少的训练时间。因此,该方法具有实用价值,值得工业应用。
{"title":"Press-fit process fault diagnosis using 1DCNN-LSTM method","authors":"Xialiang Ye, Minbo Li","doi":"10.1108/aa-06-2021-0072","DOIUrl":"https://doi.org/10.1108/aa-06-2021-0072","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>Press-fit with force and displacement monitoring is commonly adopted in automotive mechatronic system assembling. However, suitable methods for the press-fit study are still at initial investigation phase. The sequential data physical meaning, small data sets from different resources and computing efficiency should be considered. Therefore, this paper aims to better identify press-fit fault types.</p><!--/ Abstract__block -->\u0000<h3>Design/methodology/approach</h3>\u0000<p>This paper proposed one-dimensional convolutional neural network (1DCNN)–long short-term memory (LSTM) method to perform press-fit fault diagnosis into automotive assembling practice which is in accordance with current product development procedure. Specialized data augmentation method is proposed to merge different data resources and increase the sample size. Referring one-way sequential data characteristics, LSTM and batch normalization layers are integrated in 1DCNN to improve the performance.</p><!--/ Abstract__block -->\u0000<h3>Findings</h3>\u0000<p>The proposed 1DCNN-LSTM method is feasible with small data sets from different sources. Using data augmentation to make data unified and sample size increased, the accuracy could reach more than 99%. Training time has reduced from 90 s/Epoch to 4 s/Epoch compare to pure LSTM method.</p><!--/ Abstract__block -->\u0000<h3>Originality/value</h3>\u0000<p>The proposed method shows better performance with less training time compared to LSTM. Therefore, the method has practical value and is worthy of industrial application.</p><!--/ Abstract__block -->","PeriodicalId":501194,"journal":{"name":"Robotic Intelligence and Automation","volume":"53 14","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
COWO: towards real-time spatiotemporal action localization in videos 面向视频实时时空动作定位
Pub Date : 2022-01-18 DOI: 10.1108/aa-07-2021-0098
Yang Yi, Yang Sun, Saimei Yuan, Yiji Zhu, Mengyi Zhang, Wenjun Zhu

Purpose

The purpose of this paper is to provide a fast and accurate network for spatiotemporal action localization in videos. It detects human actions both in time and space simultaneously in real-time, which is applicable in real-world scenarios such as safety monitoring and collaborative assembly.

Design/methodology/approach

This paper design an end-to-end deep learning network called collaborator only watch once (COWO). COWO recognizes the ongoing human activities in real-time with enhanced accuracy. COWO inherits from the architecture of you only watch once (YOWO), known to be the best performing network for online action localization to date, but with three major structural modifications: COWO enhances the intraclass compactness and enlarges the interclass separability in the feature level. A new correlation channel fusion and attention mechanism are designed based on the Pearson correlation coefficient. Accordingly, a correction loss function is designed. This function minimizes the same class distance and enhances the intraclass compactness. Use a probabilistic K-means clustering technique for selecting the initial seed points. The idea behind this is that the initial distance between cluster centers should be as considerable as possible. CIOU regression loss function is applied instead of the Smooth L1 loss function to help the model converge stably.

Findings

COWO outperforms the original YOWO with improvements of frame mAP 3% and 2.1% at a speed of 35.12 fps. Compared with the two-stream, T-CNN, C3D, the improvement is about 5% and 14.5% when applied to J-HMDB-21, UCF101-24 and AGOT data sets.

Originality/value

COWO extends more flexibility for assembly scenarios as it perceives spatiotemporal human actions in real-time. It contributes to many real-world scenarios such as safety monitoring and collaborative assembly.

目的为视频中动作的时空定位提供一个快速准确的网络。它可以实时检测人类在时间和空间上的行为,适用于安全监控和协同组装等现实场景。设计/方法/方法本文设计了一个端到端深度学习网络,称为协作者只看一次(coco)。coo实时识别正在进行的人类活动,并提高了准确性。COWO继承了you only watch one (YOWO)的架构,YOWO被认为是迄今为止性能最好的在线动作定位网络,但在结构上进行了三个主要的修改:COWO增强了类内紧凑性,并在特征级别上扩大了类间可分离性。基于Pearson相关系数,设计了一种新的相关通道融合和注意机制。据此,设计了修正损失函数。这个函数最小化了相同的类距离,增强了类内的紧凑性。使用概率k均值聚类技术来选择初始种子点。这背后的想法是,星团中心之间的初始距离应该尽可能大。采用CIOU回归损失函数代替光滑L1损失函数,使模型稳定收敛。在35.12 fps的速度下,scowo比原来的YOWO帧mAP分别提高了3%和2.1%。在J-HMDB-21、UCF101-24和AGOT数据集上,与双流、T-CNN、C3D相比,分别提高了约5%和14.5%。独创性/valueCOWO为装配场景扩展了更多的灵活性,因为它可以实时感知时空的人类行为。它有助于许多现实世界的场景,如安全监控和协作组装。
{"title":"COWO: towards real-time spatiotemporal action localization in videos","authors":"Yang Yi, Yang Sun, Saimei Yuan, Yiji Zhu, Mengyi Zhang, Wenjun Zhu","doi":"10.1108/aa-07-2021-0098","DOIUrl":"https://doi.org/10.1108/aa-07-2021-0098","url":null,"abstract":"<h3>Purpose</h3>\u0000<p>The purpose of this paper is to provide a fast and accurate network for spatiotemporal action localization in videos. It detects human actions both in time and space simultaneously in real-time, which is applicable in real-world scenarios such as safety monitoring and collaborative assembly.</p><!--/ Abstract__block -->\u0000<h3>Design/methodology/approach</h3>\u0000<p>This paper design an end-to-end deep learning network called collaborator only watch once (COWO). COWO recognizes the ongoing human activities in real-time with enhanced accuracy. COWO inherits from the architecture of you only watch once (YOWO), known to be the best performing network for online action localization to date, but with three major structural modifications: COWO enhances the intraclass compactness and enlarges the interclass separability in the feature level. A new correlation channel fusion and attention mechanism are designed based on the Pearson correlation coefficient. Accordingly, a correction loss function is designed. This function minimizes the same class distance and enhances the intraclass compactness. Use a probabilistic K-means clustering technique for selecting the initial seed points. The idea behind this is that the initial distance between cluster centers should be as considerable as possible. CIOU regression loss function is applied instead of the Smooth L1 loss function to help the model converge stably.</p><!--/ Abstract__block -->\u0000<h3>Findings</h3>\u0000<p>COWO outperforms the original YOWO with improvements of frame mAP 3% and 2.1% at a speed of 35.12 fps. Compared with the two-stream, T-CNN, C3D, the improvement is about 5% and 14.5% when applied to J-HMDB-21, UCF101-24 and AGOT data sets.</p><!--/ Abstract__block -->\u0000<h3>Originality/value</h3>\u0000<p>COWO extends more flexibility for assembly scenarios as it perceives spatiotemporal human actions in real-time. It contributes to many real-world scenarios such as safety monitoring and collaborative assembly.</p><!--/ Abstract__block -->","PeriodicalId":501194,"journal":{"name":"Robotic Intelligence and Automation","volume":"262 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Robotic Intelligence and Automation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1