首页 > 最新文献

2022 4th International Conference on Control and Robotics (ICCR)最新文献

英文 中文
The Influence Analysis of the Power Grid Topology on the Stray Current Invading Transformers 电网拓扑结构对杂散电流侵入变压器的影响分析
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053889
He Zhong, Jiangtao Liu, Mingwei Tang, Jianlei Zhang, Kai Liu, Song Xiao, Yujun Guo
During the operational process of the metro lines, the traction current discharged from the working grounding wheels flows back to the terrestrial traction substations through the steel rail, as the DC stray current may be generated due to the poor insulating condition between the rail and the ground. The characteristics of the DC stray current invading the transformer settled at the terrestrial substation is affected by a series elements including the train's operational condition, the ‘rail -ground’ transition resistance, the soil structure and power grid topology. Exploring the influence factors of the stray current lays the foundations for further preventing the negative impact brought from stray current. In order to analyze the influence of the power grid topology on the stray current invading transformers, a coupling model involving the up and down metro lines with the power grid is built, based on the finite element method (FEM). Based on this FEM model, the variation of stray current invading the transformer is analyzed along with the relative position between the power grid single circuit and the metro line varying, meanwhile the variation of the power grid's topology is also considered. It is found that with the complexity of the power grid topology, the total amount of stray current invading the power grid increases, whereas the stray current flowing through the majority of the transmission lines decreases. In addition, no matter the two transformers constituting the power grid circuit are on the same side or on the opposite side of the metro line, the stray current invading the grid tends to increase with the reduction of the angle between the metro line and the power grid circuit.
在地铁线路运行过程中,工作接地轮排出的牵引电流通过钢轨回流到地面牵引变电所,由于钢轨与地面之间绝缘不良,可能产生直流杂散电流。进入地面变电站变压器的直流杂散电流的特性受列车运行条件、轨地过渡电阻、土壤结构和电网拓扑等一系列因素的影响。探讨杂散电流的影响因素,为进一步防止杂散电流带来的负面影响奠定了基础。为了分析电网拓扑结构对杂散电流侵入变压器的影响,基于有限元法建立了地铁上下行线与电网的耦合模型。在此有限元模型的基础上,分析了侵入变压器的杂散电流随电网单回路与地铁线路相对位置的变化,同时考虑了电网拓扑结构的变化。研究发现,随着电网拓扑复杂度的增加,侵入电网的杂散电流总量增加,而流经大部分输电线路的杂散电流则减少。此外,无论构成电网回路的两台变压器在地铁线路的同侧还是相反侧,侵入电网的杂散电流都有随着地铁线路与电网回路夹角的减小而增大的趋势。
{"title":"The Influence Analysis of the Power Grid Topology on the Stray Current Invading Transformers","authors":"He Zhong, Jiangtao Liu, Mingwei Tang, Jianlei Zhang, Kai Liu, Song Xiao, Yujun Guo","doi":"10.1109/ICCR55715.2022.10053889","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053889","url":null,"abstract":"During the operational process of the metro lines, the traction current discharged from the working grounding wheels flows back to the terrestrial traction substations through the steel rail, as the DC stray current may be generated due to the poor insulating condition between the rail and the ground. The characteristics of the DC stray current invading the transformer settled at the terrestrial substation is affected by a series elements including the train's operational condition, the ‘rail -ground’ transition resistance, the soil structure and power grid topology. Exploring the influence factors of the stray current lays the foundations for further preventing the negative impact brought from stray current. In order to analyze the influence of the power grid topology on the stray current invading transformers, a coupling model involving the up and down metro lines with the power grid is built, based on the finite element method (FEM). Based on this FEM model, the variation of stray current invading the transformer is analyzed along with the relative position between the power grid single circuit and the metro line varying, meanwhile the variation of the power grid's topology is also considered. It is found that with the complexity of the power grid topology, the total amount of stray current invading the power grid increases, whereas the stray current flowing through the majority of the transmission lines decreases. In addition, no matter the two transformers constituting the power grid circuit are on the same side or on the opposite side of the metro line, the stray current invading the grid tends to increase with the reduction of the angle between the metro line and the power grid circuit.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126735410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of Small Animals Control in Substation Based on GBDT and LR Fusion Algorithm 基于GBDT和LR融合算法的变电站小动物控制应用
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053897
Zhaofeng Chen
Animals control is an important task for the safe operation of substations. Aiming at the problems existing in the control of small animals, with the superior prediction ability of machine learning, a prediction model of small animals hazard grade is proposed, which combines gradient boosting decision (GBDT) and logistic regression (LR) algorithm. The model combined substation operation and maintenance data with local meteorological data, performs features screening by calculating the variance value, and achieves classes balance by using sampling technology. And finally the model achieves the prediction of small animals hazard grade in substation. By using different data sets and not using GBDT algorithm to train the model, the prediction results are compared and analyzed. The proposed model is better in all prediction performance indicators, which verifies the validity of the method.
动物控制是变电站安全运行的一项重要任务。针对小动物控制中存在的问题,利用机器学习优越的预测能力,提出了一种结合梯度增强决策(GBDT)和逻辑回归(LR)算法的小动物危害等级预测模型。该模型将变电站运维数据与当地气象数据相结合,通过计算方差值进行特征筛选,并利用抽样技术实现类平衡。最后,该模型实现了变电站小动物危害等级的预测。通过使用不同的数据集,不使用GBDT算法训练模型,对预测结果进行比较分析。所提模型在各预测性能指标上均较好,验证了该方法的有效性。
{"title":"Application of Small Animals Control in Substation Based on GBDT and LR Fusion Algorithm","authors":"Zhaofeng Chen","doi":"10.1109/ICCR55715.2022.10053897","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053897","url":null,"abstract":"Animals control is an important task for the safe operation of substations. Aiming at the problems existing in the control of small animals, with the superior prediction ability of machine learning, a prediction model of small animals hazard grade is proposed, which combines gradient boosting decision (GBDT) and logistic regression (LR) algorithm. The model combined substation operation and maintenance data with local meteorological data, performs features screening by calculating the variance value, and achieves classes balance by using sampling technology. And finally the model achieves the prediction of small animals hazard grade in substation. By using different data sets and not using GBDT algorithm to train the model, the prediction results are compared and analyzed. The proposed model is better in all prediction performance indicators, which verifies the validity of the method.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127962202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic Linear Quadratic Game for Discrete-time Systems Based-on Adaptive Dynamic Programming 基于自适应动态规划的离散系统随机线性二次对策
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053887
Shibo Na, Ruizhuo Song
In this paper, we proposed an adaptive dynamic programming (ADP) algorithm for discrete time stochastic linear quadratic game without system dynamics. Firstly, we described the problem and converted it into a deterministic form. Then, we solved the Bellman equation to obtain the control gain matrix and disturbance gain matrix when the system dynamics were known. After that, we implemented the ADP algorithm with unknown system through neural networks. Model network, action network, disturbance network and critic network were used to approximate the system model, control gain matrix, disturbance gain matrix and value function respectively. Finally, a simulation example was given to verify the effectiveness of the algorithm.
针对无系统动力学的离散时间随机线性二次对策,提出了一种自适应动态规划(ADP)算法。首先对问题进行描述,并将其转化为确定性形式。然后,在系统动力学已知的情况下,通过求解Bellman方程得到控制增益矩阵和扰动增益矩阵。在此基础上,通过神经网络实现了未知系统的ADP算法。采用模型网络、动作网络、干扰网络和批评家网络分别逼近系统模型、控制增益矩阵、干扰增益矩阵和值函数。最后通过仿真算例验证了算法的有效性。
{"title":"Stochastic Linear Quadratic Game for Discrete-time Systems Based-on Adaptive Dynamic Programming","authors":"Shibo Na, Ruizhuo Song","doi":"10.1109/ICCR55715.2022.10053887","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053887","url":null,"abstract":"In this paper, we proposed an adaptive dynamic programming (ADP) algorithm for discrete time stochastic linear quadratic game without system dynamics. Firstly, we described the problem and converted it into a deterministic form. Then, we solved the Bellman equation to obtain the control gain matrix and disturbance gain matrix when the system dynamics were known. After that, we implemented the ADP algorithm with unknown system through neural networks. Model network, action network, disturbance network and critic network were used to approximate the system model, control gain matrix, disturbance gain matrix and value function respectively. Finally, a simulation example was given to verify the effectiveness of the algorithm.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128009250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mobile Humanoid Robot Control through Object Movement Imagery 基于物体运动图像的移动人形机器人控制
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053905
Eneo Petoku, G. Capi
Brain-Computer Interface research aims to build systems that can connect the brain to computer or a certain robotic application. The brain activity is solely used to generate commands that can be recognized by the computer. To generate a recognizable brain activity, usually the subject imagines the movements of one's limbs without performing any real movement. In the literature, this paradigm is called Motor Imagery (MI). The subject provides data through a particular recording technology, such as EEG, in a certain time frame, in which the subject forces himself/herself into the feeling of performing a particular action. Each recorded data is linked to a label, and different techniques are used to learn patterns, in order to map them correctly. The goal of this paper is to investigate, whether it is possible to generate similar results as in the case of imagining the movement of limbs, by imagining the movement of an external object. To investigate this, we compare the performance of Motor Imagery and Object Motor Imagery. In the first case the mental task consists of imagining the movements of arms, while in the second the imagining of moving an external box through solely brain activity. A video of a box that moves through a plane in two directions, right, left, is used as visual feedback in both cases. The recorded EEG data are split into training and testing subsets, and are fed to a deep neural network, that tries to learn the different patterns and to classify them. The results show that Object Motor Imagery can achieve better results compared to MI, despite the lack of embodiment and congruity with any daily neural command. The trained architecture is used to control a mobile humanoid, investigating the implementation of Object Motor Movement in robotic application.
脑机接口研究的目标是建立能够将大脑与计算机或某个机器人应用程序连接起来的系统。大脑活动仅用于生成可被计算机识别的命令。为了产生可识别的大脑活动,通常受试者想象自己的肢体运动,而不做任何实际的运动。在文献中,这种范式被称为运动意象(MI)。受试者通过特定的记录技术,如脑电图,在特定的时间框架内提供数据,在这个时间框架内,受试者强迫自己进入执行特定动作的感觉。每个记录的数据都与一个标签相关联,并且使用不同的技术来学习模式,以便正确地映射它们。本文的目的是研究是否有可能通过想象外部物体的运动来产生与想象肢体运动类似的结果。为了研究这一点,我们比较了运动意象和物体运动意象的表现。在第一种情况下,心理任务包括想象手臂的运动,而在第二种情况下,想象移动一个外部盒子只通过大脑活动。一个盒子在平面上向左右两个方向移动的视频,在两种情况下都被用作视觉反馈。记录下来的脑电图数据被分成训练和测试子集,并被送入深度神经网络,该网络试图学习不同的模式并对它们进行分类。结果表明,尽管缺乏与任何日常神经指令的体现和一致性,但与MI相比,目标运动图像可以获得更好的结果。将训练好的体系结构用于控制移动的人形机器人,研究了物体运动在机器人应用中的实现。
{"title":"Mobile Humanoid Robot Control through Object Movement Imagery","authors":"Eneo Petoku, G. Capi","doi":"10.1109/ICCR55715.2022.10053905","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053905","url":null,"abstract":"Brain-Computer Interface research aims to build systems that can connect the brain to computer or a certain robotic application. The brain activity is solely used to generate commands that can be recognized by the computer. To generate a recognizable brain activity, usually the subject imagines the movements of one's limbs without performing any real movement. In the literature, this paradigm is called Motor Imagery (MI). The subject provides data through a particular recording technology, such as EEG, in a certain time frame, in which the subject forces himself/herself into the feeling of performing a particular action. Each recorded data is linked to a label, and different techniques are used to learn patterns, in order to map them correctly. The goal of this paper is to investigate, whether it is possible to generate similar results as in the case of imagining the movement of limbs, by imagining the movement of an external object. To investigate this, we compare the performance of Motor Imagery and Object Motor Imagery. In the first case the mental task consists of imagining the movements of arms, while in the second the imagining of moving an external box through solely brain activity. A video of a box that moves through a plane in two directions, right, left, is used as visual feedback in both cases. The recorded EEG data are split into training and testing subsets, and are fed to a deep neural network, that tries to learn the different patterns and to classify them. The results show that Object Motor Imagery can achieve better results compared to MI, despite the lack of embodiment and congruity with any daily neural command. The trained architecture is used to control a mobile humanoid, investigating the implementation of Object Motor Movement in robotic application.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"7 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114011899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robot Dynamic Path Planning Based on Improved A* and DWA Algorithms 基于改进A*和DWA算法的机器人动态路径规划
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053929
Chenxi Guan, Shuying Wang
When the traditional A* algorithm is applied to robot path planning, it has the problems of low efficiency and unable to avoid obstacles dynamically. In order to solve the above problems, a fusion algorithm based on improved A* algorithm and DWA algorithm is proposed. The A* algorithm is improved in three aspects: reducing the search direction of A* algorithm to reduce the search time, adding path information parameters to dynamically adjust the weight of heuristic function, and introducing important node extraction strategy to reduce the number of turns and shorten the path. Finally, the improved A* algorithm is fused with DWA algorithm. The experimental results show that the improved fusion algorithm can realize global optimal path planning and local real-time obstacle avoidance.
将传统的A*算法应用于机器人路径规划时,存在效率低、不能动态避障等问题。为了解决上述问题,提出了一种基于改进a *算法和DWA算法的融合算法。对A*算法进行了三方面的改进:减小A*算法的搜索方向,减少搜索时间;增加路径信息参数,动态调整启发式函数的权重;引入重要节点提取策略,减少匝数,缩短路径。最后,将改进的A*算法与DWA算法进行融合。实验结果表明,改进的融合算法可以实现全局最优路径规划和局部实时避障。
{"title":"Robot Dynamic Path Planning Based on Improved A* and DWA Algorithms","authors":"Chenxi Guan, Shuying Wang","doi":"10.1109/ICCR55715.2022.10053929","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053929","url":null,"abstract":"When the traditional A* algorithm is applied to robot path planning, it has the problems of low efficiency and unable to avoid obstacles dynamically. In order to solve the above problems, a fusion algorithm based on improved A* algorithm and DWA algorithm is proposed. The A* algorithm is improved in three aspects: reducing the search direction of A* algorithm to reduce the search time, adding path information parameters to dynamically adjust the weight of heuristic function, and introducing important node extraction strategy to reduce the number of turns and shorten the path. Finally, the improved A* algorithm is fused with DWA algorithm. The experimental results show that the improved fusion algorithm can realize global optimal path planning and local real-time obstacle avoidance.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131467692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Communication and Control Co-design for Networked Control Systems under DoS Attacks and Time-varying Delays DoS攻击和时变时延下网络控制系统的通信与控制协同设计
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053879
Lulu Zhou, Chen Peng, Z. Cao
This paper is concerned with the collaborative design of networked control systems (NCSs) subject to DoS attacks, scheduling protocols, and time-varying delays. First, to save limited network resources and prevent data collisions, the Try-Once-Discard (TOD) protocol is introduced to orchestrate the node access assignment in the sensor-to-controller channel. Then, denial-of-service (DoS) attacks that can cause communication blockages are addressed. Additionally, sufficient conditions are derived to guarantee the exponential mean-square stability of the resulting hybrid system based on which the controller gain and weighted matrix of scheduling protocols are co-designed. Finally, two simulation examples are used to illustrate the validity of the proposed method.
本文研究了受DoS攻击、调度协议和时变延迟影响的网络控制系统的协同设计。首先,为了节省有限的网络资源和防止数据冲突,在传感器到控制器的通道中引入了尝试一次丢弃(Try-Once-Discard, TOD)协议来编排节点访问分配。然后,解决可能导致通信阻塞的拒绝服务(DoS)攻击。在此基础上,给出了保证混合系统指数均方稳定性的充分条件,并在此基础上共同设计了控制器增益和调度协议加权矩阵。最后,通过两个仿真算例验证了所提方法的有效性。
{"title":"Communication and Control Co-design for Networked Control Systems under DoS Attacks and Time-varying Delays","authors":"Lulu Zhou, Chen Peng, Z. Cao","doi":"10.1109/ICCR55715.2022.10053879","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053879","url":null,"abstract":"This paper is concerned with the collaborative design of networked control systems (NCSs) subject to DoS attacks, scheduling protocols, and time-varying delays. First, to save limited network resources and prevent data collisions, the Try-Once-Discard (TOD) protocol is introduced to orchestrate the node access assignment in the sensor-to-controller channel. Then, denial-of-service (DoS) attacks that can cause communication blockages are addressed. Additionally, sufficient conditions are derived to guarantee the exponential mean-square stability of the resulting hybrid system based on which the controller gain and weighted matrix of scheduling protocols are co-designed. Finally, two simulation examples are used to illustrate the validity of the proposed method.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132394862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convolutional Neural Network Based Unmanned Ground Vehicle Control via Deep Reinforcement Learning 基于深度强化学习的卷积神经网络无人地面车辆控制
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053931
Yongxin Liu, Qiang He, Junhui Wang, Zhiliang Wang, Tianheng Chen, Shichen Jin, Chi Zhang, Zhiqiang Wang
In order to reduce the cost of human resources and material resources and improve the power line inspection efficiency, unmanned ground vehicle (UGV), which utilizes the modern artificial intelligence such as deep learning and reinforcement learning, is commonly introduced to replace of human to inspect power lines in the grid system. This paper provides a deep Q network (DQN) and convolutional neural network (CNN) based end-to-end control model to drive UGV to inspect automatically, and meanwhile to avoid obstacles. Specifically, we utilize the preprocessed grayscale image as the input of the CNN, and output the final Q value. This model simulates human learning behavior by interaction between UGV and the environment. Through repeated self-learning and reward value increasing in a simulation environment, the UGV successfully reaches the target position in a shortest time and meanwhile avoiding a variety of obstacles.
为了降低人力物力成本,提高电力线巡检效率,目前普遍引入无人地面车(UGV),利用深度学习、强化学习等现代人工智能技术代替人工对电网系统中的电力线进行巡检。本文提出了一种基于深度Q网络(DQN)和卷积神经网络(CNN)的端到端控制模型来驱动无人潜航车自动检测,同时实现避障。具体来说,我们利用预处理后的灰度图像作为CNN的输入,输出最终的Q值。该模型通过UGV与环境的相互作用来模拟人类的学习行为。通过在仿真环境中反复自我学习和增加奖励值,UGV成功地在最短时间内到达目标位置,同时避开了各种障碍物。
{"title":"Convolutional Neural Network Based Unmanned Ground Vehicle Control via Deep Reinforcement Learning","authors":"Yongxin Liu, Qiang He, Junhui Wang, Zhiliang Wang, Tianheng Chen, Shichen Jin, Chi Zhang, Zhiqiang Wang","doi":"10.1109/ICCR55715.2022.10053931","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053931","url":null,"abstract":"In order to reduce the cost of human resources and material resources and improve the power line inspection efficiency, unmanned ground vehicle (UGV), which utilizes the modern artificial intelligence such as deep learning and reinforcement learning, is commonly introduced to replace of human to inspect power lines in the grid system. This paper provides a deep Q network (DQN) and convolutional neural network (CNN) based end-to-end control model to drive UGV to inspect automatically, and meanwhile to avoid obstacles. Specifically, we utilize the preprocessed grayscale image as the input of the CNN, and output the final Q value. This model simulates human learning behavior by interaction between UGV and the environment. Through repeated self-learning and reward value increasing in a simulation environment, the UGV successfully reaches the target position in a shortest time and meanwhile avoiding a variety of obstacles.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125168325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Node Deployment and Energy Saving Optimization Method for Wireless Sensor Networks Based on Q-learning 基于q -学习的无线传感器网络节点部署及节能优化方法
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053885
Shujun Huang, Zhihua Zhang, Ruofeng Xie
The use of wireless sensor networks can achieve effective protection of the monitored area. The node deployment and energy saving optimization of wireless sensor network is important due to the constraints of limited battery capacity and short life span of nodes, and a node deployment and energy saving optimization method is proposed based on reinforcement learning. The Q-learning algorithm is used to screen the nodes that can detect the range of small animals, deploy the nodes autonomously and achieve effective energy saving optimization. Simulation results show that the method can reduce energy consumption by 30% to 35% with shorter convergence time.
利用无线传感器网络可以实现对被监控区域的有效保护。由于无线传感器网络的电池容量有限、节点寿命短,节点部署和节能优化问题显得尤为重要,提出了一种基于强化学习的节点部署和节能优化方法。采用Q-learning算法筛选能够探测到小动物范围的节点,自主部署节点,实现有效的节能优化。仿真结果表明,该方法可降低30% ~ 35%的能量消耗,且收敛时间较短。
{"title":"Node Deployment and Energy Saving Optimization Method for Wireless Sensor Networks Based on Q-learning","authors":"Shujun Huang, Zhihua Zhang, Ruofeng Xie","doi":"10.1109/ICCR55715.2022.10053885","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053885","url":null,"abstract":"The use of wireless sensor networks can achieve effective protection of the monitored area. The node deployment and energy saving optimization of wireless sensor network is important due to the constraints of limited battery capacity and short life span of nodes, and a node deployment and energy saving optimization method is proposed based on reinforcement learning. The Q-learning algorithm is used to screen the nodes that can detect the range of small animals, deploy the nodes autonomously and achieve effective energy saving optimization. Simulation results show that the method can reduce energy consumption by 30% to 35% with shorter convergence time.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114343136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Off-policy Q-learning-based Tracking Control for Stochastic Linear Discrete-Time Systems 基于离策略q学习的随机线性离散系统跟踪控制
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053863
X. Liu, Lei Zhang, Yunjian Peng
In this paper, an adaptive optimal control is investigated for a stochastic linear discrete-time system with multiplicative state-dependent noise and control-dependent noise without knowledge of the system dynamics. With the framework of Q-learning, an off-policy state feedback solution for stochastic linear quadratic tracking (SLQT) problem has been proposed. First, an augmented system of the original system and the reference command generator is established to solve SLQT problem. Then, we present an optimal control by solving stochastic algebraic Riccati equation (SARE). Next, we present the on-policy and off-policy algorithms to achieve an adaptive optimal control without knowing the system dynamics. Finally, a simulation test is finally setup to verify the performance of the proposed adaptive optimal control.
本文研究了在不知道系统动力学的情况下,具有状态相关噪声和控制相关噪声的随机线性离散系统的自适应最优控制问题。在q -学习的框架下,提出了随机线性二次跟踪(SLQT)问题的非策略状态反馈解。首先,在原系统的基础上建立了扩充系统和参考命令生成器来解决SLQT问题。然后,我们通过求解随机代数Riccati方程(SARE)给出了最优控制。其次,我们提出了在不知道系统动力学的情况下实现自适应最优控制的策略和非策略算法。最后,通过仿真实验验证了所提出的自适应最优控制的性能。
{"title":"Off-policy Q-learning-based Tracking Control for Stochastic Linear Discrete-Time Systems","authors":"X. Liu, Lei Zhang, Yunjian Peng","doi":"10.1109/ICCR55715.2022.10053863","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053863","url":null,"abstract":"In this paper, an adaptive optimal control is investigated for a stochastic linear discrete-time system with multiplicative state-dependent noise and control-dependent noise without knowledge of the system dynamics. With the framework of Q-learning, an off-policy state feedback solution for stochastic linear quadratic tracking (SLQT) problem has been proposed. First, an augmented system of the original system and the reference command generator is established to solve SLQT problem. Then, we present an optimal control by solving stochastic algebraic Riccati equation (SARE). Next, we present the on-policy and off-policy algorithms to achieve an adaptive optimal control without knowing the system dynamics. Finally, a simulation test is finally setup to verify the performance of the proposed adaptive optimal control.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114540501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparisons of RCM Generation Algorithms for Vision-Controlled Robotic Endoscope 视觉控制机器人内窥镜RCM生成算法的比较
Pub Date : 2022-12-02 DOI: 10.1109/ICCR55715.2022.10053894
Weibing Li, Biao Song, Yongping Pan
In minimally invasive surgery (MIS), a surgical endoscope is an essential instrument that provides visualization for the surgeon. One principal characteristic of surgical instruments is that remote center of motion (RCM) must be respected. To meet such a practical requirement, many physical RCM mechanisms and software-based RCM generation algorithms have been proposed. As compared with physical RCM mechanisms, RCM generation algorithms possess more flexibility due to the fact that the RCM point can be adjusted if required. This paper conducts comparisons of four typical RCM generation algorithms applied to a vision-controlled robotic endoscope under joint constraints. Kinematic models of the robotic endoscope and the four RCM generation algorithms are first briefly introduced. Then, a unified control formulation based on quadratic programming (QP) is constructed to incorporate kinematic, RCM, and physical constraints of the robotic endoscope. Based on the unified control scheme, comparative simulations and experiments are performed. The advantages and disadvantages of the four typical RCM generation algorithms are analyzed and discussed. When performing a same peg transfer task in the simulations, the RCM errors synthesized by RCM generation algorithms designed using a plane equation and an insertion equation are smaller. In the physical experiments, there are few differences in the RCM errors. Nevertheless, it is revealed that the joint velocities corresponding to the RCM generation algorithm based on a plane equation are the smallest, which means that the joint angles change more gently and it can be more friendly to MIS.
在微创手术(MIS)中,手术内窥镜是为外科医生提供可视化的重要工具。手术器械的一个主要特点是必须尊重远程运动中心(RCM)。为了满足这种实际需求,已经提出了许多物理RCM机制和基于软件的RCM生成算法。与物理RCM机制相比,RCM生成算法具有更大的灵活性,因为RCM点可以根据需要进行调整。本文对关节约束下视觉控制机器人内窥镜的四种典型RCM生成算法进行了比较。首先简要介绍了机器人内窥镜的运动学模型和四种RCM生成算法。然后,构建了基于二次规划(QP)的统一控制公式,将机器人内窥镜的运动学约束、RCM约束和物理约束结合起来。基于统一的控制方案,进行了对比仿真和实验。分析和讨论了四种典型RCM生成算法的优缺点。当在模拟中执行相同的peg转移任务时,采用平面方程和插入方程设计的RCM生成算法合成的RCM误差较小。在物理实验中,RCM误差差异不大。然而,基于平面方程的RCM生成算法所对应的关节速度最小,这意味着关节角度变化更平缓,对MIS更友好。
{"title":"Comparisons of RCM Generation Algorithms for Vision-Controlled Robotic Endoscope","authors":"Weibing Li, Biao Song, Yongping Pan","doi":"10.1109/ICCR55715.2022.10053894","DOIUrl":"https://doi.org/10.1109/ICCR55715.2022.10053894","url":null,"abstract":"In minimally invasive surgery (MIS), a surgical endoscope is an essential instrument that provides visualization for the surgeon. One principal characteristic of surgical instruments is that remote center of motion (RCM) must be respected. To meet such a practical requirement, many physical RCM mechanisms and software-based RCM generation algorithms have been proposed. As compared with physical RCM mechanisms, RCM generation algorithms possess more flexibility due to the fact that the RCM point can be adjusted if required. This paper conducts comparisons of four typical RCM generation algorithms applied to a vision-controlled robotic endoscope under joint constraints. Kinematic models of the robotic endoscope and the four RCM generation algorithms are first briefly introduced. Then, a unified control formulation based on quadratic programming (QP) is constructed to incorporate kinematic, RCM, and physical constraints of the robotic endoscope. Based on the unified control scheme, comparative simulations and experiments are performed. The advantages and disadvantages of the four typical RCM generation algorithms are analyzed and discussed. When performing a same peg transfer task in the simulations, the RCM errors synthesized by RCM generation algorithms designed using a plane equation and an insertion equation are smaller. In the physical experiments, there are few differences in the RCM errors. Nevertheless, it is revealed that the joint velocities corresponding to the RCM generation algorithm based on a plane equation are the smallest, which means that the joint angles change more gently and it can be more friendly to MIS.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"185 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131787790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 4th International Conference on Control and Robotics (ICCR)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1