首页 > 最新文献

Cognitive Robotics最新文献

英文 中文
Scalable and cohesive swarm control based on reinforcement learning 基于强化学习的可扩展、有凝聚力的蜂群控制
Pub Date : 2024-01-01 DOI: 10.1016/j.cogr.2024.05.003
Marc-Andrė Blais, Moulay A. Akhloufi

Unmanned vehicles have seen a significant increase in a wide variety of fields such as for logistics, agriculture and other commercial applications. Controlling swarms of unmanned vehicles is a challenging task that requires complex autonomous control systems. Reinforcement learning has been proposed as a solution to this challenge. We propose an approach based on agent masking to enable a simple Deep Q-Network algorithm to scale on large swarms while training on relatively smaller swarms. We train our approach using multiple swarm sizes and learning rates and compare our results using metrics such as the number of collisions. We also compare the ability of our approach to scale on swarms ranging from five to 25 agents using metrics and visual analysis. Our proposed solution was able to guide a swarm of up to 100 agents to a target while keeping a good swarm cohesion and avoiding collision.

无人驾驶飞行器在物流、农业和其他商业应用等多个领域都有显著增长。控制无人车群是一项具有挑战性的任务,需要复杂的自主控制系统。强化学习已被提出作为应对这一挑战的解决方案。我们提出了一种基于代理掩蔽的方法,使简单的深度 Q 网络算法能够在大型蜂群上扩展,同时在相对较小的蜂群上进行训练。我们使用多种蜂群规模和学习率来训练我们的方法,并使用碰撞次数等指标来比较我们的结果。我们还使用指标和视觉分析比较了我们的方法在 5 到 25 个代理的蜂群上的扩展能力。我们提出的解决方案能够引导多达 100 个代理的蜂群到达目标,同时保持良好的蜂群凝聚力并避免碰撞。
{"title":"Scalable and cohesive swarm control based on reinforcement learning","authors":"Marc-Andrė Blais,&nbsp;Moulay A. Akhloufi","doi":"10.1016/j.cogr.2024.05.003","DOIUrl":"https://doi.org/10.1016/j.cogr.2024.05.003","url":null,"abstract":"<div><p>Unmanned vehicles have seen a significant increase in a wide variety of fields such as for logistics, agriculture and other commercial applications. Controlling swarms of unmanned vehicles is a challenging task that requires complex autonomous control systems. Reinforcement learning has been proposed as a solution to this challenge. We propose an approach based on agent masking to enable a simple Deep Q-Network algorithm to scale on large swarms while training on relatively smaller swarms. We train our approach using multiple swarm sizes and learning rates and compare our results using metrics such as the number of collisions. We also compare the ability of our approach to scale on swarms ranging from five to 25 agents using metrics and visual analysis. Our proposed solution was able to guide a swarm of up to 100 agents to a target while keeping a good swarm cohesion and avoiding collision.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 88-103"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241324000053/pdfft?md5=3db6df465ddd69e88b21f962232e9c5e&pid=1-s2.0-S2667241324000053-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141292146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent path planning for cognitive mobile robot based on Dhouib-Matrix-SPP method 基于 Dhouib-Matrix-SPP 方法的认知型移动机器人智能路径规划
Pub Date : 2024-01-01 DOI: 10.1016/j.cogr.2024.02.001
Souhail Dhouib

The Mobile Robot Path Problem looks to find the optimal shortest path from the starting point to the target point with collision-free for a mobile robot. This is a popular issue in robotics and in this paper the environment is considered as static and represented as a bidirectional grid map. Besides, the novel optimal method Dhouib-Matrix-SPP (DM-SPP) is applied to create the optimal shortest path for a mobile robot in a static environment. DM-SPP is a greedy method based on a column row navigation in the distance matrix and characterized by its rapidity to solve sparse graphs. The comparative analysis is conducted by applying DM-SPP on thirteen test cases and comparing its results to the results given by four metaheuristics the Max-Min Ant System, the Ant System with punitive measures, the A* and the Improved Hybrid A*. The outcomes acquired from different scenarios indicate that the proposed DM-SPP method can rapidly outperform the four predefined artificial intelligence methods.

移动机器人路径问题旨在为移动机器人找到从起点到目标点的最佳无碰撞最短路径。在本文中,环境被视为静态,并表示为双向网格图。此外,本文还采用了新颖的最优方法 Dhouib-Matrix-SPP (DM-SPP) 为移动机器人在静态环境中创建最优最短路径。DM-SPP 是一种基于距离矩阵列行导航的贪婪方法,其特点是能快速求解稀疏图。通过在 13 个测试案例中应用 DM-SPP 并将其结果与四种元启发式方法(最大最小蚂蚁系统、带惩罚措施的蚂蚁系统、A* 和改进的混合 A*)的结果进行比较分析。从不同场景获得的结果表明,拟议的 DM-SPP 方法可以迅速超越四种预定义的人工智能方法。
{"title":"Intelligent path planning for cognitive mobile robot based on Dhouib-Matrix-SPP method","authors":"Souhail Dhouib","doi":"10.1016/j.cogr.2024.02.001","DOIUrl":"10.1016/j.cogr.2024.02.001","url":null,"abstract":"<div><p>The Mobile Robot Path Problem looks to find the optimal shortest path from the starting point to the target point with collision-free for a mobile robot. This is a popular issue in robotics and in this paper the environment is considered as static and represented as a bidirectional grid map. Besides, the novel optimal method Dhouib-Matrix-SPP (DM-SPP) is applied to create the optimal shortest path for a mobile robot in a static environment. DM-SPP is a greedy method based on a column row navigation in the distance matrix and characterized by its rapidity to solve sparse graphs. The comparative analysis is conducted by applying DM-SPP on thirteen test cases and comparing its results to the results given by four metaheuristics the Max-Min Ant System, the Ant System with punitive measures, the A* and the Improved Hybrid A*. The outcomes acquired from different scenarios indicate that the proposed DM-SPP method can rapidly outperform the four predefined artificial intelligence methods.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 62-73"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241324000028/pdfft?md5=753a3935e8733e20519f0d68f97e618f&pid=1-s2.0-S2667241324000028-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139966008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power inspection UAV task assignment matrix reversal genetic algorithm
Pub Date : 2024-01-01 DOI: 10.1016/j.cogr.2024.11.006
Kai Liu , Meizhao Liu , Ming Tang , Chen Zhang
Traditional manual power inspections are characterized by low efficiency, lengthy processes, and high costs. Existing research on UAV-based power inspections has often overlooked critical factors such as the risk levels of target tasks, the duration of tasks executed by UAVs, and the utility per unit task. To address these gaps, this paper proposes a task allocation method for UAV power inspections based on the Time Window Matrix Reversal Genetic Algorithm (TMGA). Firstly, the proposed cost model accounts for the risk levels of inspection tasks and the impact of low-altitude flight on energy consumption. Secondly, an inspection task allocation model is constructed with the goal of maximizing UAV inspection unit utility. The model is then optimized using two-point crossover and single-point reversal mutation operations, which enhance the UAV unit utility and generate an optimal allocation matrix. The performance of TMGA is evaluated through simulation experiments in three different scenarios, comparing it with existing algorithms. The results show that TMGA outperforms these algorithms in terms of average task time, task completion rate, and unit utility. Specifically, TMGA reduces the average task time by 37% compared to the Cluster Grouping Consensus-base Bundle Algorithm and improves task unit utility by 56.91% compared to the Genetic Algorithm.
{"title":"Power inspection UAV task assignment matrix reversal genetic algorithm","authors":"Kai Liu ,&nbsp;Meizhao Liu ,&nbsp;Ming Tang ,&nbsp;Chen Zhang","doi":"10.1016/j.cogr.2024.11.006","DOIUrl":"10.1016/j.cogr.2024.11.006","url":null,"abstract":"<div><div>Traditional manual power inspections are characterized by low efficiency, lengthy processes, and high costs. Existing research on UAV-based power inspections has often overlooked critical factors such as the risk levels of target tasks, the duration of tasks executed by UAVs, and the utility per unit task. To address these gaps, this paper proposes a task allocation method for UAV power inspections based on the Time Window Matrix Reversal Genetic Algorithm (TMGA). Firstly, the proposed cost model accounts for the risk levels of inspection tasks and the impact of low-altitude flight on energy consumption. Secondly, an inspection task allocation model is constructed with the goal of maximizing UAV inspection unit utility. The model is then optimized using two-point crossover and single-point reversal mutation operations, which enhance the UAV unit utility and generate an optimal allocation matrix. The performance of TMGA is evaluated through simulation experiments in three different scenarios, comparing it with existing algorithms. The results show that TMGA outperforms these algorithms in terms of average task time, task completion rate, and unit utility. Specifically, TMGA reduces the average task time by 37% compared to the Cluster Grouping Consensus-base Bundle Algorithm and improves task unit utility by 56.91% compared to the Genetic Algorithm.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 245-258"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143129866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robot assisted knee joint RoM exercise: A PID parallel compensator architecture through impedance estimation 机器人辅助膝关节 RoM 运动:通过阻抗估计实现 PID 并行补偿器架构
Pub Date : 2023-12-09 DOI: 10.1016/j.cogr.2023.11.003
M. Akhtaruzzaman , Amir A. Shafie , Md Raisuddin Khan , Md Mozasser Rahman

Knee joint rehabilitation exercise refers to a therapeutic procedure of a patient having dysfunctions in certain abilities to move knee joint due to some medical conditions like trauma or paralysis. The exercise is basically a series of repeated assistive physical movements within the range of motion (RoM) of the joint. Reflex action of limbs during RoM exercise causes inappropriate balance of load which may cause secondary injuries, such as damages of muscle or tendon tissues. Establishing correlation between impedance data and limb motions is important to solve this problem. This paper aims to design and modeling of a robotic arm with an original approach in control strategy which is developed based on the correlation in between the joint-impedances and joint-motion characteristics during exercise. The knee joint impedances are estimated based on the internal feedback of the system dynamics, that lead to design the torque compensator to improve the overall control signals in real time. This paper also demonstrates the characteristics of various responses of the system during exercise with human subject. Results have reflected good performances with low position and velocity tracking errors, ±0.02 and 0.04rad.sec1 during hold phase; and ±0.14 and 0.17rad.sec1 during motion phse. Though, the limitation of the prototype is its current RoM (limited to 025), the system has potential in the application of RoM exercise for paraplegic or monoplegic patients.

膝关节康复锻炼是指对因外伤或瘫痪等疾病导致膝关节活动能力障碍的患者进行的一种治疗程序。这项运动基本上是在关节活动范围(RoM)内反复进行一系列辅助性肢体运动。在 RoM 运动过程中,肢体的反射动作会导致不适当的负荷平衡,从而可能造成二次伤害,如肌肉或肌腱组织损伤。建立阻抗数据与肢体运动之间的相关性对于解决这一问题非常重要。本文旨在设计一种机械臂,并根据运动时关节阻抗和关节运动特性之间的相关性,采用一种新颖的控制策略对其进行建模。膝关节阻抗是根据系统动态的内部反馈进行估算的,从而设计出扭矩补偿器,实时改善整体控制信号。本文还展示了该系统在人体运动时的各种响应特性。结果表明,该系统性能良好,位置和速度跟踪误差小,在保持阶段分别为 ±0.02∘ 和 0.04rad.sec-1;在运动阶段分别为 ±0.14∘ 和 0.17rad.sec-1。虽然该原型的局限性在于其当前的 RoM(仅限于 0∘-25∘),但该系统在截瘫或单瘫患者的 RoM 运动应用方面具有潜力。
{"title":"Robot assisted knee joint RoM exercise: A PID parallel compensator architecture through impedance estimation","authors":"M. Akhtaruzzaman ,&nbsp;Amir A. Shafie ,&nbsp;Md Raisuddin Khan ,&nbsp;Md Mozasser Rahman","doi":"10.1016/j.cogr.2023.11.003","DOIUrl":"10.1016/j.cogr.2023.11.003","url":null,"abstract":"<div><p>Knee joint rehabilitation exercise refers to a therapeutic procedure of a patient having dysfunctions in certain abilities to move knee joint due to some medical conditions like trauma or paralysis. The exercise is basically a series of repeated assistive physical movements within the range of motion (RoM) of the joint. Reflex action of limbs during RoM exercise causes inappropriate balance of load which may cause secondary injuries, such as damages of muscle or tendon tissues. Establishing correlation between impedance data and limb motions is important to solve this problem. This paper aims to design and modeling of a robotic arm with an original approach in control strategy which is developed based on the correlation in between the joint-impedances and joint-motion characteristics during exercise. The knee joint impedances are estimated based on the internal feedback of the system dynamics, that lead to design the torque compensator to improve the overall control signals in real time. This paper also demonstrates the characteristics of various responses of the system during exercise with human subject. Results have reflected good performances with low position and velocity tracking errors, <span><math><mrow><mo>±</mo><mn>0</mn><mo>.</mo><msup><mn>02</mn><mo>∘</mo></msup></mrow></math></span> and <span><math><mrow><mn>0.04</mn><mi>r</mi><mi>a</mi><mi>d</mi><mo>.</mo><mi>s</mi><mi>e</mi><msup><mi>c</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup></mrow></math></span> during hold phase; and <span><math><mrow><mo>±</mo><mn>0</mn><mo>.</mo><msup><mn>14</mn><mo>∘</mo></msup></mrow></math></span> and <span><math><mrow><mn>0.17</mn><mi>r</mi><mi>a</mi><mi>d</mi><mo>.</mo><mi>s</mi><mi>e</mi><msup><mi>c</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup></mrow></math></span> during motion phse. Though, the limitation of the prototype is its current RoM (limited to <span><math><msup><mn>0</mn><mo>∘</mo></msup></math></span>–<span><math><msup><mn>25</mn><mo>∘</mo></msup></math></span>), the system has potential in the application of RoM exercise for paraplegic or monoplegic patients.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 42-61"},"PeriodicalIF":0.0,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266724132300040X/pdfft?md5=e66adda08f021e960ed5946bd42e69d8&pid=1-s2.0-S266724132300040X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138619946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network 利用希尔伯特曲线和卷积神经网络优化语音情感识别
Pub Date : 2023-12-05 DOI: 10.1016/j.cogr.2023.12.001
Zijun Yang , Shi Zhou , Lifeng Zhang , Seiichi Serikawa

In the realm of speech emotion recognition, researchers strive to refine representation methods for improved emotional information capture. Traditional one-dimensional time series classification falls short in expressing intricate emotional patterns present in speech signals, posing challenges in accuracy and robustness. This study introduces an innovative algorithm leveraging Hilbert curves to transform one-dimensional speech data into two-dimensional form, enhancing feature extraction accuracy. A tiling module based on Hilbert curve maximizes Hilbert curve arrangements for improved emotional information capture. Results reveal spatial efficiency gains up to 23,195 times pixel units, enhancing data storage. With an exceptional 98.73% accuracy, the proposed approach traditional methods, affirming its superior emotion classification performance on the same dataset. These empirical findings underscore the effectiveness of our proposed method in advancing speech emotion recognition.

在语音情感识别领域,研究人员努力改进表示方法,以提高情感信息捕捉能力。传统的一维时间序列分类法无法表达语音信号中错综复杂的情感模式,在准确性和鲁棒性方面存在挑战。本研究引入了一种创新算法,利用希尔伯特曲线将一维语音数据转换为二维形式,从而提高特征提取的准确性。基于希尔伯特曲线的平铺模块最大限度地利用了希尔伯特曲线排列,从而提高了情感信息的捕捉能力。结果显示,空间效率提高了 23 195 倍像素单位,增强了数据存储能力。所提出的方法的准确率高达 98.73%,超越了传统方法,肯定了其在相同数据集上的卓越情感分类性能。这些实证研究结果凸显了我们提出的方法在推进语音情感识别方面的有效性。
{"title":"Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network","authors":"Zijun Yang ,&nbsp;Shi Zhou ,&nbsp;Lifeng Zhang ,&nbsp;Seiichi Serikawa","doi":"10.1016/j.cogr.2023.12.001","DOIUrl":"10.1016/j.cogr.2023.12.001","url":null,"abstract":"<div><p>In the realm of speech emotion recognition, researchers strive to refine representation methods for improved emotional information capture. Traditional one-dimensional time series classification falls short in expressing intricate emotional patterns present in speech signals, posing challenges in accuracy and robustness. This study introduces an innovative algorithm leveraging Hilbert curves to transform one-dimensional speech data into two-dimensional form, enhancing feature extraction accuracy. A tiling module based on Hilbert curve maximizes Hilbert curve arrangements for improved emotional information capture. Results reveal spatial efficiency gains up to 23,195 times pixel units, enhancing data storage. With an exceptional 98.73% accuracy, the proposed approach traditional methods, affirming its superior emotion classification performance on the same dataset. These empirical findings underscore the effectiveness of our proposed method in advancing speech emotion recognition.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 30-41"},"PeriodicalIF":0.0,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000411/pdfft?md5=bfed8ff77493b33cdfb6f93a3ba0a2c9&pid=1-s2.0-S2667241323000411-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138609217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved single short detection method for smart vision-based water garbage cleaning robot 基于智能视觉的水上垃圾清洁机器人的改进型单短检测方法
Pub Date : 2023-11-22 DOI: 10.1016/j.cogr.2023.11.002
Anandakumar Haldorai, Babitha Lincy R, Suriya M, Minu Balakrishnan

These days, plastic trash is exponentially overwhelming our waterways. The catastrophe has attracted global attention at this point. As a result, protecting the environment on the water's surface has received increasing focus. Currently, manpower can be used to clean up contaminated water bodies like ponds, rivers, and oceans. Using the current cleaning approach results in low efficiency and hazard. The detection, collection, sorting, and removal of plastic trash from such water surfaces has been the subject of relatively little robotic research, despite the dire circumstances. From private sources, there are very few individual efforts to be found. In order to attain great efficiency without human assistance or operation, a fully autonomous water surface cleaning robot is proposed in this study. The robot was created to adapt to any type of water body found in the real world. An efficient object identification machine learning technique can be suggested for the creation of autonomous cleaning robots. This study improved the Single Short Detection (SSD) method to recognise objects accurately. Because of the enhanced detection techniques, the robot is able to collect trash on its own. With a mean average precision (mAP) of 94.099 % and a detection speed of up to 64.67 frames per second, experimental findings show that the enhanced SSD has exceptional detection speed and accuracy.

如今,塑料垃圾正以指数级的速度淹没我们的水道。目前,这场灾难已引起全球关注。因此,保护水面环境越来越受到重视。目前,可以利用人力清理池塘、河流和海洋等受污染的水体。目前的清理方法效率低、危害大。尽管情况危急,但有关检测、收集、分类和清除这些水体表面塑料垃圾的机器人研究却相对较少。从私人来源来看,也很少有单独的研究成果。为了在无人协助或操作的情况下实现高效率,本研究提出了一种完全自主的水面清洁机器人。该机器人可适应现实世界中任何类型的水体。建议采用高效的物体识别机器学习技术来创建自主清洁机器人。本研究改进了单短检测(SSD)方法,以准确识别物体。由于采用了增强型检测技术,机器人能够自行收集垃圾。实验结果表明,增强型 SSD 的平均精度 (mAP) 为 94.099 %,检测速度高达每秒 64.67 帧,具有出色的检测速度和精度。
{"title":"An improved single short detection method for smart vision-based water garbage cleaning robot","authors":"Anandakumar Haldorai,&nbsp;Babitha Lincy R,&nbsp;Suriya M,&nbsp;Minu Balakrishnan","doi":"10.1016/j.cogr.2023.11.002","DOIUrl":"https://doi.org/10.1016/j.cogr.2023.11.002","url":null,"abstract":"<div><p>These days, plastic trash is exponentially overwhelming our waterways. The catastrophe has attracted global attention at this point. As a result, protecting the environment on the water's surface has received increasing focus. Currently, manpower can be used to clean up contaminated water bodies like ponds, rivers, and oceans. Using the current cleaning approach results in low efficiency and hazard. The detection, collection, sorting, and removal of plastic trash from such water surfaces has been the subject of relatively little robotic research, despite the dire circumstances. From private sources, there are very few individual efforts to be found. In order to attain great efficiency without human assistance or operation, a fully autonomous water surface cleaning robot is proposed in this study. The robot was created to adapt to any type of water body found in the real world. An efficient object identification machine learning technique can be suggested for the creation of autonomous cleaning robots. This study improved the Single Short Detection (SSD) method to recognise objects accurately. Because of the enhanced detection techniques, the robot is able to collect trash on its own. With a mean average precision (mAP) of 94.099 % and a detection speed of up to 64.67 frames per second, experimental findings show that the enhanced SSD has exceptional detection speed and accuracy.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 19-29"},"PeriodicalIF":0.0,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000393/pdfft?md5=a8305dcc49d8d37defb2594ad2b10d51&pid=1-s2.0-S2667241323000393-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138738971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on YOLOv3 model compression strategy for UAV deployment 无人机部署中YOLOv3模型压缩策略研究
Pub Date : 2023-11-17 DOI: 10.1016/j.cogr.2023.11.001
Fei Xu , Litao Huang , Xiaoyang Gao , Tingting Yu , Leyi Zhang

UAVs are often limited by limited resources when performing flight tasks, especially the contradiction between storage resources and computing resources when the huge YOLOv3 model is deployed on the edge UAVs. In this paper, we tend to compress YOLOv3 model in different aspects to achieve load availability at the edge. In this paper, deep separable convolution is introduced to reduce the computation of the model. Then, PR regularization term is used as the regularization term of sparse training to better distinguish scaling factors, and then the hybrid pruning combining channel pruning and layer pruning is carried out on the model according to scaling factors, in order to reduce the number of model parameters and the amount of calculation. Finally, since the training data is a 32-bit floating point number, DoReFa-Net quantization method is used to quantify the model, so as to compress the storage capacity of the model. The experimental results show that the compression scheme proposed in this paper can effectively reduce the number of parameters by 97.5 % and the calculation amount by 82.3 %, and can maintain the original detection efficiency of UAVs.

无人机在执行飞行任务时往往受到有限资源的限制,特别是在边缘无人机上部署庞大的YOLOv3模型时,存储资源与计算资源之间的矛盾尤为突出。在本文中,我们倾向于对YOLOv3模型进行不同方面的压缩,以实现边缘的负载可用性。本文引入深度可分离卷积来减少模型的计算量。然后,利用PR正则化项作为稀疏训练的正则化项,更好地区分尺度因子,然后根据尺度因子对模型进行通道剪枝和层剪枝相结合的混合剪枝,以减少模型参数的数量和计算量。最后,由于训练数据为32位浮点数,采用DoReFa-Net量化方法对模型进行量化,从而压缩模型的存储容量。实验结果表明,本文提出的压缩方案可有效减少97.5%的参数个数和82.3%的计算量,并能保持无人机原有的检测效率。
{"title":"Research on YOLOv3 model compression strategy for UAV deployment","authors":"Fei Xu ,&nbsp;Litao Huang ,&nbsp;Xiaoyang Gao ,&nbsp;Tingting Yu ,&nbsp;Leyi Zhang","doi":"10.1016/j.cogr.2023.11.001","DOIUrl":"https://doi.org/10.1016/j.cogr.2023.11.001","url":null,"abstract":"<div><p>UAVs are often limited by limited resources when performing flight tasks, especially the contradiction between storage resources and computing resources when the huge YOLOv3 model is deployed on the edge UAVs. In this paper, we tend to compress YOLOv3 model in different aspects to achieve load availability at the edge. In this paper, deep separable convolution is introduced to reduce the computation of the model. Then, PR regularization term is used as the regularization term of sparse training to better distinguish scaling factors, and then the hybrid pruning combining channel pruning and layer pruning is carried out on the model according to scaling factors, in order to reduce the number of model parameters and the amount of calculation. Finally, since the training data is a 32-bit floating point number, DoReFa-Net quantization method is used to quantify the model, so as to compress the storage capacity of the model. The experimental results show that the compression scheme proposed in this paper can effectively reduce the number of parameters by 97.5 % and the calculation amount by 82.3 %, and can maintain the original detection efficiency of UAVs.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 8-18"},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000381/pdfft?md5=c325aa36bb6e1759d99185c61f3a6b9f&pid=1-s2.0-S2667241323000381-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138439137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SRGAN in underwater vision 水下视觉中的SRGAN
Pub Date : 2023-11-03 DOI: 10.1016/j.cogr.2023.08.002
Dingqian Zhao

In recent years, the rapid industrialization of the world has led to an increasing importance of energy minerals. However, due to the scarcity of mineral resources, opportunities to rely on alternative energy are escalating. As a result, exploration of ocean resources, which exist abundantly in the sea, is being pursued. However, the manual exploration of ocean resources by diving and visually searching is dangerous and impractical. Therefore, it is pertinent to safely advance underwater exploration by having robots perform the work instead. In underwater environments, robots are commonly used as a mainstream exploration tool due to the various hazardous environmental conditions. However, there are several problems with controlling robots in underwater environments, and one of them is poor visibility underwater. Therefore, to improve visibility underwater, efforts are being made to achieve high resolution using super-resolution technology on underwater images. In this paper we first introduce the general model and architecture in GAN. Then we combine the GAN modal and characteristics of the underwater environment, elaborating how ESRGAN can be suitable for such circumstance. For data from ECCV2018 PIRM-SR, ESRGAN outperforms other traditional model like EnhanceNet [1], EDSR [2], RCAN [3], at least 24 % [4]. Such model can be equipped with robotics that highly depends on the resolution of the image, such as autonomous underwater vehicles (AUVs) and remotely operated vehicles (ROVs).

近年来,世界工业化的迅速发展使得能源矿产的重要性日益凸显。然而,由于矿产资源的稀缺,依赖替代能源的机会正在增加。因此,正在进行对海洋中丰富的海洋资源的勘探。然而,通过潜水和目视搜索对海洋资源进行人工勘探是危险和不切实际的。因此,用机器人代替机器人来安全推进水下探测是有意义的。在水下环境中,由于各种危险的环境条件,机器人被普遍用作主流的勘探工具。然而,在水下环境中控制机器人存在几个问题,其中之一是水下能见度差。因此,为了提高水下的能见度,人们正在努力利用超分辨率技术实现水下图像的高分辨率。本文首先介绍了GAN的一般模型和结构。然后结合GAN模态和水下环境的特点,阐述了ESRGAN如何适用于这种环境。对于来自ECCV2018 PIRM-SR的数据,ESRGAN比EnhanceNet[1]、EDSR[2]、RCAN[3]等其他传统模型至少高出24%[4]。这种模型可以装备高度依赖图像分辨率的机器人,如自主水下航行器(auv)和远程操作车辆(rov)。
{"title":"SRGAN in underwater vision","authors":"Dingqian Zhao","doi":"10.1016/j.cogr.2023.08.002","DOIUrl":"10.1016/j.cogr.2023.08.002","url":null,"abstract":"<div><p>In recent years, the rapid industrialization of the world has led to an increasing importance of energy minerals. However, due to the scarcity of mineral resources, opportunities to rely on alternative energy are escalating. As a result, exploration of ocean resources, which exist abundantly in the sea, is being pursued. However, the manual exploration of ocean resources by diving and visually searching is dangerous and impractical. Therefore, it is pertinent to safely advance underwater exploration by having robots perform the work instead. In underwater environments, robots are commonly used as a mainstream exploration tool due to the various hazardous environmental conditions. However, there are several problems with controlling robots in underwater environments, and one of them is poor visibility underwater. Therefore, to improve visibility underwater, efforts are being made to achieve high resolution using super-resolution technology on underwater images. In this paper we first introduce the general model and architecture in GAN. Then we combine the GAN modal and characteristics of the underwater environment, elaborating how ESRGAN can be suitable for such circumstance. For data from ECCV2018 PIRM-SR, ESRGAN outperforms other traditional model like EnhanceNet <span>[1]</span>, EDSR <span>[2]</span>, RCAN <span>[3]</span>, at least 24 % <span>[4]</span>. Such model can be equipped with robotics that highly depends on the resolution of the image, such as autonomous underwater vehicles (AUVs) and remotely operated vehicles (ROVs).</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 1-7"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000289/pdfft?md5=327c4a7880ba070fb45e7c349a11ba1e&pid=1-s2.0-S2667241323000289-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135410456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review Of The Latest Research Technologies Related To 3D Point Cloud 三维点云最新研究技术综述
Pub Date : 2023-09-01 DOI: 10.1016/j.cogr.2023.09.001
Zhang Xin
In recent years, point clouds have been widely used in fields such as computer vision, medical image processing, virtual and augmented reality, autonomous driving, and robotics. Despite the remarkable achievements of deep learning methods in processing 2D data, they still face some unique challenges when processing 3D point cloud data [1]. The unstructured and irregular nature of point clouds makes it difficult to directly apply traditional deep learning methods, so point cloud deep learning is still in its infancy. However, some progress has been made in the field of deep learning for point clouds. Researchers have proposed many innovative methods and network architectures for solving tasks such as classification, segmentation, generation, and detection of point cloud data. These methods include the network structure of PointNet [2], PointRCNN [9] and so on as well as various data enhancement and optimization strategies. These research results laid the foundation for the development of point cloud deep learning, and provided important reference and inspiration for future research.
近年来,点云在计算机视觉、医学图像处理、虚拟与增强现实、自动驾驶、机器人等领域得到了广泛的应用。尽管深度学习方法在处理二维数据方面取得了显著成就,但在处理三维点云数据时仍然面临一些独特的挑战[1]。点云的非结构化和不规则性使得传统的深度学习方法难以直接应用,因此点云深度学习还处于起步阶段。然而,在点云的深度学习领域已经取得了一些进展。研究人员提出了许多创新的方法和网络架构来解决点云数据的分类、分割、生成和检测等任务。这些方法包括PointNet[2]、PointRCNN[9]等网络结构以及各种数据增强和优化策略。这些研究成果为点云深度学习的发展奠定了基础,为今后的研究提供了重要的参考和启示。
{"title":"A Review Of The Latest Research Technologies Related To 3D Point Cloud","authors":"Zhang Xin","doi":"10.1016/j.cogr.2023.09.001","DOIUrl":"https://doi.org/10.1016/j.cogr.2023.09.001","url":null,"abstract":"In recent years, point clouds have been widely used in fields such as computer vision, medical image processing, virtual and augmented reality, autonomous driving, and robotics. Despite the remarkable achievements of deep learning methods in processing 2D data, they still face some unique challenges when processing 3D point cloud data [1]. The unstructured and irregular nature of point clouds makes it difficult to directly apply traditional deep learning methods, so point cloud deep learning is still in its infancy. However, some progress has been made in the field of deep learning for point clouds. Researchers have proposed many innovative methods and network architectures for solving tasks such as classification, segmentation, generation, and detection of point cloud data. These methods include the network structure of PointNet [2], PointRCNN [9] and so on as well as various data enhancement and optimization strategies. These research results laid the foundation for the development of point cloud deep learning, and provided important reference and inspiration for future research.","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135638057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unmanned aerial vehicles: A review 无人飞行器:综述
Pub Date : 2023-01-01 DOI: 10.1016/j.cogr.2022.12.004
Asif Ali Laghari , Awais Khan Jumani , Rashid Ali Laghari , Haque Nawaz

The lightweight Unmanned Aerial Vehicle (UAV) flight activities are constrained, particularly in the UAV range or activity span and perseverance, by the strategic correspondence link capabilities. This paper tends to the different overlap issue of trading off a set of mission prerequisites, the UAV execution parameters, and strategic credibility; thus compromising between the communication load characterized by a crucial, communication link transmitting power necessities, power accessibility onboard UAV as a weight-restricted parameter, and the UAV security.

轻型无人机的飞行活动受到战略通信链路能力的限制,特别是在无人机的射程或活动范围和毅力方面。本文倾向于权衡一组任务先决条件、无人机执行参数和战略可信度的不同重叠问题;从而在通信负载和无人机安全性之间进行折衷,通信负载的特征在于关键的通信链路传输功率需求、无人机上的功率可达性作为重量限制参数。
{"title":"Unmanned aerial vehicles: A review","authors":"Asif Ali Laghari ,&nbsp;Awais Khan Jumani ,&nbsp;Rashid Ali Laghari ,&nbsp;Haque Nawaz","doi":"10.1016/j.cogr.2022.12.004","DOIUrl":"https://doi.org/10.1016/j.cogr.2022.12.004","url":null,"abstract":"<div><p>The lightweight Unmanned Aerial Vehicle (UAV) flight activities are constrained, particularly in the UAV range or activity span and perseverance, by the strategic correspondence link capabilities. This paper tends to the different overlap issue of trading off a set of mission prerequisites, the UAV execution parameters, and strategic credibility; thus compromising between the communication load characterized by a crucial, communication link transmitting power necessities, power accessibility onboard UAV as a weight-restricted parameter, and the UAV security.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"3 ","pages":"Pages 8-22"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49723424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Cognitive Robotics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1