Pub Date : 2024-01-01DOI: 10.1016/j.cogr.2024.05.003
Marc-Andrė Blais, Moulay A. Akhloufi
Unmanned vehicles have seen a significant increase in a wide variety of fields such as for logistics, agriculture and other commercial applications. Controlling swarms of unmanned vehicles is a challenging task that requires complex autonomous control systems. Reinforcement learning has been proposed as a solution to this challenge. We propose an approach based on agent masking to enable a simple Deep Q-Network algorithm to scale on large swarms while training on relatively smaller swarms. We train our approach using multiple swarm sizes and learning rates and compare our results using metrics such as the number of collisions. We also compare the ability of our approach to scale on swarms ranging from five to 25 agents using metrics and visual analysis. Our proposed solution was able to guide a swarm of up to 100 agents to a target while keeping a good swarm cohesion and avoiding collision.
{"title":"Scalable and cohesive swarm control based on reinforcement learning","authors":"Marc-Andrė Blais, Moulay A. Akhloufi","doi":"10.1016/j.cogr.2024.05.003","DOIUrl":"https://doi.org/10.1016/j.cogr.2024.05.003","url":null,"abstract":"<div><p>Unmanned vehicles have seen a significant increase in a wide variety of fields such as for logistics, agriculture and other commercial applications. Controlling swarms of unmanned vehicles is a challenging task that requires complex autonomous control systems. Reinforcement learning has been proposed as a solution to this challenge. We propose an approach based on agent masking to enable a simple Deep Q-Network algorithm to scale on large swarms while training on relatively smaller swarms. We train our approach using multiple swarm sizes and learning rates and compare our results using metrics such as the number of collisions. We also compare the ability of our approach to scale on swarms ranging from five to 25 agents using metrics and visual analysis. Our proposed solution was able to guide a swarm of up to 100 agents to a target while keeping a good swarm cohesion and avoiding collision.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 88-103"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241324000053/pdfft?md5=3db6df465ddd69e88b21f962232e9c5e&pid=1-s2.0-S2667241324000053-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141292146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01DOI: 10.1016/j.cogr.2024.02.001
Souhail Dhouib
The Mobile Robot Path Problem looks to find the optimal shortest path from the starting point to the target point with collision-free for a mobile robot. This is a popular issue in robotics and in this paper the environment is considered as static and represented as a bidirectional grid map. Besides, the novel optimal method Dhouib-Matrix-SPP (DM-SPP) is applied to create the optimal shortest path for a mobile robot in a static environment. DM-SPP is a greedy method based on a column row navigation in the distance matrix and characterized by its rapidity to solve sparse graphs. The comparative analysis is conducted by applying DM-SPP on thirteen test cases and comparing its results to the results given by four metaheuristics the Max-Min Ant System, the Ant System with punitive measures, the A* and the Improved Hybrid A*. The outcomes acquired from different scenarios indicate that the proposed DM-SPP method can rapidly outperform the four predefined artificial intelligence methods.
{"title":"Intelligent path planning for cognitive mobile robot based on Dhouib-Matrix-SPP method","authors":"Souhail Dhouib","doi":"10.1016/j.cogr.2024.02.001","DOIUrl":"10.1016/j.cogr.2024.02.001","url":null,"abstract":"<div><p>The Mobile Robot Path Problem looks to find the optimal shortest path from the starting point to the target point with collision-free for a mobile robot. This is a popular issue in robotics and in this paper the environment is considered as static and represented as a bidirectional grid map. Besides, the novel optimal method Dhouib-Matrix-SPP (DM-SPP) is applied to create the optimal shortest path for a mobile robot in a static environment. DM-SPP is a greedy method based on a column row navigation in the distance matrix and characterized by its rapidity to solve sparse graphs. The comparative analysis is conducted by applying DM-SPP on thirteen test cases and comparing its results to the results given by four metaheuristics the Max-Min Ant System, the Ant System with punitive measures, the A* and the Improved Hybrid A*. The outcomes acquired from different scenarios indicate that the proposed DM-SPP method can rapidly outperform the four predefined artificial intelligence methods.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 62-73"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241324000028/pdfft?md5=753a3935e8733e20519f0d68f97e618f&pid=1-s2.0-S2667241324000028-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139966008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-01DOI: 10.1016/j.cogr.2024.11.006
Kai Liu , Meizhao Liu , Ming Tang , Chen Zhang
Traditional manual power inspections are characterized by low efficiency, lengthy processes, and high costs. Existing research on UAV-based power inspections has often overlooked critical factors such as the risk levels of target tasks, the duration of tasks executed by UAVs, and the utility per unit task. To address these gaps, this paper proposes a task allocation method for UAV power inspections based on the Time Window Matrix Reversal Genetic Algorithm (TMGA). Firstly, the proposed cost model accounts for the risk levels of inspection tasks and the impact of low-altitude flight on energy consumption. Secondly, an inspection task allocation model is constructed with the goal of maximizing UAV inspection unit utility. The model is then optimized using two-point crossover and single-point reversal mutation operations, which enhance the UAV unit utility and generate an optimal allocation matrix. The performance of TMGA is evaluated through simulation experiments in three different scenarios, comparing it with existing algorithms. The results show that TMGA outperforms these algorithms in terms of average task time, task completion rate, and unit utility. Specifically, TMGA reduces the average task time by 37% compared to the Cluster Grouping Consensus-base Bundle Algorithm and improves task unit utility by 56.91% compared to the Genetic Algorithm.
{"title":"Power inspection UAV task assignment matrix reversal genetic algorithm","authors":"Kai Liu , Meizhao Liu , Ming Tang , Chen Zhang","doi":"10.1016/j.cogr.2024.11.006","DOIUrl":"10.1016/j.cogr.2024.11.006","url":null,"abstract":"<div><div>Traditional manual power inspections are characterized by low efficiency, lengthy processes, and high costs. Existing research on UAV-based power inspections has often overlooked critical factors such as the risk levels of target tasks, the duration of tasks executed by UAVs, and the utility per unit task. To address these gaps, this paper proposes a task allocation method for UAV power inspections based on the Time Window Matrix Reversal Genetic Algorithm (TMGA). Firstly, the proposed cost model accounts for the risk levels of inspection tasks and the impact of low-altitude flight on energy consumption. Secondly, an inspection task allocation model is constructed with the goal of maximizing UAV inspection unit utility. The model is then optimized using two-point crossover and single-point reversal mutation operations, which enhance the UAV unit utility and generate an optimal allocation matrix. The performance of TMGA is evaluated through simulation experiments in three different scenarios, comparing it with existing algorithms. The results show that TMGA outperforms these algorithms in terms of average task time, task completion rate, and unit utility. Specifically, TMGA reduces the average task time by 37% compared to the Cluster Grouping Consensus-base Bundle Algorithm and improves task unit utility by 56.91% compared to the Genetic Algorithm.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 245-258"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143129866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-09DOI: 10.1016/j.cogr.2023.11.003
M. Akhtaruzzaman , Amir A. Shafie , Md Raisuddin Khan , Md Mozasser Rahman
Knee joint rehabilitation exercise refers to a therapeutic procedure of a patient having dysfunctions in certain abilities to move knee joint due to some medical conditions like trauma or paralysis. The exercise is basically a series of repeated assistive physical movements within the range of motion (RoM) of the joint. Reflex action of limbs during RoM exercise causes inappropriate balance of load which may cause secondary injuries, such as damages of muscle or tendon tissues. Establishing correlation between impedance data and limb motions is important to solve this problem. This paper aims to design and modeling of a robotic arm with an original approach in control strategy which is developed based on the correlation in between the joint-impedances and joint-motion characteristics during exercise. The knee joint impedances are estimated based on the internal feedback of the system dynamics, that lead to design the torque compensator to improve the overall control signals in real time. This paper also demonstrates the characteristics of various responses of the system during exercise with human subject. Results have reflected good performances with low position and velocity tracking errors, and during hold phase; and and during motion phse. Though, the limitation of the prototype is its current RoM (limited to –), the system has potential in the application of RoM exercise for paraplegic or monoplegic patients.
膝关节康复锻炼是指对因外伤或瘫痪等疾病导致膝关节活动能力障碍的患者进行的一种治疗程序。这项运动基本上是在关节活动范围(RoM)内反复进行一系列辅助性肢体运动。在 RoM 运动过程中,肢体的反射动作会导致不适当的负荷平衡,从而可能造成二次伤害,如肌肉或肌腱组织损伤。建立阻抗数据与肢体运动之间的相关性对于解决这一问题非常重要。本文旨在设计一种机械臂,并根据运动时关节阻抗和关节运动特性之间的相关性,采用一种新颖的控制策略对其进行建模。膝关节阻抗是根据系统动态的内部反馈进行估算的,从而设计出扭矩补偿器,实时改善整体控制信号。本文还展示了该系统在人体运动时的各种响应特性。结果表明,该系统性能良好,位置和速度跟踪误差小,在保持阶段分别为 ±0.02∘ 和 0.04rad.sec-1;在运动阶段分别为 ±0.14∘ 和 0.17rad.sec-1。虽然该原型的局限性在于其当前的 RoM(仅限于 0∘-25∘),但该系统在截瘫或单瘫患者的 RoM 运动应用方面具有潜力。
{"title":"Robot assisted knee joint RoM exercise: A PID parallel compensator architecture through impedance estimation","authors":"M. Akhtaruzzaman , Amir A. Shafie , Md Raisuddin Khan , Md Mozasser Rahman","doi":"10.1016/j.cogr.2023.11.003","DOIUrl":"10.1016/j.cogr.2023.11.003","url":null,"abstract":"<div><p>Knee joint rehabilitation exercise refers to a therapeutic procedure of a patient having dysfunctions in certain abilities to move knee joint due to some medical conditions like trauma or paralysis. The exercise is basically a series of repeated assistive physical movements within the range of motion (RoM) of the joint. Reflex action of limbs during RoM exercise causes inappropriate balance of load which may cause secondary injuries, such as damages of muscle or tendon tissues. Establishing correlation between impedance data and limb motions is important to solve this problem. This paper aims to design and modeling of a robotic arm with an original approach in control strategy which is developed based on the correlation in between the joint-impedances and joint-motion characteristics during exercise. The knee joint impedances are estimated based on the internal feedback of the system dynamics, that lead to design the torque compensator to improve the overall control signals in real time. This paper also demonstrates the characteristics of various responses of the system during exercise with human subject. Results have reflected good performances with low position and velocity tracking errors, <span><math><mrow><mo>±</mo><mn>0</mn><mo>.</mo><msup><mn>02</mn><mo>∘</mo></msup></mrow></math></span> and <span><math><mrow><mn>0.04</mn><mi>r</mi><mi>a</mi><mi>d</mi><mo>.</mo><mi>s</mi><mi>e</mi><msup><mi>c</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup></mrow></math></span> during hold phase; and <span><math><mrow><mo>±</mo><mn>0</mn><mo>.</mo><msup><mn>14</mn><mo>∘</mo></msup></mrow></math></span> and <span><math><mrow><mn>0.17</mn><mi>r</mi><mi>a</mi><mi>d</mi><mo>.</mo><mi>s</mi><mi>e</mi><msup><mi>c</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup></mrow></math></span> during motion phse. Though, the limitation of the prototype is its current RoM (limited to <span><math><msup><mn>0</mn><mo>∘</mo></msup></math></span>–<span><math><msup><mn>25</mn><mo>∘</mo></msup></math></span>), the system has potential in the application of RoM exercise for paraplegic or monoplegic patients.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 42-61"},"PeriodicalIF":0.0,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266724132300040X/pdfft?md5=e66adda08f021e960ed5946bd42e69d8&pid=1-s2.0-S266724132300040X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138619946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-05DOI: 10.1016/j.cogr.2023.12.001
Zijun Yang , Shi Zhou , Lifeng Zhang , Seiichi Serikawa
In the realm of speech emotion recognition, researchers strive to refine representation methods for improved emotional information capture. Traditional one-dimensional time series classification falls short in expressing intricate emotional patterns present in speech signals, posing challenges in accuracy and robustness. This study introduces an innovative algorithm leveraging Hilbert curves to transform one-dimensional speech data into two-dimensional form, enhancing feature extraction accuracy. A tiling module based on Hilbert curve maximizes Hilbert curve arrangements for improved emotional information capture. Results reveal spatial efficiency gains up to 23,195 times pixel units, enhancing data storage. With an exceptional 98.73% accuracy, the proposed approach traditional methods, affirming its superior emotion classification performance on the same dataset. These empirical findings underscore the effectiveness of our proposed method in advancing speech emotion recognition.
{"title":"Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network","authors":"Zijun Yang , Shi Zhou , Lifeng Zhang , Seiichi Serikawa","doi":"10.1016/j.cogr.2023.12.001","DOIUrl":"10.1016/j.cogr.2023.12.001","url":null,"abstract":"<div><p>In the realm of speech emotion recognition, researchers strive to refine representation methods for improved emotional information capture. Traditional one-dimensional time series classification falls short in expressing intricate emotional patterns present in speech signals, posing challenges in accuracy and robustness. This study introduces an innovative algorithm leveraging Hilbert curves to transform one-dimensional speech data into two-dimensional form, enhancing feature extraction accuracy. A tiling module based on Hilbert curve maximizes Hilbert curve arrangements for improved emotional information capture. Results reveal spatial efficiency gains up to 23,195 times pixel units, enhancing data storage. With an exceptional 98.73% accuracy, the proposed approach traditional methods, affirming its superior emotion classification performance on the same dataset. These empirical findings underscore the effectiveness of our proposed method in advancing speech emotion recognition.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 30-41"},"PeriodicalIF":0.0,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000411/pdfft?md5=bfed8ff77493b33cdfb6f93a3ba0a2c9&pid=1-s2.0-S2667241323000411-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138609217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-22DOI: 10.1016/j.cogr.2023.11.002
Anandakumar Haldorai, Babitha Lincy R, Suriya M, Minu Balakrishnan
These days, plastic trash is exponentially overwhelming our waterways. The catastrophe has attracted global attention at this point. As a result, protecting the environment on the water's surface has received increasing focus. Currently, manpower can be used to clean up contaminated water bodies like ponds, rivers, and oceans. Using the current cleaning approach results in low efficiency and hazard. The detection, collection, sorting, and removal of plastic trash from such water surfaces has been the subject of relatively little robotic research, despite the dire circumstances. From private sources, there are very few individual efforts to be found. In order to attain great efficiency without human assistance or operation, a fully autonomous water surface cleaning robot is proposed in this study. The robot was created to adapt to any type of water body found in the real world. An efficient object identification machine learning technique can be suggested for the creation of autonomous cleaning robots. This study improved the Single Short Detection (SSD) method to recognise objects accurately. Because of the enhanced detection techniques, the robot is able to collect trash on its own. With a mean average precision (mAP) of 94.099 % and a detection speed of up to 64.67 frames per second, experimental findings show that the enhanced SSD has exceptional detection speed and accuracy.
{"title":"An improved single short detection method for smart vision-based water garbage cleaning robot","authors":"Anandakumar Haldorai, Babitha Lincy R, Suriya M, Minu Balakrishnan","doi":"10.1016/j.cogr.2023.11.002","DOIUrl":"https://doi.org/10.1016/j.cogr.2023.11.002","url":null,"abstract":"<div><p>These days, plastic trash is exponentially overwhelming our waterways. The catastrophe has attracted global attention at this point. As a result, protecting the environment on the water's surface has received increasing focus. Currently, manpower can be used to clean up contaminated water bodies like ponds, rivers, and oceans. Using the current cleaning approach results in low efficiency and hazard. The detection, collection, sorting, and removal of plastic trash from such water surfaces has been the subject of relatively little robotic research, despite the dire circumstances. From private sources, there are very few individual efforts to be found. In order to attain great efficiency without human assistance or operation, a fully autonomous water surface cleaning robot is proposed in this study. The robot was created to adapt to any type of water body found in the real world. An efficient object identification machine learning technique can be suggested for the creation of autonomous cleaning robots. This study improved the Single Short Detection (SSD) method to recognise objects accurately. Because of the enhanced detection techniques, the robot is able to collect trash on its own. With a mean average precision (mAP) of 94.099 % and a detection speed of up to 64.67 frames per second, experimental findings show that the enhanced SSD has exceptional detection speed and accuracy.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 19-29"},"PeriodicalIF":0.0,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000393/pdfft?md5=a8305dcc49d8d37defb2594ad2b10d51&pid=1-s2.0-S2667241323000393-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138738971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
UAVs are often limited by limited resources when performing flight tasks, especially the contradiction between storage resources and computing resources when the huge YOLOv3 model is deployed on the edge UAVs. In this paper, we tend to compress YOLOv3 model in different aspects to achieve load availability at the edge. In this paper, deep separable convolution is introduced to reduce the computation of the model. Then, PR regularization term is used as the regularization term of sparse training to better distinguish scaling factors, and then the hybrid pruning combining channel pruning and layer pruning is carried out on the model according to scaling factors, in order to reduce the number of model parameters and the amount of calculation. Finally, since the training data is a 32-bit floating point number, DoReFa-Net quantization method is used to quantify the model, so as to compress the storage capacity of the model. The experimental results show that the compression scheme proposed in this paper can effectively reduce the number of parameters by 97.5 % and the calculation amount by 82.3 %, and can maintain the original detection efficiency of UAVs.
{"title":"Research on YOLOv3 model compression strategy for UAV deployment","authors":"Fei Xu , Litao Huang , Xiaoyang Gao , Tingting Yu , Leyi Zhang","doi":"10.1016/j.cogr.2023.11.001","DOIUrl":"https://doi.org/10.1016/j.cogr.2023.11.001","url":null,"abstract":"<div><p>UAVs are often limited by limited resources when performing flight tasks, especially the contradiction between storage resources and computing resources when the huge YOLOv3 model is deployed on the edge UAVs. In this paper, we tend to compress YOLOv3 model in different aspects to achieve load availability at the edge. In this paper, deep separable convolution is introduced to reduce the computation of the model. Then, PR regularization term is used as the regularization term of sparse training to better distinguish scaling factors, and then the hybrid pruning combining channel pruning and layer pruning is carried out on the model according to scaling factors, in order to reduce the number of model parameters and the amount of calculation. Finally, since the training data is a 32-bit floating point number, DoReFa-Net quantization method is used to quantify the model, so as to compress the storage capacity of the model. The experimental results show that the compression scheme proposed in this paper can effectively reduce the number of parameters by 97.5 % and the calculation amount by 82.3 %, and can maintain the original detection efficiency of UAVs.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 8-18"},"PeriodicalIF":0.0,"publicationDate":"2023-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000381/pdfft?md5=c325aa36bb6e1759d99185c61f3a6b9f&pid=1-s2.0-S2667241323000381-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138439137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-11-03DOI: 10.1016/j.cogr.2023.08.002
Dingqian Zhao
In recent years, the rapid industrialization of the world has led to an increasing importance of energy minerals. However, due to the scarcity of mineral resources, opportunities to rely on alternative energy are escalating. As a result, exploration of ocean resources, which exist abundantly in the sea, is being pursued. However, the manual exploration of ocean resources by diving and visually searching is dangerous and impractical. Therefore, it is pertinent to safely advance underwater exploration by having robots perform the work instead. In underwater environments, robots are commonly used as a mainstream exploration tool due to the various hazardous environmental conditions. However, there are several problems with controlling robots in underwater environments, and one of them is poor visibility underwater. Therefore, to improve visibility underwater, efforts are being made to achieve high resolution using super-resolution technology on underwater images. In this paper we first introduce the general model and architecture in GAN. Then we combine the GAN modal and characteristics of the underwater environment, elaborating how ESRGAN can be suitable for such circumstance. For data from ECCV2018 PIRM-SR, ESRGAN outperforms other traditional model like EnhanceNet [1], EDSR [2], RCAN [3], at least 24 % [4]. Such model can be equipped with robotics that highly depends on the resolution of the image, such as autonomous underwater vehicles (AUVs) and remotely operated vehicles (ROVs).
{"title":"SRGAN in underwater vision","authors":"Dingqian Zhao","doi":"10.1016/j.cogr.2023.08.002","DOIUrl":"10.1016/j.cogr.2023.08.002","url":null,"abstract":"<div><p>In recent years, the rapid industrialization of the world has led to an increasing importance of energy minerals. However, due to the scarcity of mineral resources, opportunities to rely on alternative energy are escalating. As a result, exploration of ocean resources, which exist abundantly in the sea, is being pursued. However, the manual exploration of ocean resources by diving and visually searching is dangerous and impractical. Therefore, it is pertinent to safely advance underwater exploration by having robots perform the work instead. In underwater environments, robots are commonly used as a mainstream exploration tool due to the various hazardous environmental conditions. However, there are several problems with controlling robots in underwater environments, and one of them is poor visibility underwater. Therefore, to improve visibility underwater, efforts are being made to achieve high resolution using super-resolution technology on underwater images. In this paper we first introduce the general model and architecture in GAN. Then we combine the GAN modal and characteristics of the underwater environment, elaborating how ESRGAN can be suitable for such circumstance. For data from ECCV2018 PIRM-SR, ESRGAN outperforms other traditional model like EnhanceNet <span>[1]</span>, EDSR <span>[2]</span>, RCAN <span>[3]</span>, at least 24 % <span>[4]</span>. Such model can be equipped with robotics that highly depends on the resolution of the image, such as autonomous underwater vehicles (AUVs) and remotely operated vehicles (ROVs).</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 1-7"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000289/pdfft?md5=327c4a7880ba070fb45e7c349a11ba1e&pid=1-s2.0-S2667241323000289-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135410456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-01DOI: 10.1016/j.cogr.2023.09.001
Zhang Xin
In recent years, point clouds have been widely used in fields such as computer vision, medical image processing, virtual and augmented reality, autonomous driving, and robotics. Despite the remarkable achievements of deep learning methods in processing 2D data, they still face some unique challenges when processing 3D point cloud data [1]. The unstructured and irregular nature of point clouds makes it difficult to directly apply traditional deep learning methods, so point cloud deep learning is still in its infancy. However, some progress has been made in the field of deep learning for point clouds. Researchers have proposed many innovative methods and network architectures for solving tasks such as classification, segmentation, generation, and detection of point cloud data. These methods include the network structure of PointNet [2], PointRCNN [9] and so on as well as various data enhancement and optimization strategies. These research results laid the foundation for the development of point cloud deep learning, and provided important reference and inspiration for future research.
{"title":"A Review Of The Latest Research Technologies Related To 3D Point Cloud","authors":"Zhang Xin","doi":"10.1016/j.cogr.2023.09.001","DOIUrl":"https://doi.org/10.1016/j.cogr.2023.09.001","url":null,"abstract":"In recent years, point clouds have been widely used in fields such as computer vision, medical image processing, virtual and augmented reality, autonomous driving, and robotics. Despite the remarkable achievements of deep learning methods in processing 2D data, they still face some unique challenges when processing 3D point cloud data [1]. The unstructured and irregular nature of point clouds makes it difficult to directly apply traditional deep learning methods, so point cloud deep learning is still in its infancy. However, some progress has been made in the field of deep learning for point clouds. Researchers have proposed many innovative methods and network architectures for solving tasks such as classification, segmentation, generation, and detection of point cloud data. These methods include the network structure of PointNet [2], PointRCNN [9] and so on as well as various data enhancement and optimization strategies. These research results laid the foundation for the development of point cloud deep learning, and provided important reference and inspiration for future research.","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135638057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-01-01DOI: 10.1016/j.cogr.2022.12.004
Asif Ali Laghari , Awais Khan Jumani , Rashid Ali Laghari , Haque Nawaz
The lightweight Unmanned Aerial Vehicle (UAV) flight activities are constrained, particularly in the UAV range or activity span and perseverance, by the strategic correspondence link capabilities. This paper tends to the different overlap issue of trading off a set of mission prerequisites, the UAV execution parameters, and strategic credibility; thus compromising between the communication load characterized by a crucial, communication link transmitting power necessities, power accessibility onboard UAV as a weight-restricted parameter, and the UAV security.
{"title":"Unmanned aerial vehicles: A review","authors":"Asif Ali Laghari , Awais Khan Jumani , Rashid Ali Laghari , Haque Nawaz","doi":"10.1016/j.cogr.2022.12.004","DOIUrl":"https://doi.org/10.1016/j.cogr.2022.12.004","url":null,"abstract":"<div><p>The lightweight Unmanned Aerial Vehicle (UAV) flight activities are constrained, particularly in the UAV range or activity span and perseverance, by the strategic correspondence link capabilities. This paper tends to the different overlap issue of trading off a set of mission prerequisites, the UAV execution parameters, and strategic credibility; thus compromising between the communication load characterized by a crucial, communication link transmitting power necessities, power accessibility onboard UAV as a weight-restricted parameter, and the UAV security.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"3 ","pages":"Pages 8-22"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49723424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}