首页 > 最新文献

Cognitive Robotics最新文献

英文 中文
Robotic terrain classification based on convolutional and long short-term memory neural networks 基于卷积和长短期记忆神经网络的机器人地形分类
Pub Date : 2025-01-01 Epub Date: 2025-04-17 DOI: 10.1016/j.cogr.2025.04.002
YiGe Hu
Robotic mobility remains constrained by complex terrains and technological limitations, hindering real-world applications. This study presents a terrain classification framework integrating Fourier transform, adaptive filtering, and deep learning to enhance adaptability. Leveraging CNNs, LSTMs, and an attention mechanism, the approach improves feature fusion and classification accuracy. Evaluations on the Tampere University dataset demonstrate an 81 % classification accuracy, validating its effectiveness in terrain perception and autonomous navigation. The findings contribute to advancing robotic mobility in unstructured environments.
机器人的移动性仍然受到复杂地形和技术限制的制约,阻碍了现实世界的应用。本文提出了一种融合傅里叶变换、自适应滤波和深度学习的地形分类框架,以增强其自适应能力。该方法利用cnn、lstm和注意机制,提高了特征融合和分类精度。对坦佩雷大学数据集的评估表明,分类准确率达到81%,验证了其在地形感知和自主导航方面的有效性。这一发现有助于提高机器人在非结构化环境中的机动性。
{"title":"Robotic terrain classification based on convolutional and long short-term memory neural networks","authors":"YiGe Hu","doi":"10.1016/j.cogr.2025.04.002","DOIUrl":"10.1016/j.cogr.2025.04.002","url":null,"abstract":"<div><div>Robotic mobility remains constrained by complex terrains and technological limitations, hindering real-world applications. This study presents a terrain classification framework integrating Fourier transform, adaptive filtering, and deep learning to enhance adaptability. Leveraging CNNs, LSTMs, and an attention mechanism, the approach improves feature fusion and classification accuracy. Evaluations on the Tampere University dataset demonstrate an 81 % classification accuracy, validating its effectiveness in terrain perception and autonomous navigation. The findings contribute to advancing robotic mobility in unstructured environments.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 166-175"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143864324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robot assisted knee joint RoM exercise: A PID parallel compensator architecture through impedance estimation 机器人辅助膝关节 RoM 运动:通过阻抗估计实现 PID 并行补偿器架构
Pub Date : 2024-01-01 Epub Date: 2023-12-09 DOI: 10.1016/j.cogr.2023.11.003
M. Akhtaruzzaman , Amir A. Shafie , Md Raisuddin Khan , Md Mozasser Rahman

Knee joint rehabilitation exercise refers to a therapeutic procedure of a patient having dysfunctions in certain abilities to move knee joint due to some medical conditions like trauma or paralysis. The exercise is basically a series of repeated assistive physical movements within the range of motion (RoM) of the joint. Reflex action of limbs during RoM exercise causes inappropriate balance of load which may cause secondary injuries, such as damages of muscle or tendon tissues. Establishing correlation between impedance data and limb motions is important to solve this problem. This paper aims to design and modeling of a robotic arm with an original approach in control strategy which is developed based on the correlation in between the joint-impedances and joint-motion characteristics during exercise. The knee joint impedances are estimated based on the internal feedback of the system dynamics, that lead to design the torque compensator to improve the overall control signals in real time. This paper also demonstrates the characteristics of various responses of the system during exercise with human subject. Results have reflected good performances with low position and velocity tracking errors, ±0.02 and 0.04rad.sec1 during hold phase; and ±0.14 and 0.17rad.sec1 during motion phse. Though, the limitation of the prototype is its current RoM (limited to 025), the system has potential in the application of RoM exercise for paraplegic or monoplegic patients.

膝关节康复锻炼是指对因外伤或瘫痪等疾病导致膝关节活动能力障碍的患者进行的一种治疗程序。这项运动基本上是在关节活动范围(RoM)内反复进行一系列辅助性肢体运动。在 RoM 运动过程中,肢体的反射动作会导致不适当的负荷平衡,从而可能造成二次伤害,如肌肉或肌腱组织损伤。建立阻抗数据与肢体运动之间的相关性对于解决这一问题非常重要。本文旨在设计一种机械臂,并根据运动时关节阻抗和关节运动特性之间的相关性,采用一种新颖的控制策略对其进行建模。膝关节阻抗是根据系统动态的内部反馈进行估算的,从而设计出扭矩补偿器,实时改善整体控制信号。本文还展示了该系统在人体运动时的各种响应特性。结果表明,该系统性能良好,位置和速度跟踪误差小,在保持阶段分别为 ±0.02∘ 和 0.04rad.sec-1;在运动阶段分别为 ±0.14∘ 和 0.17rad.sec-1。虽然该原型的局限性在于其当前的 RoM(仅限于 0∘-25∘),但该系统在截瘫或单瘫患者的 RoM 运动应用方面具有潜力。
{"title":"Robot assisted knee joint RoM exercise: A PID parallel compensator architecture through impedance estimation","authors":"M. Akhtaruzzaman ,&nbsp;Amir A. Shafie ,&nbsp;Md Raisuddin Khan ,&nbsp;Md Mozasser Rahman","doi":"10.1016/j.cogr.2023.11.003","DOIUrl":"10.1016/j.cogr.2023.11.003","url":null,"abstract":"<div><p>Knee joint rehabilitation exercise refers to a therapeutic procedure of a patient having dysfunctions in certain abilities to move knee joint due to some medical conditions like trauma or paralysis. The exercise is basically a series of repeated assistive physical movements within the range of motion (RoM) of the joint. Reflex action of limbs during RoM exercise causes inappropriate balance of load which may cause secondary injuries, such as damages of muscle or tendon tissues. Establishing correlation between impedance data and limb motions is important to solve this problem. This paper aims to design and modeling of a robotic arm with an original approach in control strategy which is developed based on the correlation in between the joint-impedances and joint-motion characteristics during exercise. The knee joint impedances are estimated based on the internal feedback of the system dynamics, that lead to design the torque compensator to improve the overall control signals in real time. This paper also demonstrates the characteristics of various responses of the system during exercise with human subject. Results have reflected good performances with low position and velocity tracking errors, <span><math><mrow><mo>±</mo><mn>0</mn><mo>.</mo><msup><mn>02</mn><mo>∘</mo></msup></mrow></math></span> and <span><math><mrow><mn>0.04</mn><mi>r</mi><mi>a</mi><mi>d</mi><mo>.</mo><mi>s</mi><mi>e</mi><msup><mi>c</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup></mrow></math></span> during hold phase; and <span><math><mrow><mo>±</mo><mn>0</mn><mo>.</mo><msup><mn>14</mn><mo>∘</mo></msup></mrow></math></span> and <span><math><mrow><mn>0.17</mn><mi>r</mi><mi>a</mi><mi>d</mi><mo>.</mo><mi>s</mi><mi>e</mi><msup><mi>c</mi><mrow><mo>−</mo><mn>1</mn></mrow></msup></mrow></math></span> during motion phse. Though, the limitation of the prototype is its current RoM (limited to <span><math><msup><mn>0</mn><mo>∘</mo></msup></math></span>–<span><math><msup><mn>25</mn><mo>∘</mo></msup></math></span>), the system has potential in the application of RoM exercise for paraplegic or monoplegic patients.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 42-61"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266724132300040X/pdfft?md5=e66adda08f021e960ed5946bd42e69d8&pid=1-s2.0-S266724132300040X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138619946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fourier Hilbert: The input transformation to enhance CNN models for speech emotion recognition 傅里叶·希尔伯特:输入变换增强CNN模型的语音情感识别
Pub Date : 2024-01-01 Epub Date: 2024-11-14 DOI: 10.1016/j.cogr.2024.11.002
Bao Long Ly
Signal processing in general, and speech emotion recognition in particular, have long been familiar Artificial Intelligence (AI) tasks. With the explosion of deep learning, CNN models are used more frequently, accompanied by the emergence of many signal transformations. However, these methods often require significant hardware and runtime. In an effort to address these issues, we analyze and learn from existing transformations, leading us to propose a new method: Fourier Hilbert Transformation (FHT). In general, this method applies the Hilbert curve to Fourier images. The resulting images are small and dense, which is a shape well-suited to the CNN architecture. Additionally, the better distribution of information on the image allows the filters to fully utilize their power. These points support the argument that FHT provides an optimal input for CNN. Experiments conducted on popular datasets yielded promising results. FHT saves a large amount of hardware usage and runtime while maintaining high performance, even offers greater stability compared to existing methods. This opens up opportunities for deploying signal processing tasks on real-time systems with limited hardware.
一般来说,信号处理,特别是语音情感识别,一直是人们熟悉的人工智能(AI)任务。随着深度学习的爆炸式发展,CNN模型的使用越来越频繁,伴随着许多信号变换的出现。然而,这些方法通常需要大量的硬件和运行时。为了解决这些问题,我们分析并学习了现有的变换,从而提出了一种新的方法:傅里叶希尔伯特变换(FHT)。一般来说,这种方法将希尔伯特曲线应用于傅里叶图像。生成的图像小而密集,这是一种非常适合CNN架构的形状。此外,图像上信息的更好分布允许滤波器充分利用它们的功率。这些观点支持了FHT为CNN提供最佳输入的论点。在流行的数据集上进行的实验产生了令人鼓舞的结果。FHT在保持高性能的同时节省了大量的硬件使用和运行时间,甚至比现有方法提供了更高的稳定性。这为在硬件有限的实时系统上部署信号处理任务提供了机会。
{"title":"Fourier Hilbert: The input transformation to enhance CNN models for speech emotion recognition","authors":"Bao Long Ly","doi":"10.1016/j.cogr.2024.11.002","DOIUrl":"10.1016/j.cogr.2024.11.002","url":null,"abstract":"<div><div>Signal processing in general, and speech emotion recognition in particular, have long been familiar Artificial Intelligence (AI) tasks. With the explosion of deep learning, CNN models are used more frequently, accompanied by the emergence of many signal transformations. However, these methods often require significant hardware and runtime. In an effort to address these issues, we analyze and learn from existing transformations, leading us to propose a new method: Fourier Hilbert Transformation (FHT). In general, this method applies the Hilbert curve to Fourier images. The resulting images are small and dense, which is a shape well-suited to the CNN architecture. Additionally, the better distribution of information on the image allows the filters to fully utilize their power. These points support the argument that FHT provides an optimal input for CNN. Experiments conducted on popular datasets yielded promising results. FHT saves a large amount of hardware usage and runtime while maintaining high performance, even offers greater stability compared to existing methods. This opens up opportunities for deploying signal processing tasks on real-time systems with limited hardware.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 228-236"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142748300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mobile robot path planning using deep deterministic policy gradient with differential gaming (DDPG-DG) exploration 利用深度确定性策略梯度与微分博弈(DDPG-DG)探索移动机器人路径规划
Pub Date : 2024-01-01 Epub Date: 2024-09-05 DOI: 10.1016/j.cogr.2024.08.002
Shripad V. Deshpande , Harikrishnan R , Babul Salam KSM Kader Ibrahim , Mahesh Datta Sai Ponnuru

Mobile robot path planning involves decision-making in uncertain, dynamic conditions, where Reinforcement Learning (RL) algorithms excel in generating safe and optimal paths. The Deep Deterministic Policy Gradient (DDPG) is an RL technique focused on mobile robot navigation. RL algorithms must balance exploitation and exploration to enable effective learning. The balance between these actions directly impacts learning efficiency.

This research proposes a method combining the DDPG strategy for exploitation with the Differential Gaming (DG) strategy for exploration. The DG algorithm ensures the mobile robot always reaches its target without collisions, thereby adding positive learning episodes to the memory buffer. An epsilon-greedy strategy determines whether to explore or exploit. When exploration is chosen, the DG algorithm is employed. The combination of DG strategy with DDPG facilitates faster learning by increasing the number of successful episodes and reducing the number of failure episodes in the experience buffer. The DDPG algorithm supports continuous state and action spaces, resulting in smoother, non-jerky movements and improved control over the turns when navigating obstacles. Reward shaping considers finer details, ensuring even small advantages in each iteration contribute to learning.

Through diverse test scenarios, it is demonstrated that DG exploration, compared to random exploration, results in an average increase of 389% in successful target reaches and a 39% decrease in collisions. Additionally, DG exploration shows a 69% improvement in the number of episodes where convergence is achieved within a maximum of 2000 steps.

移动机器人路径规划涉及在不确定的动态条件下进行决策,而强化学习(RL)算法在生成安全和最优路径方面表现出色。深度确定性策略梯度(DDPG)是一种专注于移动机器人导航的 RL 技术。RL 算法必须兼顾利用和探索,才能实现有效学习。本研究提出了一种方法,将用于开发的 DDPG 策略与用于探索的差分博弈(DG)策略相结合。DG 算法可确保移动机器人始终在无碰撞的情况下到达目标,从而为记忆缓冲区增加积极的学习事件。ε-贪婪策略决定是探索还是利用。当选择探索时,则采用 DG 算法。将 DG 策略与 DDPG 算法相结合,可以增加经验缓冲区中成功事件的数量,减少失败事件的数量,从而加快学习速度。DDPG 算法支持连续的状态和动作空间,从而使动作更平滑、不生涩,并改善了导航障碍物时对转弯的控制。奖励塑造考虑到了更精细的细节,确保每次迭代中的微小优势也能促进学习。通过各种测试场景证明,与随机探索相比,DG 探索使成功到达目标的次数平均增加了 389%,碰撞次数减少了 39%。此外,DG探索在最多2000步内实现收敛的次数提高了69%。
{"title":"Mobile robot path planning using deep deterministic policy gradient with differential gaming (DDPG-DG) exploration","authors":"Shripad V. Deshpande ,&nbsp;Harikrishnan R ,&nbsp;Babul Salam KSM Kader Ibrahim ,&nbsp;Mahesh Datta Sai Ponnuru","doi":"10.1016/j.cogr.2024.08.002","DOIUrl":"10.1016/j.cogr.2024.08.002","url":null,"abstract":"<div><p>Mobile robot path planning involves decision-making in uncertain, dynamic conditions, where Reinforcement Learning (RL) algorithms excel in generating safe and optimal paths. The Deep Deterministic Policy Gradient (DDPG) is an RL technique focused on mobile robot navigation. RL algorithms must balance exploitation and exploration to enable effective learning. The balance between these actions directly impacts learning efficiency.</p><p>This research proposes a method combining the DDPG strategy for exploitation with the Differential Gaming (DG) strategy for exploration. The DG algorithm ensures the mobile robot always reaches its target without collisions, thereby adding positive learning episodes to the memory buffer. An epsilon-greedy strategy determines whether to explore or exploit. When exploration is chosen, the DG algorithm is employed. The combination of DG strategy with DDPG facilitates faster learning by increasing the number of successful episodes and reducing the number of failure episodes in the experience buffer. The DDPG algorithm supports continuous state and action spaces, resulting in smoother, non-jerky movements and improved control over the turns when navigating obstacles. Reward shaping considers finer details, ensuring even small advantages in each iteration contribute to learning.</p><p>Through diverse test scenarios, it is demonstrated that DG exploration, compared to random exploration, results in an average increase of 389% in successful target reaches and a 39% decrease in collisions. Additionally, DG exploration shows a 69% improvement in the number of episodes where convergence is achieved within a maximum of 2000 steps.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 156-173"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241324000119/pdfft?md5=8c083de5d6ac1af9d3cedcb0733a30fa&pid=1-s2.0-S2667241324000119-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142271646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emerging trends in human upper extremity rehabilitation robot 人体上肢康复机器人的新趋势
Pub Date : 2024-01-01 Epub Date: 2024-09-14 DOI: 10.1016/j.cogr.2024.09.001
Sk. Khairul Hasan, Subodh B. Bhujel, Gabrielle Sara Niemiec

Stroke is a leading cause of neurological disorders that result in physical disability, particularly among the elderly. Neurorehabilitation plays a crucial role in helping stroke patients recover from physical impairments and regain mobility. Physical therapy is one of the most effective forms of neurorehabilitation, but the growing number of patients requires a large workforce of trained therapists, which is currently insufficient. Robotic rehabilitation offers a promising alternative, capable of supplementing or even replacing human-assisted physical therapy through the use of rehabilitation robots. To design effective robotic devices for rehabilitation, a solid foundation of knowledge is essential. This article provides a comprehensive overview of the key elements needed to develop human upper extremity rehabilitation robots. It covers critical aspects such as upper extremity anatomy, joint range of motion, anthropometric parameters, disability assessment techniques, and robot-assisted training methods. Additionally, it reviews recent advancements in rehabilitation robots, including exoskeletons, end-effector-based robots, and planar robots. The article also evaluates existing upper extremity rehabilitation robots based on their mechanical design and functionality, identifies their limitations, and suggests future research directions for further improvement.

中风是导致身体残疾的神经系统疾病的主要原因,尤其是在老年人中。神经康复在帮助脑卒中患者从肢体损伤中康复并恢复行动能力方面发挥着至关重要的作用。物理治疗是最有效的神经康复方式之一,但由于患者人数不断增加,需要大量训练有素的治疗师,而目前这方面的人才还很缺乏。机器人康复提供了一种前景广阔的替代方案,通过使用康复机器人,能够补充甚至取代人类辅助物理治疗。要设计出有效的康复机器人设备,扎实的知识基础必不可少。本文全面概述了开发人类上肢康复机器人所需的关键要素。它涵盖了上肢解剖、关节活动范围、人体测量参数、残疾评估技术和机器人辅助训练方法等关键方面。此外,文章还回顾了康复机器人的最新进展,包括外骨骼、基于末端执行器的机器人和平面机器人。文章还根据机械设计和功能评估了现有的上肢康复机器人,指出了它们的局限性,并提出了进一步改进的未来研究方向。
{"title":"Emerging trends in human upper extremity rehabilitation robot","authors":"Sk. Khairul Hasan,&nbsp;Subodh B. Bhujel,&nbsp;Gabrielle Sara Niemiec","doi":"10.1016/j.cogr.2024.09.001","DOIUrl":"10.1016/j.cogr.2024.09.001","url":null,"abstract":"<div><p>Stroke is a leading cause of neurological disorders that result in physical disability, particularly among the elderly. Neurorehabilitation plays a crucial role in helping stroke patients recover from physical impairments and regain mobility. Physical therapy is one of the most effective forms of neurorehabilitation, but the growing number of patients requires a large workforce of trained therapists, which is currently insufficient. Robotic rehabilitation offers a promising alternative, capable of supplementing or even replacing human-assisted physical therapy through the use of rehabilitation robots. To design effective robotic devices for rehabilitation, a solid foundation of knowledge is essential. This article provides a comprehensive overview of the key elements needed to develop human upper extremity rehabilitation robots. It covers critical aspects such as upper extremity anatomy, joint range of motion, anthropometric parameters, disability assessment techniques, and robot-assisted training methods. Additionally, it reviews recent advancements in rehabilitation robots, including exoskeletons, end-effector-based robots, and planar robots. The article also evaluates existing upper extremity rehabilitation robots based on their mechanical design and functionality, identifies their limitations, and suggests future research directions for further improvement.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 174-190"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241324000120/pdfft?md5=a51e80d94f3f2f6ca53c667c4682ef83&pid=1-s2.0-S2667241324000120-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142271647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
POMDP-based probabilistic decision making for path planning in wheeled mobile robot 基于 POMDP 的轮式移动机器人路径规划概率决策
Pub Date : 2024-01-01 Epub Date: 2024-06-18 DOI: 10.1016/j.cogr.2024.06.001
Shripad V. Deshpande, Harikrishnan R, Rahee Walambe

Path Planning in a collaborative mobile robot system has been a research topic for many years. Uncertainty in robot states, actions, and environmental conditions makes finding the optimum path for navigation highly challenging for the robot. To achieve robust behavior for mobile robots in the presence of static and dynamic obstacles, it is pertinent that the robot employs a path-finding mechanism that is based on the probabilistic perception of the uncertainty in various parameters governing its movement. Partially Observable Markov Decision Process (POMDP) is being used by many researchers as a proven methodology for handling uncertainty. The POMDP framework requires manually setting up the state transition matrix, the observation matrix, and the reward values. This paper describes an approach for creating the POMDP model and demonstrates its working by simulating it on two mobile robots destined on a collision course. Selective test cases are run on the two robots with three categories – MDP (POMDP with belief state spread of 1), POMDP with distribution spread of belief state over ten observations, and distribution spread across two observations. Uncertainty in the sensor data is simulated with varying levels of up to 10 %. The results are compared and analyzed. It is demonstrated that when the observation probability spread is increased from 2 to 10, collision reduces from 34 % to 22 %, indicating that the system's robustness increases by 12 % with only a marginal increase of 3.4 % in the computational complexity.

多年来,协作式移动机器人系统的路径规划一直是一个研究课题。机器人状态、行动和环境条件的不确定性使得寻找最佳导航路径对机器人来说极具挑战性。为了实现移动机器人在静态和动态障碍物面前的稳健行为,机器人必须采用一种基于对支配其运动的各种参数的不确定性的概率感知的路径寻找机制。部分可观测马尔可夫决策过程(POMDP)被许多研究人员用作处理不确定性的成熟方法。POMDP 框架需要手动设置状态转换矩阵、观测矩阵和奖励值。本文介绍了一种创建 POMDP 模型的方法,并通过在两个注定会发生碰撞的移动机器人上进行模拟来演示其工作原理。在两个机器人上运行了三个类别的选择性测试案例--MDP(信念状态分布为 1 的 POMDP)、信念状态分布为 10 个观测值的 POMDP 和分布为 2 个观测值的 POMDP。对传感器数据的不确定性进行了模拟,不确定性最高可达 10%。对结果进行了比较和分析。结果表明,当观测概率分布从 2 增加到 10 时,碰撞率从 34% 降低到 22%,这表明系统的鲁棒性提高了 12%,而计算复杂度仅略微增加了 3.4%。
{"title":"POMDP-based probabilistic decision making for path planning in wheeled mobile robot","authors":"Shripad V. Deshpande,&nbsp;Harikrishnan R,&nbsp;Rahee Walambe","doi":"10.1016/j.cogr.2024.06.001","DOIUrl":"https://doi.org/10.1016/j.cogr.2024.06.001","url":null,"abstract":"<div><p>Path Planning in a collaborative mobile robot system has been a research topic for many years. Uncertainty in robot states, actions, and environmental conditions makes finding the optimum path for navigation highly challenging for the robot. To achieve robust behavior for mobile robots in the presence of static and dynamic obstacles, it is pertinent that the robot employs a path-finding mechanism that is based on the probabilistic perception of the uncertainty in various parameters governing its movement. Partially Observable Markov Decision Process (POMDP) is being used by many researchers as a proven methodology for handling uncertainty. The POMDP framework requires manually setting up the state transition matrix, the observation matrix, and the reward values. This paper describes an approach for creating the POMDP model and demonstrates its working by simulating it on two mobile robots destined on a collision course. Selective test cases are run on the two robots with three categories – MDP (POMDP with belief state spread of 1), POMDP with distribution spread of belief state over ten observations, and distribution spread across two observations. Uncertainty in the sensor data is simulated with varying levels of up to 10 %. The results are compared and analyzed. It is demonstrated that when the observation probability spread is increased from 2 to 10, collision reduces from 34 % to 22 %, indicating that the system's robustness increases by 12 % with only a marginal increase of 3.4 % in the computational complexity.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 104-115"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241324000077/pdfft?md5=ccfa806c0ae32c5aba224cbf968b6b8d&pid=1-s2.0-S2667241324000077-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141483724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SRGAN in underwater vision 水下视觉中的SRGAN
Pub Date : 2024-01-01 Epub Date: 2023-11-03 DOI: 10.1016/j.cogr.2023.08.002
Dingqian Zhao

In recent years, the rapid industrialization of the world has led to an increasing importance of energy minerals. However, due to the scarcity of mineral resources, opportunities to rely on alternative energy are escalating. As a result, exploration of ocean resources, which exist abundantly in the sea, is being pursued. However, the manual exploration of ocean resources by diving and visually searching is dangerous and impractical. Therefore, it is pertinent to safely advance underwater exploration by having robots perform the work instead. In underwater environments, robots are commonly used as a mainstream exploration tool due to the various hazardous environmental conditions. However, there are several problems with controlling robots in underwater environments, and one of them is poor visibility underwater. Therefore, to improve visibility underwater, efforts are being made to achieve high resolution using super-resolution technology on underwater images. In this paper we first introduce the general model and architecture in GAN. Then we combine the GAN modal and characteristics of the underwater environment, elaborating how ESRGAN can be suitable for such circumstance. For data from ECCV2018 PIRM-SR, ESRGAN outperforms other traditional model like EnhanceNet [1], EDSR [2], RCAN [3], at least 24 % [4]. Such model can be equipped with robotics that highly depends on the resolution of the image, such as autonomous underwater vehicles (AUVs) and remotely operated vehicles (ROVs).

近年来,世界工业化的迅速发展使得能源矿产的重要性日益凸显。然而,由于矿产资源的稀缺,依赖替代能源的机会正在增加。因此,正在进行对海洋中丰富的海洋资源的勘探。然而,通过潜水和目视搜索对海洋资源进行人工勘探是危险和不切实际的。因此,用机器人代替机器人来安全推进水下探测是有意义的。在水下环境中,由于各种危险的环境条件,机器人被普遍用作主流的勘探工具。然而,在水下环境中控制机器人存在几个问题,其中之一是水下能见度差。因此,为了提高水下的能见度,人们正在努力利用超分辨率技术实现水下图像的高分辨率。本文首先介绍了GAN的一般模型和结构。然后结合GAN模态和水下环境的特点,阐述了ESRGAN如何适用于这种环境。对于来自ECCV2018 PIRM-SR的数据,ESRGAN比EnhanceNet[1]、EDSR[2]、RCAN[3]等其他传统模型至少高出24%[4]。这种模型可以装备高度依赖图像分辨率的机器人,如自主水下航行器(auv)和远程操作车辆(rov)。
{"title":"SRGAN in underwater vision","authors":"Dingqian Zhao","doi":"10.1016/j.cogr.2023.08.002","DOIUrl":"10.1016/j.cogr.2023.08.002","url":null,"abstract":"<div><p>In recent years, the rapid industrialization of the world has led to an increasing importance of energy minerals. However, due to the scarcity of mineral resources, opportunities to rely on alternative energy are escalating. As a result, exploration of ocean resources, which exist abundantly in the sea, is being pursued. However, the manual exploration of ocean resources by diving and visually searching is dangerous and impractical. Therefore, it is pertinent to safely advance underwater exploration by having robots perform the work instead. In underwater environments, robots are commonly used as a mainstream exploration tool due to the various hazardous environmental conditions. However, there are several problems with controlling robots in underwater environments, and one of them is poor visibility underwater. Therefore, to improve visibility underwater, efforts are being made to achieve high resolution using super-resolution technology on underwater images. In this paper we first introduce the general model and architecture in GAN. Then we combine the GAN modal and characteristics of the underwater environment, elaborating how ESRGAN can be suitable for such circumstance. For data from ECCV2018 PIRM-SR, ESRGAN outperforms other traditional model like EnhanceNet <span>[1]</span>, EDSR <span>[2]</span>, RCAN <span>[3]</span>, at least 24 % <span>[4]</span>. Such model can be equipped with robotics that highly depends on the resolution of the image, such as autonomous underwater vehicles (AUVs) and remotely operated vehicles (ROVs).</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 1-7"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000289/pdfft?md5=327c4a7880ba070fb45e7c349a11ba1e&pid=1-s2.0-S2667241323000289-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135410456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on YOLOv3 model compression strategy for UAV deployment 无人机部署中YOLOv3模型压缩策略研究
Pub Date : 2024-01-01 Epub Date: 2023-11-17 DOI: 10.1016/j.cogr.2023.11.001
Fei Xu , Litao Huang , Xiaoyang Gao , Tingting Yu , Leyi Zhang

UAVs are often limited by limited resources when performing flight tasks, especially the contradiction between storage resources and computing resources when the huge YOLOv3 model is deployed on the edge UAVs. In this paper, we tend to compress YOLOv3 model in different aspects to achieve load availability at the edge. In this paper, deep separable convolution is introduced to reduce the computation of the model. Then, PR regularization term is used as the regularization term of sparse training to better distinguish scaling factors, and then the hybrid pruning combining channel pruning and layer pruning is carried out on the model according to scaling factors, in order to reduce the number of model parameters and the amount of calculation. Finally, since the training data is a 32-bit floating point number, DoReFa-Net quantization method is used to quantify the model, so as to compress the storage capacity of the model. The experimental results show that the compression scheme proposed in this paper can effectively reduce the number of parameters by 97.5 % and the calculation amount by 82.3 %, and can maintain the original detection efficiency of UAVs.

无人机在执行飞行任务时往往受到有限资源的限制,特别是在边缘无人机上部署庞大的YOLOv3模型时,存储资源与计算资源之间的矛盾尤为突出。在本文中,我们倾向于对YOLOv3模型进行不同方面的压缩,以实现边缘的负载可用性。本文引入深度可分离卷积来减少模型的计算量。然后,利用PR正则化项作为稀疏训练的正则化项,更好地区分尺度因子,然后根据尺度因子对模型进行通道剪枝和层剪枝相结合的混合剪枝,以减少模型参数的数量和计算量。最后,由于训练数据为32位浮点数,采用DoReFa-Net量化方法对模型进行量化,从而压缩模型的存储容量。实验结果表明,本文提出的压缩方案可有效减少97.5%的参数个数和82.3%的计算量,并能保持无人机原有的检测效率。
{"title":"Research on YOLOv3 model compression strategy for UAV deployment","authors":"Fei Xu ,&nbsp;Litao Huang ,&nbsp;Xiaoyang Gao ,&nbsp;Tingting Yu ,&nbsp;Leyi Zhang","doi":"10.1016/j.cogr.2023.11.001","DOIUrl":"https://doi.org/10.1016/j.cogr.2023.11.001","url":null,"abstract":"<div><p>UAVs are often limited by limited resources when performing flight tasks, especially the contradiction between storage resources and computing resources when the huge YOLOv3 model is deployed on the edge UAVs. In this paper, we tend to compress YOLOv3 model in different aspects to achieve load availability at the edge. In this paper, deep separable convolution is introduced to reduce the computation of the model. Then, PR regularization term is used as the regularization term of sparse training to better distinguish scaling factors, and then the hybrid pruning combining channel pruning and layer pruning is carried out on the model according to scaling factors, in order to reduce the number of model parameters and the amount of calculation. Finally, since the training data is a 32-bit floating point number, DoReFa-Net quantization method is used to quantify the model, so as to compress the storage capacity of the model. The experimental results show that the compression scheme proposed in this paper can effectively reduce the number of parameters by 97.5 % and the calculation amount by 82.3 %, and can maintain the original detection efficiency of UAVs.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 8-18"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241323000381/pdfft?md5=c325aa36bb6e1759d99185c61f3a6b9f&pid=1-s2.0-S2667241323000381-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138439137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Food Sample Handling and Placement Pattern Recognition with YOLO: Advanced Techniques in Robotic Object Detection 利用 YOLO 优化食品样品处理和放置模式识别:机器人物体检测的先进技术
Pub Date : 2024-01-01 DOI: 10.1016/j.cogr.2024.01.001
Shoki Koga, Keisuke Hamamoto, Huimin Lu, Y. Nakatoh
{"title":"Optimizing Food Sample Handling and Placement Pattern Recognition with YOLO: Advanced Techniques in Robotic Object Detection","authors":"Shoki Koga, Keisuke Hamamoto, Huimin Lu, Y. Nakatoh","doi":"10.1016/j.cogr.2024.01.001","DOIUrl":"https://doi.org/10.1016/j.cogr.2024.01.001","url":null,"abstract":"","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":" 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139392652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Autonomous novel class discovery for vision-based recognition in non-interactive environments 在非交互式环境中自主发现基于视觉识别的新类别
Pub Date : 2024-01-01 Epub Date: 2024-11-16 DOI: 10.1016/j.cogr.2024.10.002
Xuelin Zhang , Feng Liu , Xuelian Cheng , Siyuan Yan , Zhibin Liao , Zongyuan Ge
Visual recognition with deep learning has recently been shown to be effective in robotic vision. However, these algorithms tend to be build under fixed and structured environment, which is rarely the case in real life. When facing unknown objects, avoidance or human interactions are required, which may miss critical objects or be prohibitively costly to obtain on robots in the real world. We consider a practical problem setting that aims to allow robots to automatically discover novel classes with only labelled known class samples in hand, defined as open-set clustering (OSC). To address the OSC problem, we propose a framework combining three approaches: 1) using selfsupervised vision transformers to mitigate the discard of information needed for clustering unknown classes; 2) adaptive weighting for image patches to prioritize patches with richer textures; and 3) incorporating a temperature scaling strategy to generate more separable feature embeddings for clustering. We demonstrate the efficacy of our approach in six fine-grained image datasets.
利用深度学习进行视觉识别最近被证明在机器人视觉领域非常有效。然而,这些算法往往是在固定和结构化的环境下构建的,而现实生活中很少出现这种情况。在面对未知物体时,需要进行回避或人机交互,这可能会错过关键物体,或者在现实世界中机器人获得这些物体的成本过高。我们考虑了一个实际问题,其目的是让机器人在只掌握已知类别样本的情况下自动发现新类别,这被定义为开放集群(Open-Set Clustering,OSC)。为了解决开放集群问题,我们提出了一个结合三种方法的框架:1) 使用自监督视觉转换器来减少聚类未知类别所需的信息丢弃;2) 自适应图像片段加权,优先考虑纹理更丰富的片段;3) 结合温度缩放策略,生成更多可分离的特征嵌入,用于聚类。我们在六个细粒度图像数据集中展示了我们的方法的有效性。
{"title":"Autonomous novel class discovery for vision-based recognition in non-interactive environments","authors":"Xuelin Zhang ,&nbsp;Feng Liu ,&nbsp;Xuelian Cheng ,&nbsp;Siyuan Yan ,&nbsp;Zhibin Liao ,&nbsp;Zongyuan Ge","doi":"10.1016/j.cogr.2024.10.002","DOIUrl":"10.1016/j.cogr.2024.10.002","url":null,"abstract":"<div><div>Visual recognition with deep learning has recently been shown to be effective in robotic vision. However, these algorithms tend to be build under fixed and structured environment, which is rarely the case in real life. When facing unknown objects, avoidance or human interactions are required, which may miss critical objects or be prohibitively costly to obtain on robots in the real world. We consider a practical problem setting that aims to allow robots to automatically discover novel classes with only labelled known class samples in hand, defined as open-set clustering (OSC). To address the OSC problem, we propose a framework combining three approaches: 1) using selfsupervised vision transformers to mitigate the discard of information needed for clustering unknown classes; 2) adaptive weighting for image patches to prioritize patches with richer textures; and 3) incorporating a temperature scaling strategy to generate more separable feature embeddings for clustering. We demonstrate the efficacy of our approach in six fine-grained image datasets.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"4 ","pages":"Pages 191-203"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cognitive Robotics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1