Pub Date : 2025-12-05DOI: 10.1016/j.cogr.2025.11.002
Guanze Shen , Jingxuan Zhang , Zhe Chen
Underwater images suffer from haze effects and low contrast due to wavelength- and distance-dependent scattering and attenuation. These issues present significant challenges for various underwater vision applications. Super resolution (SR) of underwater images offers an effective solution for enhancing both detail refinement and overall image visibility. However, underwater image SR remains challenging owing to the severe degradation of texture and color information. This paper proposes a multidomain learning-based SR network to enhance the performance of underwater image SR. Specifically, we introduce a multidomain encoder network that integrates grayscale and dual-color spaces into a unified framework. This architecture enables our model to simultaneously improve the underwater image quality through texture enhancement and color correction. By incorporating a channel attention mechanism, the most discriminative features extracted from multiple domains can be adaptively weighted and fused. Consequently, our network effectively boosts image resolution and enhances visual quality by leveraging multidomain data and the advantages of learning-based approaches. Experimental results demonstrate the superior performance of the proposed model in underwater image SR.
{"title":"Underwater image super-resolution via multi-domain learning","authors":"Guanze Shen , Jingxuan Zhang , Zhe Chen","doi":"10.1016/j.cogr.2025.11.002","DOIUrl":"10.1016/j.cogr.2025.11.002","url":null,"abstract":"<div><div>Underwater images suffer from haze effects and low contrast due to wavelength- and distance-dependent scattering and attenuation. These issues present significant challenges for various underwater vision applications. Super resolution (SR) of underwater images offers an effective solution for enhancing both detail refinement and overall image visibility. However, underwater image SR remains challenging owing to the severe degradation of texture and color information. This paper proposes a multidomain learning-based SR network to enhance the performance of underwater image SR. Specifically, we introduce a multidomain encoder network that integrates grayscale and dual-color spaces into a unified framework. This architecture enables our model to simultaneously improve the underwater image quality through texture enhancement and color correction. By incorporating a channel attention mechanism, the most discriminative features extracted from multiple domains can be adaptively weighted and fused. Consequently, our network effectively boosts image resolution and enhances visual quality by leveraging multidomain data and the advantages of learning-based approaches. Experimental results demonstrate the superior performance of the proposed model in underwater image SR.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"6 ","pages":"Pages 20-31"},"PeriodicalIF":0.0,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145737051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper presents a novel adaptive control framework for robotic grippers that handles a wide range of compliant objects by mimicking human grasping behaviour. The proposed system integrates three distinct control strategies: classical Proportional-Integral-Derivative (PID), Proportional-Integral-based Fuzzy Logic Control (PI-FLC), and Reinforcement Learning (RL) to achieve precise and safe force modulation during object manipulation. A two-finger gripper prototype was developed and experimentally validated using objects of varying stiffness levels, including rigid (iron, plastic) and deformable materials (silicone, foam, sponge). Real-time force control was benchmarked against human-defined reference profiles derived from tactile interaction experiments. The results demonstrate that while PID control provides satisfactory performance for rigid objects, it fails to adapt to nonlinear dynamics in soft materials. In contrast, the PI-Fuzzy and RL controllers can achieve superior force tracking, stability, and generalisation, closely aligning with human-like grasping patterns. The PI-Fuzzy controller excels in rule-based adaptability, while RL shows potential in learning optimal strategies across different compliance levels. This study underscores the significance of integrating classical and intelligent control strategies to improve robotic dexterity, safety, and autonomy, particularly in unstructured environments. The findings have meaningful implications for industrial automation, human-robot collaboration, and the effective manipulation of objects with varying stiffness.
{"title":"Self-adaptive control of a two-point contact gripper for the precise handling of compliant objects in industrial robotics","authors":"Sarawit Cheewaratchanon , Jutamanee Auysakul , Paramin Neranon , Arisara Romyen","doi":"10.1016/j.cogr.2025.11.001","DOIUrl":"10.1016/j.cogr.2025.11.001","url":null,"abstract":"<div><div>This paper presents a novel adaptive control framework for robotic grippers that handles a wide range of compliant objects by mimicking human grasping behaviour. The proposed system integrates three distinct control strategies: classical Proportional-Integral-Derivative (PID), Proportional-Integral-based Fuzzy Logic Control (PI-FLC), and Reinforcement Learning (RL) to achieve precise and safe force modulation during object manipulation. A two-finger gripper prototype was developed and experimentally validated using objects of varying stiffness levels, including rigid (iron, plastic) and deformable materials (silicone, foam, sponge). Real-time force control was benchmarked against human-defined reference profiles derived from tactile interaction experiments. The results demonstrate that while PID control provides satisfactory performance for rigid objects, it fails to adapt to nonlinear dynamics in soft materials. In contrast, the PI-Fuzzy and RL controllers can achieve superior force tracking, stability, and generalisation, closely aligning with human-like grasping patterns. The PI-Fuzzy controller excels in rule-based adaptability, while RL shows potential in learning optimal strategies across different compliance levels. This study underscores the significance of integrating classical and intelligent control strategies to improve robotic dexterity, safety, and autonomy, particularly in unstructured environments. The findings have meaningful implications for industrial automation, human-robot collaboration, and the effective manipulation of objects with varying stiffness.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"6 ","pages":"Pages 1-19"},"PeriodicalIF":0.0,"publicationDate":"2025-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145600536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01DOI: 10.1016/j.cogr.2025.02.002
Sameer Agrawal , Bhumeshwar K. Patle , Sudarshan Sanap
The development of drones in various applications makes it essential to address the critical issue of providing collision-free and optimal navigation in uncertain environments. The current research work aims to develop, simulate and experiment with the Probability Fuzzy Logic (PFL) controller for route planning and obstacle avoidance for drones in uncertain static and dynamic environments. The PFL system uses probability-based impact assessment and fuzzy logic rules to deal with unknowns and environmental changes. The fuzzy logic system takes in input about the distance of objects from the drone's front, left, and right sides, as well as the probability of collision based on the drone's speed and how close it is to the obstacles. The set of thirty fuzzy rules based on the distance of the obstacle from front left and right are defined to decide the output, i.e. speed of the drone and heading angle. The simulation environment is developed using MATLAB, with grid-based motion planning that accounts for static and dynamic obstacles. The system's performance is validated through simulations and real-world experiments, comparing path length and travel time. On comparing the simulation and experimental results, the proposed PFL-based controller has been proven to be efficient, accurate, and robust for both static and dynamic and simple to complex environments. The drones can plan the shortest and most collision-free path across all the scenarios, as depicted in the simulation and experimentation results. However, due to communication delay, inaccuracy of sensor response, environmental impact and motor delay, there are slight deviations between the simulation and experimentation values. Upon performing the error analysis, it is found that the error between the simulation and experimental value is within the range of 6.66 % in all the studied scenarios.
{"title":"Navigation control of unmanned aerial vehicles in dynamic collaborative indoor environment using probability fuzzy logic approach","authors":"Sameer Agrawal , Bhumeshwar K. Patle , Sudarshan Sanap","doi":"10.1016/j.cogr.2025.02.002","DOIUrl":"10.1016/j.cogr.2025.02.002","url":null,"abstract":"<div><div>The development of drones in various applications makes it essential to address the critical issue of providing collision-free and optimal navigation in uncertain environments. The current research work aims to develop, simulate and experiment with the Probability Fuzzy Logic (PFL) controller for route planning and obstacle avoidance for drones in uncertain static and dynamic environments. The PFL system uses probability-based impact assessment and fuzzy logic rules to deal with unknowns and environmental changes. The fuzzy logic system takes in input about the distance of objects from the drone's front, left, and right sides, as well as the probability of collision based on the drone's speed and how close it is to the obstacles. The set of thirty fuzzy rules based on the distance of the obstacle from front left and right are defined to decide the output, i.e. speed of the drone and heading angle. The simulation environment is developed using MATLAB, with grid-based motion planning that accounts for static and dynamic obstacles. The system's performance is validated through simulations and real-world experiments, comparing path length and travel time. On comparing the simulation and experimental results, the proposed PFL-based controller has been proven to be efficient, accurate, and robust for both static and dynamic and simple to complex environments. The drones can plan the shortest and most collision-free path across all the scenarios, as depicted in the simulation and experimentation results. However, due to communication delay, inaccuracy of sensor response, environmental impact and motor delay, there are slight deviations between the simulation and experimentation values. Upon performing the error analysis, it is found that the error between the simulation and experimental value is within the range of 6.66 % in all the studied scenarios.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 86-113"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143579569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-01DOI: 10.1016/j.cogr.2025.10.001
Xuan Jin , Sheng Wang , Miaomiao Zhang , Guoteng Xu , Bingqi Hu , Hanlin Tang
Seal recognition, as a fundamental perception capability, is crucial for enabling cognitive robotic systems to autonomously interact with and understand physical documents in intelligent office and archival environments. While Transformer based optical character recognition (OCR) methods have recently achieved remarkable progress, the recognition of curved and degraded seal text remains a significant challenge. Traditional approaches often rely on cumbersome pipelines with limited robustness, which hampers their integration into robotic cognitive platforms. To address these issues, this paper proposes a novel perception framework that integrates the YOLO-based detection module with the TrOCR recognition model for seal content analysis. The framework enhances robotic perception through three core mechanisms: precise spatial localization, adaptive noise suppression, and efficient curved-text recognition. Experimental results demonstrate that the proposed approach achieves 94.8% accuracy in bent seal text recognition tasks, validating its effectiveness in complex, real-world scenarios. These findings highlight the potential of the method to serve as a reliable perception module within cognitive robotic systems for document understanding and autonomous decision-making.
{"title":"TrOCR-driven seal instrument detection and recognition for cognitive robotic applications","authors":"Xuan Jin , Sheng Wang , Miaomiao Zhang , Guoteng Xu , Bingqi Hu , Hanlin Tang","doi":"10.1016/j.cogr.2025.10.001","DOIUrl":"10.1016/j.cogr.2025.10.001","url":null,"abstract":"<div><div>Seal recognition, as a fundamental perception capability, is crucial for enabling cognitive robotic systems to autonomously interact with and understand physical documents in intelligent office and archival environments. While Transformer based optical character recognition (OCR) methods have recently achieved remarkable progress, the recognition of curved and degraded seal text remains a significant challenge. Traditional approaches often rely on cumbersome pipelines with limited robustness, which hampers their integration into robotic cognitive platforms. To address these issues, this paper proposes a novel perception framework that integrates the YOLO-based detection module with the TrOCR recognition model for seal content analysis. The framework enhances robotic perception through three core mechanisms: precise spatial localization, adaptive noise suppression, and efficient curved-text recognition. Experimental results demonstrate that the proposed approach achieves 94.8% accuracy in bent seal text recognition tasks, validating its effectiveness in complex, real-world scenarios. These findings highlight the potential of the method to serve as a reliable perception module within cognitive robotic systems for document understanding and autonomous decision-making.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 286-298"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145361999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}