Snake robots with limbless structure and rich locomotion gaits have been designed and built for wide application in various fields including military reconnaissance, pipeline operation, disaster search and rescue, etc. However, the problem how to flexibly and smoothly control switch and change of different locomotion gaits is still facing enormous challenges. A novel unified design rule of the CPG network model composed of improved Hopf oscillators is proposed, based on which a variety of different network structures can be created by designing connection distances and coupling weights among all oscillator units. Through the relationships between the control parameters of the Hopf oscillator, decoupling of the bifurcation parameters is achieved to solve inconsistent output waveform amplitude when the bifurcation parameters are not completely equal. Furthermore, five typical movement modes of biological snake are designed and smooth switch between different locomotion gaits is realized. A control system is constructed based on the Robot Operating System (ROS) and a prototype of snake robot is built, and the effectiveness of the proposed CPG model in controlling locomotion gaits was verified through simulations and experiments. The CPG modeling approach has important theoretical significance and practical instructive value for motion planning and gait control of snake robots in complex environments.
{"title":"Locomotion gait control of snake robots based on a novel unified CPG network model composed of Hopf oscillators","authors":"Xupeng Liu , Yong Zang , Zhiying Gao , Maolin Liao","doi":"10.1016/j.robot.2024.104746","DOIUrl":"https://doi.org/10.1016/j.robot.2024.104746","url":null,"abstract":"<div><p>Snake robots with limbless structure and rich locomotion gaits have been designed and built for wide application in various fields including military reconnaissance, pipeline operation, disaster search and rescue, etc. However, the problem how to flexibly and smoothly control switch and change of different locomotion gaits is still facing enormous challenges. A novel unified design rule of the CPG network model composed of improved Hopf oscillators is proposed, based on which a variety of different network structures can be created by designing connection distances and coupling weights among all oscillator units. Through the relationships between the control parameters of the Hopf oscillator, decoupling of the bifurcation parameters is achieved to solve inconsistent output waveform amplitude when the bifurcation parameters are not completely equal. Furthermore, five typical movement modes of biological snake are designed and smooth switch between different locomotion gaits is realized. A control system is constructed based on the Robot Operating System (ROS) and a prototype of snake robot is built, and the effectiveness of the proposed CPG model in controlling locomotion gaits was verified through simulations and experiments. The CPG modeling approach has important theoretical significance and practical instructive value for motion planning and gait control of snake robots in complex environments.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104746"},"PeriodicalIF":4.3,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141539433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-02DOI: 10.1016/j.robot.2024.104743
Jamil Ahmad , Vasco Fanti , Darwin G. Caldwell , Christian Di Natali
Work-related Musculoskeletal Disorders (WMSDs) are the most common occupational diseases caused by the prolonged performance of strenuous work, such as manual handling of loads or long-term maintenance of incongruous postures. Different safety protocols are implemented to reduce WMSDs and optimize the working environment, but one of the most promising solutions is using occupational exoskeletons (OEs). However, to truly acknowledge the benefits of OEs and be able to introduce them into daily business use, devices must pass several development and testing stages that determine the Technology Readiness Level (TRL). This review study aims to present an up-to-date collection of the most advanced assessments of exoskeletons for upper and back support, ranging from laboratory real-task simulations to operational scenarios in industrial sites. To identify relevant studies, we conducted comprehensive searches across different electronic databases, i.e., PubMed, Scopus, and Web of Science. Different keywords were used for the literature search, e.g., occupational exoskeleton, industrial exoskeleton, etc. Studies were included if they investigated the assessment of exoskeletons in the laboratory with real tasks or an industrial environment. We identified 45 research articles that fulfilled this selection criterion. Several features are compared and discussed in detail, such as industrial environment, experimental protocol, task performed, and exoskeleton typology. These data allowed us to formulate results that report the correspondence or discrepancy between the number of papers testing exoskeletons and WMSDs in different industrial sectors, the type of assessment performed, and the impact of exoskeletons on workers and industries at different TRLs. Among the results, the incidence of WMSDs in the manufacturing industry is 21.13%, while the adoption of exoskeletons in the same field is the highest with respect to the other industrial fields, at 44.45%. Electromyography (EMG) and Questionnaires were the most evaluated typologies across all development and testing stages (with an incidence of 64% across the selected articles). Additionally, an average reduction of EMG activity was reported, with 24% for Upper Limb and 20% for Back Support. Regarding the subjective assessment reported in the questionnaires, 68% of the studies reported a positive evaluation. Based on these outcomes, this work provides a framework for an effective evaluation process for the OEs to raise TRL with recommendations for future research activities.
{"title":"Framework for the adoption, evaluation and impact of occupational Exoskeletons at different technology readiness levels: A systematic review","authors":"Jamil Ahmad , Vasco Fanti , Darwin G. Caldwell , Christian Di Natali","doi":"10.1016/j.robot.2024.104743","DOIUrl":"https://doi.org/10.1016/j.robot.2024.104743","url":null,"abstract":"<div><p>Work-related Musculoskeletal Disorders (WMSDs) are the most common occupational diseases caused by the prolonged performance of strenuous work, such as manual handling of loads or long-term maintenance of incongruous postures. Different safety protocols are implemented to reduce WMSDs and optimize the working environment, but one of the most promising solutions is using occupational exoskeletons (OEs). However, to truly acknowledge the benefits of OEs and be able to introduce them into daily business use, devices must pass several development and testing stages that determine the Technology Readiness Level (TRL). This review study aims to present an up-to-date collection of the most advanced assessments of exoskeletons for upper and back support, ranging from laboratory real-task simulations to operational scenarios in industrial sites. To identify relevant studies, we conducted comprehensive searches across different electronic databases, i.e., PubMed, Scopus, and Web of Science. Different keywords were used for the literature search, e.g., occupational exoskeleton, industrial exoskeleton, etc. Studies were included if they investigated the assessment of exoskeletons in the laboratory with real tasks or an industrial environment. We identified 45 research articles that fulfilled this selection criterion. Several features are compared and discussed in detail, such as industrial environment, experimental protocol, task performed, and exoskeleton typology. These data allowed us to formulate results that report the correspondence or discrepancy between the number of papers testing exoskeletons and WMSDs in different industrial sectors, the type of assessment performed, and the impact of exoskeletons on workers and industries at different TRLs. Among the results, the incidence of WMSDs in the manufacturing industry is 21.13%, while the adoption of exoskeletons in the same field is the highest with respect to the other industrial fields, at 44.45%. Electromyography (EMG) and Questionnaires were the most evaluated typologies across all development and testing stages (with an incidence of 64% across the selected articles). Additionally, an average reduction of EMG activity was reported, with 24% for Upper Limb and 20% for Back Support. Regarding the subjective assessment reported in the questionnaires, 68% of the studies reported a positive evaluation. Based on these outcomes, this work provides a framework for an effective evaluation process for the OEs to raise TRL with recommendations for future research activities.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104743"},"PeriodicalIF":4.3,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0921889024001271/pdfft?md5=ac460432a2a6a01712c39e905436a5b0&pid=1-s2.0-S0921889024001271-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141539420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-28DOI: 10.1016/j.robot.2024.104745
Yaguang Yan , Minan Tang , Wenjuan Wang , Yaqi Zhang , Bo An
Wearable rehabilitation robots have become an important auxiliary tool in rehabilitation therapy, providing effective rehabilitation training and helping to recover damaged muscles and joints. In response to the difficulty of traditional control methods in solving various constraints in the trajectory tracking process of the Upper Limb Rehabilitation Robot (ULRR), this study uses model predictive control to study the trajectory tracking problem of the upper limb rehabilitation robot. Firstly, based on the Lagrangian dynamic model of wearable rehabilitation robots, an extended state space model with pseudo linearization of the system was established. Given the performance indicators and various constraints of the system, a corresponding model predictive controller is designed based on the Laguerre model to ensure system performance while greatly reducing the computational complexity of predictive control. Secondly, the stability of the model predictive controller is demonstrated, and a disturbance observer is introduced into the controller to achieve compensation for slow-varying perturbations; a joint space sliding mode variable is also introduced to achieve simultaneous tracking of the joint’s desired position and desired velocity. Finally, taking a planar two bar robot as an example, comparative simulation verification was conducted on unconstrained joint trajectory tracking and constrained joint trajectory tracking. The simulation results show that the model predictive controller can achieve simultaneous tracking of joint expected trajectory and expected speed while meeting various constraints. It has good effects in improving patient motion control ability and reducing patient fatigue, providing new research ideas and methods for the field of rehabilitation therapy.
{"title":"Trajectory tracking control of wearable upper limb rehabilitation robot based on Laguerre model predictive control","authors":"Yaguang Yan , Minan Tang , Wenjuan Wang , Yaqi Zhang , Bo An","doi":"10.1016/j.robot.2024.104745","DOIUrl":"https://doi.org/10.1016/j.robot.2024.104745","url":null,"abstract":"<div><p>Wearable rehabilitation robots have become an important auxiliary tool in rehabilitation therapy, providing effective rehabilitation training and helping to recover damaged muscles and joints. In response to the difficulty of traditional control methods in solving various constraints in the trajectory tracking process of the Upper Limb Rehabilitation Robot (ULRR), this study uses model predictive control to study the trajectory tracking problem of the upper limb rehabilitation robot. Firstly, based on the Lagrangian dynamic model of wearable rehabilitation robots, an extended state space model with pseudo linearization of the system was established. Given the performance indicators and various constraints of the system, a corresponding model predictive controller is designed based on the Laguerre model to ensure system performance while greatly reducing the computational complexity of predictive control. Secondly, the stability of the model predictive controller is demonstrated, and a disturbance observer is introduced into the controller to achieve compensation for slow-varying perturbations; a joint space sliding mode variable is also introduced to achieve simultaneous tracking of the joint’s desired position and desired velocity. Finally, taking a planar two bar robot as an example, comparative simulation verification was conducted on unconstrained joint trajectory tracking and constrained joint trajectory tracking. The simulation results show that the model predictive controller can achieve simultaneous tracking of joint expected trajectory and expected speed while meeting various constraints. It has good effects in improving patient motion control ability and reducing patient fatigue, providing new research ideas and methods for the field of rehabilitation therapy.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104745"},"PeriodicalIF":4.3,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141539435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-26DOI: 10.1016/j.robot.2024.104748
Zhi Cai , Jiahang Liu , Lin Xu , Jiayi Wang
The rapid development of robotics technology has driven the growth of robot types and the development of related technologies. As an important aspect of robot research, path planning technology plays an irreplaceable role in practical production and application. Ant colony algorithm has a wide range of applications in robot path planning, but there is also a problem of performance overly relying on initial parameter selection. In order to solve this problem and improve the performance of mobile robot path planning, an improved ant colony algorithm based on firefly algorithm was studied and designed in a two-dimensional environment. In order to further explore the performance of ant colony algorithm in solving robot coordinated path planning problems, an improved ant colony algorithm based on heuristic function was also designed. In a three-dimensional environment, an improved ant colony algorithm based on the improved artificial potential field method was designed. The research results show that the maximum running time of the improved ant colony algorithm based on the firefly algorithm in different grid environments is 819.36 s, 847.01 s, and 811.54 s, respectively. The average running time of the improved ant colony algorithm based on heuristic function in different grid environments is 5.19 s, 5.97 s, and 9.09 s, with average path lengths of 29.90 cm, 31.08 cm, and 37.01 cm, and path length variances of 0.35, 0.87, and 2.21, respectively. The ant colony algorithm based on the improved artificial potential field method has a running time of 1.930 s, 3.182 s, and 4.662 s in different grid environments, and a path length of 29.275 cm, 49.447 cm, and 67.057 cm, respectively. The ant colony algorithm for research and design optimization has good performance. The contribution of the research lies in the design of three path planning methods for mobile robots, including two-dimensional path planning and three-dimensional path planning, which improves the time of path planning and shortens the average path length. The novelty of the research is reflected in the design of a path planning method for mobile robots in two-dimensional and three-dimensional environments, which improves the ant colony algorithm through firefly algorithm and heuristic function, and combines the ant colony algorithm with the improved artificial potential field method. The method designed by the research institute can provide technical support for path planning of mobile robots.
机器人技术的飞速发展推动了机器人种类的增加和相关技术的发展。作为机器人研究的一个重要方面,路径规划技术在实际生产和应用中发挥着不可替代的作用。蚁群算法在机器人路径规划中有着广泛的应用,但也存在性能过于依赖初始参数选择的问题。为了解决这一问题,提高移动机器人路径规划的性能,在二维环境下研究并设计了一种基于萤火虫算法的改进蚁群算法。为了进一步探索蚁群算法在解决机器人协调路径规划问题中的性能,还设计了一种基于启发式函数的改进蚁群算法。在三维环境中,设计了基于改进人工势场方法的改进蚁群算法。研究结果表明,基于萤火虫算法的改进蚁群算法在不同网格环境下的最大运行时间分别为 819.36 秒、847.01 秒和 811.54 秒。基于启发式函数的改进蚁群算法在不同网格环境下的平均运行时间分别为 5.19 s、5.97 s 和 9.09 s,平均路径长度分别为 29.90 cm、31.08 cm 和 37.01 cm,路径长度方差分别为 0.35、0.87 和 2.21。基于改进人工势场方法的蚁群算法在不同网格环境下的运行时间分别为 1.930 s、3.182 s 和 4.662 s,路径长度分别为 29.275 cm、49.447 cm 和 67.057 cm。蚁群算法在研究和设计优化方面具有良好的性能。该研究的贡献在于为移动机器人设计了三种路径规划方法,包括二维路径规划和三维路径规划,提高了路径规划的时间,缩短了平均路径长度。研究的新颖性体现在设计了一种二维和三维环境下的移动机器人路径规划方法,该方法通过萤火虫算法和启发式函数改进了蚁群算法,并将蚁群算法与改进的人工势场方法相结合。研究所设计的方法可为移动机器人的路径规划提供技术支持。
{"title":"Cooperative path planning study of distributed multi-mobile robots based on optimised ACO algorithm","authors":"Zhi Cai , Jiahang Liu , Lin Xu , Jiayi Wang","doi":"10.1016/j.robot.2024.104748","DOIUrl":"https://doi.org/10.1016/j.robot.2024.104748","url":null,"abstract":"<div><p>The rapid development of robotics technology has driven the growth of robot types and the development of related technologies. As an important aspect of robot research, path planning technology plays an irreplaceable role in practical production and application. Ant colony algorithm has a wide range of applications in robot path planning, but there is also a problem of performance overly relying on initial parameter selection. In order to solve this problem and improve the performance of mobile robot path planning, an improved ant colony algorithm based on firefly algorithm was studied and designed in a two-dimensional environment. In order to further explore the performance of ant colony algorithm in solving robot coordinated path planning problems, an improved ant colony algorithm based on heuristic function was also designed. In a three-dimensional environment, an improved ant colony algorithm based on the improved artificial potential field method was designed. The research results show that the maximum running time of the improved ant colony algorithm based on the firefly algorithm in different grid environments is 819.36 s, 847.01 s, and 811.54 s, respectively. The average running time of the improved ant colony algorithm based on heuristic function in different grid environments is 5.19 s, 5.97 s, and 9.09 s, with average path lengths of 29.90 cm, 31.08 cm, and 37.01 cm, and path length variances of 0.35, 0.87, and 2.21, respectively. The ant colony algorithm based on the improved artificial potential field method has a running time of 1.930 s, 3.182 s, and 4.662 s in different grid environments, and a path length of 29.275 cm, 49.447 cm, and 67.057 cm, respectively. The ant colony algorithm for research and design optimization has good performance. The contribution of the research lies in the design of three path planning methods for mobile robots, including two-dimensional path planning and three-dimensional path planning, which improves the time of path planning and shortens the average path length. The novelty of the research is reflected in the design of a path planning method for mobile robots in two-dimensional and three-dimensional environments, which improves the ant colony algorithm through firefly algorithm and heuristic function, and combines the ant colony algorithm with the improved artificial potential field method. The method designed by the research institute can provide technical support for path planning of mobile robots.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104748"},"PeriodicalIF":4.3,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141606688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This note scrutinizes an adaptive fault-tolerant control (FTC) approach tailored for unmanned aerial vehicles (UAVs), addressing the critical need for both fault accommodation and disturbance suppression. Departing from traditional reliance on robust discontinuous control strategies prone to chattering and demanding precise uncertainty bounds, our FTC method ensures fixed-time stability, guaranteeing the convergence of attitude tracking errors to zero. Central to our approach is an adaptive algorithm adept at concurrently estimating unknown actuator faults and upper bounds of lumped uncertainties. Moreover, our adaptive schemes accurately estimate the upper bound of the lumped uncertainty term, encompassing model uncertainties, external disturbances, and unmodeled dynamics, thereby eliminating the need for assuming known bounds on uncertainties. Stability analysis under the developed control law is thoroughly performed using the Lyapunov stability theory. Notably, our strategy employs an extended Kalman filter (EKF) observer for state estimation and fault detection, facilitating fault detection through an adaptive threshold technique dynamically adjusted based on real-time mean and variance of the residual signal. Through comprehensive simulation and experimental validations, our proposed methodology demonstrates significant advancements in ensuring safety and reliability in UAVs.
{"title":"Robust fault detection and adaptive fixed-time fault-tolerant control for quadrotor UAVs","authors":"Mahmood Mazare, Mostafa Taghizadeh, Pegah Ghaf-Ghanbari, Ehsan Davoodi","doi":"10.1016/j.robot.2024.104747","DOIUrl":"https://doi.org/10.1016/j.robot.2024.104747","url":null,"abstract":"<div><p>This note scrutinizes an adaptive fault-tolerant control (FTC) approach tailored for unmanned aerial vehicles (UAVs), addressing the critical need for both fault accommodation and disturbance suppression. Departing from traditional reliance on robust discontinuous control strategies prone to chattering and demanding precise uncertainty bounds, our FTC method ensures fixed-time stability, guaranteeing the convergence of attitude tracking errors to zero. Central to our approach is an adaptive algorithm adept at concurrently estimating unknown actuator faults and upper bounds of lumped uncertainties. Moreover, our adaptive schemes accurately estimate the upper bound of the lumped uncertainty term, encompassing model uncertainties, external disturbances, and unmodeled dynamics, thereby eliminating the need for assuming known bounds on uncertainties. Stability analysis under the developed control law is thoroughly performed using the Lyapunov stability theory. Notably, our strategy employs an extended Kalman filter (EKF) observer for state estimation and fault detection, facilitating fault detection through an adaptive threshold technique dynamically adjusted based on real-time mean and variance of the residual signal. Through comprehensive simulation and experimental validations, our proposed methodology demonstrates significant advancements in ensuring safety and reliability in UAVs.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104747"},"PeriodicalIF":4.3,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141539419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-20DOI: 10.1016/j.robot.2024.104732
Soo Ho Woo , Soon-Geul Lee , JaeHwan Choi , JunKi Hong , Jae-Hong Lee , HaoTian Xie
This paper presents adaptive and occupancy grid map algorithms for automatic lane change technology, a core technology in autonomous vehicles. The objectives are to improve driver safety and convenience with technology that automatically changes lanes at the request of the driver. The algorithms construct nine grids on the basis of ego vehicles and generate adaptive and occupancy grid maps by using the relative speeds of ego and target vehicles. When a driver requests a lane change, the adaptive grid map reduces the number of cases where the target vehicles may exist around the ego vehicle from 256 to 32, thus decreasing the calculation amount. Therefore, the algorithms are suitable for use in autonomous vehicles that require real-time calculations. An occupancy grid map is formed in accordance with the location of the target vehicles, and whether lane changes are possible is determined. The algorithms generate a virtual simulation environment with the CarMaker and are simulated using Matlab/Simulink. An experiment is conducted in a real driving environment with real vehicles to prove the validity of the algorithms.
Pub Date : 2024-06-13DOI: 10.1016/j.robot.2024.104731
Viktor Wiberg , Erik Wallin , Arvid Fälldin , Tobias Semberg , Morgan Rossander , Eddie Wadbro , Martin Servin
We explore sim-to-real transfer of deep reinforcement learning controllers for a heavy vehicle with active suspensions designed for traversing rough terrain. While related research primarily focuses on lightweight robots with electric motors and fast actuation, this study uses a forestry vehicle with a complex hydraulic driveline and slow actuation. We simulate the vehicle using multibody dynamics and apply system identification to find an appropriate set of simulation parameters. We then train policies in simulation using various techniques to mitigate the sim-to-real gap, including domain randomization, action delays, and a reward penalty to encourage smooth control. In reality, the policies trained with action delays and a penalty for erratic actions perform nearly at the same level as in simulation. In experiments on level ground, the motion trajectories closely overlap when turning to either side, as well as in a route tracking scenario. When faced with a ramp that requires active use of the suspensions, the simulated and real motions are in close alignment. This shows that the actuator model together with system identification yields a sufficiently accurate model of the actuators. We observe that policies trained without the additional action penalty exhibit fast switching or bang–bang control. These present smooth motions and high performance in simulation but transfer poorly to reality. We find that policies make marginal use of the local height map for perception, showing no indications of predictive planning. However, the strong transfer capabilities entail that further development concerning perception and performance can be largely confined to simulation.
{"title":"Sim-to-real transfer of active suspension control using deep reinforcement learning","authors":"Viktor Wiberg , Erik Wallin , Arvid Fälldin , Tobias Semberg , Morgan Rossander , Eddie Wadbro , Martin Servin","doi":"10.1016/j.robot.2024.104731","DOIUrl":"https://doi.org/10.1016/j.robot.2024.104731","url":null,"abstract":"<div><p>We explore sim-to-real transfer of deep reinforcement learning controllers for a heavy vehicle with active suspensions designed for traversing rough terrain. While related research primarily focuses on lightweight robots with electric motors and fast actuation, this study uses a forestry vehicle with a complex hydraulic driveline and slow actuation. We simulate the vehicle using multibody dynamics and apply system identification to find an appropriate set of simulation parameters. We then train policies in simulation using various techniques to mitigate the sim-to-real gap, including domain randomization, action delays, and a reward penalty to encourage smooth control. In reality, the policies trained with action delays and a penalty for erratic actions perform nearly at the same level as in simulation. In experiments on level ground, the motion trajectories closely overlap when turning to either side, as well as in a route tracking scenario. When faced with a ramp that requires active use of the suspensions, the simulated and real motions are in close alignment. This shows that the actuator model together with system identification yields a sufficiently accurate model of the actuators. We observe that policies trained without the additional action penalty exhibit fast switching or bang–bang control. These present smooth motions and high performance in simulation but transfer poorly to reality. We find that policies make marginal use of the local height map for perception, showing no indications of predictive planning. However, the strong transfer capabilities entail that further development concerning perception and performance can be largely confined to simulation.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104731"},"PeriodicalIF":4.3,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0921889024001155/pdfft?md5=e7bbe412bd07a5f03c52e1e36921e3d4&pid=1-s2.0-S0921889024001155-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141480382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-13DOI: 10.1016/j.robot.2024.104744
Canjun Yang , Xin Wu , Mingwei Lin , Ri Lin , Di Wu
Underwater humanoid robots (UHRs) have emerged as a significant area of interest in robotics, with the potential to overcome the limitations of traditional underwater robots and revolutionize underwater activities. This review examines the development of UHRs, focusing on their perception, decision-making, and execution capabilities within a hierarchical human-machine cooperation framework. The Perception Layer involves gathering information from the environment and human collaborators. The Decision-making Layer explores different levels of robot autonomy and the current status of human-UHR collaborative decision-making. The Execution Layer encompasses modeling, control, and actuation mechanisms to translate high-level intentions into physical actions. Various UHR implementations across research teams are reviewed to provide a comprehensive overview of current advancements. Discussions and challenges surrounding UHR progress are provided as well. Continued research and development efforts of UHR represent a promising avenue for advancing human-machine cooperation and pushing the boundaries of underwater exploration, contributing to scientific discoveries and societal benefits in this captivating realm.
{"title":"A review of advances in underwater humanoid robots for human–machine cooperation","authors":"Canjun Yang , Xin Wu , Mingwei Lin , Ri Lin , Di Wu","doi":"10.1016/j.robot.2024.104744","DOIUrl":"10.1016/j.robot.2024.104744","url":null,"abstract":"<div><p>Underwater humanoid robots (UHRs) have emerged as a significant area of interest in robotics, with the potential to overcome the limitations of traditional underwater robots and revolutionize underwater activities. This review examines the development of UHRs, focusing on their perception, decision-making, and execution capabilities within a hierarchical human-machine cooperation framework. The Perception Layer involves gathering information from the environment and human collaborators. The Decision-making Layer explores different levels of robot autonomy and the current status of human-UHR collaborative decision-making. The Execution Layer encompasses modeling, control, and actuation mechanisms to translate high-level intentions into physical actions. Various UHR implementations across research teams are reviewed to provide a comprehensive overview of current advancements. Discussions and challenges surrounding UHR progress are provided as well. Continued research and development efforts of UHR represent a promising avenue for advancing human-machine cooperation and pushing the boundaries of underwater exploration, contributing to scientific discoveries and societal benefits in this captivating realm.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104744"},"PeriodicalIF":4.3,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141404094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-13DOI: 10.1016/j.robot.2024.104742
Ke Wang , Zhaoyang Jacopo Hu , Peter Tisnikar , Oskar Helander , Digby Chappell , Petar Kormushev
Online footstep planning is essential for bipedal walking robots, allowing them to walk in the presence of disturbances and sensory noise. Most of the literature on the topic has focused on optimizing the footstep placement while keeping the step timing constant. In this work, we introduce a footstep planner capable of optimizing footstep placement and step time online. The proposed planner, consisting of an Interior Point Optimizer (IPOPT) and an optimizer based on Augmented Lagrangian (AL) method with analytical gradient descent, solves the full dynamics of the Linear Inverted Pendulum (LIP) model in real time to optimize for footstep location as well as step timing at the rate of 200 Hz. We show that such asynchronous real-time optimization with the AL method (ARTO-AL) provides the required robustness and speed for successful online footstep planning. Furthermore, ARTO-AL can be extended to plan footsteps in 3D, allowing terrain-aware footstep planning on uneven terrains. Compared to an algorithm with no footstep time adaptation, our proposed ARTO-AL demonstrates increased stability in simulated walking experiments as it can resist pushes on flat ground and on a ramp up to 120 N and 100 N respectively. Videos2 and open-source code3 are released.
在线脚步规划对于双足行走机器人来说至关重要,它使机器人能够在干扰和感知噪声的情况下行走。有关该主题的大部分文献都侧重于优化脚步位置,同时保持步进时间不变。在这项工作中,我们介绍了一种能够在线优化脚步位置和步进时间的脚步规划器。所提出的规划器由一个内部点优化器(IPOPT)和一个基于分析梯度下降的增强拉格朗日(AL)方法的优化器组成,实时求解线性倒立摆(LIP)模型的全部动力学,以 200 Hz 的速率优化脚步位置和步进时间。我们的研究表明,AL 方法(ARTO-AL)的这种异步实时优化为成功的在线脚步规划提供了所需的鲁棒性和速度。此外,ARTO-AL 还可以扩展到三维脚步规划,从而在不平坦的地形上实现地形感知脚步规划。与没有脚步时间自适应的算法相比,我们提出的 ARTO-AL 在模拟行走实验中表现出更高的稳定性,因为它可以在平地上和 10° 斜坡上分别抵抗高达 120 N 和 100 N 的推力。视频2和开源代码3已发布。
{"title":"When and where to step: Terrain-aware real-time footstep location and timing optimization for bipedal robots","authors":"Ke Wang , Zhaoyang Jacopo Hu , Peter Tisnikar , Oskar Helander , Digby Chappell , Petar Kormushev","doi":"10.1016/j.robot.2024.104742","DOIUrl":"https://doi.org/10.1016/j.robot.2024.104742","url":null,"abstract":"<div><p>Online footstep planning is essential for bipedal walking robots, allowing them to walk in the presence of disturbances and sensory noise. Most of the literature on the topic has focused on optimizing the footstep placement while keeping the step timing constant. In this work, we introduce a footstep planner capable of optimizing footstep placement and step time online. The proposed planner, consisting of an Interior Point Optimizer (IPOPT) and an optimizer based on Augmented Lagrangian (AL) method with analytical gradient descent, solves the full dynamics of the Linear Inverted Pendulum (LIP) model in real time to optimize for footstep location as well as step timing at the rate of 200 Hz. We show that such asynchronous real-time optimization with the AL method (ARTO-AL) provides the required robustness and speed for successful online footstep planning. Furthermore, ARTO-AL can be extended to plan footsteps in 3D, allowing terrain-aware footstep planning on uneven terrains. Compared to an algorithm with no footstep time adaptation, our proposed ARTO-AL demonstrates increased stability in simulated walking experiments as it can resist pushes on flat ground and on a <span><math><mrow><mn>10</mn><mo>°</mo></mrow></math></span> ramp up to 120 N and 100 N respectively. Videos<span><sup>2</sup></span> and open-source code<span><sup>3</sup></span> are released.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104742"},"PeriodicalIF":4.3,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S092188902400126X/pdfft?md5=599882b40704d445bb509be303dd3163&pid=1-s2.0-S092188902400126X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141434213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-07DOI: 10.1016/j.robot.2024.104730
Mehrez Boulares, Afef Fehri, Mohamed Jemni
In the context of real world application, Search and Rescue Missions on the ocean surface remain a complex task due to the large-scale area and the forces of the ocean currents, spreading lost targets and debris in an unpredictable way. In this work, we present a Path Planning Approach to search for a lost target on ocean surface using a swarm of UAVs. The combination of GlobCurrent dataset and a Lagrangian simulator is used to determine where the particles are moved by the ocean currents forces while Deep Q-learning algorithm is applied to learn from their dynamics. The evaluation results of the trained models show that our search strategy is effective and efficient. Over a total search area (red Sea zone), surface of 453422 Km, we have shown that our strategy Search Success Rate is 98.61%, the maximum Search Time to detection is 15 days and the average Search Time to detection is almost 15 h.
{"title":"UAV path planning algorithm based on Deep Q-Learning to search for a floating lost target in the ocean","authors":"Mehrez Boulares, Afef Fehri, Mohamed Jemni","doi":"10.1016/j.robot.2024.104730","DOIUrl":"10.1016/j.robot.2024.104730","url":null,"abstract":"<div><p>In the context of real world application, Search and Rescue Missions on the ocean surface remain a complex task due to the large-scale area and the forces of the ocean currents, spreading lost targets and debris in an unpredictable way. In this work, we present a Path Planning Approach to search for a lost target on ocean surface using a swarm of UAVs. The combination of GlobCurrent dataset and a Lagrangian simulator is used to determine where the particles are moved by the ocean currents forces while Deep Q-learning algorithm is applied to learn from their dynamics. The evaluation results of the trained models show that our search strategy is effective and efficient. Over a total search area (red Sea zone), surface of 453422 Km<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>, we have shown that our strategy Search Success Rate is 98.61%, the maximum Search Time to detection is 15 days and the average Search Time to detection is almost 15 h.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"179 ","pages":"Article 104730"},"PeriodicalIF":4.3,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141399395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}