Pub Date : 2025-12-09DOI: 10.1007/s10489-025-07035-7
Chenyang Miao, Yingzhuo Jiang, Yunduan Cui, Yidong Chen, Tianfu Sun
A novel Reinforcement Learning (RL) approach Multi-agent Joint Control with State-Action Embedding (MASAE) is proposed in this paper to address the sample-efficiency issue of RL in robot control. It combines the relative entropy regularization and high update-to-data (UTD) ratios in one multi-agent framework to accelerate the learning process while naturally mitigating the overestimation of value functions caused by high UTD ratios by multiple agents. The state-action embeddings are employed to adaptively abstract the hidden features behind the state-action space for enhanced learning efficiency. Evaluated by several simulated benchmark control tasks and a real-world Unitree Go1 quadruped robot system, MASAE demonstrates significant advantages in learning capability and sampling efficiency compared to various related RL baselines, indicating its potential in learning challenging real-world robot systems with a limited number of samples. The open-source code of MASAE is available at https://github.com/AdrienLin1/MASAE.
{"title":"Sample-efficient multi-agent reinforcement learning with high update-to-data ratio and state-action embedding","authors":"Chenyang Miao, Yingzhuo Jiang, Yunduan Cui, Yidong Chen, Tianfu Sun","doi":"10.1007/s10489-025-07035-7","DOIUrl":"10.1007/s10489-025-07035-7","url":null,"abstract":"<div><p>A novel Reinforcement Learning (RL) approach Multi-agent Joint Control with State-Action Embedding (MASAE) is proposed in this paper to address the sample-efficiency issue of RL in robot control. It combines the relative entropy regularization and high update-to-data (UTD) ratios in one multi-agent framework to accelerate the learning process while naturally mitigating the overestimation of value functions caused by high UTD ratios by multiple agents. The state-action embeddings are employed to adaptively abstract the hidden features behind the state-action space for enhanced learning efficiency. Evaluated by several simulated benchmark control tasks and a real-world Unitree Go1 quadruped robot system, MASAE demonstrates significant advantages in learning capability and sampling efficiency compared to various related RL baselines, indicating its potential in learning challenging real-world robot systems with a limited number of samples. The open-source code of MASAE is available at https://github.com/AdrienLin1/MASAE.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145729908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fetal heart rate (FHR) signals are widely used for fetal health assessment in clinical settings, making them popular in artificial intelligence-based algorithms for fetal health diagnosis. However, a major challenge for such algorithms is the need for a large amount of labeled and category-balanced clinical data to train the models. Like other medical data, FHR faces severe class imbalance in pathological data. Therefore, this paper proposes a minority sample generation method to generate high-quality pathological FHR signals to improve downstream classification task performance. We propose a long time series progressive growing generative adversarial network, TSP-GAN, which dynamically increases the network during training to achieve a transition from coarse-grained to fine-grained time features, thus generating long-time series with rich detailed information. The loss function of this network introduces L2 regularization on the basis of Wasserstein distance and gradient penalty terms to generate high-fidelity signals while avoiding mode collapse. On the one hand, visual and quantitative comparison experiments are designed and the results show that signals of different lengths generated by our network all obtained superior performance. On the other hand, downstream classification tasks are designed and the results indicate that the augmented category-balanced dataset improved by 10% in accuracy compared to the original unbalanced dataset. Therefore, TSP-GAN developed in this paper has practical application value in addressing the problem of sample imbalance in time series.
{"title":"A time-series progressive generative adversarial network for improving imbalanced fetal heart rate signal classification","authors":"Yanjun Deng, Yefei Zhang, Hao Wang, Pengfei Jiao, Gang Li, Zhidong Zhao","doi":"10.1007/s10489-025-07008-w","DOIUrl":"10.1007/s10489-025-07008-w","url":null,"abstract":"<div><p>Fetal heart rate (FHR) signals are widely used for fetal health assessment in clinical settings, making them popular in artificial intelligence-based algorithms for fetal health diagnosis. However, a major challenge for such algorithms is the need for a large amount of labeled and category-balanced clinical data to train the models. Like other medical data, FHR faces severe class imbalance in pathological data. Therefore, this paper proposes a minority sample generation method to generate high-quality pathological FHR signals to improve downstream classification task performance. We propose a long time series progressive growing generative adversarial network, TSP-GAN, which dynamically increases the network during training to achieve a transition from coarse-grained to fine-grained time features, thus generating long-time series with rich detailed information. The loss function of this network introduces L2 regularization on the basis of Wasserstein distance and gradient penalty terms to generate high-fidelity signals while avoiding mode collapse. On the one hand, visual and quantitative comparison experiments are designed and the results show that signals of different lengths generated by our network all obtained superior performance. On the other hand, downstream classification tasks are designed and the results indicate that the augmented category-balanced dataset improved by 10% in accuracy compared to the original unbalanced dataset. Therefore, TSP-GAN developed in this paper has practical application value in addressing the problem of sample imbalance in time series.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145675642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-05DOI: 10.1007/s10489-025-06922-3
Weixiong Jiang, Jun Wu, Zuoyi Chen, Haiping Zhu, Yaqiong Lv
Axial piston pump fault diagnosis plays a critical role in industrial application field. However, the existing methods face tremendous difficulties in disposing of multi-sensor data-driven and varied pressure pulsation issues. This makes it impossible to select effective diagnostic evidence from multi-sensor data, and the reserved diagnosis model trained under known pressure pulsation fails to adapt for new operation condition. Dedicated to these problems, this paper proposes the manifold transfer (MT) and ensemble filter strategy (EFS) for pump fault diagnosis. In this work, MT is constructed for dimension reduction and feature transformation based on curvilinear component analysis (CCA). It is capable of nonlinear manifold learning to address the issue of varied pressure pulsation. Then, an ensemble filter strategy with an information filtrate function is designed to improve the fault diagnosis performance. The effectiveness of the proposed method is validated by a fault experiment on axial piston pump. The experimental results demonstrate that compared with other existing methods, the proposed method is competitive in terms of diagnostic accuracy and efficiency. Highlights. The manifold transfer is proposed to solve the varied pressure pulsation issue. An ensemble filter strategy is devised to achieve accurate and efficient fault diagnosis without manual intervention. Axial piston pump fault simulation experiments are conducted to validate the effectiveness of proposed method.
{"title":"Manifold transfer and ensemble filter strategy for axial piston pump fault diagnosis under varied pressure pulsation","authors":"Weixiong Jiang, Jun Wu, Zuoyi Chen, Haiping Zhu, Yaqiong Lv","doi":"10.1007/s10489-025-06922-3","DOIUrl":"10.1007/s10489-025-06922-3","url":null,"abstract":"<div><p>Axial piston pump fault diagnosis plays a critical role in industrial application field. However, the existing methods face tremendous difficulties in disposing of multi-sensor data-driven and varied pressure pulsation issues. This makes it impossible to select effective diagnostic evidence from multi-sensor data, and the reserved diagnosis model trained under known pressure pulsation fails to adapt for new operation condition. Dedicated to these problems, this paper proposes the manifold transfer (MT) and ensemble filter strategy (EFS) for pump fault diagnosis. In this work, MT is constructed for dimension reduction and feature transformation based on curvilinear component analysis (CCA). It is capable of nonlinear manifold learning to address the issue of varied pressure pulsation. Then, an ensemble filter strategy with an information filtrate function is designed to improve the fault diagnosis performance. The effectiveness of the proposed method is validated by a fault experiment on axial piston pump. The experimental results demonstrate that compared with other existing methods, the proposed method is competitive in terms of diagnostic accuracy and efficiency. Highlights. The manifold transfer is proposed to solve the varied pressure pulsation issue. An ensemble filter strategy is devised to achieve accurate and efficient fault diagnosis without manual intervention. Axial piston pump fault simulation experiments are conducted to validate the effectiveness of proposed method.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145675570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}