首页 > 最新文献

自主智能系统(英文)最新文献

英文 中文
Risk assessment in autonomous driving: a comprehensive survey of risk sources, methodologies, and system architectures 自动驾驶中的风险评估:风险源、方法和系统架构的全面调查
Pub Date : 2025-09-22 DOI: 10.1007/s43684-025-00112-1
Dongyuan Lu, Haoyang Du, Zhengfei Wu, Shuo Yang

As autonomous driving technology advances from assisted to higher levels of autonomy, the complexity of operational environments and the uncertainty of driving tasks continue to increase, posing significant challenges to system safety. The key to ensuring safety lies in conducting comprehensive and rational risk assessments to identify potential hazards and inform policy optimization. Consequently, risk assessment has emerged as a critical component for ensuring the safe operation of higher-level autonomous driving systems. This review focuses on research into risk assessment for autonomous driving. It systematically surveys the state-of-the-art literature from three key perspectives: risk sources, assessment methodologies, data foundations, and system architectures. For each perspective, the paper provides an in-depth analysis of representative technical approaches, modeling principles, and typical application scenarios, while summarizing their research characteristics and applicable boundaries. Finally, this paper synthesizes the three fundamental challenges that persist in current research and further explores future directions and development opportunities. It provides a theoretical foundation and methodological references for the development of autonomous driving systems that exhibit high safety and reliability.

随着自动驾驶技术从辅助驾驶向更高水平的自主驾驶发展,操作环境的复杂性和驾驶任务的不确定性不断增加,对系统安全性提出了重大挑战。确保安全的关键在于进行全面合理的风险评估,识别潜在危险,为政策优化提供信息。因此,风险评估已成为确保高级自动驾驶系统安全运行的关键组成部分。本文对自动驾驶风险评估的研究进行了综述。它从三个关键角度系统地调查了最新的文献:风险源、评估方法、数据基础和系统架构。针对每个视角,深入分析了具有代表性的技术方法、建模原理和典型应用场景,总结了各自的研究特点和适用范围。最后,本文综合了当前研究中存在的三个根本性挑战,并进一步探讨了未来的研究方向和发展机遇。为开发高安全性、高可靠性的自动驾驶系统提供了理论基础和方法参考。
{"title":"Risk assessment in autonomous driving: a comprehensive survey of risk sources, methodologies, and system architectures","authors":"Dongyuan Lu,&nbsp;Haoyang Du,&nbsp;Zhengfei Wu,&nbsp;Shuo Yang","doi":"10.1007/s43684-025-00112-1","DOIUrl":"10.1007/s43684-025-00112-1","url":null,"abstract":"<div><p>As autonomous driving technology advances from assisted to higher levels of autonomy, the complexity of operational environments and the uncertainty of driving tasks continue to increase, posing significant challenges to system safety. The key to ensuring safety lies in conducting comprehensive and rational risk assessments to identify potential hazards and inform policy optimization. Consequently, risk assessment has emerged as a critical component for ensuring the safe operation of higher-level autonomous driving systems. This review focuses on research into risk assessment for autonomous driving. It systematically surveys the state-of-the-art literature from three key perspectives: risk sources, assessment methodologies, data foundations, and system architectures. For each perspective, the paper provides an in-depth analysis of representative technical approaches, modeling principles, and typical application scenarios, while summarizing their research characteristics and applicable boundaries. Finally, this paper synthesizes the three fundamental challenges that persist in current research and further explores future directions and development opportunities. It provides a theoretical foundation and methodological references for the development of autonomous driving systems that exhibit high safety and reliability.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00112-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145100773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: An intelligent surface roughness prediction method based on automatic feature extraction and adaptive data fusion 一种基于自动特征提取和自适应数据融合的表面粗糙度智能预测方法
Pub Date : 2025-09-10 DOI: 10.1007/s43684-025-00107-y
Xun Zhang, Sibao Wang, Fangrui Gao, Hao Wang, Haoyu Wu, Ying Liu
{"title":"Correction to: An intelligent surface roughness prediction method based on automatic feature extraction and adaptive data fusion","authors":"Xun Zhang,&nbsp;Sibao Wang,&nbsp;Fangrui Gao,&nbsp;Hao Wang,&nbsp;Haoyu Wu,&nbsp;Ying Liu","doi":"10.1007/s43684-025-00107-y","DOIUrl":"10.1007/s43684-025-00107-y","url":null,"abstract":"","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00107-y.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Output-based adaptive distributed observer for general linear leader systems over periodic switching digraphs 修正:周期切换有向图上一般线性先导系统的基于输出的自适应分布式观测器
Pub Date : 2025-09-10 DOI: 10.1007/s43684-025-00109-w
Changran He, Jie Huang
{"title":"Correction to: Output-based adaptive distributed observer for general linear leader systems over periodic switching digraphs","authors":"Changran He,&nbsp;Jie Huang","doi":"10.1007/s43684-025-00109-w","DOIUrl":"10.1007/s43684-025-00109-w","url":null,"abstract":"","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00109-w.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Multi-domain fusion for cargo UAV fault diagnosis knowledge graph construction 修正:基于多域融合的货运无人机故障诊断知识图谱构建
Pub Date : 2025-09-10 DOI: 10.1007/s43684-025-00106-z
Ao Xiao, Wei Yan, Xumei Zhang, Ying Liu, Hua Zhang, Qi Liu
{"title":"Correction to: Multi-domain fusion for cargo UAV fault diagnosis knowledge graph construction","authors":"Ao Xiao,&nbsp;Wei Yan,&nbsp;Xumei Zhang,&nbsp;Ying Liu,&nbsp;Hua Zhang,&nbsp;Qi Liu","doi":"10.1007/s43684-025-00106-z","DOIUrl":"10.1007/s43684-025-00106-z","url":null,"abstract":"","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00106-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: A novel method for measuring center-axis velocity of unmanned aerial vehicles through synthetic motion blur images 一种利用合成运动模糊图像测量无人机中心轴速度的新方法
Pub Date : 2025-09-10 DOI: 10.1007/s43684-025-00108-x
Quanxi Zhan, Yanmin Zhou, Junrui Zhang, Chenyang Sun, Runjie Shen, Bin He
{"title":"Correction to: A novel method for measuring center-axis velocity of unmanned aerial vehicles through synthetic motion blur images","authors":"Quanxi Zhan,&nbsp;Yanmin Zhou,&nbsp;Junrui Zhang,&nbsp;Chenyang Sun,&nbsp;Runjie Shen,&nbsp;Bin He","doi":"10.1007/s43684-025-00108-x","DOIUrl":"10.1007/s43684-025-00108-x","url":null,"abstract":"","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00108-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Explanation framework for industrial recommendation systems based on the generative adversarial network with embedding constraints 更正:基于嵌入约束的生成对抗网络的工业推荐系统的解释框架
Pub Date : 2025-09-10 DOI: 10.1007/s43684-025-00110-3
Binchuan Qi, Wei Gong, Li Li
{"title":"Correction: Explanation framework for industrial recommendation systems based on the generative adversarial network with embedding constraints","authors":"Binchuan Qi,&nbsp;Wei Gong,&nbsp;Li Li","doi":"10.1007/s43684-025-00110-3","DOIUrl":"10.1007/s43684-025-00110-3","url":null,"abstract":"","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00110-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145028147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language models for PHM: a review of optimization techniques and applications PHM的大型语言模型:优化技术和应用综述
Pub Date : 2025-08-19 DOI: 10.1007/s43684-025-00100-5
Tingyi Yu, Junya Tang, Qingyun Yu, Li Li, Ying Liu, Raul Poler

The rapid advancement of Large Language Models (LLMs) has created unprecedented opportunities for industrial automation, process optimization, and decision support systems. As industries seek to leverage LLMs for industrial tasks, understanding their architecture, deployment strategies, and fine-tuning methods becomes critical. In this review, we aim to summarize the challenges, key technologies, current status, and future directions of LLM in Prognostics and Health Management(PHM). First, this review introduces deep learning for PHM. We begin by analyzing the architectural considerations and deployment strategies for industrial environments, including acceleration techniques and quantization methods that enable efficient operation on resource-constrained industrial hardware. Second, we investigate Parameter Efficient Fine-Tuning (PEFT) techniques that allow industry-specific adaptation without prohibitive computational costs. Multi-modal capabilities extending LLMs beyond text to process sensor data, images, and time-series information are also discussed. Finally, we explore emerging PHM including anomaly detection systems that identify equipment malfunctions, fault diagnosis frameworks that determine root causes, and specialized question-answering systems that empower workers with instant domain expertise. We conclude by identifying key challenges and future research directions for LLM deployment in PHM. This review provides a timely resource for researchers, engineers, and decision-makers navigating the transformative potential of language models in industry 4.0 environments.

大型语言模型(llm)的快速发展为工业自动化、流程优化和决策支持系统创造了前所未有的机会。随着行业寻求利用llm来完成工业任务,了解llm的体系结构、部署策略和微调方法变得至关重要。本文综述了预后与健康管理(PHM)法学硕士面临的挑战、关键技术、现状和未来发展方向。首先,本文介绍了PHM的深度学习。我们首先分析工业环境的体系结构考虑因素和部署策略,包括加速技术和量化方法,它们可以在资源受限的工业硬件上实现高效操作。其次,我们研究了参数高效微调(PEFT)技术,该技术允许行业特定的适应,而不需要高昂的计算成本。还讨论了将llm扩展到文本之外的多模式功能,以处理传感器数据、图像和时间序列信息。最后,我们探讨了新兴的PHM,包括识别设备故障的异常检测系统,确定根本原因的故障诊断框架,以及赋予工人即时领域专业知识的专业问答系统。最后,我们确定了LLM在PHM中部署的主要挑战和未来的研究方向。这篇综述为研究人员、工程师和决策者在工业4.0环境中导航语言模型的变革潜力提供了及时的资源。
{"title":"Large language models for PHM: a review of optimization techniques and applications","authors":"Tingyi Yu,&nbsp;Junya Tang,&nbsp;Qingyun Yu,&nbsp;Li Li,&nbsp;Ying Liu,&nbsp;Raul Poler","doi":"10.1007/s43684-025-00100-5","DOIUrl":"10.1007/s43684-025-00100-5","url":null,"abstract":"<div><p>The rapid advancement of Large Language Models (LLMs) has created unprecedented opportunities for industrial automation, process optimization, and decision support systems. As industries seek to leverage LLMs for industrial tasks, understanding their architecture, deployment strategies, and fine-tuning methods becomes critical. In this review, we aim to summarize the challenges, key technologies, current status, and future directions of LLM in Prognostics and Health Management(PHM). First, this review introduces deep learning for PHM. We begin by analyzing the architectural considerations and deployment strategies for industrial environments, including acceleration techniques and quantization methods that enable efficient operation on resource-constrained industrial hardware. Second, we investigate Parameter Efficient Fine-Tuning (PEFT) techniques that allow industry-specific adaptation without prohibitive computational costs. Multi-modal capabilities extending LLMs beyond text to process sensor data, images, and time-series information are also discussed. Finally, we explore emerging PHM including anomaly detection systems that identify equipment malfunctions, fault diagnosis frameworks that determine root causes, and specialized question-answering systems that empower workers with instant domain expertise. We conclude by identifying key challenges and future research directions for LLM deployment in PHM. This review provides a timely resource for researchers, engineers, and decision-makers navigating the transformative potential of language models in industry 4.0 environments.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00100-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144868630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing predictive maintenance and mission assignment to enhance fleet readiness under uncertainty 优化预测性维护和任务分配,增强不确定条件下的机队战备状态
Pub Date : 2025-08-15 DOI: 10.1007/s43684-025-00104-1
Ryan O’Neil, Abdelhakim Khatab, Claver Diallo

In many industrial settings, fleets of assets are required to operate through alternating missions and breaks. Fleet Selective Maintenance (FSM) is widely used in such contexts to improve the fleet performance. However, existing FSM models assume that upcoming missions are identical and require only a single system configuration for completion. Additionally, these models typically assume that all missions must be completed, overlooking resource constraints that may prevent readying all systems within the available break duration. This makes mission prioritization and assignment a necessary consideration for the decision-maker. This work proposes a novel FSM model that jointly optimizes system to mission assignment, component and maintenance level selection, and repair task allocation. The proposed framework integrates analytical models for standard components and Deep Neural Networks (DNNs) for sensor-monitored ones, enabling a hybrid reliability assessment approach that better reflects real-world multi-component systems. To account for uncertainties in maintenance and break durations, a chance-constrained optimization model is developed to ensure that maintenance is completed within the available break duration with a specified confidence level. The optimization model is reformulated using two well-known techniques: Sample Average Approximation (SAA) and Conditional Value-at-Risk (CVaR) approximation. A case study of military aircraft fleet maintenance is investigated to demonstrate the accuracy and added value of the proposed approach.

在许多工业环境中,资产车队需要通过交替的任务和休息来运行。在这种情况下,车队选择性维护(FSM)被广泛用于提高车队的性能。然而,现有的FSM模型假设即将到来的任务是相同的,并且只需要一个系统配置即可完成。此外,这些模型通常假设所有任务都必须完成,忽略了可能妨碍在可用的中断时间内准备所有系统的资源限制。这使得任务的优先级和分配成为决策者的必要考虑因素。本文提出了一种新的FSM模型,该模型对系统的任务分配、部件和维护级别的选择以及维修任务的分配进行了联合优化。提出的框架集成了标准组件的分析模型和传感器监测组件的深度神经网络(dnn),使混合可靠性评估方法能够更好地反映现实世界的多组件系统。为了考虑维护和中断持续时间的不确定性,开发了一个机会约束优化模型,以确保在指定的置信水平下,在可用的中断持续时间内完成维护。优化模型采用两种著名的技术:样本平均近似(SAA)和条件风险值(CVaR)近似。以军用飞机机队维修为例,验证了该方法的准确性和附加价值。
{"title":"Optimizing predictive maintenance and mission assignment to enhance fleet readiness under uncertainty","authors":"Ryan O’Neil,&nbsp;Abdelhakim Khatab,&nbsp;Claver Diallo","doi":"10.1007/s43684-025-00104-1","DOIUrl":"10.1007/s43684-025-00104-1","url":null,"abstract":"<div><p>In many industrial settings, fleets of assets are required to operate through alternating missions and breaks. Fleet Selective Maintenance (FSM) is widely used in such contexts to improve the fleet performance. However, existing FSM models assume that upcoming missions are identical and require only a single system configuration for completion. Additionally, these models typically assume that all missions must be completed, overlooking resource constraints that may prevent readying all systems within the available break duration. This makes mission prioritization and assignment a necessary consideration for the decision-maker. This work proposes a novel FSM model that jointly optimizes system to mission assignment, component and maintenance level selection, and repair task allocation. The proposed framework integrates analytical models for standard components and Deep Neural Networks (DNNs) for sensor-monitored ones, enabling a hybrid reliability assessment approach that better reflects real-world multi-component systems. To account for uncertainties in maintenance and break durations, a chance-constrained optimization model is developed to ensure that maintenance is completed within the available break duration with a specified confidence level. The optimization model is reformulated using two well-known techniques: Sample Average Approximation (SAA) and Conditional Value-at-Risk (CVaR) approximation. A case study of military aircraft fleet maintenance is investigated to demonstrate the accuracy and added value of the proposed approach.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00104-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144843250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning to trade autonomously in stocks and shares: integrating uncertainty into trading strategies 学习自主交易股票:将不确定性纳入交易策略
Pub Date : 2025-08-11 DOI: 10.1007/s43684-025-00101-4
Yuyang Li, Minghui Liwang, Li Li

Machine learning, a revolutionary and advanced technology, has been widely applied in the field of stock trading. However, training an autonomous trading strategy which can effectively balance risk and Return On Investment without human supervision in the stock market with high uncertainty is still a bottleneck. This paper constructs a Bayesian-inferenced Gated Recurrent Unit architecture to support long-term stock price prediction based on characteristics of the stock information learned from historical data, augmented with memory of recent up- and-down fluctuations occur in the data of short-term stock movement. The Gated Recurrent Unit architecture incorporates uncertainty estimation into the prediction process, which take care of decision-making in an ever-changing dynamic environment. Three trading strategies were implemented in this model; namely, a Price Model Strategy, a Probabilistic Model Strategy, and a Bayesian Gated Recurrent Unit Strategy, each leveraging the respective model’s outputs to optimize trading decisions. The experimental results show that, compared with the standard Gated Recurrent Unit models, the modified model exhibits a huge tremendous/dramatic advantage in managing volatility and improving return on investment Return On Investment. The results and findings underscore the significant potential of combining Bayesian inference with machine learning to operate effectively in chaotic decision-making environments.

机器学习是一项革命性的先进技术,在股票交易领域得到了广泛的应用。然而,在具有高度不确定性的股票市场中,训练一种能够在无人监督的情况下有效平衡风险和投资回报的自主交易策略仍然是一个瓶颈。本文构建了一个贝叶斯推理的门控循环单元架构,基于从历史数据中学习到的股票信息的特征来支持长期股票价格预测,并增强了短期股票运动数据中近期涨跌波动的记忆。门控循环单元体系结构将不确定性估计纳入预测过程,在不断变化的动态环境中进行决策。该模型实现了三种交易策略;即价格模型策略、概率模型策略和贝叶斯门控循环单元策略,每种策略都利用各自模型的输出来优化交易决策。实验结果表明,与标准的门控循环单元模型相比,改进后的模型在管理波动率和提高投资回报率方面具有巨大的优势。结果和发现强调了将贝叶斯推理与机器学习结合起来在混乱的决策环境中有效运行的巨大潜力。
{"title":"Learning to trade autonomously in stocks and shares: integrating uncertainty into trading strategies","authors":"Yuyang Li,&nbsp;Minghui Liwang,&nbsp;Li Li","doi":"10.1007/s43684-025-00101-4","DOIUrl":"10.1007/s43684-025-00101-4","url":null,"abstract":"<div><p>Machine learning, a revolutionary and advanced technology, has been widely applied in the field of stock trading. However, training an autonomous trading strategy which can effectively balance risk and Return On Investment without human supervision in the stock market with high uncertainty is still a bottleneck. This paper constructs a Bayesian-inferenced Gated Recurrent Unit architecture to support long-term stock price prediction based on characteristics of the stock information learned from historical data, augmented with memory of recent up- and-down fluctuations occur in the data of short-term stock movement. The Gated Recurrent Unit architecture incorporates uncertainty estimation into the prediction process, which take care of decision-making in an ever-changing dynamic environment. Three trading strategies were implemented in this model; namely, a Price Model Strategy, a Probabilistic Model Strategy, and a Bayesian Gated Recurrent Unit Strategy, each leveraging the respective model’s outputs to optimize trading decisions. The experimental results show that, compared with the standard Gated Recurrent Unit models, the modified model exhibits a huge tremendous/dramatic advantage in managing volatility and improving return on investment Return On Investment. The results and findings underscore the significant potential of combining Bayesian inference with machine learning to operate effectively in chaotic decision-making environments.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00101-4.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144810783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated reinforcement learning for sequential ordering problem using hyperparameter optimization and metalearning 基于超参数优化和元学习的序列排序问题自动强化学习
Pub Date : 2025-07-29 DOI: 10.1007/s43684-025-00103-2
André Luiz Carvalho Ottoni

AutoML systems seek to assist Artificial Intelligence users in finding the best configurations for machine learning models. Following this line, recently the area of Automated Reinforcement Learning (AutoRL) has become increasingly relevant, given the growing increase in applications for reinforcement learning algorithms. However, the literature still lacks specific AutoRL systems for combinatorial optimization, especially for the Sequential Ordering Problem (SOP). Therefore, this paper aims to present a new AutoRL approach for SOP. For this, two new methods are proposed using hyperparameter optimization and metalearning: AutoRL-SOP and AutoRL-SOP-MtL. The proposed AutoRL techniques enable the combined tuning of three SARSA hyperparameters, being ϵ-greedy policy, learning rate, and discount factor. Furthermore, the new metalearning approach enables the transfer of hyperparameters between two combinatorial optimization domains: TSP (source) and SOP (target). The results show that the application of metalearning generates a reduction in computational cost in hyperparameter optimization. Furthermore, the proposed AutoRL methods achieved the best solutions in 23 out of 28 simulated TSPLIB instances compared to recent literature studies.

AutoML系统旨在帮助人工智能用户找到机器学习模型的最佳配置。沿着这条线,鉴于强化学习算法的应用日益增加,最近自动强化学习(AutoRL)领域变得越来越相关。然而,文献中仍然缺乏针对组合优化的特定自动驾驶系统,特别是针对顺序排序问题(SOP)。因此,本文旨在为SOP提供一种新的AutoRL方法。为此,提出了两种基于超参数优化和元学习的新方法:AutoRL-SOP和AutoRL-SOP- mtl。提出的AutoRL技术能够组合调整三个SARSA超参数,即ϵ-greedy策略、学习率和折现系数。此外,新的元学习方法能够在TSP(源)和SOP(目标)两个组合优化域之间传递超参数。结果表明,元学习的应用减少了超参数优化的计算成本。此外,与最近的文献研究相比,所提出的AutoRL方法在28个模拟TSPLIB实例中的23个中获得了最佳解决方案。
{"title":"Automated reinforcement learning for sequential ordering problem using hyperparameter optimization and metalearning","authors":"André Luiz Carvalho Ottoni","doi":"10.1007/s43684-025-00103-2","DOIUrl":"10.1007/s43684-025-00103-2","url":null,"abstract":"<div><p>AutoML systems seek to assist Artificial Intelligence users in finding the best configurations for machine learning models. Following this line, recently the area of Automated Reinforcement Learning (AutoRL) has become increasingly relevant, given the growing increase in applications for reinforcement learning algorithms. However, the literature still lacks specific AutoRL systems for combinatorial optimization, especially for the Sequential Ordering Problem (SOP). Therefore, this paper aims to present a new AutoRL approach for SOP. For this, two new methods are proposed using hyperparameter optimization and metalearning: AutoRL-SOP and AutoRL-SOP-MtL. The proposed AutoRL techniques enable the combined tuning of three SARSA hyperparameters, being <i>ϵ</i>-greedy policy, learning rate, and discount factor. Furthermore, the new metalearning approach enables the transfer of hyperparameters between two combinatorial optimization domains: TSP (source) and SOP (target). The results show that the application of metalearning generates a reduction in computational cost in hyperparameter optimization. Furthermore, the proposed AutoRL methods achieved the best solutions in 23 out of 28 simulated TSPLIB instances compared to recent literature studies.</p></div>","PeriodicalId":71187,"journal":{"name":"自主智能系统(英文)","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43684-025-00103-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145171555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
自主智能系统(英文)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1