Deep learning techniques have demonstrated significant potential for the quantitative identification of Acoustic Emission (AE) wire breakage signals generated by wire breakage in bridge cables, which is critical for ensuring the structural safety of cable-supported bridges. However, under actual bridge operating conditions, AE wire breakage signals are extremely limited and difficult to collect. This leads to data imbalance due to scarcity of wire breakage samples and significantly reduces the accuracy of wire breakage identification. To address this challenge, we propose an innovative method combining a Wasserstein distance and Gradient penalty-enhanced Deep Convolutional Generative Adversarial Network (WGDCGAN) and the Shifted windows (Swin) Transformer model for AE wire breakage signal identification. The effectiveness of the proposed method is validated through full-scale bridge cable tests. Experimental results demonstrate that the proposed approach achieves superior performance in terms of total accuracy AT, breakage true detection rate BT (equivalent to sensitivity), and F1-Score F1, while effectively overcoming the performance degradation typically caused by imbalanced data. These findings highlight the method's strong potential for improving the reliability of AE-based wire breakage monitoring in bridge engineering.
{"title":"Identification of acoustic emission wire breakage signals in bridge cables under imbalanced data conditions","authors":"Kaixuan Hui , Guangming Li , Guizhen Niu , Shuai Zhao","doi":"10.1016/j.engappai.2026.114085","DOIUrl":"10.1016/j.engappai.2026.114085","url":null,"abstract":"<div><div>Deep learning techniques have demonstrated significant potential for the quantitative identification of Acoustic Emission (AE) wire breakage signals generated by wire breakage in bridge cables, which is critical for ensuring the structural safety of cable-supported bridges. However, under actual bridge operating conditions, AE wire breakage signals are extremely limited and difficult to collect. This leads to data imbalance due to scarcity of wire breakage samples and significantly reduces the accuracy of wire breakage identification. To address this challenge, we propose an innovative method combining a Wasserstein distance and Gradient penalty-enhanced Deep Convolutional Generative Adversarial Network (WGDCGAN) and the Shifted windows (Swin) Transformer model for AE wire breakage signal identification. The effectiveness of the proposed method is validated through full-scale bridge cable tests. Experimental results demonstrate that the proposed approach achieves superior performance in terms of total accuracy <em>A</em><sub>T</sub>, breakage true detection rate <em>B</em><sub>T</sub> (equivalent to sensitivity), and <em>F</em><sub>1</sub>-Score <em>F</em><sub>1</sub>, while effectively overcoming the performance degradation typically caused by imbalanced data. These findings highlight the method's strong potential for improving the reliability of AE-based wire breakage monitoring in bridge engineering.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114085"},"PeriodicalIF":8.0,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-10DOI: 10.1016/j.engappai.2026.114110
Peng Shi , Jingjing Guo , Lu Deng , Yingkai Liu , Lizhi Long , Shaopeng Xu
Automatic recognition of Railway Sleeper Serial Numbers (RSSNs) is essential for traceability, quality management, and lifecycle maintenance of railway infrastructure. In practice, embossed serial numbers on concrete surfaces exhibit extremely low discernibility with minimal height variations (<1 mm) and negligible color differentiation. Traditional Red-Green-Blue-based (RGB-based) image enhancement and Optical Character Recognition (OCR) methods face a fundamental limitation: they cannot directly capture the three-dimensional geometric features distinguishing embossed characters from their surroundings. To address this challenge, this study proposes an integrated framework based on line laser height imaging and position prior-guided detection with three key innovations: (1) a cascaded processing framework leverages geometric height information to overcome RGB-based method limitations; (2) a Dual-stage Adaptive Image Enhancement (DAIE) strategy converts 16 binary-digit (bit) height images into optimized 8-bit visualizations by systematically selecting optimal methods: modulo truncation for global structure and Minimum-Maximum (Min-Max) normalization for local detail enhancement; and (3) a Position Prior-guided Spatial Attention (PPSA) Feature Pyramid Network (FPN) integrates statistically-derived position priors to enhance small target detection. Comprehensive validation on 2234 images demonstrates superior performance: 98.2% F1-score and 99.38% recognition accuracy at 27 Frames Per Second (FPS), achieving 2.4% improvement over state-of-the-art methods. Ablation experiments confirm the individual contributions of the PPSA module (4.0%), the Small Target Enhancement (STE) module (1.4%), and the DAIE strategy (3.08%). Field testing in a prefabricated factory validates industrial applicability, providing a scalable technical framework and valuable reference for low-discernibility embossed industrial character recognition. Code is publicly available at https://github.com/shipeng38/RSSN-recognition.
{"title":"Enhanced recognition of low-discernibility Railway Sleeper Serial Numbers via dual-stage adaptive image enhancement and position prior-guided detection","authors":"Peng Shi , Jingjing Guo , Lu Deng , Yingkai Liu , Lizhi Long , Shaopeng Xu","doi":"10.1016/j.engappai.2026.114110","DOIUrl":"10.1016/j.engappai.2026.114110","url":null,"abstract":"<div><div>Automatic recognition of Railway Sleeper Serial Numbers (RSSNs) is essential for traceability, quality management, and lifecycle maintenance of railway infrastructure. In practice, embossed serial numbers on concrete surfaces exhibit extremely low discernibility with minimal height variations (<1 mm) and negligible color differentiation. Traditional Red-Green-Blue-based (RGB-based) image enhancement and Optical Character Recognition (OCR) methods face a fundamental limitation: they cannot directly capture the three-dimensional geometric features distinguishing embossed characters from their surroundings. To address this challenge, this study proposes an integrated framework based on line laser height imaging and position prior-guided detection with three key innovations: (1) a cascaded processing framework leverages geometric height information to overcome RGB-based method limitations; (2) a Dual-stage Adaptive Image Enhancement (DAIE) strategy converts 16 binary-digit (bit) height images into optimized 8-bit visualizations by systematically selecting optimal methods: modulo truncation for global structure and Minimum-Maximum (Min-Max) normalization for local detail enhancement; and (3) a Position Prior-guided Spatial Attention (PPSA) Feature Pyramid Network (FPN) integrates statistically-derived position priors to enhance small target detection. Comprehensive validation on 2234 images demonstrates superior performance: 98.2% F1-score and 99.38% recognition accuracy at 27 Frames Per Second (FPS), achieving 2.4% improvement over state-of-the-art methods. Ablation experiments confirm the individual contributions of the PPSA module (4.0%), the Small Target Enhancement (STE) module (1.4%), and the DAIE strategy (3.08%). Field testing in a prefabricated factory validates industrial applicability, providing a scalable technical framework and valuable reference for low-discernibility embossed industrial character recognition. Code is publicly available at <span><span>https://github.com/shipeng38/RSSN-recognition</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114110"},"PeriodicalIF":8.0,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.114106
Jiakang Zhang , Guoan Gan , Kun Long , Allen A. Zhang , Jing Shang , Chuanqi Yan , Changfa Ai
Asphalt materials form the foundation of pavement durability, with styrene–butadiene–styrene (SBS) copolymers widely used to enhance performance. However, the preparation of SBS-modified asphalt (SBSMA) still relies heavily on inefficient trial-and-error approaches. Although artificial intelligence–based methods have been applied to asphalt performance prediction, most existing models directly map preparation parameters to macro-performance, neglecting cross-scale mechanisms linking preparation parameters, micro-properties, and macroscopic behavior. This limitation reduces their robustness and practical applicability in complex material systems. To address this issue, this study proposes a Cross-Scale Hybrid Attention Network (CSA-Net) that explicitly models hierarchical information transfer from preparation parameters to micro-properties and further to macro-performance. CSA-Net adopts a dual-branch architecture: a micro-branch predicts micro-properties using attention-enhanced preparation features, while a macro-branch integrates attention-refined preparation features and predicted micro-features through a second attention module. Joint optimization of micro- and macro-level tasks is achieved via a composite loss function. A comprehensive experimental dataset comprising 864 SBSMA samples was established. Results show that CSA-Net achieves high accuracy in macro-performance prediction, with coefficients of determination (R2) consistently exceeding 0.982, mean absolute percentage errors below 5%, and root mean square errors within experimental uncertainty ranges. Compared with single-scale, multi-scale, and non-attention benchmark models, CSA-Net exhibits improved robustness, as demonstrated by Monte Carlo simulations, with the interquartile range of R2 reduced by more than 25%. Shapley additive explanations analysis further reveals meaningful cross-scale relationships between preparation parameters, microstructural evolution, and macroscopic performance. Overall, CSA-Net provides a robust and interpretable framework for intelligent design and performance prediction of modified asphalt binders.
{"title":"Cross-scale hybrid attention network for enhancing performance prediction of modified asphalt binder preparation","authors":"Jiakang Zhang , Guoan Gan , Kun Long , Allen A. Zhang , Jing Shang , Chuanqi Yan , Changfa Ai","doi":"10.1016/j.engappai.2026.114106","DOIUrl":"10.1016/j.engappai.2026.114106","url":null,"abstract":"<div><div>Asphalt materials form the foundation of pavement durability, with styrene–butadiene–styrene (SBS) copolymers widely used to enhance performance. However, the preparation of SBS-modified asphalt (SBSMA) still relies heavily on inefficient trial-and-error approaches. Although artificial intelligence–based methods have been applied to asphalt performance prediction, most existing models directly map preparation parameters to macro-performance, neglecting cross-scale mechanisms linking preparation parameters, micro-properties, and macroscopic behavior. This limitation reduces their robustness and practical applicability in complex material systems. To address this issue, this study proposes a Cross-Scale Hybrid Attention Network (CSA-Net) that explicitly models hierarchical information transfer from preparation parameters to micro-properties and further to macro-performance. CSA-Net adopts a dual-branch architecture: a micro-branch predicts micro-properties using attention-enhanced preparation features, while a macro-branch integrates attention-refined preparation features and predicted micro-features through a second attention module. Joint optimization of micro- and macro-level tasks is achieved via a composite loss function. A comprehensive experimental dataset comprising 864 SBSMA samples was established. Results show that CSA-Net achieves high accuracy in macro-performance prediction, with coefficients of determination (R<sup>2</sup>) consistently exceeding 0.982, mean absolute percentage errors below 5%, and root mean square errors within experimental uncertainty ranges. Compared with single-scale, multi-scale, and non-attention benchmark models, CSA-Net exhibits improved robustness, as demonstrated by Monte Carlo simulations, with the interquartile range of R<sup>2</sup> reduced by more than 25%. Shapley additive explanations analysis further reveals meaningful cross-scale relationships between preparation parameters, microstructural evolution, and macroscopic performance. Overall, CSA-Net provides a robust and interpretable framework for intelligent design and performance prediction of modified asphalt binders.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114106"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.114054
Zhengwei Zhu , Zhixuan Chen , Chenyang Zhu , Wen Si , Fang Wang
Partially observable reinforcement learning extends the reinforcement learning framework to environments in which agents have limited visibility of the state space, making it particularly relevant for applications in robotics and autonomous vehicle navigation. However, a primary challenge in partially observable reinforcement learning is defining effective reward functions that can guide the learning process despite partial observability. To address this challenge, this paper introduces a novel approach for constructing potential-based reward automata by employing genetic local search methods. Specifically, our method constructs these automata from compressed representations of exploration trajectories, which succinctly capture critical decision points and essential state transitions while eliminating redundant steps. By optimizing trajectory samples and shortening agent trajectories to their crucial transitions, our technique significantly reduces computational overhead. Formally, we define the learning objective as an optimization problem aimed at maximizing the log-likelihood of future observations while simultaneously minimizing the structural complexity of the learned reward automata. Furthermore, by incorporating value-based strategies to estimate potential values within the reward automata, our approach improves learning efficiency and facilitates the identification of optimal reward structures. We empirically evaluate our proposed method on seven partially observable grid-world benchmarks. Experimental results demonstrate that our method achieves superior performance relative to state-of-the-art reward automata-based techniques, exhibiting both accelerated learning speeds and higher accumulated rewards. Additionally, our genetic local search algorithm consistently outperforms comparative heuristic methods in terms of learning curves and reward accumulation.
{"title":"Optimizing potential-based reward automata in partially observable reinforcement learning using genetic local search","authors":"Zhengwei Zhu , Zhixuan Chen , Chenyang Zhu , Wen Si , Fang Wang","doi":"10.1016/j.engappai.2026.114054","DOIUrl":"10.1016/j.engappai.2026.114054","url":null,"abstract":"<div><div>Partially observable reinforcement learning extends the reinforcement learning framework to environments in which agents have limited visibility of the state space, making it particularly relevant for applications in robotics and autonomous vehicle navigation. However, a primary challenge in partially observable reinforcement learning is defining effective reward functions that can guide the learning process despite partial observability. To address this challenge, this paper introduces a novel approach for constructing potential-based reward automata by employing genetic local search methods. Specifically, our method constructs these automata from compressed representations of exploration trajectories, which succinctly capture critical decision points and essential state transitions while eliminating redundant steps. By optimizing trajectory samples and shortening agent trajectories to their crucial transitions, our technique significantly reduces computational overhead. Formally, we define the learning objective as an optimization problem aimed at maximizing the log-likelihood of future observations while simultaneously minimizing the structural complexity of the learned reward automata. Furthermore, by incorporating value-based strategies to estimate potential values within the reward automata, our approach improves learning efficiency and facilitates the identification of optimal reward structures. We empirically evaluate our proposed method on seven partially observable grid-world benchmarks. Experimental results demonstrate that our method achieves superior performance relative to state-of-the-art reward automata-based techniques, exhibiting both accelerated learning speeds and higher accumulated rewards. Additionally, our genetic local search algorithm consistently outperforms comparative heuristic methods in terms of learning curves and reward accumulation.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114054"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.114130
Xu Yuan , Jiaqiang Wang , Shaokui Gu , Yi Guo , Ange Qi , Shijin Li , Liang Zhao
The stock market is a highly dynamic, complex, and uncertain environment, where traditional investment strategies and technical analysis tools often fail to provide reliable guidance, leading to increased investment risk and uncertainty. This study aims to develop an adaptive multi-agent stock trading decision support system that can effectively respond to volatile market conditions while balancing returns and risk management. We propose a deep reinforcement learning framework based on the Dueling Deep Q-Network (Dueling DQN) algorithm, in which multiple agents independently make optimal trading decisions based on the constructed environment state. The system incorporates a redesigned reward function, a dynamic exploration strategy, and a risk management mechanism to ensure real-time adaptation to market feedback. Extensive experiments on domestic and international market data demonstrate that the proposed system outperforms existing models, effectively responds to market shocks, and exhibits superior adaptability across different market conditions. The proposed multi-agent trading system achieves a robust balance between profitability and risk control, indicating its potential economic value and applicability in dynamic financial markets.
股票市场是一个高度动态、复杂和不确定的环境,传统的投资策略和技术分析工具往往不能提供可靠的指导,导致投资风险和不确定性增加。本研究旨在开发一套自适应的多智能体股票交易决策支持系统,能在平衡收益与风险管理的同时,有效地因应多变的市场环境。我们提出了一种基于Dueling deep Q-Network (Dueling DQN)算法的深度强化学习框架,其中多个智能体根据构建的环境状态独立地做出最优交易决策。该系统整合了重新设计的奖励功能、动态勘探策略和风险管理机制,以确保实时适应市场反馈。国内外市场数据的大量实验表明,该系统优于现有模型,有效应对市场冲击,并在不同市场条件下表现出卓越的适应性。所提出的多智能体交易系统在盈利能力和风险控制之间取得了良好的平衡,显示了其潜在的经济价值和在动态金融市场中的适用性。
{"title":"Adaptive multi-agent stock trading decision support system based on deep reinforcement learning","authors":"Xu Yuan , Jiaqiang Wang , Shaokui Gu , Yi Guo , Ange Qi , Shijin Li , Liang Zhao","doi":"10.1016/j.engappai.2026.114130","DOIUrl":"10.1016/j.engappai.2026.114130","url":null,"abstract":"<div><div>The stock market is a highly dynamic, complex, and uncertain environment, where traditional investment strategies and technical analysis tools often fail to provide reliable guidance, leading to increased investment risk and uncertainty. This study aims to develop an adaptive multi-agent stock trading decision support system that can effectively respond to volatile market conditions while balancing returns and risk management. We propose a deep reinforcement learning framework based on the Dueling Deep Q-Network (Dueling DQN) algorithm, in which multiple agents independently make optimal trading decisions based on the constructed environment state. The system incorporates a redesigned reward function, a dynamic exploration strategy, and a risk management mechanism to ensure real-time adaptation to market feedback. Extensive experiments on domestic and international market data demonstrate that the proposed system outperforms existing models, effectively responds to market shocks, and exhibits superior adaptability across different market conditions. The proposed multi-agent trading system achieves a robust balance between profitability and risk control, indicating its potential economic value and applicability in dynamic financial markets.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114130"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.113894
Zhihui Gao , Baomin Xu , Jidong Yuan , Jinfeng Wang , Xu Li
Multivariate time series (MTS) representation learning poses a significant challenge in data mining. Current deep learning-based MTS representation methods mostly utilize neural networks to model temporal dependencies within individual univariate sequences, while failing to adequately consider the spatial relationships among different channels within MTS data. While a few methods leverage graph neural networks (GNNs) to model spatial dependencies, but they often do not effectively capture both global and local features simultaneously, potentially limiting the quality of MTS data representations. To overcome these limitations, we present MTGL, a novel Multi-Task Graph Neural Network-based MTS Representation Learning Framework. It leverages MTS reconstruction, global-level graph learning, and local-level graph learning to capture latent spatio-temporal dependencies without relying on predefined graph structures. To obtain global graph-level representations, MTGL performs message-passing and graph pooling operations, and simultaneously leverages a dynamic graph mechanism to capture associations across different windows for local-level representations. By fusing global and local features in a unified framework, MTGL effectively supports a variety of MTS tasks. Extensive experiments show that the proposed method outperforms existing state-of-the-art baselines on benchmark MTS datasets and the tunnel boring machine dataset.
{"title":"Multivariate time series representation learning with multi-task graph neural network","authors":"Zhihui Gao , Baomin Xu , Jidong Yuan , Jinfeng Wang , Xu Li","doi":"10.1016/j.engappai.2026.113894","DOIUrl":"10.1016/j.engappai.2026.113894","url":null,"abstract":"<div><div>Multivariate time series (MTS) representation learning poses a significant challenge in data mining. Current deep learning-based MTS representation methods mostly utilize neural networks to model temporal dependencies within individual univariate sequences, while failing to adequately consider the spatial relationships among different channels within MTS data. While a few methods leverage graph neural networks (GNNs) to model spatial dependencies, but they often do not effectively capture both global and local features simultaneously, potentially limiting the quality of MTS data representations. To overcome these limitations, we present <strong>MTGL</strong>, a novel <strong>M</strong>ulti-<strong>T</strong>ask <strong>G</strong>raph Neural Network-based MTS Representation <strong>L</strong>earning Framework. It leverages MTS reconstruction, global-level graph learning, and local-level graph learning to capture latent spatio-temporal dependencies without relying on predefined graph structures. To obtain global graph-level representations, MTGL performs message-passing and graph pooling operations, and simultaneously leverages a dynamic graph mechanism to capture associations across different windows for local-level representations. By fusing global and local features in a unified framework, MTGL effectively supports a variety of MTS tasks. Extensive experiments show that the proposed method outperforms existing state-of-the-art baselines on benchmark MTS datasets and the tunnel boring machine dataset.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 113894"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.114024
Yi-Ning Weng , Hsuan-Wei Lee
Cooperation emergence in multi-agent systems represents a fundamental statistical physics problem where microscopic learning rules drive macroscopic collective behavior transitions. We propose a Q-learning-based variant of adaptive rewiring that builds on mechanisms studied in the literature. This method combines temporal difference learning with network restructuring so that agents can optimize strategies and social connections based on interaction histories. Through neighbor-specific Q-learning, agents develop sophisticated partnership management strategies that enable cooperator cluster formation, creating spatial separation between cooperative and defective regions. Using power-law networks that reflect real-world heterogeneous connectivity patterns, we evaluate emergent behaviors under varying rewiring constraint levels, revealing distinct cooperation regimes across parameter space, characterized by qualitative changes in macroscopic cooperation behavior. Our systematic analysis identifies three behavioral regimes: a permissive regime (low constraints) enabling rapid cooperative cluster formation, an intermediate regime with sensitive dependence on dilemma strength, and a patient regime (high constraints) where strategic accumulation gradually optimizes network structure. Comparative analysis against Bush–Mosteller stimulus–response learning demonstrates that Q-learning’s temporal credit assignment capabilities produce superior cooperation outcomes, particularly under intermediate rewiring constraints where long-term relationship assessment becomes crucial. Simulation results show that while moderate constraints create transition-like zones that suppress cooperation, fully adaptive rewiring enhances cooperation levels through systematic exploration of favorable network configurations. Quantitative analysis reveals that increased rewiring frequency drives large-scale cluster formation. Our results establish a new paradigm for understanding intelligence-driven cooperation pattern formation in complex adaptive systems, revealing how machine learning serves as an alternative driving force for spontaneous organization in multi-agent networks.
{"title":"Q-learning-driven adaptive rewiring for cooperative control in heterogeneous networks","authors":"Yi-Ning Weng , Hsuan-Wei Lee","doi":"10.1016/j.engappai.2026.114024","DOIUrl":"10.1016/j.engappai.2026.114024","url":null,"abstract":"<div><div>Cooperation emergence in multi-agent systems represents a fundamental statistical physics problem where microscopic learning rules drive macroscopic collective behavior transitions. We propose a Q-learning-based variant of adaptive rewiring that builds on mechanisms studied in the literature. This method combines temporal difference learning with network restructuring so that agents can optimize strategies and social connections based on interaction histories. Through neighbor-specific Q-learning, agents develop sophisticated partnership management strategies that enable cooperator cluster formation, creating spatial separation between cooperative and defective regions. Using power-law networks that reflect real-world heterogeneous connectivity patterns, we evaluate emergent behaviors under varying rewiring constraint levels, revealing distinct cooperation regimes across parameter space, characterized by qualitative changes in macroscopic cooperation behavior. Our systematic analysis identifies three behavioral regimes: a permissive regime (low constraints) enabling rapid cooperative cluster formation, an intermediate regime with sensitive dependence on dilemma strength, and a patient regime (high constraints) where strategic accumulation gradually optimizes network structure. Comparative analysis against Bush–Mosteller stimulus–response learning demonstrates that Q-learning’s temporal credit assignment capabilities produce superior cooperation outcomes, particularly under intermediate rewiring constraints where long-term relationship assessment becomes crucial. Simulation results show that while moderate constraints create transition-like zones that suppress cooperation, fully adaptive rewiring enhances cooperation levels through systematic exploration of favorable network configurations. Quantitative analysis reveals that increased rewiring frequency drives large-scale cluster formation. Our results establish a new paradigm for understanding intelligence-driven cooperation pattern formation in complex adaptive systems, revealing how machine learning serves as an alternative driving force for spontaneous organization in multi-agent networks.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114024"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.113933
Agam Sanghera , Paramveer Singh , Elaine Chu , Sumin Leem , Ruizhi Li , Sogol Ghattan , Andy Man Yeung Tai
Background/Problem
Maritime crimes such as Illegal, Unreported, and Unregulated (IUU) fishing, piracy, and smuggling pose significant threats to marine ecosystems, trade, and coastal security, especially in developing regions. Automatic Identification System (AIS) data offers a scalable solution for vessel monitoring, but the use of fully supervised machine learning models is constrained by the substantial manual effort and expert input required to label training data.
Methods
To address this challenge, authors propose a semi-supervised machine learning pipeline that classifies vessel activities from AIS data without relying on pre-labeled datasets. Our approach leverages scaled geospatial and temporal features, including latitude, longitude, speed, and time difference, to train multiple Hidden Markov Models (HMMs) on trajectory segments. These segments are then grouped using similarity-based K-means clustering and subsequently classified with supervised models, including Random Forest and Long Short-Term Memory (LSTM) networks. The pipeline effectively identifies and labels maritime activities such as sailing, fishing, idling, and other activities.
Results/conclusions
Experiments were conducted on a dataset comprising 156,379 AIS points, partitioned into training and test sets. The LSTM-based supervised model achieved an F1 score of 0.86 on the local test set, while the end-to-end pipeline achieved an F1 score of 0.5 on a global evaluation set. These results demonstrate the feasibility of automating maritime activity classification through artificial intelligence and hybrid learning, offering a scalable solution for real-world maritime surveillance.
{"title":"Semi-supervised vessel trajectory analysis for unregulated fishing activity detection","authors":"Agam Sanghera , Paramveer Singh , Elaine Chu , Sumin Leem , Ruizhi Li , Sogol Ghattan , Andy Man Yeung Tai","doi":"10.1016/j.engappai.2026.113933","DOIUrl":"10.1016/j.engappai.2026.113933","url":null,"abstract":"<div><h3>Background/Problem</h3><div>Maritime crimes such as Illegal, Unreported, and Unregulated (IUU) fishing, piracy, and smuggling pose significant threats to marine ecosystems, trade, and coastal security, especially in developing regions. Automatic Identification System (AIS) data offers a scalable solution for vessel monitoring, but the use of fully supervised machine learning models is constrained by the substantial manual effort and expert input required to label training data.</div></div><div><h3>Methods</h3><div>To address this challenge, authors propose a semi-supervised machine learning pipeline that classifies vessel activities from AIS data without relying on pre-labeled datasets. Our approach leverages scaled geospatial and temporal features, including latitude, longitude, speed, and time difference, to train multiple Hidden Markov Models (HMMs) on trajectory segments. These segments are then grouped using similarity-based K-means clustering and subsequently classified with supervised models, including Random Forest and Long Short-Term Memory (LSTM) networks. The pipeline effectively identifies and labels maritime activities such as sailing, fishing, idling, and other activities.</div></div><div><h3>Results/conclusions</h3><div>Experiments were conducted on a dataset comprising 156,379 AIS points, partitioned into training and test sets. The LSTM-based supervised model achieved an F1 score of 0.86 on the local test set, while the end-to-end pipeline achieved an F1 score of 0.5 on a global evaluation set. These results demonstrate the feasibility of automating maritime activity classification through artificial intelligence and hybrid learning, offering a scalable solution for real-world maritime surveillance.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 113933"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.114118
Zhenlong Chen , Xiao Zhuang , Di Zhou , Weifang Sun , Jing Cai , Jiawei Xiang
In the real world, the data imbalance problem is an everlasting challenge in the field of knowledge-based fault diagnosis. The cycle-consistent generative adversarial network (CycleGAN) has been widely used in sample generation tasks with good results, but still faces the discrepancy between generator-generated fault samples and synthesized fault samples, which leads to the joint participation of the same degenerate branch affecting the final performance, resulting in the tendency to produce artifacts in difficult scenarios. To better solve the data imbalanced problem in infrared image-based fault diagnosis, in this paper, a novel Dual-attention Semi-Cycled Data Augmentation Structure using generative adversarial network (DASCGAN) is proposed to generate high-quality infrared thermal images to solve the fault diagnosis of gearbox under data imbalance. Firstly, the Dual-attention Semi-Cycled Data Augmentation Structure is constructed, which includes the forward and backward semi-cycle sub-network to generate diverse infrared images. The two semi-cycle sub-networks consist of two independent fault-free generation branches and a shared fault generation branch. Secondly, Convolutional Block Attention Module (CBAM) is embedded in the fault-free generation branch to increase the attention of the fault part by suppressing irrelevant features. Thirdly, self-attention module is embedded in the fault generation branch to increase the attention of the fault generation branch on the global hot pixels. Finally, comparison experiments are conducted. Experimental results show that the proposed DASCGAN method outperforms other benchmark generative adversarial networks. The proposed DASCGAN can effectively solve the data imbalance so as to improve the reliability and accuracy of gearbox fault diagnosis.
{"title":"Dual-attention semi-cycled generative adversarial network data augmentation structure for gearbox fault diagnosis using infrared thermal images","authors":"Zhenlong Chen , Xiao Zhuang , Di Zhou , Weifang Sun , Jing Cai , Jiawei Xiang","doi":"10.1016/j.engappai.2026.114118","DOIUrl":"10.1016/j.engappai.2026.114118","url":null,"abstract":"<div><div>In the real world, the data imbalance problem is an everlasting challenge in the field of knowledge-based fault diagnosis. The cycle-consistent generative adversarial network (CycleGAN) has been widely used in sample generation tasks with good results, but still faces the discrepancy between generator-generated fault samples and synthesized fault samples, which leads to the joint participation of the same degenerate branch affecting the final performance, resulting in the tendency to produce artifacts in difficult scenarios. To better solve the data imbalanced problem in infrared image-based fault diagnosis, in this paper, a novel Dual-attention Semi-Cycled Data Augmentation Structure using generative adversarial network (DASCGAN) is proposed to generate high-quality infrared thermal images to solve the fault diagnosis of gearbox under data imbalance. Firstly, the Dual-attention Semi-Cycled Data Augmentation Structure is constructed, which includes the forward and backward semi-cycle sub-network to generate diverse infrared images. The two semi-cycle sub-networks consist of two independent fault-free generation branches and a shared fault generation branch. Secondly, Convolutional Block Attention Module (CBAM) is embedded in the fault-free generation branch to increase the attention of the fault part by suppressing irrelevant features. Thirdly, self-attention module is embedded in the fault generation branch to increase the attention of the fault generation branch on the global hot pixels. Finally, comparison experiments are conducted. Experimental results show that the proposed DASCGAN method outperforms other benchmark generative adversarial networks. The proposed DASCGAN can effectively solve the data imbalance so as to improve the reliability and accuracy of gearbox fault diagnosis.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114118"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-09DOI: 10.1016/j.engappai.2026.114117
Xinyang Meng , Keliang Pang , Zhiyuan Gu , Youzhi Zheng , Fujun Liu , Chaoran Wan , Haotian Wu , Minmin Sun , Hua Zhao
Coke dry quenching (CDQ) is a common, environmentally friendly technology applied in iron and steel production and plays an important role in improving coke quality as well as in emission reduction and pollution reduction. Material location prediction is crucial for ensuring the stable operation of dry quenching systems. In this paper, we propose a novel artificial intelligence approach for predicting the location of coke materials in CDQ furnaces by incorporating a method known as physical information feature reconstruction (PIFR). This method integrates physical a priori knowledge (such as the law of mass conservation and furnace structural characteristics) into the feature engineering process, effectively improving the accuracy and stability of time-series predictions in both single-step and multistep forecasting tasks. The experimental results demonstrate that PIFR significantly enhances the performance of various deep learning models. Specifically, for the long short-term memory model, the mean squared error and mean absolute error decreased by 51.25% and 37.63%, respectively, whereas the coefficient of determination increased to 0.941. Moreover, PIFR effectively mitigates issues commonly encountered in multi-step prediction, such as cumulative error and prediction curve flattening. The application of PIFR not only improves the accuracy of the model but also significantly enhances its generalization capability.
{"title":"Deep learning-based coke dry quenching material location prediction using physical information reconstruction features","authors":"Xinyang Meng , Keliang Pang , Zhiyuan Gu , Youzhi Zheng , Fujun Liu , Chaoran Wan , Haotian Wu , Minmin Sun , Hua Zhao","doi":"10.1016/j.engappai.2026.114117","DOIUrl":"10.1016/j.engappai.2026.114117","url":null,"abstract":"<div><div>Coke dry quenching (CDQ) is a common, environmentally friendly technology applied in iron and steel production and plays an important role in improving coke quality as well as in emission reduction and pollution reduction. Material location prediction is crucial for ensuring the stable operation of dry quenching systems. In this paper, we propose a novel artificial intelligence approach for predicting the location of coke materials in CDQ furnaces by incorporating a method known as physical information feature reconstruction (PIFR). This method integrates physical <em>a priori</em> knowledge (such as the law of mass conservation and furnace structural characteristics) into the feature engineering process, effectively improving the accuracy and stability of time-series predictions in both single-step and multistep forecasting tasks. The experimental results demonstrate that PIFR significantly enhances the performance of various deep learning models. Specifically, for the long short-term memory model, the mean squared error and mean absolute error decreased by 51.25% and 37.63%, respectively, whereas the coefficient of determination increased to 0.941. Moreover, PIFR effectively mitigates issues commonly encountered in multi-step prediction, such as cumulative error and prediction curve flattening. The application of PIFR not only improves the accuracy of the model but also significantly enhances its generalization capability.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114117"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}