首页 > 最新文献

Engineering Applications of Artificial Intelligence最新文献

英文 中文
System for diagnosis and optimization of combustion in pulverized coal boilers based on artificial intelligence methods 基于人工智能方法的煤粉锅炉燃烧诊断与优化系统
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114103
S.S. Abdurakipov, E.B. Butakov, E.P. Kopyev, D.M. Markovich
Efficient and reliable monitoring of processes in coal-fired boilers requires advanced combustion diagnostic and optimization methods. This paper introduces an integrated system based on machine learning and deep learning technologies to diagnose and optimize combustion regimes in pulverized coal-fired boilers, aiming to improve efficiency and safety. An artificial neural network accurately simulated thermogravimetric mass loss curves, achieving an average coefficient of determination of 99%. Deep learning methods were employed to detect combustion regimes by monitoring the coal flame and identifying anomalies in flame images. For datasets lacking precise measurements, an unsupervised autoencoder was developed. It achieved an average precision of 77% and a recall of 66%. For datasets with measurements, a supervised convolutional neural network provided a higher average recall of 89%. Various machine learning algorithms were employed to predict deviations from stable combustion modes, and a long short-term memory network with an attention mechanism performed best. It had a mean absolute percentage error of up to 8% and an average coefficient of determination of 91%.
高效、可靠的燃煤锅炉过程监测需要先进的燃烧诊断和优化方法。本文介绍了一种基于机器学习和深度学习技术的综合系统,用于煤粉锅炉燃烧状态的诊断和优化,以提高效率和安全性。人工神经网络准确模拟了热重质量损失曲线,平均确定系数达到99%。采用深度学习方法通过监测煤炭火焰和识别火焰图像中的异常来检测燃烧状态。对于缺乏精确测量的数据集,开发了一种无监督自编码器。它的平均准确率为77%,召回率为66%。对于具有测量值的数据集,有监督的卷积神经网络提供了89%的平均召回率。不同的机器学习算法被用于预测偏离稳定燃烧模式,其中带有注意机制的长短期记忆网络表现最好。其平均绝对百分比误差高达8%,平均确定系数为91%。
{"title":"System for diagnosis and optimization of combustion in pulverized coal boilers based on artificial intelligence methods","authors":"S.S. Abdurakipov,&nbsp;E.B. Butakov,&nbsp;E.P. Kopyev,&nbsp;D.M. Markovich","doi":"10.1016/j.engappai.2026.114103","DOIUrl":"10.1016/j.engappai.2026.114103","url":null,"abstract":"<div><div>Efficient and reliable monitoring of processes in coal-fired boilers requires advanced combustion diagnostic and optimization methods. This paper introduces an integrated system based on machine learning and deep learning technologies to diagnose and optimize combustion regimes in pulverized coal-fired boilers, aiming to improve efficiency and safety. An artificial neural network accurately simulated thermogravimetric mass loss curves, achieving an average coefficient of determination of 99%. Deep learning methods were employed to detect combustion regimes by monitoring the coal flame and identifying anomalies in flame images. For datasets lacking precise measurements, an unsupervised autoencoder was developed. It achieved an average precision of 77% and a recall of 66%. For datasets with measurements, a supervised convolutional neural network provided a higher average recall of 89%. Various machine learning algorithms were employed to predict deviations from stable combustion modes, and a long short-term memory network with an attention mechanism performed best. It had a mean absolute percentage error of up to 8% and an average coefficient of determination of 91%.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114103"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Communicating vessels detection network for small object detection in realistic scenario 面向现实场景小目标检测的通信船舶检测网络
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114062
Wenkai Pang , Zhi Tan
Object detection is a crucial task in computer vision. Despite recent advancements in generalized object detection, accurately detecting small objects remains a significant challenge due to feature degradation in neural networks. Although increasing the input image resolution is widely recognized as the most effective countermeasure against feature degradation effects, this approach inevitably results in substantial computational cost increases. To this end, we propose the Communicating Vessels Detection Network, which presents two major contributions. Initially, the Communicating Vessels Backbone enhances the resolution of deep feature maps and fuses them with shallow-layer features, effectively mitigating feature degradation. In addition, we design the Global Information Coding Module, which adaptively captures contextual dependencies from horizontal, vertical, and global directions, providing critical guidance for small object detection. On the challenging Vision Meets Drone: A Challenge 2019 dataset, the Communicating Vessels Detection Network improves the mean average precision, the average precision for small objects, and the average precision for medium objects by 1.5%, 0.64%, and 1.02%, respectively. Furthermore, our model demonstrates superior performance on the Traffic Signal Detection dataset and two Safety Helmet Wearing Detection datasets compared to state-of-the-art methods.
目标检测是计算机视觉中的一项重要任务。尽管最近在广义目标检测方面取得了进展,但由于神经网络的特征退化,准确检测小目标仍然是一个重大挑战。虽然提高输入图像分辨率被广泛认为是对抗特征退化效应的最有效的对策,但这种方法不可避免地会导致大量的计算成本增加。为此,我们提出了通信船舶检测网络,它有两个主要贡献。首先,通信血管主干增强了深层特征图的分辨率,并将其与浅层特征融合,有效缓解了特征退化。此外,我们还设计了全局信息编码模块,该模块可以自适应地从水平、垂直和全局方向捕获上下文依赖关系,为小目标检测提供关键指导。在具有挑战性的Vision Meets Drone: A Challenge 2019数据集上,通信船只检测网络将平均精度、小物体的平均精度和中等物体的平均精度分别提高了1.5%、0.64%和1.02%。此外,与最先进的方法相比,我们的模型在交通信号检测数据集和两个安全帽佩戴检测数据集上表现出优越的性能。
{"title":"Communicating vessels detection network for small object detection in realistic scenario","authors":"Wenkai Pang ,&nbsp;Zhi Tan","doi":"10.1016/j.engappai.2026.114062","DOIUrl":"10.1016/j.engappai.2026.114062","url":null,"abstract":"<div><div>Object detection is a crucial task in computer vision. Despite recent advancements in generalized object detection, accurately detecting small objects remains a significant challenge due to feature degradation in neural networks. Although increasing the input image resolution is widely recognized as the most effective countermeasure against feature degradation effects, this approach inevitably results in substantial computational cost increases. To this end, we propose the Communicating Vessels Detection Network, which presents two major contributions. Initially, the Communicating Vessels Backbone enhances the resolution of deep feature maps and fuses them with shallow-layer features, effectively mitigating feature degradation. In addition, we design the Global Information Coding Module, which adaptively captures contextual dependencies from horizontal, vertical, and global directions, providing critical guidance for small object detection. On the challenging Vision Meets Drone: A Challenge 2019 dataset, the Communicating Vessels Detection Network improves the mean average precision, the average precision for small objects, and the average precision for medium objects by 1.5%, 0.64%, and 1.02%, respectively. Furthermore, our model demonstrates superior performance on the Traffic Signal Detection dataset and two Safety Helmet Wearing Detection datasets compared to state-of-the-art methods.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114062"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental investigation and explainable artificial intelligence-based modeling of punching shear behavior in self-compacting concrete flat-slabs with low hybrid fiber content 低混杂纤维含量自密实混凝土平板冲剪特性试验研究及可解释性人工智能建模
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114116
Abdulkerim Ari , Metin Katlav , Izzeddin Donmez , Kazim Turk
Flat-slab systems manufactured with self-compacting concrete (SCC) incorporating low hybrid fiber content offer a promising alternative for improving punching shear performance while enhancing constructability in building applications. In this paper, the punching shear behavior of flat-slabs produced with single, binary, and ternary fiber-reinforced SCC was experimentally investigated in terms of load–deflection response, ductility, toughness, cracking behavior, and failure mode. In parallel, a comprehensive database comprising 268 fiber-reinforced concrete flat-slab test results collected from the literature was established, and artificial intelligence (AI)-based predictive models were developed to estimate punching shear capacity (Vpun). Model performance was evaluated using statistical indicators, whereas SHapley Additive exPlanations (SHAP) feature importance and partial dependence plots (PDPs) were employed to enhance interpretability and reveal the governing parameters influencing punching capacity. The outcomes demonstrate that binary hybrid fiber systems provide the most effective enhancement in punching capacity and post-cracking performance, even at low fiber contents, outperforming conventional solutions such as shear studs. Among the developed AI models, the Extra Trees Regressor and Random Forest algorithms exhibited the highest prediction accuracy for the Vpun. Finally, the AI models were integrated into a user-friendly graphical interface to facilitate practical engineering applications. Overall, this research contributes by experimentally validating low-fiber SCC flat-slabs as an efficient punching solution and by proposing an explainable, data-driven decision-support framework for engineering design.
采用低混合纤维含量的自密实混凝土(SCC)制造的平板系统为改善冲剪性能,同时增强建筑应用中的可施工性提供了一种有希望的替代方案。本文从荷载-挠曲响应、延性、韧性、开裂行为和破坏模式等方面对单、双、三元纤维增强SCC平板的冲剪行为进行了实验研究。同时,建立了一个综合数据库,包括从文献中收集的268个纤维钢筋混凝土平板试验结果,并开发了基于人工智能(AI)的预测模型来估计冲剪能力(Vpun)。采用统计指标评价模型性能,采用SHapley加性解释(SHAP)特征重要性和部分依赖图(pdp)增强可解释性,揭示影响冲压能力的控制参数。结果表明,即使在纤维含量较低的情况下,二元混合纤维体系也能最有效地提高冲孔能力和开裂后性能,优于剪切螺柱等传统解决方案。在已开发的人工智能模型中,Extra Trees Regressor和Random Forest算法对Vpun的预测精度最高。最后,将人工智能模型集成到用户友好的图形界面中,以方便实际工程应用。总的来说,本研究通过实验验证了低纤维SCC平板作为一种有效的冲压解决方案,并为工程设计提出了一个可解释的、数据驱动的决策支持框架。
{"title":"Experimental investigation and explainable artificial intelligence-based modeling of punching shear behavior in self-compacting concrete flat-slabs with low hybrid fiber content","authors":"Abdulkerim Ari ,&nbsp;Metin Katlav ,&nbsp;Izzeddin Donmez ,&nbsp;Kazim Turk","doi":"10.1016/j.engappai.2026.114116","DOIUrl":"10.1016/j.engappai.2026.114116","url":null,"abstract":"<div><div>Flat-slab systems manufactured with self-compacting concrete (SCC) incorporating low hybrid fiber content offer a promising alternative for improving punching shear performance while enhancing constructability in building applications. In this paper, the punching shear behavior of flat-slabs produced with single, binary, and ternary fiber-reinforced SCC was experimentally investigated in terms of load–deflection response, ductility, toughness, cracking behavior, and failure mode. In parallel, a comprehensive database comprising 268 fiber-reinforced concrete flat-slab test results collected from the literature was established, and artificial intelligence (AI)-based predictive models were developed to estimate punching shear capacity (<em>V</em><sub><em>pun</em></sub>). Model performance was evaluated using statistical indicators, whereas SHapley Additive exPlanations (SHAP) feature importance and partial dependence plots (PDPs) were employed to enhance interpretability and reveal the governing parameters influencing punching capacity. The outcomes demonstrate that binary hybrid fiber systems provide the most effective enhancement in punching capacity and post-cracking performance, even at low fiber contents, outperforming conventional solutions such as shear studs. Among the developed AI models, the Extra Trees Regressor and Random Forest algorithms exhibited the highest prediction accuracy for the <em>V</em><sub><em>pun</em></sub>. Finally, the AI models were integrated into a user-friendly graphical interface to facilitate practical engineering applications. Overall, this research contributes by experimentally validating low-fiber SCC flat-slabs as an efficient punching solution and by proposing an explainable, data-driven decision-support framework for engineering design.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114116"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-stage automated design of railway vertical alignments with topography-driven Fourier transform and constrained A-Star search 基于地形驱动傅立叶变换和约束A-Star搜索的铁路垂直线路两阶段自动化设计
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114061
Taoran Song , Hao Pu , Hong Zhang , Paul Schonfeld , Lihui Peng
Vertical alignment design is crucial for a railway project since it largely determines its construction investment, lifecycle costs, and various other impacts. However, among the theoretically-infinite numbers of possible alternatives, it is difficult to optimize a vertical alignment matching the drastically-undulating terrain line along the railway, while also considering large structure tradeoffs (such as bridges and tunnels) and various design constraints. To solve this problem, a two-stage method is proposed for automated vertical alignment optimization. In stage I, the railway terrain line is converted to a spatial wave and modelled through the spectral signature analysis of a topography-driven Fast Fourier Transform (FFT). Afterward, by identifying the key terrain characteristic locations based on the derived Fourier series function, feasible search regions for vertical alignment design are determined by processing specific design constraints. For stage II, an A-Star algorithm is customized for vertical alignment search. First, A-Star nodes are discretized within the above feasible search spaces. Then, a comprehensive constraint-handling operator is devised to guarantee a solution’s feasibility during optimization. Moreover, a deterministic simulation algorithm is integrated to create a weighted directed graph for A-Star path generation. Ultimately, the developed method is applied to a complex mountain railway alignment case. The algorithm performances of the two stages are both discussed in detail.
垂直线路设计对铁路项目至关重要,因为它在很大程度上决定了其建设投资、生命周期成本和各种其他影响。然而,在理论上无限多的可能选择中,在考虑大型结构权衡(如桥梁和隧道)和各种设计约束的同时,很难优化与铁路沿线急剧起伏的地形线相匹配的垂直路线。为了解决这一问题,提出了一种两阶段自动垂直排列优化方法。在第一阶段,将铁路地形线转换为空间波,并通过地形驱动的快速傅立叶变换(FFT)的频谱特征分析进行建模。然后,根据导出的傅立叶级数函数识别关键地形特征位置,通过处理特定设计约束确定垂直路线设计的可行搜索区域。第二阶段,定制A-Star算法进行垂直对齐搜索。首先,在上述可行搜索空间内离散A-Star节点。然后,设计了一个综合约束处理算子,以保证优化过程中解的可行性。结合确定性仿真算法,建立了a - star路径生成的加权有向图。最后,将该方法应用于一个复杂的山地铁路线形实例。详细讨论了这两个阶段的算法性能。
{"title":"Two-stage automated design of railway vertical alignments with topography-driven Fourier transform and constrained A-Star search","authors":"Taoran Song ,&nbsp;Hao Pu ,&nbsp;Hong Zhang ,&nbsp;Paul Schonfeld ,&nbsp;Lihui Peng","doi":"10.1016/j.engappai.2026.114061","DOIUrl":"10.1016/j.engappai.2026.114061","url":null,"abstract":"<div><div>Vertical alignment design is crucial for a railway project since it largely determines its construction investment, lifecycle costs, and various other impacts. However, among the theoretically-infinite numbers of possible alternatives, it is difficult to optimize a vertical alignment matching the drastically-undulating terrain line along the railway, while also considering large structure tradeoffs (such as bridges and tunnels) and various design constraints. To solve this problem, a two-stage method is proposed for automated vertical alignment optimization. In stage I, the railway terrain line is converted to a spatial wave and modelled through the spectral signature analysis of a topography-driven Fast Fourier Transform (FFT). Afterward, by identifying the key terrain characteristic locations based on the derived Fourier series function, feasible search regions for vertical alignment design are determined by processing specific design constraints. For stage II, an A-Star algorithm is customized for vertical alignment search. First, A-Star nodes are discretized within the above feasible search spaces. Then, a comprehensive constraint-handling operator is devised to guarantee a solution’s feasibility during optimization. Moreover, a deterministic simulation algorithm is integrated to create a weighted directed graph for A-Star path generation. Ultimately, the developed method is applied to a complex mountain railway alignment case. The algorithm performances of the two stages are both discussed in detail.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114061"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of machine learning to mechanical properties of composite materials: A ten-year review (2015–2025) 机器学习在复合材料力学性能研究中的应用:十年回顾(2015-2025)
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114068
Jinjing Zhu , Ruixiang Bai , Yaoxing Xu , Jiantong Wang , Longchao He , Zhenkun Lei , Cheng Yan
Artificial intelligence (AI), particularly machine learning (ML), has advanced rapidly and is increasingly applied across scientific domains. This paper presents a review of ML applications in predicting mechanical properties and optimizing designs of composite materials, analyzing 165 studies published between 2015 and 2025. Valued in aerospace for lightweight and high-strength properties, composites present modeling challenges due to nonlinear behavior, multi-scale characteristics, and manufacturing sensitivity that limit the accuracy and efficiency of traditional methods. ML techniques have demonstrated remarkable potential in overcoming these limitations through data-driven paradigms, particularly in multi-scale modeling. This study examines the evolutionary trajectory of supervised, unsupervised, reinforcement, and deep learning (DL) within algorithmic frameworks, with a focus on applications in multi-scale analysis of composite materials. At the microscale, ML enables modulus prediction, strength evaluation, and interface analysis via representative volume elements (RVE). For macroscale mechanical analysis, it improves predictions of nonlinear responses, damage progression, and impact behavior in laminates for the rapid optimization design of composite stiffened plates based on buckling and collapse load. Special attention is given to damage evolution, health monitoring, multi-physics field and multi-scale mechanical properties of composite materials. Future research directions, including the development of hybrid data-physics models, digital twin integration, and cross-disciplinary collaboration for enabling breakthroughs in intelligent composite design, are also prospected.
人工智能(AI),特别是机器学习(ML),发展迅速,越来越多地应用于科学领域。本文回顾了机器学习在预测复合材料力学性能和优化设计方面的应用,分析了2015年至2025年间发表的165项研究。在航空航天领域,复合材料因其轻量化和高强度特性而受到重视,但由于非线性行为、多尺度特性和制造灵敏度限制了传统方法的准确性和效率,因此复合材料的建模面临挑战。机器学习技术通过数据驱动的范例,特别是在多尺度建模方面,已经显示出克服这些限制的巨大潜力。本研究考察了算法框架内监督学习、无监督学习、强化学习和深度学习(DL)的进化轨迹,重点研究了复合材料多尺度分析中的应用。在微观尺度上,ML可以通过代表性体积元素(RVE)实现模量预测、强度评估和界面分析。对于宏观力学分析,它改进了非线性响应、损伤进展和层合板冲击行为的预测,从而实现了基于屈曲和崩溃载荷的复合材料加筋板的快速优化设计。特别关注复合材料的损伤演化、健康监测、多物理场和多尺度力学性能。展望了未来的研究方向,包括混合数据物理模型的发展、数字孪生集成和跨学科协作,以实现智能复合材料设计的突破。
{"title":"Application of machine learning to mechanical properties of composite materials: A ten-year review (2015–2025)","authors":"Jinjing Zhu ,&nbsp;Ruixiang Bai ,&nbsp;Yaoxing Xu ,&nbsp;Jiantong Wang ,&nbsp;Longchao He ,&nbsp;Zhenkun Lei ,&nbsp;Cheng Yan","doi":"10.1016/j.engappai.2026.114068","DOIUrl":"10.1016/j.engappai.2026.114068","url":null,"abstract":"<div><div>Artificial intelligence (AI), particularly machine learning (ML), has advanced rapidly and is increasingly applied across scientific domains. This paper presents a review of ML applications in predicting mechanical properties and optimizing designs of composite materials, analyzing 165 studies published between 2015 and 2025. Valued in aerospace for lightweight and high-strength properties, composites present modeling challenges due to nonlinear behavior, multi-scale characteristics, and manufacturing sensitivity that limit the accuracy and efficiency of traditional methods. ML techniques have demonstrated remarkable potential in overcoming these limitations through data-driven paradigms, particularly in multi-scale modeling. This study examines the evolutionary trajectory of supervised, unsupervised, reinforcement, and deep learning (DL) within algorithmic frameworks, with a focus on applications in multi-scale analysis of composite materials. At the microscale, ML enables modulus prediction, strength evaluation, and interface analysis via representative volume elements (RVE). For macroscale mechanical analysis, it improves predictions of nonlinear responses, damage progression, and impact behavior in laminates for the rapid optimization design of composite stiffened plates based on buckling and collapse load. Special attention is given to damage evolution, health monitoring, multi-physics field and multi-scale mechanical properties of composite materials. Future research directions, including the development of hybrid data-physics models, digital twin integration, and cross-disciplinary collaboration for enabling breakthroughs in intelligent composite design, are also prospected.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114068"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Versatile electromagnetic vortex beamforming for phased array using hybrid neural network-based modeling and optimization 基于混合神经网络的相控阵多用途电磁涡旋波束形成建模与优化
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114122
Le Kang , Mengyao Dai , Hui Li , Xinhuai Wang
Vortex beams featuring low-sidelobe, null-steering, co-divergence and multi-mode capabilities hold significant promise for advanced radio-frequency systems. To achieve this, a hybrid neural network (NN)-based framework is proposed for phased arrays. It employs a multilayer perceptron (MLP) followed by a physics-informed network, integrating surrogate modeling for the array with optimization for element excitations. The established model learns the non-linear mapping from input orbital angular momentum (OAM) features to output excitations, and facilitates an optimization process that operates simultaneously with training. The physics-informed network further shifts the learning paradigm from mere data fitting to finding physically guided solutions. This mitigates the reliance on densely layered architectures, and empowers the model with enhanced functionality for beamforming. For verification, numerical experiments involving various scenarios have been conducted. For a 16 × 16 uniform rectangular array without failure, the generated vortex beams achieve sidelobe levels ≤ −19.5 decibel (dB), null depths ≤ −20.2 dB, mode purities ≥85 %, and null steering up to (30°, 30°). Co-divergence and coaxial multi-mode capabilities are also supported. The implementation, involving 512 variables and 5 constraints, requires 1.7 × 107 floating-point operations (FLOPs) and an average time cost of 20 s. Meanwhile, it enables excitation quantization as well as compatibility with diverse array configurations and element patterns. Compared with the state-of-the-art beamforming methods, this work demonstrates concurrent multifunctionality, broader applicability to complex and higher-dimensional problems, and superior computational efficiency.
涡旋波束具有低旁瓣、零转向、共发散和多模式能力,在先进的射频系统中具有重要的前景。为此,提出了一种基于混合神经网络的相控阵框架。它采用多层感知器(MLP),然后是物理信息网络,将阵列的代理建模与元素激励的优化相结合。建立的模型学习了输入轨道角动量(OAM)特征到输出激励的非线性映射,便于优化过程与训练同时进行。物理信息网络进一步将学习范式从单纯的数据拟合转变为寻找物理指导的解决方案。这减轻了对密集分层架构的依赖,并使模型具有增强的波束形成功能。为了验证,进行了各种场景的数值实验。对于无故障的16 × 16均匀矩形阵列,产生的涡旋光束的旁瓣电平≤- 19.5分贝(dB),零深度≤- 20.2 dB,模式纯度≥85%,零转向可达(30°,30°)。还支持共散和同轴多模功能。该实现涉及512个变量和5个约束,需要1.7 × 107个浮点运算(FLOPs),平均时间成本为20秒。同时,它可以实现激励量化以及与各种阵列配置和元素模式的兼容性。与最先进的波束形成方法相比,这项工作显示了并发多功能性,更广泛地适用于复杂和高维问题,以及优越的计算效率。
{"title":"Versatile electromagnetic vortex beamforming for phased array using hybrid neural network-based modeling and optimization","authors":"Le Kang ,&nbsp;Mengyao Dai ,&nbsp;Hui Li ,&nbsp;Xinhuai Wang","doi":"10.1016/j.engappai.2026.114122","DOIUrl":"10.1016/j.engappai.2026.114122","url":null,"abstract":"<div><div>Vortex beams featuring low-sidelobe, null-steering, co-divergence and multi-mode capabilities hold significant promise for advanced radio-frequency systems. To achieve this, a hybrid neural network (NN)-based framework is proposed for phased arrays. It employs a multilayer perceptron (MLP) followed by a physics-informed network, integrating surrogate modeling for the array with optimization for element excitations. The established model learns the non-linear mapping from input orbital angular momentum (OAM) features to output excitations, and facilitates an optimization process that operates simultaneously with training. The physics-informed network further shifts the learning paradigm from mere data fitting to finding physically guided solutions. This mitigates the reliance on densely layered architectures, and empowers the model with enhanced functionality for beamforming. For verification, numerical experiments involving various scenarios have been conducted. For a 16 × 16 uniform rectangular array without failure, the generated vortex beams achieve sidelobe levels ≤ −19.5 decibel (dB), null depths ≤ −20.2 dB, mode purities ≥85 %, and null steering up to (30°, 30°). Co-divergence and coaxial multi-mode capabilities are also supported. The implementation, involving 512 variables and 5 constraints, requires 1.7 × 10<sup>7</sup> floating-point operations (FLOPs) and an average time cost of 20 s. Meanwhile, it enables excitation quantization as well as compatibility with diverse array configurations and element patterns. Compared with the state-of-the-art beamforming methods, this work demonstrates concurrent multifunctionality, broader applicability to complex and higher-dimensional problems, and superior computational efficiency.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114122"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hesitant fuzzy three-way decision-making for large-scale data based on a new distance measure and behavioral theory 基于新距离测度和行为理论的大数据犹豫模糊三向决策
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114105
Jing Li , Haidong Zhang , Zhuoma Dawa , Yanping He
In highly uncertain real-world environments, making robust decisions amid incomplete information and the cognitive biases of decision-makers remain critical challenges in project management and other complex system decision-making-related domains. Although the three-way decision-making (3WD) method based on hesitant fuzzy (HF) environments provides an effective approach for managing uncertainty, the current research still has shortcomings in several key areas. On the one hand, the distance formulas that are commonly used for computing hesitant fuzzy elements (HFEs) generally suffer from insufficient sensitivity to the information captured by the score function and weak discriminative power. On the other hand, the process of determining loss functions is subjective and fails to consider the behavioral psychological factors of decision-makers, making it difficult to reflect the cognitive characteristics of humans during actual decision-making processes. These issues collectively limit the adaptability and practicality of the existing decision-making methods in complex real-world environments. To address the aforementioned issues, this study is aimed at constructing an HF 3WD framework that possesses both cognitive rationality and computational robustness. To this end, the core contributions of this work are as follows. First, a novel HF distance measure is developed, significantly improving the ability to distinguish fuzzy information differences. Second, a novel o-dominance relation is introduced, and the conditional probability is calculated using a data-driven approach, eliminating the reliance on expert scoring and thereby improving the objectivity and accuracy of the conditional probability. Finally, an objective loss function is established, effectively capturing the decision-maker’s nonlinear value perceptions and comparative psychology in gain and loss scenarios. Furthermore, comparative experiments and parameter analyses are conducted in big data scenarios to validate the fact that the proposed method outperforms the existing methods in terms of classification accuracy and decision stability, demonstrating superior effectiveness and robustness. We believe that by simulating human judgments made under uncertainty, this method opens up new avenues for implementing artificial intelligence-based decision-making systems in high-risk scenarios.
在高度不确定的现实世界环境中,在不完整的信息和决策者的认知偏差中做出稳健的决策仍然是项目管理和其他复杂系统决策相关领域的关键挑战。基于犹豫模糊(HF)环境的三向决策(3WD)方法为管理不确定性提供了一种有效的方法,但目前的研究在几个关键领域仍存在不足。一方面,通常用于计算犹豫模糊元素(hfe)的距离公式对分数函数捕获的信息灵敏度不足,判别能力较弱。另一方面,损失函数的确定过程是主观的,没有考虑决策者的行为心理因素,难以反映人在实际决策过程中的认知特征。这些问题共同限制了现有决策方法在复杂现实环境中的适应性和实用性。为了解决上述问题,本研究旨在构建一个既具有认知合理性又具有计算鲁棒性的高频3WD框架。为此,本工作的核心贡献如下:首先,提出了一种新的高频距离测度,显著提高了模糊信息差异的识别能力。其次,引入了一种新的o-优势关系,采用数据驱动的方法计算条件概率,消除了对专家评分的依赖,提高了条件概率的客观性和准确性;最后,建立了目标损失函数,有效捕捉了决策者在得失情景下的非线性价值感知和比较心理。并在大数据场景下进行对比实验和参数分析,验证了所提方法在分类精度和决策稳定性方面优于现有方法,显示出优越的有效性和鲁棒性。我们相信,通过模拟人类在不确定性下做出的判断,该方法为在高风险场景中实施基于人工智能的决策系统开辟了新的途径。
{"title":"Hesitant fuzzy three-way decision-making for large-scale data based on a new distance measure and behavioral theory","authors":"Jing Li ,&nbsp;Haidong Zhang ,&nbsp;Zhuoma Dawa ,&nbsp;Yanping He","doi":"10.1016/j.engappai.2026.114105","DOIUrl":"10.1016/j.engappai.2026.114105","url":null,"abstract":"<div><div>In highly uncertain real-world environments, making robust decisions amid incomplete information and the cognitive biases of decision-makers remain critical challenges in project management and other complex system decision-making-related domains. Although the three-way decision-making (3WD) method based on hesitant fuzzy (HF) environments provides an effective approach for managing uncertainty, the current research still has shortcomings in several key areas. On the one hand, the distance formulas that are commonly used for computing hesitant fuzzy elements (HFEs) generally suffer from insufficient sensitivity to the information captured by the score function and weak discriminative power. On the other hand, the process of determining loss functions is subjective and fails to consider the behavioral psychological factors of decision-makers, making it difficult to reflect the cognitive characteristics of humans during actual decision-making processes. These issues collectively limit the adaptability and practicality of the existing decision-making methods in complex real-world environments. To address the aforementioned issues, this study is aimed at constructing an HF 3WD framework that possesses both cognitive rationality and computational robustness. To this end, the core contributions of this work are as follows. First, a novel HF distance measure is developed, significantly improving the ability to distinguish fuzzy information differences. Second, a novel <span><math><mi>o</mi></math></span>-dominance relation is introduced, and the conditional probability is calculated using a data-driven approach, eliminating the reliance on expert scoring and thereby improving the objectivity and accuracy of the conditional probability. Finally, an objective loss function is established, effectively capturing the decision-maker’s nonlinear value perceptions and comparative psychology in gain and loss scenarios. Furthermore, comparative experiments and parameter analyses are conducted in big data scenarios to validate the fact that the proposed method outperforms the existing methods in terms of classification accuracy and decision stability, demonstrating superior effectiveness and robustness. We believe that by simulating human judgments made under uncertainty, this method opens up new avenues for implementing artificial intelligence-based decision-making systems in high-risk scenarios.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114105"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment and evaluation of machine learning algorithms for recommendation system of E-commerce based on bipolar fuzzy- method based on the removal effects of criteria-elimination and choice translating Reality-I approach 基于标准消除和选择翻译现实- i方法去除效果的双极模糊方法的电子商务推荐系统机器学习算法的评估与评价
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-09 DOI: 10.1016/j.engappai.2026.114107
Muhammad Arsal, Ubaid ur Rehman, Abaid ur Rehman Virk
E-commerce websites are increasingly depending on machine learning (ML) algorithms to improve their recommendation systems to deliver personalized user experiences and enhance customer satisfaction. ML models provide useful solutions, but choosing the best ML model is still difficult, especially for e-commerce platforms' recommendation systems. This selection challenge involves dual aspects (bipolarity), which means that both the positive and negative aspects of the evaluation criteria for ML models must be taken into consideration. It is a multi-criteria decision-making (MCDM) problem with uncertainty. However, previous research has failed to consider the bipolar character of these model selection criteria. Finding the right weights for the evaluation criteria also becomes important in MCDM, particularly when handling the bipolarity and uncertainty present in actual e-commerce situations. To overcome this limitation, we introduce a novel bipolar fuzzy (BF)-method based on the removal effects of criteria (MEREC)-elimination and choice translating reality (ELECTRE-I) methodology that combines the method based on the removal effects of criteria (MEREC) method's objective weighting capability with the elimination and choice translating reality (ELECTRE-I) technique's decision-ranking power. The methodology is used in a real-world case study that focuses on choosing the best ML model for an e-commerce platform's recommendation system. The proposed structure provides a strong decision-support tool that tackles the difficulties associated with choosing an ML algorithm and captures the dual nature of evaluation criteria. Comparative findings show how well the BF-MEREC-ELECTRE-I method works to provide data-driven, interpretable, and useful recommendations for e-commerce applications.
电子商务网站越来越依赖于机器学习(ML)算法来改进他们的推荐系统,以提供个性化的用户体验并提高客户满意度。ML模型提供了有用的解决方案,但选择最好的ML模型仍然很困难,特别是对于电子商务平台的推荐系统。这种选择挑战涉及双重方面(双极性),这意味着必须考虑ML模型评估标准的积极和消极方面。这是一个具有不确定性的多准则决策问题。然而,以往的研究未能考虑到这些模型选择标准的双极性特征。在MCDM中,为评估标准找到正确的权重也变得很重要,特别是在处理实际电子商务情况中存在的两极化和不确定性时。为了克服这一局限性,我们引入了一种新的基于标准去除效果的双极模糊(BF)方法(MEREC)-消除和选择翻译现实(electrei)方法,该方法将基于标准去除效果(MEREC)方法的客观加权能力与基于消除和选择翻译现实(electrei)技术的决策排序能力相结合。该方法被用于一个现实世界的案例研究,重点是为电子商务平台的推荐系统选择最佳的机器学习模型。所提出的结构提供了一个强大的决策支持工具,解决了与选择ML算法相关的困难,并捕获了评估标准的双重性质。对比结果显示BF-MEREC-ELECTRE-I方法在为电子商务应用程序提供数据驱动的、可解释的和有用的建议方面是多么有效。
{"title":"Assessment and evaluation of machine learning algorithms for recommendation system of E-commerce based on bipolar fuzzy- method based on the removal effects of criteria-elimination and choice translating Reality-I approach","authors":"Muhammad Arsal,&nbsp;Ubaid ur Rehman,&nbsp;Abaid ur Rehman Virk","doi":"10.1016/j.engappai.2026.114107","DOIUrl":"10.1016/j.engappai.2026.114107","url":null,"abstract":"<div><div>E-commerce websites are increasingly depending on machine learning (ML) algorithms to improve their recommendation systems to deliver personalized user experiences and enhance customer satisfaction. ML models provide useful solutions, but choosing the best ML model is still difficult, especially for e-commerce platforms' recommendation systems. This selection challenge involves dual aspects (bipolarity), which means that both the positive and negative aspects of the evaluation criteria for ML models must be taken into consideration. It is a multi-criteria decision-making (MCDM) problem with uncertainty. However, previous research has failed to consider the bipolar character of these model selection criteria. Finding the right weights for the evaluation criteria also becomes important in MCDM, particularly when handling the bipolarity and uncertainty present in actual e-commerce situations. To overcome this limitation, we introduce a novel bipolar fuzzy (BF)-method based on the removal effects of criteria (MEREC)-elimination and choice translating reality (ELECTRE-I) methodology that combines the method based on the removal effects of criteria (MEREC) method's objective weighting capability with the elimination and choice translating reality (ELECTRE-I) technique's decision-ranking power. The methodology is used in a real-world case study that focuses on choosing the best ML model for an e-commerce platform's recommendation system. The proposed structure provides a strong decision-support tool that tackles the difficulties associated with choosing an ML algorithm and captures the dual nature of evaluation criteria. Comparative findings show how well the BF-MEREC-ELECTRE-I method works to provide data-driven, interpretable, and useful recommendations for e-commerce applications.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114107"},"PeriodicalIF":8.0,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A double deep Q network-powered approach to discovering symbolic solutions for nonlinear integral equations 非线性积分方程符号解的双深Q网络求解方法
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-08 DOI: 10.1016/j.engappai.2026.114113
Mahdi Movahedian Moghaddam , Hassan Dana Mazraeh , Kourosh Parand
This study introduces a reinforcement learning framework using Double Deep Q-Networks (DDQN) to discover interpretable, symbolic solutions to integral and integro-differential equations. The method leverages Context-Free Grammars to guide expression generation and Physics-Informed Neural Networks (PINN) to optimize coefficients. Evaluated on diverse equations (Fredholm, Volterra, and fractional types), DDQN outperforms standard DQN in stability, convergence, and interpretability, and succeeds in complex cases like population models where DQN fails. This approach is highly valuable for engineering domains such as control theory, electromagnetics, and fluid mechanics, where such equations are prevalent. It provides engineers with compact, analytical expressions that offer immediate physical insight, unlike opaque numerical solutions. These symbolic models enable faster system analysis, real-time control, and robust design optimization, bridging the gap between high-fidelity simulation and practical, interpretable models.
本研究引入了一个强化学习框架,使用双深度q网络(DDQN)来发现积分方程和积分微分方程的可解释的符号解。该方法利用上下文无关语法来指导表达式生成和物理信息神经网络(PINN)来优化系数。在多种方程(Fredholm, Volterra和分数型)上进行评估,DDQN在稳定性,收敛性和可解释性方面优于标准DQN,并在DQN失败的人口模型等复杂情况下取得成功。这种方法对于控制理论、电磁学和流体力学等工程领域非常有价值,因为这些领域的方程非常普遍。它为工程师提供了紧凑的解析表达式,提供即时的物理洞察力,不像不透明的数值解决方案。这些符号模型实现了更快的系统分析、实时控制和稳健的设计优化,弥合了高保真仿真和实用、可解释模型之间的差距。
{"title":"A double deep Q network-powered approach to discovering symbolic solutions for nonlinear integral equations","authors":"Mahdi Movahedian Moghaddam ,&nbsp;Hassan Dana Mazraeh ,&nbsp;Kourosh Parand","doi":"10.1016/j.engappai.2026.114113","DOIUrl":"10.1016/j.engappai.2026.114113","url":null,"abstract":"<div><div>This study introduces a reinforcement learning framework using Double Deep Q-Networks (DDQN) to discover interpretable, symbolic solutions to integral and integro-differential equations. The method leverages Context-Free Grammars to guide expression generation and Physics-Informed Neural Networks (PINN) to optimize coefficients. Evaluated on diverse equations (Fredholm, Volterra, and fractional types), DDQN outperforms standard DQN in stability, convergence, and interpretability, and succeeds in complex cases like population models where DQN fails. This approach is highly valuable for engineering domains such as control theory, electromagnetics, and fluid mechanics, where such equations are prevalent. It provides engineers with compact, analytical expressions that offer immediate physical insight, unlike opaque numerical solutions. These symbolic models enable faster system analysis, real-time control, and robust design optimization, bridging the gap between high-fidelity simulation and practical, interpretable models.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114113"},"PeriodicalIF":8.0,"publicationDate":"2026-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforced semantic information acquiring for contrastive clustering 增强对比聚类的语义信息获取
IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-08 DOI: 10.1016/j.engappai.2026.114115
Pengfei Cui , Haowei Wu , Jun Yin
The goal of deep image clustering is to assign images to their corresponding clusters in an unsupervised manner. Contrastive Clustering with Effective Sample pairs construction (CCES) (Yin et al., 2023) has demonstrated the effectiveness of combining ContrastiveCrop (Peng et al., 2022) with nearest-neighbor mining for clustering. However, a key limitation of CCES is that it treats augmentations of different samples from the same class as negative pairs, which introduces false negatives and conflicts with the goal of clustering. To address this issue, we propose a novel contrastive clustering framework termed Reinforced Semantic Information Acquiring for Contrastive Clustering (RASCC). We extend CCES by introducing a pseudo-label-guided dual-level fine-tuning strategy that mitigates the adverse impact of false negatives and improves clustering quality. Specifically, RASCC consists of two stages. In Stage 1, we follow CCES to learn discriminative features and generate high-confidence pseudo labels based on clustering predictions. In Stage 2, we leverage these pseudo labels to perform dual-level fine-tuning. At the instance level, we mitigate false negatives by excluding same-pseudo-label pairs from the negative set. At the cluster level, we improve assignment quality and balance by refining cluster predictions with a class-balanced self-labeling loss that each sample is weighted inversely proportional to the frequency of its high-confidence pseudo-label. We conduct experiments on four challenging datasets. The proposed method surpasses many state-of-the-art deep clustering algorithms, demonstrating the effectiveness of sample pair construction and fine-tuning strategy.
深度图像聚类的目标是以无监督的方式将图像分配到相应的聚类中。对比聚类与有效样本对构建(CCES) (Yin等人,2023)已经证明了将对比聚类(Peng等人,2022)与最近邻挖掘相结合进行聚类的有效性。然而,CCES的一个关键限制是,它将来自同一类的不同样本的增强视为负对,这引入了假阴性,并与聚类的目标相冲突。为了解决这个问题,我们提出了一种新的对比聚类框架,称为增强语义信息获取对比聚类(RASCC)。我们通过引入伪标签引导的双级微调策略来扩展cce,该策略减轻了假阴性的不利影响并提高了聚类质量。具体来说,RASCC包括两个阶段。在第一阶段,我们遵循CCES学习判别特征,并基于聚类预测生成高置信度的伪标签。在阶段2中,我们利用这些伪标签来执行双级微调。在实例级,我们通过从否定集中排除相同的伪标签对来减少假否定。在聚类水平上,我们通过使用类平衡的自标记损失(每个样本的权重与其高置信度伪标签的频率成反比)来改进聚类预测,从而提高分配质量和平衡。我们在四个具有挑战性的数据集上进行实验。该方法优于许多最先进的深度聚类算法,证明了样本对构建和微调策略的有效性。
{"title":"Reinforced semantic information acquiring for contrastive clustering","authors":"Pengfei Cui ,&nbsp;Haowei Wu ,&nbsp;Jun Yin","doi":"10.1016/j.engappai.2026.114115","DOIUrl":"10.1016/j.engappai.2026.114115","url":null,"abstract":"<div><div>The goal of deep image clustering is to assign images to their corresponding clusters in an unsupervised manner. Contrastive Clustering with Effective Sample pairs construction (CCES) (Yin et al., 2023) has demonstrated the effectiveness of combining ContrastiveCrop (Peng et al., 2022) with nearest-neighbor mining for clustering. However, a key limitation of CCES is that it treats augmentations of different samples from the same class as negative pairs, which introduces false negatives and conflicts with the goal of clustering. To address this issue, we propose a novel contrastive clustering framework termed Reinforced Semantic Information Acquiring for Contrastive Clustering (RASCC). We extend CCES by introducing a pseudo-label-guided dual-level fine-tuning strategy that mitigates the adverse impact of false negatives and improves clustering quality. Specifically, RASCC consists of two stages. In Stage 1, we follow CCES to learn discriminative features and generate high-confidence pseudo labels based on clustering predictions. In Stage 2, we leverage these pseudo labels to perform dual-level fine-tuning. At the instance level, we mitigate false negatives by excluding same-pseudo-label pairs from the negative set. At the cluster level, we improve assignment quality and balance by refining cluster predictions with a class-balanced self-labeling loss that each sample is weighted inversely proportional to the frequency of its high-confidence pseudo-label. We conduct experiments on four challenging datasets. The proposed method surpasses many state-of-the-art deep clustering algorithms, demonstrating the effectiveness of sample pair construction and fine-tuning strategy.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"169 ","pages":"Article 114115"},"PeriodicalIF":8.0,"publicationDate":"2026-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Engineering Applications of Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1