Applied Intelligence最新文献_第7页

DGT: Depth-guided RGB-D occluded target detection with transformers DGT：带变压器的深度制导RGB-D闭塞目标检测

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-024-06192-5

Kelei Xu, Chunyan Wang, Wanzhong Zhao, Jinqiang Liu

In occluded urban environments, the traditional object detection algorithm relies solely on RGB as input, making it challenging to discern the spatial relationship of occluded objects and consequently affecting the target detection accuracy. Previous studies primarily focused on fusing depth and RGB information at the feature level, resulting in the loss of detailed features from the original data, such as occlusion boundaries. This leads to blurred fusion features and degraded model detection performance. Therefore, this paper proposes a depth-guided RGB-D occluded target detection framework based on transformers (DGT) to effectively extract occlusion boundary information and guide the occlusion discrimination via data-level fusion of depth and RGB information. In particular, a multimodal data-level fusion model is proposed for a two-part task. One is to generate dense depth images with strengthened occlusion edge features by extracting the depth difference of object edges in the point cloud data. The other is to dilute the influence of useless information using RGB-D data-level fusion. A depth-guided occlusion layered detection network with transformers was designed to obtain the cross-module guided feature vector by exchanging the weights of the residual and interaction vectors. Extensive experiments showed that DGT achieves state-of-the-art performance in occluded environments.

在闭塞的城市环境中，传统的目标检测算法仅依赖于RGB作为输入，难以识别被遮挡物体的空间关系，从而影响目标检测精度。以往的研究主要集中在特征层面上融合深度和RGB信息，导致原始数据中遮挡边界等细节特征的丢失。这导致融合特征模糊，模型检测性能下降。因此，本文提出了一种基于变压器（DGT）的深度引导RGB- d遮挡目标检测框架，通过深度和RGB信息的数据级融合，有效提取遮挡边界信息，指导遮挡判别。特别地，针对两部分任务，提出了一种多模态数据级融合模型。一是通过提取点云数据中物体边缘的深度差，生成具有增强遮挡边缘特征的密集深度图像。二是利用RGB-D数据级融合淡化无用信息的影响。设计了一种带变压器的深度引导遮挡分层检测网络，通过交换残差向量和交互向量的权值，获得交叉模块引导的特征向量。大量实验表明，DGT在闭塞环境中达到了最先进的性能。

{"title":"DGT: Depth-guided RGB-D occluded target detection with transformers","authors":"Kelei Xu, Chunyan Wang, Wanzhong Zhao, Jinqiang Liu","doi":"10.1007/s10489-024-06192-5","DOIUrl":"10.1007/s10489-024-06192-5","url":null,"abstract":"<div><p>In occluded urban environments, the traditional object detection algorithm relies solely on RGB as input, making it challenging to discern the spatial relationship of occluded objects and consequently affecting the target detection accuracy. Previous studies primarily focused on fusing depth and RGB information at the feature level, resulting in the loss of detailed features from the original data, such as occlusion boundaries. This leads to blurred fusion features and degraded model detection performance. Therefore, this paper proposes a depth-guided RGB-D occluded target detection framework based on transformers (DGT) to effectively extract occlusion boundary information and guide the occlusion discrimination via data-level fusion of depth and RGB information. In particular, a multimodal data-level fusion model is proposed for a two-part task. One is to generate dense depth images with strengthened occlusion edge features by extracting the depth difference of object edges in the point cloud data. The other is to dilute the influence of useless information using RGB-D data-level fusion. A depth-guided occlusion layered detection network with transformers was designed to obtain the cross-module guided feature vector by exchanging the weights of the residual and interaction vectors. Extensive experiments showed that DGT achieves state-of-the-art performance in occluded environments.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142963003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Privacy constrained fairness estimation for decision trees 基于隐私约束的决策树公平性估计

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-024-05953-6

Florian van der Steen, Fré Vink, Heysem Kaya

The protection of sensitive data becomes more vital, as data increases in value and potency. Furthermore, the pressure increases from regulators and society on model developers to make their Artificial Intelligence (AI) models non-discriminatory. To boot, there is a need for interpretable, transparent AI models for high-stakes tasks. In general, measuring the fairness of any AI model requires the sensitive attributes of the individuals in the dataset, thus raising privacy concerns. In this work, the trade-offs between fairness (in terms of Statistical Parity (SP)), privacy (quantified with a budget), and interpretability are further explored in the context of Decision Trees (DTs) as intrinsically interpretable models. We propose a novel method, dubbed Privacy-Aware Fairness Estimation of Rules (PAFER), that can estimate SP in a Differential Privacy (DP)-aware manner for DTs. Our method is the first to assess algorithmic fairness on a rule-level, providing insight into sources of discrimination for policy makers. DP, making use of a third-party legal entity that securely holds this sensitive data, guarantees privacy by adding noise to the sensitive data. We experimentally compare several DP mechanisms. We show that using the Laplacian mechanism, the method is able to estimate SP with low error while guaranteeing the privacy of the individuals in the dataset with high certainty. We further show experimentally and theoretically that the method performs better for those DTs that humans generally find easier to interpret.

随着数据的价值和效力的增加，敏感数据的保护变得更加重要。此外，监管机构和社会对模型开发商的压力也在增加，要求他们的人工智能（AI）模型非歧视性。首先，需要为高风险任务提供可解释的、透明的人工智能模型。一般来说，衡量任何人工智能模型的公平性都需要数据集中个人的敏感属性，从而引起隐私问题。在这项工作中，公平性（根据统计平价（SP））、隐私性（用预算量化）和可解释性之间的权衡在决策树（dt）作为内在可解释模型的背景下进一步探讨。我们提出了一种新的方法，称为隐私感知规则公平性估计（PAFER），它可以以差分隐私（DP）感知的方式估计dt的SP。我们的方法是第一个在规则层面上评估算法公平性的方法，为政策制定者提供了对歧视来源的洞察。DP利用安全持有这些敏感数据的第三方法律实体，通过在敏感数据中添加噪声来保证隐私。我们实验比较了几种DP机制。结果表明，利用拉普拉斯机制，该方法能够以较低的误差估计SP，同时以较高的确定性保证数据集中个体的隐私。我们进一步从实验和理论上证明，该方法对于那些人类通常认为更容易解释的dtd表现更好。

{"title":"Privacy constrained fairness estimation for decision trees","authors":"Florian van der Steen, Fré Vink, Heysem Kaya","doi":"10.1007/s10489-024-05953-6","DOIUrl":"10.1007/s10489-024-05953-6","url":null,"abstract":"<p>The protection of sensitive data becomes more vital, as data increases in value and potency. Furthermore, the pressure increases from regulators and society on model developers to make their Artificial Intelligence (AI) models non-discriminatory. To boot, there is a need for interpretable, transparent AI models for high-stakes tasks. In general, measuring the fairness of any AI model requires the sensitive attributes of the individuals in the dataset, thus raising privacy concerns. In this work, the trade-offs between fairness (in terms of Statistical Parity (SP)), privacy (quantified with a budget), and interpretability are further explored in the context of Decision Trees (DTs) as intrinsically interpretable models. We propose a novel method, dubbed Privacy-Aware Fairness Estimation of Rules (PAFER), that can estimate SP in a Differential Privacy (DP)-aware manner for DTs. Our method is the first to assess algorithmic fairness on a rule-level, providing insight into sources of discrimination for policy makers. DP, making use of a third-party legal entity that securely holds this sensitive data, guarantees privacy by adding noise to the sensitive data. We experimentally compare several DP mechanisms. We show that using the Laplacian mechanism, the method is able to estimate SP with low error while guaranteeing the privacy of the individuals in the dataset with high certainty. We further show experimentally and theoretically that the method performs better for those DTs that humans generally find easier to interpret.</p>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10489-024-05953-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142962997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DGMI: A diffusion-based generative adversarial framework for multivariate air quality imputation DGMI：一个基于扩散的多变量空气质量估算生成对抗框架

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-025-06240-8

Nuo Cheng, Qingjian Ni

In the process of monitoring spatiotemporal air quality data, data sample missingness is prevalent, thus rectifying missing values in spatiotemporal data holds paramount significance. In recent years, diffusion probability models have played a prominent role in image, video, and text generation, and have also begun to be applied in the field of spatiotemporal data imputation. However, such models face challenges in extracting fine-grained features for stable model operation and accurate modeling of data probability distributions. To address the aforementioned issues, we propose a Diffusion-based Generative adversarial framework for Multivariate air quality data Imputation, termed DGMI. Recognizing the similar temporal, sensor, and indicator change characteristics inherent in air quality data, our framework is designed to cater to the spatiotemporal characteristics of air quality data by incorporating a multi-cycle temporal feature extraction module and a sensor indicator feature extraction module, facilitating multidimensional refinement and integration of temporal, sensor, and indicator information. Moreover, the initial missing value is encoded with linear interpolation and sine-cosine functions. Following the generation of imputed values by the model, we introduce a discriminator module to discern the consistency between imputed values and observed values to provide feedback for optimizing the model from a data distribution perspective. DGMI outperforms most current data imputation methods under various missing ratios in two real air quality datasets by 4.1% (root mean square error) and 3.0% (mean absolute error), exhibiting efficacy in scenarios characterized by multidimensional spatiotemporal and high missing rates data.

在时空空气质量数据监测过程中，数据样本缺失现象普遍存在，因此对时空数据中的缺失值进行校正具有至关重要的意义。近年来，扩散概率模型在图像、视频和文本生成中发挥了突出的作用，并开始在时空数据输入领域得到应用。然而，这些模型在提取细粒度特征以保证模型稳定运行和准确建模数据概率分布方面面临挑战。为了解决上述问题，我们提出了一个基于扩散的多变量空气质量数据输入生成对抗框架，称为DGMI。认识到空气质量数据中固有的相似的时间、传感器和指标变化特征，我们的框架通过合并多周期时间特征提取模块和传感器指标特征提取模块来满足空气质量数据的时空特征，促进了时间、传感器和指标信息的多维细化和集成。此外，用线性插值和正弦余弦函数对初始缺失值进行编码。在模型生成输入值之后，我们引入判别器模块来判别输入值与观测值的一致性，从数据分布的角度为优化模型提供反馈。在两个真实空气质量数据集的不同缺失率下，DGMI比目前大多数数据输入方法分别高出4.1%（均方根误差）和3.0%（平均绝对误差），在多维时空和高缺失率数据的场景下表现出有效性。

{"title":"DGMI: A diffusion-based generative adversarial framework for multivariate air quality imputation","authors":"Nuo Cheng, Qingjian Ni","doi":"10.1007/s10489-025-06240-8","DOIUrl":"10.1007/s10489-025-06240-8","url":null,"abstract":"<div><p>In the process of monitoring spatiotemporal air quality data, data sample missingness is prevalent, thus rectifying missing values in spatiotemporal data holds paramount significance. In recent years, diffusion probability models have played a prominent role in image, video, and text generation, and have also begun to be applied in the field of spatiotemporal data imputation. However, such models face challenges in extracting fine-grained features for stable model operation and accurate modeling of data probability distributions. To address the aforementioned issues, we propose a Diffusion-based Generative adversarial framework for Multivariate air quality data Imputation, termed DGMI. Recognizing the similar temporal, sensor, and indicator change characteristics inherent in air quality data, our framework is designed to cater to the spatiotemporal characteristics of air quality data by incorporating a multi-cycle temporal feature extraction module and a sensor indicator feature extraction module, facilitating multidimensional refinement and integration of temporal, sensor, and indicator information. Moreover, the initial missing value is encoded with linear interpolation and sine-cosine functions. Following the generation of imputed values by the model, we introduce a discriminator module to discern the consistency between imputed values and observed values to provide feedback for optimizing the model from a data distribution perspective. DGMI outperforms most current data imputation methods under various missing ratios in two real air quality datasets by 4.1% (root mean square error) and 3.0% (mean absolute error), exhibiting efficacy in scenarios characterized by multidimensional spatiotemporal and high missing rates data.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142962999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CaVIT: An integrated method for image style transfer using parallel CNN and vision transformer CaVIT：一种基于并行CNN和视觉转换器的图像风格转换集成方法

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-024-06114-5

ZaiFang Zhang, ShunLu Lu, Qing Guo, Nan Gao, YuXiao Yang

This study focuses on image style transfer, aiming to generate images with the desired style while preserving the underlying content structure. Existing models face challenges in accurately representing both content and style features. To address this, an integrated method for image style transfer is proposed, utilizing a parallel CNN and Vision Transformer (CaVIT). It combines a Convolutional Neural Network (CNN) with a Vision Transformer (VIT) to achieve enhanced performance. Our method utilizes VGG-19 with residual blocks to encode style features for enhanced refinement. Additionally, the PA-Trans Encoder Layer is introduced, inspired by the Transformer Encoder Layer, to efficiently encode content features while preserving the complete content structure. The fused features are then decoded into stylized images using a CNN decoder. Qualitative and quantitative evaluations demonstrate that our proposed method outperforms existing models, delivering high-quality results.

本研究的重点是图像风格转换，旨在生成具有所需风格的图像，同时保留底层内容结构。现有模型在准确表示内容和样式特征方面面临挑战。为了解决这一问题，提出了一种集成的图像样式转移方法，利用并行CNN和视觉变压器（CaVIT）。它结合了卷积神经网络（CNN）和视觉变压器（VIT）来实现增强的性能。我们的方法利用带有剩余块的VGG-19对样式特征进行编码，以增强精细化。此外，受变压器编码器层的启发，引入了PA-Trans编码器层，以有效地编码内容特征，同时保留完整的内容结构。然后使用CNN解码器将融合的特征解码为风格化的图像。定性和定量评估表明，我们提出的方法优于现有的模型，提供高质量的结果。

引用次数: 0

LSTM-SVM-Weibull modeling for decommissioning amount prediction of power batteries based on attention mechanism and ISPBO algorithm 基于关注机制和ISPBO算法的动力电池退役量预测LSTM-SVM-Weibull建模

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-024-05941-w

Mengna Zhao, Shiping Chen

Taking Shanghai as the research area, fully considering the randomness and timeliness of power battery recycling, a combined prediction model of Long Short-term Memory network—Support Vector Machine—Weibull (LSTM-SVM-Weibull) that integrates attention mechanism and hyperparameters optimized by improved student psychology based optimization (ISPBO) algorithm is proposed to predict the retired amount of power batteries, to further improve the prediction accuracy. In the first stage, the grey relational analysis (GRA) method is used to screen out the strong related influence factors of power battery installed amount. In the second stage, a two-stage predictive model of power battery installed amount based on LSTM network with attention mechanism (Attention-LSTM) and SVM optimized by ISPBO algorithm (ISPBO-SVM) is constructed. Firstly, the attention mechanism is fused in the hidden layer of the LSTM network to accurately predict the various indicators selected by GRA, reduce the impact of indicator value errors on the target value prediction, and highlight the contribution degree of the input sequence at different time prediction points; Then, the Lévy flight strategy is introduced and combined with the Metropolis criterion of the simulated annealing algorithm to accept inferior solutions, the ISPBO algorithm is designed to optimize the hyperparameters of SVM model, so as to predict the target value (power battery installed amount) on the basis of future indicator values by training the ISPBO-SVM target prediction layer. By collecting the historical data of power battery installed amount in Shanghai for simulation experiments, the comparison results show that the designed Attention-LSTM-ISPBO-SVM two-stage power battery installed amount prediction model is significantly better than other comparison models in terms of error and accuracy on different data sets, and it has high accuracy and generalization ability for the prediction of power battery retirement amount. In the third stage, based on the predictive model of power battery installed amount, the Weibull life distribution model is combined to predict the retired amount of power batteries in Shanghai. Through the goodness-of-fit test and evaluating the retirement amount prediction results of different models, it is proved that the predictive model for power battery retired amount based on Weibull life distribution has high stability and practical application value, which effectively reflects the overall change trend of the future power battery retirement amount in Shanghai, and can provide data reference for improving the recovery rate of waste batteries.

以上海为研究区域，充分考虑动力电池回收的随机性和时效性，提出了一种长短期记忆网络-支持向量机-威布尔（LSTM-SVM-Weibull）结合注意机制和改进的基于学生心理优化（ISPBO）算法优化的超参数的组合预测模型来预测动力电池的退役量，进一步提高了预测精度。第一阶段，采用灰色关联分析（GRA）方法筛选出与动力电池装机量关联度较强的影响因素；第二阶段，构建了基于关注机制的LSTM网络（attention -LSTM）和基于ISPBO算法优化的支持向量机（SVM）的两阶段动力电池装机量预测模型。首先，将注意力机制融合在LSTM网络的隐层中，对GRA选择的各种指标进行准确预测，减少指标值误差对目标值预测的影响，突出不同时间预测点输入序列的贡献程度；然后，引入l郁闷飞行策略，结合模拟退火算法的Metropolis准则接受劣等解，设计ISPBO算法对SVM模型的超参数进行优化，通过训练ISPBO-SVM目标预测层，在未来指标值的基础上预测目标值（动力电池装机量）。通过收集上海市动力电池装机量历史数据进行仿真实验，对比结果表明，所设计的关注度- lstm - ispbo - svm两级动力电池装机量预测模型在不同数据集上的误差和精度均显著优于其他比较模型，对动力电池退役量的预测具有较高的准确性和推广能力。第三阶段，在建立动力电池装机量预测模型的基础上，结合威布尔寿命分布模型对上海市动力电池退役量进行预测。通过拟合优度检验和评价不同模型的退役量预测结果，证明基于威布尔寿命分布的动力电池退役量预测模型具有较高的稳定性和实际应用价值，有效反映了上海市未来动力电池退役量的整体变化趋势，可为提高废电池回收率提供数据参考。

{"title":"LSTM-SVM-Weibull modeling for decommissioning amount prediction of power batteries based on attention mechanism and ISPBO algorithm","authors":"Mengna Zhao, Shiping Chen","doi":"10.1007/s10489-024-05941-w","DOIUrl":"10.1007/s10489-024-05941-w","url":null,"abstract":"<div><p>Taking Shanghai as the research area, fully considering the randomness and timeliness of power battery recycling, a combined prediction model of Long Short-term Memory network—Support Vector Machine—Weibull (LSTM-SVM-Weibull) that integrates attention mechanism and hyperparameters optimized by improved student psychology based optimization (ISPBO) algorithm is proposed to predict the retired amount of power batteries, to further improve the prediction accuracy. In the first stage, the grey relational analysis (GRA) method is used to screen out the strong related influence factors of power battery installed amount. In the second stage, a two-stage predictive model of power battery installed amount based on LSTM network with attention mechanism (Attention-LSTM) and SVM optimized by ISPBO algorithm (ISPBO-SVM) is constructed. Firstly, the attention mechanism is fused in the hidden layer of the LSTM network to accurately predict the various indicators selected by GRA, reduce the impact of indicator value errors on the target value prediction, and highlight the contribution degree of the input sequence at different time prediction points; Then, the Lévy flight strategy is introduced and combined with the Metropolis criterion of the simulated annealing algorithm to accept inferior solutions, the ISPBO algorithm is designed to optimize the hyperparameters of SVM model, so as to predict the target value (power battery installed amount) on the basis of future indicator values by training the ISPBO-SVM target prediction layer. By collecting the historical data of power battery installed amount in Shanghai for simulation experiments, the comparison results show that the designed Attention-LSTM-ISPBO-SVM two-stage power battery installed amount prediction model is significantly better than other comparison models in terms of error and accuracy on different data sets, and it has high accuracy and generalization ability for the prediction of power battery retirement amount. In the third stage, based on the predictive model of power battery installed amount, the Weibull life distribution model is combined to predict the retired amount of power batteries in Shanghai. Through the goodness-of-fit test and evaluating the retirement amount prediction results of different models, it is proved that the predictive model for power battery retired amount based on Weibull life distribution has high stability and practical application value, which effectively reflects the overall change trend of the future power battery retirement amount in Shanghai, and can provide data reference for improving the recovery rate of waste batteries.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142963004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The relationship among some special concepts from the perspective of formal context restoration 从形式语境还原的角度看一些特殊概念之间的关系

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-025-06227-5

Siyu Zhao, Jianjun Qi, Ling Wei, Qing Wan

Formal context restoration is a recently developing topic in the field of formal concept analysis (FCA). Its goal is to restore a formal context from some known formal concepts. Each type of formal concept uses its unique perspective to restore formal contexts or concept lattices. This paper investigates the relationship among five types of basic concepts: the object concepts, the attribute concepts, the join-irreducible concepts, the meet-irreducible concepts, and the formal concepts in concept reducts. This paper first studies the elementary relationship among the basic sets of formal concepts and presents these relationship in a Venn diagram. Then the relationship among the basic concept sets are explored from a restoration perspective, specifically including the relationship among basic concepts from the perspectives of formal context restoration and concept lattice restoration. These relationship are then used to study the transformation between the basic concept sets, and are summarised in a formal context and its concept lattice. Finally, a practical case is given to illustrate the relationship among the basic concept sets explored in this paper, including the elementary relationship, and the relationship from the perspectives of formal context restoration and concept lattice restoration.

形式语境还原是形式概念分析（FCA）领域的一个新兴课题。它的目标是从一些已知的形式概念中恢复形式上下文。每种形式概念都使用其独特的视角来还原形式语境或概念格。研究了概念约简中对象概念、属性概念、连接不可约概念、满足不可约概念和形式概念这五类基本概念之间的关系。本文首先研究了形式概念基本集之间的基本关系，并用维恩图表示了这些基本关系。然后从恢复的角度探讨了基本概念集之间的关系，具体包括从形式语境恢复和概念格恢复的角度探讨了基本概念集之间的关系。然后将这些关系用于研究基本概念集之间的转换，并在形式上下文及其概念格中进行总结。最后，通过一个实例说明了本文所探讨的基本概念集之间的关系，包括基本概念集之间的关系，以及从形式语境恢复和概念格恢复的角度来看的关系。

{"title":"The relationship among some special concepts from the perspective of formal context restoration","authors":"Siyu Zhao, Jianjun Qi, Ling Wei, Qing Wan","doi":"10.1007/s10489-025-06227-5","DOIUrl":"10.1007/s10489-025-06227-5","url":null,"abstract":"<div><p>Formal context restoration is a recently developing topic in the field of formal concept analysis (FCA). Its goal is to restore a formal context from some known formal concepts. Each type of formal concept uses its unique perspective to restore formal contexts or concept lattices. This paper investigates the relationship among five types of basic concepts: the object concepts, the attribute concepts, the join-irreducible concepts, the meet-irreducible concepts, and the formal concepts in concept reducts. This paper first studies the elementary relationship among the basic sets of formal concepts and presents these relationship in a Venn diagram. Then the relationship among the basic concept sets are explored from a restoration perspective, specifically including the relationship among basic concepts from the perspectives of formal context restoration and concept lattice restoration. These relationship are then used to study the transformation between the basic concept sets, and are summarised in a formal context and its concept lattice. Finally, a practical case is given to illustrate the relationship among the basic concept sets explored in this paper, including the elementary relationship, and the relationship from the perspectives of formal context restoration and concept lattice restoration.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142963026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

From spatial to semantic: attribute-aware fashion similarity learning via iterative positioning and attribute diverging 从空间到语义：基于迭代定位和属性发散的属性感知时尚相似学习

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-024-06173-8

Yongquan Wan, Jianfei Zheng, Cairong Yan, Guobing Zou

Fashion image retrieval emphasizes accurately perceiving the fine-grained features to meet users’ precise needs. However, the existing global image-based retrieval methods encounter challenges such as imprecise positioning of attributes, difficulty in distinguishing visually similar but semantically different attribute values, and struggles in the learning of attribute features within specific regions and viewpoints. This paper proposes a two-stage hybrid framework called IPAD (Iterative Positioning and Attribute Diverging) for attribute-aware fashion similarity learning. In the initial stage, we present an iterative positioning strategy to precisely identify local attribute regions through an iterative attention mechanism with adaptive suppression. IPAD leverages the strengths of Convolutional Neural Networks and Vision Transformers. Subsequently, we design an attribute diverging strategy to optimize attribute value aggregation via online clustering using a momentum encoder, thereby enhancing model stability and representation. During inference, we further present a feature reasoning mechanism to refine retrieval results through subgraph similarity matrix generation and re-ranking to enhance accuracy and robustness. Extensive evaluations on three public datasets demonstrate IPAD’s superior performance over state-of-the-art methods in retrieval accuracy, achieving an average improvement in MAP by +4.22%. The source code is available at https://github.com/h8e9r7/IPAD.

时尚图像检索强调对细粒度特征的准确感知，以满足用户的精准需求。然而，现有的基于图像的全局检索方法存在属性定位不精确、难以区分视觉相似但语义不同的属性值、难以在特定区域和视点内学习属性特征等问题。本文提出了一种基于属性感知的服装相似度学习的两阶段混合框架IPAD（迭代定位和属性发散）。在初始阶段，我们提出了一种迭代定位策略，通过自适应抑制的迭代注意机制来精确识别局部属性区域。IPAD利用了卷积神经网络和视觉转换器的优势。随后，我们设计了一种属性发散策略，利用动量编码器通过在线聚类优化属性值的聚合，从而增强了模型的稳定性和表征性。在推理过程中，我们进一步提出了一种特征推理机制，通过生成子图相似矩阵和重新排序来优化检索结果，以提高准确性和鲁棒性。对三个公共数据集的广泛评估表明，IPAD在检索精度方面优于最先进的方法，MAP平均提高了+4.22%。源代码可从https://github.com/h8e9r7/IPAD获得。

{"title":"From spatial to semantic: attribute-aware fashion similarity learning via iterative positioning and attribute diverging","authors":"Yongquan Wan, Jianfei Zheng, Cairong Yan, Guobing Zou","doi":"10.1007/s10489-024-06173-8","DOIUrl":"10.1007/s10489-024-06173-8","url":null,"abstract":"<div><p>Fashion image retrieval emphasizes accurately perceiving the fine-grained features to meet users’ precise needs. However, the existing global image-based retrieval methods encounter challenges such as imprecise positioning of attributes, difficulty in distinguishing visually similar but semantically different attribute values, and struggles in the learning of attribute features within specific regions and viewpoints. This paper proposes a two-stage hybrid framework called IPAD (Iterative Positioning and Attribute Diverging) for attribute-aware fashion similarity learning. In the initial stage, we present an iterative positioning strategy to precisely identify local attribute regions through an iterative attention mechanism with adaptive suppression. IPAD leverages the strengths of Convolutional Neural Networks and Vision Transformers. Subsequently, we design an attribute diverging strategy to optimize attribute value aggregation via online clustering using a momentum encoder, thereby enhancing model stability and representation. During inference, we further present a feature reasoning mechanism to refine retrieval results through subgraph similarity matrix generation and re-ranking to enhance accuracy and robustness. Extensive evaluations on three public datasets demonstrate IPAD’s superior performance over state-of-the-art methods in retrieval accuracy, achieving an average improvement in MAP by +4.22%. The source code is available at https://github.com/h8e9r7/IPAD.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142962995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing suicidal ideation detection through advanced feature selection and stacked deep learning models 通过高级特征选择和堆叠深度学习模型增强自杀意念检测

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-13 DOI: 10.1007/s10489-025-06256-0

Shiv Shankar Prasad Shukla, Maheshwari Prasad Singh

Detecting suicidal ideation on communication platforms such as social media is critical for suicide prevention, as these platforms are frequently used for emotional expression and can reflect significant behavior changes. Many machine learning and deep learning techniques have been employed to address this issue, utilizing embedding methods such as Count Vector, Term Frequency-Inverse Document Frequency, Bidirectional Encoder Representations from Transformers, Multilingual Universal Sentence Encoder etc generate high-dimensional vectors. Directly inputting word embeddings into models can introduce noise and outliers, which may negatively impact predictive accuracy. Therefore, feature selection to optimize the dimensionality of word embedding vectors has emerged as a promising direction for future research. This study proposes a feature selection method called Propose Best Feature Selection, which combines Grey Wolf Optimization, Recursive Feature Elimination, and Stepwise Feature Selection. It uses a Voting Classifier to identify and filter the most significant features, reducing dimensionality. These optimized features are then fed into a stacked ensemble hybrid model, with Bi-Directional Gated Recurrent Unit with Attention and Convolutional Neural Network, acting like base and Extreme Gradient Boostis working like the meta-classifier, achieving an accuracy of 98% in Reddit and 97% in Twitter(X) dataset, outperforming similar methods in the field. This work is focused on textual data, and future efforts may expand to include multimodal analysis, incorporating image-based emotional cues. Scalability challenges for large datasets and real-time applications remain a key limitation.

在社交媒体等交流平台上检测自杀意念对预防自杀至关重要，因为这些平台经常用于情绪表达，可以反映重大的行为变化。许多机器学习和深度学习技术已经被用来解决这个问题，利用嵌入方法，如计数向量，词频率-逆文档频率，双向编码器表示从变压器，多语言通用句子编码器等产生高维向量。直接将词嵌入输入到模型中可能会引入噪声和异常值，这可能会对预测精度产生负面影响。因此，通过特征选择优化词嵌入向量的维数已成为未来研究的一个很有前景的方向。本研究提出了一种将灰狼优化、递归特征消除和逐步特征选择相结合的特征选择方法，称为“建议最佳特征选择”。它使用投票分类器来识别和过滤最重要的特征，降低维数。然后将这些优化的特征输入到堆叠的集成混合模型中，其中带有注意力和卷积神经网络的双向门控循环单元（Bi-Directional Gated Recurrent Unit）的作用类似于基础，极端梯度boost的作用类似于元分类器，在Reddit和Twitter(X)数据集中实现了98%的准确率和97%的准确率，优于该领域的类似方法。这项工作的重点是文本数据，未来的努力可能会扩展到包括多模态分析，结合基于图像的情感线索。大型数据集和实时应用程序的可伸缩性挑战仍然是一个关键的限制。

{"title":"Enhancing suicidal ideation detection through advanced feature selection and stacked deep learning models","authors":"Shiv Shankar Prasad Shukla, Maheshwari Prasad Singh","doi":"10.1007/s10489-025-06256-0","DOIUrl":"10.1007/s10489-025-06256-0","url":null,"abstract":"<div><p>Detecting suicidal ideation on communication platforms such as social media is critical for suicide prevention, as these platforms are frequently used for emotional expression and can reflect significant behavior changes. Many machine learning and deep learning techniques have been employed to address this issue, utilizing embedding methods such as Count Vector, Term Frequency-Inverse Document Frequency, Bidirectional Encoder Representations from Transformers, Multilingual Universal Sentence Encoder etc generate high-dimensional vectors. Directly inputting word embeddings into models can introduce noise and outliers, which may negatively impact predictive accuracy. Therefore, feature selection to optimize the dimensionality of word embedding vectors has emerged as a promising direction for future research. This study proposes a feature selection method called Propose Best Feature Selection, which combines Grey Wolf Optimization, Recursive Feature Elimination, and Stepwise Feature Selection. It uses a Voting Classifier to identify and filter the most significant features, reducing dimensionality. These optimized features are then fed into a stacked ensemble hybrid model, with Bi-Directional Gated Recurrent Unit with Attention and Convolutional Neural Network, acting like base and Extreme Gradient Boostis working like the meta-classifier, achieving an accuracy of 98% in Reddit and 97% in Twitter(X) dataset, outperforming similar methods in the field. This work is focused on textual data, and future efforts may expand to include multimodal analysis, incorporating image-based emotional cues. Scalability challenges for large datasets and real-time applications remain a key limitation.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142963005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Crowd evacuation path planning and simulation method based on deep reinforcement learning and repulsive force field 基于深度强化学习和斥力场的人群疏散路径规划与仿真方法

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-11 DOI: 10.1007/s10489-024-06074-w

Hongyue Wang, Hong Liu, Wenhao Li

Path planning is essential for simulating crowd evacuation. However, existing path planning methods encounter challenges, including unbalanced exit utilization, ineffective obstacle avoidance, and low evacuation efficiency. To address these issues, this paper presents a path planning method based on Deep Reinforcement Learning (DRL) and a Repulsive Force Field (RFF) for crowd evacuation simulation. First, a dynamic exit scoring mechanism is proposed and integrated into the DRL training process to balance exit utilization during evacuation. Additionally, we address the sparse reward issue in DRL by extracting key points from actual evacuation trajectories as short-term goals. Finally, we enhance the movement strategy output by constructing an RFF to improve obstacle avoidance in complex environments. Experimental results demonstrate that the proposed method effectively avoids obstacles and efficiently completes evacuation tasks.

路径规划是模拟人群疏散的关键。然而，现有的路径规划方法存在出口利用不平衡、避障效果不佳、疏散效率低等问题。为了解决这些问题，本文提出了一种基于深度强化学习（DRL）和斥力场（RFF）的人群疏散仿真路径规划方法。首先，提出了一种动态出口评分机制，并将其集成到DRL训练过程中，以平衡疏散过程中的出口利用率。此外，我们通过从实际疏散轨迹中提取关键点作为短期目标来解决DRL中的稀疏奖励问题。最后，我们通过构建RFF来增强运动策略输出，以提高在复杂环境下的避障能力。实验结果表明，该方法能有效地避开障碍物，高效地完成疏散任务。

引用次数: 0

Cyclic deformable medical image registration with prompt: deep fusion of diffeomorphic and transformer methods 循环变形医学图像快速配准：差分和变形方法的深度融合

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-01-11 DOI: 10.1007/s10489-025-06232-8

Longhao Li, Li Li, Yunfeng Zhang, Fangxun Bao, Xunxiang Yao, Caiming Zhang

Medical image registration is a fundamental task in medical image analysis. Recently, competitive methods, such as deep learning-based registration and deformable registration, which have demonstrated promising results, have been proposed. However, meeting the demands for high precision in clinical applications is still a challenge. Here, we propose a cyclic optimization registration framework that deeply fuses diffeomorphic and deep learning methods through a single forward-two path structure. A neural network estimates the initial deformation field, which directly generates registered images to enhance feature extraction capabilities. Additionally, dynamic diffeomorphism is introduced for the initial deformation field to generate the final deformation field, ensuring the invertibility of the transformation. We incorporate the Dense Spatial Correspondence Prompt module for cyclically learning the final deformation field, enabling the network to estimate smoother and more accurate spatial transformations. Experiments conducted on a publicly available 3D dataset demonstrate exceptional registration accuracy with a DSC of 0.621 and an SSIM of 0.913, while preserving desirable diffeomorphic properties with almost zero non-positive Jacobians.

医学图像配准是医学图像分析中的一项基础性工作。近年来，人们提出了基于深度学习的配准和可变形配准等具有竞争力的方法，并取得了良好的效果。然而，在临床应用中满足高精度的要求仍然是一个挑战。在这里，我们提出了一个循环优化配准框架，该框架通过单一的前二路径结构深度融合了微分同构和深度学习方法。神经网络估计初始变形场，直接生成配准图像，增强特征提取能力。并引入初始变形场的动态微分同构来生成最终变形场，保证了变换的可逆性。我们将密集空间对应提示模块用于循环学习最终变形场，使网络能够估计更平滑和更准确的空间变换。在公开可用的3D数据集上进行的实验表明，DSC为0.621，SSIM为0.913，同时保留了几乎为零的非正雅可比矩阵的理想微分同态特性，具有出色的配准精度。

{"title":"Cyclic deformable medical image registration with prompt: deep fusion of diffeomorphic and transformer methods","authors":"Longhao Li, Li Li, Yunfeng Zhang, Fangxun Bao, Xunxiang Yao, Caiming Zhang","doi":"10.1007/s10489-025-06232-8","DOIUrl":"10.1007/s10489-025-06232-8","url":null,"abstract":"<div><p>Medical image registration is a fundamental task in medical image analysis. Recently, competitive methods, such as deep learning-based registration and deformable registration, which have demonstrated promising results, have been proposed. However, meeting the demands for high precision in clinical applications is still a challenge. Here, we propose a cyclic optimization registration framework that deeply fuses diffeomorphic and deep learning methods through a single forward-two path structure. A neural network estimates the initial deformation field, which directly generates registered images to enhance feature extraction capabilities. Additionally, dynamic diffeomorphism is introduced for the initial deformation field to generate the final deformation field, ensuring the invertibility of the transformation. We incorporate the Dense Spatial Correspondence Prompt module for cyclically learning the final deformation field, enabling the network to estimate smoother and more accurate spatial transformations. Experiments conducted on a publicly available 3D dataset demonstrate exceptional registration accuracy with a DSC of 0.621 and an SSIM of 0.913, while preserving desirable diffeomorphic properties with almost zero non-positive Jacobians.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142941021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0