首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
Phenotype and Genotype Based Sample Aware Surrogate-Assisted Genetic Programming in Dynamic Flexible Job Shop Scheduling 基于表型和基因型的样本感知代理辅助遗传规划在动态柔性作业车间调度中的应用
Pub Date : 2025-04-17 DOI: 10.1109/TAI.2025.3562161
Luyao Zhu;Fangfang Zhang;Xiaodong Zhu;Ke Chen;Mengjie Zhang
Genetic programming (GP) has been widely applied to evolve scheduling heuristics for dynamic flexible job shop scheduling (DFJSS). However, the evaluation of GP individuals is computationally expensive, especially in large scale DFJSS scenarios. A k-nearest neighbor (KNN) based surrogate has been successfully used to reduce individual evaluation time for GP by predicting the fitness of an individual with the most similar sample in KNN. Particularly, the phenotypes of GP individuals have been utilized to generate samples for KNN-based surrogates with a precondition that the fitness of individuals with the same phenotype is the same or similar. However, their real fitness may differ greatly due to different input decision situations for fitness calculations in DFJSS. Thus, only considering phenotypes of GP individuals to extract samples could decrease the accuracy of KNN surrogates. This article proposes a KNN-based surrogate assisted GP algorithm by considering both the phenotype and genotype of GP individuals to generate samples. Specifically, a genotypic characterization based on terminal frequency is designed to measure the similarity of individual genotypes. The results show that with the same training time, the proposed algorithm can converge fast and achieve better scheduling heuristics than the state-of-the-art algorithms in most examined scenarios. With the same number of generations, the proposed algorithm can obtain comparable performance but only needs about one third of the training time of baseline GP. The effectiveness of the proposed algorithm is also verified from different aspects, e.g., relation between genotype correlation and fitness difference of individuals, and population diversity.
遗传规划(GP)被广泛应用于动态柔性作业车间调度(DFJSS)的进化调度启发式算法。然而,GP个体的评估在计算上是昂贵的,特别是在大规模DFJSS场景中。基于k最近邻(KNN)的代理通过预测最相似样本的个体适应度,成功地减少了GP的个体评估时间。特别是,在具有相同表型的个体的适应度相同或相似的前提下,GP个体的表型已被用来为基于knn的替代品生成样本。然而,由于DFJSS中适应度计算的输入决策情况不同,它们的实际适应度可能相差很大。因此,仅考虑GP个体的表型来提取样本可能会降低KNN替代品的准确性。本文提出了一种基于knn的代理辅助GP算法,通过考虑GP个体的表型和基因型来生成样本。具体来说,基于终端频率的基因型表征被设计用来测量个体基因型的相似性。结果表明,在相同的训练时间下,在大多数测试场景下,该算法收敛速度快,调度启发式优于现有算法。在相同的代数下,该算法可以获得相当的性能,而所需的训练时间仅为基线GP的三分之一左右。从个体基因型相关性与适应度差异的关系、种群多样性等方面验证了算法的有效性。
{"title":"Phenotype and Genotype Based Sample Aware Surrogate-Assisted Genetic Programming in Dynamic Flexible Job Shop Scheduling","authors":"Luyao Zhu;Fangfang Zhang;Xiaodong Zhu;Ke Chen;Mengjie Zhang","doi":"10.1109/TAI.2025.3562161","DOIUrl":"https://doi.org/10.1109/TAI.2025.3562161","url":null,"abstract":"Genetic programming (GP) has been widely applied to evolve scheduling heuristics for dynamic flexible job shop scheduling (DFJSS). However, the evaluation of GP individuals is computationally expensive, especially in large scale DFJSS scenarios. A k-nearest neighbor (KNN) based surrogate has been successfully used to reduce individual evaluation time for GP by predicting the fitness of an individual with the most similar sample in KNN. Particularly, the phenotypes of GP individuals have been utilized to generate samples for KNN-based surrogates with a precondition that the fitness of individuals with the same phenotype is the same or similar. However, their real fitness may differ greatly due to different input decision situations for fitness calculations in DFJSS. Thus, only considering phenotypes of GP individuals to extract samples could decrease the accuracy of KNN surrogates. This article proposes a KNN-based surrogate assisted GP algorithm by considering both the phenotype and genotype of GP individuals to generate samples. Specifically, a genotypic characterization based on terminal frequency is designed to measure the similarity of individual genotypes. The results show that with the same training time, the proposed algorithm can converge fast and achieve better scheduling heuristics than the state-of-the-art algorithms in most examined scenarios. With the same number of generations, the proposed algorithm can obtain comparable performance but only needs about one third of the training time of baseline GP. The effectiveness of the proposed algorithm is also verified from different aspects, e.g., relation between genotype correlation and fitness difference of individuals, and population diversity.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 12","pages":"3232-3247"},"PeriodicalIF":0.0,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement Learning for Efficient Multiagent Task Allocation in Potential Game Model 潜在博弈模型中高效多智能体任务分配的强化学习
Pub Date : 2025-04-17 DOI: 10.1109/TAI.2025.3562160
Yuxing Xing;Caixia Chen;Jie Wu;Jie Chen
The potential game has been widely used to describe multiagent task allocation. However, the application of traditional game-theoretic algorithms has shown unsatisfactory performance in scenarios with a high agent count. For this, we employ reinforcement learning algorithm to enable each agent to independently make decision in response to other agents’ decisions and variations in the number of agents, ultimately working towards achieving a desired goal. First, we construct a potential game for multiagent task allocation and design a corresponding utility function for each agent. Then, we propose a deep q-network algorithm based on graph neural network, and enhance the agent selection mechanism in this learning algorithm. During each iteration, a task is randomly selected for an agent from the participant set, and each agent updates its strategy accordingly. Finally, by comparing several representative game theoretical algorithms, the numerical simulations highlight the advantages and performance of our proposed GDQ-Net algorithm across various tasks and numbers of agents under the constructed model.
潜在博弈被广泛用于描述多智能体任务分配。然而,传统的博弈论算法在智能体数量较多的情况下表现不理想。为此,我们采用强化学习算法,使每个智能体能够独立地做出决策,以响应其他智能体的决策和智能体数量的变化,最终朝着预期的目标努力。首先,我们构造了一个多智能体任务分配的潜在博弈,并为每个智能体设计了相应的效用函数。然后,我们提出了一种基于图神经网络的深度q-network算法,并对该学习算法中的智能体选择机制进行了改进。在每次迭代过程中,从参与者集中随机为一个代理选择一个任务,每个代理相应地更新其策略。最后,通过比较几种具有代表性的博弈理论算法,通过数值仿真,突出了本文提出的GDQ-Net算法在构建的模型下,在各种任务和智能体数量下的优势和性能。
{"title":"Reinforcement Learning for Efficient Multiagent Task Allocation in Potential Game Model","authors":"Yuxing Xing;Caixia Chen;Jie Wu;Jie Chen","doi":"10.1109/TAI.2025.3562160","DOIUrl":"https://doi.org/10.1109/TAI.2025.3562160","url":null,"abstract":"The potential game has been widely used to describe multiagent task allocation. However, the application of traditional game-theoretic algorithms has shown unsatisfactory performance in scenarios with a high agent count. For this, we employ reinforcement learning algorithm to enable each agent to independently make decision in response to other agents’ decisions and variations in the number of agents, ultimately working towards achieving a desired goal. First, we construct a potential game for multiagent task allocation and design a corresponding utility function for each agent. Then, we propose a deep q-network algorithm based on graph neural network, and enhance the agent selection mechanism in this learning algorithm. During each iteration, a task is randomly selected for an agent from the participant set, and each agent updates its strategy accordingly. Finally, by comparing several representative game theoretical algorithms, the numerical simulations highlight the advantages and performance of our proposed GDQ-Net algorithm across various tasks and numbers of agents under the constructed model.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 12","pages":"3217-3231"},"PeriodicalIF":0.0,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ownership Infringement Detection for Generative Adversarial Networks Against Model Stealing 针对模型窃取的生成对抗网络所有权侵权检测
Pub Date : 2025-04-16 DOI: 10.1109/TAI.2025.3560921
Hailong Hu;Jun Pang
Generative adversarial networks (GANs) have shown remarkable success in image synthesis, making GAN models themselves commercially valuable to legitimate model owners. Therefore, it is critical to technically protect the intellectual property of GANs. Prior works need to tamper with the training set or training process to verify the ownership of a GAN. In this article, we show that these methods are not robust to emerging model extraction attacks. Then, we propose a new method GAN-Guards which utilizes the common characteristics of a target model and its stolen models for ownership infringement detection. Our method can be directly applicable to all well-trained GANs as it does not require retraining target models. Extensive experimental results show that our new method achieves superior detection performance, compared with the watermark-based and fingerprint-based methods. Finally, we demonstrate the effectiveness of our method with respect to the number of generations of model extraction attacks, the number of generated samples, and adaptive attacks.
生成对抗网络(GAN)在图像合成方面取得了显著的成功,使得GAN模型本身对合法的模型所有者具有商业价值。因此,从技术上保护gan的知识产权是至关重要的。先前的工作需要篡改训练集或训练过程来验证GAN的所有权。在本文中,我们展示了这些方法对新出现的模型提取攻击的鲁棒性。然后,我们提出了一种新的gan - guard方法,该方法利用目标模型和被盗模型的共同特征进行所有权侵权检测。我们的方法可以直接适用于所有训练良好的gan,因为它不需要再训练目标模型。大量的实验结果表明,与基于水印和指纹的检测方法相比,该方法具有更好的检测性能。最后,我们证明了我们的方法在模型提取攻击的代数、生成样本的数量和自适应攻击方面的有效性。
{"title":"Ownership Infringement Detection for Generative Adversarial Networks Against Model Stealing","authors":"Hailong Hu;Jun Pang","doi":"10.1109/TAI.2025.3560921","DOIUrl":"https://doi.org/10.1109/TAI.2025.3560921","url":null,"abstract":"Generative adversarial networks (GANs) have shown remarkable success in image synthesis, making GAN models themselves commercially valuable to legitimate model owners. Therefore, it is critical to technically protect the intellectual property of GANs. Prior works need to tamper with the training set or training process to verify the ownership of a GAN. In this article, we show that these methods are not robust to emerging model extraction attacks. Then, we propose a new method GAN-Guards which utilizes the common characteristics of a target model and its stolen models for ownership infringement detection. Our method can be directly applicable to all well-trained GANs as it does not require retraining target models. Extensive experimental results show that our new method achieves superior detection performance, compared with the watermark-based and fingerprint-based methods. Finally, we demonstrate the effectiveness of our method with respect to the number of generations of model extraction attacks, the number of generated samples, and adaptive attacks.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 11","pages":"3018-3029"},"PeriodicalIF":0.0,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Privacy-Enhancing Framework for Low-Dose CT Denoising 一种新的低剂量CT去噪隐私增强框架
Pub Date : 2025-04-16 DOI: 10.1109/TAI.2025.3561092
Ziyuan Yang;Huijie Huangfu;Maosong Ran;Zhiwen Wang;Hui Yu;Mengyu Sun;Yi Zhang
Deep learning (DL) has made significant advancements in tomographic imaging, particularly in low-dose computed tomography (LDCT) denoising. A recent trend involves servers training powerful models with enormous self-collected data and providing application programming interfaces (APIs) for users, such as Chat-GPT. To avoid model leakage, users are required to upload their data to the server. This approach is particularly advantageous for devices with limited computational capabilities, as it offloads computation to the server, easing the workload on the devices themselves. However, this way raises public concerns about the privacy disclosure risk. Hence, to alleviate related concerns, we propose to directly denoise LDCT in the encrypted domain to achieve privacy-preserving cloud services without exposing private data to the server. Concretely, we employ homomorphic encryption to encrypt private LDCT, which is then transferred to the server model trained with plaintext LDCT for further denoising. Since fundamental DL operations, such as convolution and linear transformation, cannot be directly used in the encrypted domain, we transform the fundamental mathematic operations in the plaintext domain into the operations in the encrypted domain. Moreover, we present two interactive frameworks for linear and nonlinear models, both of which can achieve lossless operating. In this way, the proposed methods can achieve two merits, the data privacy is well protected, and the server model is free from the risk of model leakage. Moreover, we provide theoretical proof to validate the lossless property of our framework. Finally, experiments were conducted to demonstrate that the transferred contents are well protected and cannot be reconstructed.1

The codes are released at https://github.com/Zi-YuanYang/Encrypt_LDCT_Recon

深度学习(DL)在层析成像方面取得了重大进展,特别是在低剂量计算机断层扫描(LDCT)去噪方面。最近的一个趋势是,服务器使用大量自收集的数据训练强大的模型,并为用户提供应用程序编程接口(api),比如Chat-GPT。为了避免模型泄漏,用户需要将他们的数据上传到服务器。这种方法对于计算能力有限的设备特别有利,因为它将计算转移到服务器上,从而减轻了设备本身的工作负载。然而,这种方式引发了公众对隐私泄露风险的担忧。因此,为了缓解相关问题,我们建议在加密域中直接对LDCT进行去噪,以实现保护隐私的云服务,而不会将私有数据暴露给服务器。具体而言,我们使用同态加密对私有LDCT进行加密,然后将其传输到使用明文LDCT训练的服务器模型中进行进一步去噪。由于基本的DL运算,如卷积和线性变换,不能直接在加密域中使用,我们将明文域中的基本数学运算转换为加密域中的运算。此外,我们提出了线性和非线性模型的两种交互框架,这两种框架都可以实现无损操作。这样,所提出的方法可以达到两个优点,一是数据隐私得到了很好的保护,二是服务器模型没有模型泄漏的风险。此外,我们还提供了理论证明来验证我们的框架的无损性。最后,通过实验验证了传输的内容得到了很好的保护,并且不能被重构。代码在https://github.com/Zi-YuanYang/Encrypt_LDCT_Recon上发布
{"title":"A Novel Privacy-Enhancing Framework for Low-Dose CT Denoising","authors":"Ziyuan Yang;Huijie Huangfu;Maosong Ran;Zhiwen Wang;Hui Yu;Mengyu Sun;Yi Zhang","doi":"10.1109/TAI.2025.3561092","DOIUrl":"https://doi.org/10.1109/TAI.2025.3561092","url":null,"abstract":"Deep learning (DL) has made significant advancements in tomographic imaging, particularly in low-dose computed tomography (LDCT) denoising. A recent trend involves servers training powerful models with enormous self-collected data and providing application programming interfaces (APIs) for users, such as Chat-GPT. To avoid model leakage, users are required to upload their data to the server. This approach is particularly advantageous for devices with limited computational capabilities, as it offloads computation to the server, easing the workload on the devices themselves. However, this way raises public concerns about the privacy disclosure risk. Hence, to alleviate related concerns, we propose to directly denoise LDCT in the encrypted domain to achieve privacy-preserving cloud services without exposing private data to the server. Concretely, we employ homomorphic encryption to encrypt private LDCT, which is then transferred to the server model trained with plaintext LDCT for further denoising. Since fundamental DL operations, such as convolution and linear transformation, cannot be directly used in the encrypted domain, we transform the fundamental mathematic operations in the plaintext domain into the operations in the encrypted domain. Moreover, we present two interactive frameworks for linear and nonlinear models, both of which can achieve lossless operating. In this way, the proposed methods can achieve two merits, the data privacy is well protected, and the server model is free from the risk of model leakage. Moreover, we provide theoretical proof to validate the lossless property of our framework. Finally, experiments were conducted to demonstrate that the transferred contents are well protected and cannot be reconstructed.<xref><sup>1</sup></xref><fn><label><sup>1</sup></label><p>The codes are released at <uri>https://github.com/Zi-YuanYang/Encrypt_LDCT_Recon</uri></p></fn>","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 11","pages":"3043-3055"},"PeriodicalIF":0.0,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating Large Language Model for Improved Causal Discovery 集成大语言模型改进因果发现
Pub Date : 2025-04-16 DOI: 10.1109/TAI.2025.3560927
Taiyu Ban;Lyuzhou Chen;Derui Lyu;Xiangyu Wang;Qinrui Zhu;Qiang Tu;Huanhuan Chen
Recovering the structure of causal graphical models from observational data is an essential yet challenging task for causal discovery in scientific scenarios. Domain-specific causal discovery usually relies on expert validation or prior analysis to improve the reliability of recovered causality, which is yet limited by the scarcity of expert resources. Recently, large language models (LLM) have been used for causal analysis across various domain-specific scenarios, suggesting its potential as autonomous expert roles in guiding data-based structure learning. However, integrating LLMs into causal discovery faces challenges due to inaccuracies in LLM-based reasoning on revealing the actual causal structure. To address this challenge, we propose an error-tolerant LLM-driven causal discovery framework. The error-tolerant mechanism is designed three-fold with sufficient consideration on potential inaccuracies. In the LLM-based reasoning process, an accuracy-oriented prompting strategy restricts causal analysis to a reliable range. Next, a knowledge-to-structure transition aligns LLM-derived causal statements with structural causal interactions. In the structure learning process, the goodness-of-fit to data and adherence to LLM-derived priors are balanced to further address prior inaccuracies. Evaluation of eight real-world causal structures demonstrates the efficacy of our LLM-driven approach in improving data-based causal discovery, along with its robustness to inaccurate LLM-derived priors.
从观测数据中恢复因果图模型的结构是科学情景中因果发现的一项重要但具有挑战性的任务。特定领域的因果关系发现通常依赖于专家验证或先验分析来提高因果关系恢复的可靠性,但这受到专家资源稀缺的限制。最近,大型语言模型(LLM)已被用于各种领域特定场景的因果分析,这表明它在指导基于数据的结构学习方面具有自主专家角色的潜力。然而,将法学硕士整合到因果发现中面临挑战,因为基于法学硕士的推理在揭示实际因果结构方面存在不准确性。为了应对这一挑战,我们提出了一个容错的法学硕士驱动的因果发现框架。容错机制设计了三层,充分考虑了潜在的误差。在基于llm的推理过程中,以准确性为导向的提示策略将因果分析限制在可靠的范围内。接下来,知识到结构的转变将法学硕士衍生的因果陈述与结构因果相互作用结合起来。在结构学习过程中,对数据的拟合优度和对llm推导的先验的依从性进行了平衡,以进一步解决先验的不准确性。对八个现实世界因果结构的评估证明了我们的法学硕士驱动方法在改进基于数据的因果发现方面的有效性,以及它对不准确的法学硕士衍生先验的鲁棒性。
{"title":"Integrating Large Language Model for Improved Causal Discovery","authors":"Taiyu Ban;Lyuzhou Chen;Derui Lyu;Xiangyu Wang;Qinrui Zhu;Qiang Tu;Huanhuan Chen","doi":"10.1109/TAI.2025.3560927","DOIUrl":"https://doi.org/10.1109/TAI.2025.3560927","url":null,"abstract":"Recovering the structure of causal graphical models from observational data is an essential yet challenging task for causal discovery in scientific scenarios. Domain-specific causal discovery usually relies on expert validation or prior analysis to improve the reliability of recovered causality, which is yet limited by the scarcity of expert resources. Recently, large language models (LLM) have been used for causal analysis across various domain-specific scenarios, suggesting its potential as autonomous expert roles in guiding data-based structure learning. However, integrating LLMs into causal discovery faces challenges due to inaccuracies in LLM-based reasoning on revealing the actual causal structure. To address this challenge, we propose an error-tolerant LLM-driven causal discovery framework. The error-tolerant mechanism is designed three-fold with sufficient consideration on potential inaccuracies. In the LLM-based reasoning process, an accuracy-oriented prompting strategy restricts causal analysis to a reliable range. Next, a knowledge-to-structure transition aligns LLM-derived causal statements with structural causal interactions. In the structure learning process, the goodness-of-fit to data and adherence to LLM-derived priors are balanced to further address prior inaccuracies. Evaluation of eight real-world causal structures demonstrates the efficacy of our LLM-driven approach in improving data-based causal discovery, along with its robustness to inaccurate LLM-derived priors.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 11","pages":"3030-3042"},"PeriodicalIF":0.0,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145456013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixture-of-Experts for Open Set Domain Adaptation: A Dual-Space Detection Approach 开集域自适应的专家混合:一种双空间检测方法
Pub Date : 2025-04-14 DOI: 10.1109/TAI.2025.3560590
Zhenbang Du;Jiayu An;Yunlu Tu;Jiahao Hong;Dongrui Wu
Open set domain adaptation (OSDA) copes with the distribution and label shifts between the source and target domains simultaneously, performing accurate classification for known classes while identifying unknown class samples in the target domain. Most existing OSDA approaches, depending on the final image feature space of deep models, require manually-tuned thresholds, and may easily misclassify unknown samples as known classes. Mixture-of-experts (MoE) could be a remedy. Within an MoE, different experts handle distinct input features, producing unique expert routing patterns for various classes in a routing feature space. As a result, unknown class samples may display different expert routing patterns to known classes. This article proposes dual-space detection, which exploits the inconsistencies between the image feature space and the routing feature space to detect unknown class samples without any threshold. A graph router is further introduced to better make use of the spatial information among the image patches. Experiments on three datasets validated the effectiveness and superiority of our approach.
开放集域自适应(OSDA)同时处理源域和目标域之间的分布和标签转移,在识别目标域中未知类样本的同时,对已知类样本进行准确分类。大多数现有的OSDA方法依赖于深度模型的最终图像特征空间,需要手动调整阈值,并且很容易将未知样本误分类为已知类。专家混合(MoE)可能是一种补救措施。在MoE中,不同的专家处理不同的输入特征,为路由特征空间中的各种类生成独特的专家路由模式。因此,未知的类样本可能会显示与已知类不同的专家路由模式。本文提出双空间检测,利用图像特征空间与路由特征空间的不一致性,不设阈值检测未知类样本。为了更好地利用图像块间的空间信息,进一步引入了图形路由器。在三个数据集上的实验验证了我们方法的有效性和优越性。
{"title":"Mixture-of-Experts for Open Set Domain Adaptation: A Dual-Space Detection Approach","authors":"Zhenbang Du;Jiayu An;Yunlu Tu;Jiahao Hong;Dongrui Wu","doi":"10.1109/TAI.2025.3560590","DOIUrl":"https://doi.org/10.1109/TAI.2025.3560590","url":null,"abstract":"Open set domain adaptation (OSDA) copes with the distribution and label shifts between the source and target domains simultaneously, performing accurate classification for known classes while identifying unknown class samples in the target domain. Most existing OSDA approaches, depending on the final image feature space of deep models, require manually-tuned thresholds, and may easily misclassify unknown samples as known classes. Mixture-of-experts (MoE) could be a remedy. Within an MoE, different experts handle distinct input features, producing unique expert routing patterns for various classes in a routing feature space. As a result, unknown class samples may display different expert routing patterns to known classes. This article proposes dual-space detection, which exploits the inconsistencies between the image feature space and the routing feature space to detect unknown class samples without any threshold. A graph router is further introduced to better make use of the spatial information among the image patches. Experiments on three datasets validated the effectiveness and superiority of our approach.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 12","pages":"3207-3216"},"PeriodicalIF":0.0,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145612207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incremental Semisupervised Learning With Adaptive Locality Preservation for High-Dimensional Data 基于自适应局部保存的高维数据增量半监督学习
Pub Date : 2025-04-14 DOI: 10.1109/TAI.2025.3560592
Guojie Li;Zhiwen Yu;Kaixiang Yang;Ziwei Fan;C. L. Philip Chen
Broad learning system (BLS) has been widely researched and applied in the field of semisupervised learning. However, current semisupervised BLS methods rely on predefined graph structures. High-dimensional small-sample data, characterized by abundant redundant and noisy features with complex distribution patterns, often leads to the construction of poor-quality predefined graphs, thereby constraining the model’s performance. Additionally, the random generation of feature and enhancement nodes in BLS, combined with limited data labels, results in suboptimal model performance. To address these issues, this article first proposes a broad learning system with adaptive locality preservation (BLS-ALP). This method employs adaptive locality preservation constraints in the output space to ensure that similar samples share the same label, iteratively updating the graph structure. To further enhance the performance of BLS-ALP, an incremental ensemble framework (IBLS-ALP) is proposed. This framework effectively mitigates the impact of redundant and noisy features by using multiple random subspaces instead of the original high-dimensional space. Additionally, IBLS-ALP enhances the utilization of a small number of labels by incorporating residual labels, thereby significantly improving the model’s overall performance. Extensive experiments conducted on various high-dimensional small-sample datasets demonstrate that IBLS-ALP exhibits superior performance.
广义学习系统(BLS)在半监督学习领域得到了广泛的研究和应用。然而,目前的半监督BLS方法依赖于预定义的图结构。高维小样本数据具有丰富的冗余和噪声特征,分布模式复杂,往往导致构建质量较差的预定义图,从而制约了模型的性能。此外,BLS中特征和增强节点的随机生成,加上有限的数据标签,导致模型性能不理想。为了解决这些问题,本文首先提出了一种具有自适应局部保存的广义学习系统(BLS-ALP)。该方法在输出空间中采用自适应局域保持约束,确保相似样本共享相同的标签,迭代更新图结构。为了进一步提高BLS-ALP的性能,提出了一种增量集成框架(IBLS-ALP)。该框架利用多个随机子空间代替原有的高维空间,有效地减轻了冗余和噪声特征的影响。此外,IBLS-ALP通过加入残差标签,提高了少量标签的利用率,从而显著提高了模型的整体性能。在各种高维小样本数据集上进行的大量实验表明,IBLS-ALP具有优越的性能。
{"title":"Incremental Semisupervised Learning With Adaptive Locality Preservation for High-Dimensional Data","authors":"Guojie Li;Zhiwen Yu;Kaixiang Yang;Ziwei Fan;C. L. Philip Chen","doi":"10.1109/TAI.2025.3560592","DOIUrl":"https://doi.org/10.1109/TAI.2025.3560592","url":null,"abstract":"Broad learning system (BLS) has been widely researched and applied in the field of semisupervised learning. However, current semisupervised BLS methods rely on predefined graph structures. High-dimensional small-sample data, characterized by abundant redundant and noisy features with complex distribution patterns, often leads to the construction of poor-quality predefined graphs, thereby constraining the model’s performance. Additionally, the random generation of feature and enhancement nodes in BLS, combined with limited data labels, results in suboptimal model performance. To address these issues, this article first proposes a broad learning system with adaptive locality preservation (BLS-ALP). This method employs adaptive locality preservation constraints in the output space to ensure that similar samples share the same label, iteratively updating the graph structure. To further enhance the performance of BLS-ALP, an incremental ensemble framework (IBLS-ALP) is proposed. This framework effectively mitigates the impact of redundant and noisy features by using multiple random subspaces instead of the original high-dimensional space. Additionally, IBLS-ALP enhances the utilization of a small number of labels by incorporating residual labels, thereby significantly improving the model’s overall performance. Extensive experiments conducted on various high-dimensional small-sample datasets demonstrate that IBLS-ALP exhibits superior performance.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 11","pages":"2990-3004"},"PeriodicalIF":0.0,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EEG Emotion Recognition Based on an Implicit Emotion Regulatory Mechanism 基于内隐情绪调节机制的脑电情绪识别
Pub Date : 2025-04-14 DOI: 10.1109/TAI.2025.3560593
Dongdong Li;Zhishuo Jin;Yujun Shen;Zhe Wang;Suo Jiang
One of the main challenges in electroencephalography (EEG) emotion recognition is the lack of understanding of the biological properties of the brain and how they relate to emotions. To address this issue, this article proposes an implicit emotion regulatory mechanism inspired contrastive learning framework (CLIER) for EEG emotion recognition. The framework simulates the complex relationship between emotions and the underlying neurobiological processes; to achieve this, the mechanism is mainly simulated through three parts. First, to leverage the interindividual variability of emotional expression, the emotion features of the individual are captured by a dynamic connection graph in the subject-dependent setting. Subsequently, reverse regulation is simulated by contrast learning based on label information and data augmentation to capture more biologically specific emotional features. Finally, caused by the asymmetry between the left and right hemispheres of the human brain in response to emotions, brain lateralization mutual learning facilitates the fusion of the hemispheres in determining emotions. Experiments on SEED, SEED-IV, SEED-V, and EREMUS datasets show impressive results: 93.4% accuracy on SEED, 90.2% on SEED-IV, 82.46% on SEED-V, and 41.63% on EREMUS. Employing an identical experimental protocol, our model demonstrated superior performance relative to the majority of existing methods, thus showcasing its effectiveness in the realm of EEG emotion recognition.
脑电图(EEG)情绪识别的主要挑战之一是缺乏对大脑生物学特性及其与情绪的关系的理解。为了解决这一问题,本文提出了一种基于内隐情绪调节机制的脑电情绪识别对比学习框架(CLIER)。该框架模拟了情绪与潜在神经生物学过程之间的复杂关系;为此,主要通过三个部分对该机制进行仿真。首先,为了利用情绪表达的个体间可变性,个体的情绪特征被主体依赖设置中的动态连接图捕获。随后,通过基于标签信息和数据增强的对比学习模拟反向调节,以捕获更多生物特异性情绪特征。最后,由于人类大脑左右半球对情绪的反应不对称,大脑侧化相互学习促进了左右半球在决定情绪时的融合。在SEED、SEED- iv、SEED- v和EREMUS数据集上的实验显示,SEED的准确率为93.4%,SEED- iv的准确率为90.2%,SEED- v的准确率为82.46%,EREMUS的准确率为41.63%。采用相同的实验方案,我们的模型相对于大多数现有方法显示出优越的性能,从而展示了其在EEG情感识别领域的有效性。
{"title":"EEG Emotion Recognition Based on an Implicit Emotion Regulatory Mechanism","authors":"Dongdong Li;Zhishuo Jin;Yujun Shen;Zhe Wang;Suo Jiang","doi":"10.1109/TAI.2025.3560593","DOIUrl":"https://doi.org/10.1109/TAI.2025.3560593","url":null,"abstract":"One of the main challenges in electroencephalography (EEG) emotion recognition is the lack of understanding of the biological properties of the brain and how they relate to emotions. To address this issue, this article proposes an implicit emotion regulatory mechanism inspired contrastive learning framework (CLIER) for EEG emotion recognition. The framework simulates the complex relationship between emotions and the underlying neurobiological processes; to achieve this, the mechanism is mainly simulated through three parts. First, to leverage the interindividual variability of emotional expression, the emotion features of the individual are captured by a dynamic connection graph in the subject-dependent setting. Subsequently, reverse regulation is simulated by contrast learning based on label information and data augmentation to capture more biologically specific emotional features. Finally, caused by the asymmetry between the left and right hemispheres of the human brain in response to emotions, brain lateralization mutual learning facilitates the fusion of the hemispheres in determining emotions. Experiments on SEED, SEED-IV, SEED-V, and EREMUS datasets show impressive results: 93.4% accuracy on SEED, 90.2% on SEED-IV, 82.46% on SEED-V, and 41.63% on EREMUS. Employing an identical experimental protocol, our model demonstrated superior performance relative to the majority of existing methods, thus showcasing its effectiveness in the realm of EEG emotion recognition.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 11","pages":"3005-3017"},"PeriodicalIF":0.0,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Residual Learning of a Probabilistic’ Partial Least Squares Model for Predictive Data Analytics 预测数据分析中概率偏最小二乘模型的深度残差学习
Pub Date : 2025-04-11 DOI: 10.1109/TAI.2025.3560248
Zhiqiang Ge;Duxin Chen;Wenwu Yu
Recently, probabilistic latent variable models have played an important role in data analytics in various industrial application scenarios, such as process monitoring, fault diagnosis, and soft sensing. Inspired by the idea of lightweight deep learning, this article proposes a new deep residual learning method for the probabilistic’ partial least squares (PLSs) model. First, layerwise probabilistic modeling is carried out to extract supervised latent variables in different hidden layers of the deep model using a well-designed expectation-maximization algorithm for parameter optimization. Through this layerwise residual learning process, more target-related latent variables can be extracted, which are supervised by the outputs of the predictive model. Next, an additional probabilistic model is constructed for information fusion and further extraction of supervised latent variables which are highly related to the modeling target. In fact, this step can be considered as an ensemble learning strategy, which has great potentials in decreasing modeling error and reducing prediction uncertainty. A soft-sensing strategy is then developed for online prediction of key variables. The performance is evaluated using two industrial examples. Compared to the shallow probabilistic model, the performance of the deep model has been improved by 10%–20%.
近年来,概率潜变量模型在过程监控、故障诊断和软测量等各种工业应用场景的数据分析中发挥了重要作用。受轻量级深度学习思想的启发,本文提出了一种新的概率偏最小二乘(pls)模型的深度残差学习方法。首先,利用设计良好的期望最大化算法进行参数优化,进行分层概率建模,提取深度模型不同隐藏层的有监督潜在变量。通过这种分层残差学习过程,可以提取更多与目标相关的潜在变量,这些潜在变量由预测模型的输出进行监督。其次,构建一个附加的概率模型,用于信息融合和进一步提取与建模目标高度相关的监督潜变量。实际上,这一步可以看作是一种集成学习策略,在减少建模误差和降低预测不确定性方面具有很大的潜力。然后开发了一种软测量策略,用于在线预测关键变量。用两个工业实例对其性能进行了评价。与浅层概率模型相比,深层模型的性能提高了10%-20%。
{"title":"Deep Residual Learning of a Probabilistic’ Partial Least Squares Model for Predictive Data Analytics","authors":"Zhiqiang Ge;Duxin Chen;Wenwu Yu","doi":"10.1109/TAI.2025.3560248","DOIUrl":"https://doi.org/10.1109/TAI.2025.3560248","url":null,"abstract":"Recently, probabilistic latent variable models have played an important role in data analytics in various industrial application scenarios, such as process monitoring, fault diagnosis, and soft sensing. Inspired by the idea of lightweight deep learning, this article proposes a new deep residual learning method for the probabilistic’ partial least squares (PLSs) model. First, layerwise probabilistic modeling is carried out to extract supervised latent variables in different hidden layers of the deep model using a well-designed expectation-maximization algorithm for parameter optimization. Through this layerwise residual learning process, more target-related latent variables can be extracted, which are supervised by the outputs of the predictive model. Next, an additional probabilistic model is constructed for information fusion and further extraction of supervised latent variables which are highly related to the modeling target. In fact, this step can be considered as an ensemble learning strategy, which has great potentials in decreasing modeling error and reducing prediction uncertainty. A soft-sensing strategy is then developed for online prediction of key variables. The performance is evaluated using two industrial examples. Compared to the shallow probabilistic model, the performance of the deep model has been improved by 10%–20%.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 11","pages":"2977-2989"},"PeriodicalIF":0.0,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145455952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parallel Inductive Shift Learning Based Recommendation System 基于并行感应移位学习的推荐系统
Pub Date : 2025-04-09 DOI: 10.1109/TAI.2025.3558183
Nilufar Zaman;Angshuman Jana
In today’s world, online services have revolutionized human activities and thus the consumers expect their service providers to make their online experiences more fruitful by recommending the relevant services to them. In this case, it becomes really challenging for the service providers to provide recommendation to a user whose information’s and preferences are unavailable. This issue is handled by cross-domain approach, which explores similar users across various domains in the same platform. However, the main concern with this cross-domain approach is that the information needs to be available in any domain of one platform. Thus, a multidomain recommendation is designed to optimize the recommendation system performance by analyzing the information obtained from multiple platforms. However, existing multidomain recommendation model has mainly two challenges. First, there are no overlapping users to understand the similarities between them. Second, the transfer learning approach in multidomain allows the transfer of information from only the source to the target domain. Therefore, our proposed approach consider the parallel inductive shift learning (PISL) model to address these two above-mentioned challenges. For the first challenge, we have focused to identify the similarities between user–user and user–item by considering various features of user and item. For the next challenge, our proposed model analyzes the source and the target domain simultaneously and thus does a parallel transfer of information from the source to the target domain and vice versa. We have tested our model for three real-life movie and book datasets i.e. for the movie dataset we have used Movielens, Amazon, and Netflix datasets. In contrast, for the book dataset, we have used the Amazon, Good Reads, and Book Crossing dataset, which proves to outperform the other state-of-the-art approaches.
在当今世界,在线服务已经彻底改变了人类的活动,因此消费者期望他们的服务提供商通过向他们推荐相关的服务来使他们的在线体验更加富有成效。在这种情况下,服务提供者向信息和偏好不可用的用户提供推荐变得非常具有挑战性。这个问题是通过跨域方法处理的,该方法在同一平台的不同域中探索类似的用户。然而,这种跨领域方法的主要问题是信息需要在一个平台的任何领域中可用。因此,我们设计了一个多领域推荐,通过分析多个平台获得的信息来优化推荐系统的性能。然而,现有的多领域推荐模型主要存在两方面的挑战。首先,没有重叠的用户来理解它们之间的相似之处。第二,多域迁移学习方法允许信息仅从源域迁移到目标域。因此,我们提出的方法考虑并行归纳移位学习(PISL)模型来解决上述两个挑战。对于第一个挑战,我们专注于通过考虑用户和物品的各种特征来识别用户-用户和用户-物品之间的相似性。对于下一个挑战,我们提出的模型同时分析源域和目标域,从而将信息从源域并行传输到目标域,反之亦然。我们已经为三个现实生活中的电影和书籍数据集测试了我们的模型,即对于电影数据集,我们使用了Movielens, Amazon和Netflix数据集。相比之下,对于图书数据集,我们使用了Amazon、Good Reads和book Crossing数据集,事实证明,这些数据集的性能优于其他最先进的方法。
{"title":"Parallel Inductive Shift Learning Based Recommendation System","authors":"Nilufar Zaman;Angshuman Jana","doi":"10.1109/TAI.2025.3558183","DOIUrl":"https://doi.org/10.1109/TAI.2025.3558183","url":null,"abstract":"In today’s world, online services have revolutionized human activities and thus the consumers expect their service providers to make their online experiences more fruitful by recommending the relevant services to them. In this case, it becomes really challenging for the service providers to provide recommendation to a user whose information’s and preferences are unavailable. This issue is handled by cross-domain approach, which explores similar users across various domains in the same platform. However, the main concern with this cross-domain approach is that the information needs to be available in any domain of one platform. Thus, a multidomain recommendation is designed to optimize the recommendation system performance by analyzing the information obtained from multiple platforms. However, existing multidomain recommendation model has mainly two challenges. First, there are no overlapping users to understand the similarities between them. Second, the transfer learning approach in multidomain allows the transfer of information from only the source to the target domain. Therefore, our proposed approach consider the parallel inductive shift learning (PISL) model to address these two above-mentioned challenges. For the first challenge, we have focused to identify the similarities between user–user and user–item by considering various features of user and item. For the next challenge, our proposed model analyzes the source and the target domain simultaneously and thus does a parallel transfer of information from the source to the target domain and vice versa. We have tested our model for three real-life movie and book datasets i.e. for the movie dataset we have used Movielens, Amazon, and Netflix datasets. In contrast, for the book dataset, we have used the Amazon, Good Reads, and Book Crossing dataset, which proves to outperform the other state-of-the-art approaches.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 11","pages":"2953-2965"},"PeriodicalIF":0.0,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145428951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1