首页 > 最新文献

IEEE transactions on artificial intelligence最新文献

英文 中文
Stabilizing Diffusion Model for Robotic Control With Dynamic Programming and Transition Feasibility 采用动态编程和过渡可行性的机器人控制稳定扩散模型
Pub Date : 2024-04-10 DOI: 10.1109/TAI.2024.3387401
Haoran Li;Yaocheng Zhang;Haowei Wen;Yuanheng Zhu;Dongbin Zhao
Due to its strong ability in distribution representation, the diffusion model has been incorporated into offline reinforcement learning (RL) to cover diverse trajectories of the complex behavior policy. However, this also causes several challenges. Training the diffusion model to imitate behavior from the collected trajectories suffers from limited stitching capability which derives better policies from suboptimal trajectories. Furthermore, the inherent randomness of the diffusion model can lead to unpredictable control and dangerous behavior for the robot. To address these concerns, we propose the value-learning-based decision diffuser (V-DD), which consists of the trajectory diffusion module (TDM) and the trajectory evaluation module (TEM). During the training process, the TDM combines the state-value and classifier-free guidance to bolster the ability to stitch suboptimal trajectories. During the inference process, we design the TEM to select a feasible trajectory generated by the diffusion model. Empirical results demonstrate that our method delivers competitive results on the D4RL benchmark and substantially outperforms current diffusion model-based methods on the real-world robot task.
由于扩散模型在分布表示方面的强大能力,它已被纳入离线强化学习(RL),以覆盖复杂行为政策的各种轨迹。然而,这也带来了一些挑战。从收集到的轨迹中训练扩散模型来模仿行为,会受到拼接能力的限制,从而从次优轨迹中得出更好的策略。此外,扩散模型固有的随机性可能会导致机器人无法预测的控制和危险行为。为了解决这些问题,我们提出了基于价值学习的决策扩散器(V-DD),它由轨迹扩散模块(TDM)和轨迹评估模块(TEM)组成。在训练过程中,TDM 结合了状态值和无分类器指导,以提高缝合次优轨迹的能力。在推理过程中,我们设计 TEM 来选择由扩散模型生成的可行轨迹。实证结果表明,我们的方法在 D4RL 基准测试中取得了具有竞争力的结果,并且在实际机器人任务中大大优于当前基于扩散模型的方法。
{"title":"Stabilizing Diffusion Model for Robotic Control With Dynamic Programming and Transition Feasibility","authors":"Haoran Li;Yaocheng Zhang;Haowei Wen;Yuanheng Zhu;Dongbin Zhao","doi":"10.1109/TAI.2024.3387401","DOIUrl":"https://doi.org/10.1109/TAI.2024.3387401","url":null,"abstract":"Due to its strong ability in distribution representation, the diffusion model has been incorporated into offline reinforcement learning (RL) to cover diverse trajectories of the complex behavior policy. However, this also causes several challenges. Training the diffusion model to imitate behavior from the collected trajectories suffers from limited stitching capability which derives better policies from suboptimal trajectories. Furthermore, the inherent randomness of the diffusion model can lead to unpredictable control and dangerous behavior for the robot. To address these concerns, we propose the value-learning-based decision diffuser (V-DD), which consists of the trajectory diffusion module (TDM) and the trajectory evaluation module (TEM). During the training process, the TDM combines the state-value and classifier-free guidance to bolster the ability to stitch suboptimal trajectories. During the inference process, we design the TEM to select a feasible trajectory generated by the diffusion model. Empirical results demonstrate that our method delivers competitive results on the D4RL benchmark and substantially outperforms current diffusion model-based methods on the real-world robot task.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Counterfactual Explanation of Graph Neural Networks via Generative Flow Network 通过生成流网络学习图神经网络的反事实解释
Pub Date : 2024-04-10 DOI: 10.1109/TAI.2024.3387406
Kangjia He;Li Liu;Youmin Zhang;Ye Wang;Qun Liu;Guoyin Wang
Counterfactual subgraphs explain graph neural networks (GNNs) by answering the question: “How would the prediction change if a certain subgraph were absent in the input instance?” The differentiable proxy adjacency matrix is prevalent in current counterfactual subgraph discovery studies due to its ability to avoid exhaustive edge searching. However, a prediction gap exists when feeding the proxy matrix with continuous values and the thresholded discrete adjacency matrix to GNNs, compromising the optimization of the subgraph generator. Furthermore, the end-to-end learning schema adopted in the subgraph generator limits the diversity of counterfactual subgraphs. To this end, we propose CF-GFNExplainer, a flow-based approach for learning counterfactual subgraphs. CF-GFNExplainer employs a policy network with a discrete edge removal schema to construct counterfactual subgraph generation trajectories. Additionally, we introduce a loss function designed to guide CF-GFNExplainer's optimization. The discrete adjacency matrix generated in each trajectory eliminates the prediction gap, enhancing the validity of the learned subgraphs. Furthermore, the multitrajectories sampling strategy adopted in CF-GFNExplainer results in diverse counterfactual subgraphs. Extensive experiments conducted on synthetic and real-world datasets demonstrate the effectiveness of the proposed method in terms of validity and diversity.
反事实子图通过回答以下问题来解释图神经网络(GNN):"如果输入实例中不存在某个子图,预测结果会发生怎样的变化?由于可微分代理邻接矩阵能够避免穷举式边缘搜索,因此在当前的反事实子图发现研究中非常普遍。然而,在将连续值的代理矩阵和阈值化的离散邻接矩阵输入 GNN 时,会出现预测差距,从而影响子图生成器的优化。此外,子图生成器采用的端到端学习模式限制了反事实子图的多样性。为此,我们提出了基于流的反事实子图学习方法 CF-GFNExplainer。CF-GFNExplainer 采用具有离散边缘移除模式的策略网络来构建反事实子图生成轨迹。此外,我们还引入了一个损失函数,旨在指导 CF-GFNExplainer 进行优化。在每个轨迹中生成的离散邻接矩阵消除了预测差距,增强了所学子图的有效性。此外,CF-GFNExplainer 采用的多轨迹采样策略还能生成多样化的反事实子图。在合成和真实世界数据集上进行的大量实验证明了所提方法在有效性和多样性方面的有效性。
{"title":"Learning Counterfactual Explanation of Graph Neural Networks via Generative Flow Network","authors":"Kangjia He;Li Liu;Youmin Zhang;Ye Wang;Qun Liu;Guoyin Wang","doi":"10.1109/TAI.2024.3387406","DOIUrl":"https://doi.org/10.1109/TAI.2024.3387406","url":null,"abstract":"Counterfactual subgraphs explain graph neural networks (GNNs) by answering the question: “How would the prediction change if a certain subgraph were absent in the input instance?” The differentiable proxy adjacency matrix is prevalent in current counterfactual subgraph discovery studies due to its ability to avoid exhaustive edge searching. However, a prediction gap exists when feeding the proxy matrix with continuous values and the thresholded discrete adjacency matrix to GNNs, compromising the optimization of the subgraph generator. Furthermore, the end-to-end learning schema adopted in the subgraph generator limits the diversity of counterfactual subgraphs. To this end, we propose CF-GFNExplainer, a flow-based approach for learning counterfactual subgraphs. CF-GFNExplainer employs a policy network with a discrete edge removal schema to construct counterfactual subgraph generation trajectories. Additionally, we introduce a loss function designed to guide CF-GFNExplainer's optimization. The discrete adjacency matrix generated in each trajectory eliminates the prediction gap, enhancing the validity of the learned subgraphs. Furthermore, the multitrajectories sampling strategy adopted in CF-GFNExplainer results in diverse counterfactual subgraphs. Extensive experiments conducted on synthetic and real-world datasets demonstrate the effectiveness of the proposed method in terms of validity and diversity.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142169597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incomplete Graph Learning via Partial Graph Convolutional Network 通过部分图卷积网络进行不完整图学习
Pub Date : 2024-04-10 DOI: 10.1109/TAI.2024.3386499
Ziyan Zhang;Bo Jiang;Jin Tang;Jinhui Tang;Bin Luo
Graph convolutional networks (GCNs) gain increasing attention on graph data learning tasks in recent years. However, in many applications, graph may come with an incomplete form where attributes of graph nodes are partially unknown/missing. Existing graph convolutions (GCs) are generally designed on complete graphs which cannot deal with attribute-incomplete graph data directly. To address this problem, in this article, we extend standard GC and develop an explicit Partial Graph Convolution (PaGC) for attribute-incomplete graph data. Our PaGC is derived based on the observation that the core neighborhood aggregator in GC operation can be equivalently viewed as an energy minimization model. Based on it, we can define a novel partial aggregation function and derive PaGC for incomplete graph data. Experiments demonstrate the effectiveness and efficiency of the proposed PaGCN.
近年来,图卷积网络(GCN)在图数据学习任务中越来越受到关注。然而,在许多应用中,图可能是不完整的,图节点的属性部分未知或缺失。现有的图卷积(GC)一般是针对完整图设计的,无法直接处理属性不完整的图数据。为了解决这个问题,我们在本文中扩展了标准图卷积,并开发了一种用于属性不完整图数据的显式部分图卷积(PaGC)。我们的 PaGC 是在观察到 GC 操作中的核心邻域聚合器可以等同于能量最小化模型的基础上推导出来的。在此基础上,我们可以定义一个新颖的部分聚合函数,并推导出适用于不完整图数据的 PaGC。实验证明了所提出的 PaGCN 的有效性和效率。
{"title":"Incomplete Graph Learning via Partial Graph Convolutional Network","authors":"Ziyan Zhang;Bo Jiang;Jin Tang;Jinhui Tang;Bin Luo","doi":"10.1109/TAI.2024.3386499","DOIUrl":"https://doi.org/10.1109/TAI.2024.3386499","url":null,"abstract":"Graph convolutional networks (GCNs) gain increasing attention on graph data learning tasks in recent years. However, in many applications, graph may come with an incomplete form where attributes of graph nodes are partially unknown/missing. Existing graph convolutions (GCs) are generally designed on complete graphs which cannot deal with attribute-incomplete graph data directly. To address this problem, in this article, we extend standard GC and develop an explicit Partial Graph Convolution (PaGC) for attribute-incomplete graph data. Our PaGC is derived based on the observation that the core neighborhood aggregator in GC operation can be equivalently viewed as an energy minimization model. Based on it, we can define a novel \u0000<italic>partial aggregation function</i>\u0000 and derive PaGC for incomplete graph data. Experiments demonstrate the effectiveness and efficiency of the proposed PaGCN.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Distributed Conditional Wasserstein Deep Convolutional Relativistic Loss Generative Adversarial Network With Improved Convergence 改进收敛性的分布式条件瓦瑟斯坦深度卷积相对损失生成对抗网络
Pub Date : 2024-04-09 DOI: 10.1109/TAI.2024.3386500
Arunava Roy;Dipankar Dasgupta
Generative adversarial networks (GANs) excel in diverse applications such as image enhancement, manipulation, and generating images and videos from text. Yet, training GANs with large datasets remains computationally intensive for standalone systems. Synchronization issues between the generator and discriminator lead to unstable training, poor convergence, vanishing, and exploding gradient challenges. In decentralized environments, standalone GANs struggle with distributed data on client machines. Researchers have turned to federated learning (FL) for distributed-GAN (D-GAN) implementations, but efforts often fall short due to training instability and poor synchronization within GAN components. In this study, we present DRL-GAN, a lightweight Wasserstein conditional distributed relativistic loss-GAN designed to overcome existing limitations. DRL-GAN ensures training stability in the face of nonconvex losses by employing a single global generator on the central server and a discriminator per client. Utilizing Wasserstein-1 for relativistic loss computation between real and fake samples, DRL-GAN effectively addresses issues, such as mode collapses, vanishing, and exploding gradients, accommodating both iid and non-iid private data in clients and fostering strong convergence. The absence of a robust conditional distributed-GAN model serves as another motivation for this work. We provide a comprehensive mathematical formulation of DRL-GAN and validate our claims empirically on CIFAR-10, MNIST, EuroSAT, and LSUN-Bedroom datasets.
生成式对抗网络(GAN)在图像增强、处理以及根据文本生成图像和视频等多种应用中表现出色。然而,对于独立系统而言,使用大型数据集训练生成式对抗网络仍然是一项计算密集型工作。生成器和判别器之间的同步问题会导致训练不稳定、收敛性差、消失和梯度爆炸等难题。在分散的环境中,独立的 GANs 难以处理客户端机器上的分布式数据。研究人员已将联合学习(FL)用于分布式 GAN(D-GAN)的实现,但由于 GAN 组件内的训练不稳定和同步性差,这些努力往往无法奏效。在本研究中,我们介绍了 DRL-GAN,它是一种轻量级的 Wasserstein 条件分布式相对论损失-GAN,旨在克服现有的局限性。DRL-GAN 通过在中央服务器上采用单个全局发生器和每个客户端采用一个判别器,确保了面对非凸损失时的训练稳定性。DRL-GAN 利用 Wasserstein-1 在真实样本和虚假样本之间进行相对损失计算,有效解决了模式坍塌、消失和梯度爆炸等问题,同时兼顾了客户机中的 iid 和非 iid 私有数据,并促进了强大的收敛性。缺乏稳健的条件分布式广义网络模型是这项工作的另一个动机。我们提供了 DRL-GAN 的全面数学表述,并在 CIFAR-10、MNIST、EuroSAT 和 LSUN-Bedroom 数据集上验证了我们的主张。
{"title":"A Distributed Conditional Wasserstein Deep Convolutional Relativistic Loss Generative Adversarial Network With Improved Convergence","authors":"Arunava Roy;Dipankar Dasgupta","doi":"10.1109/TAI.2024.3386500","DOIUrl":"https://doi.org/10.1109/TAI.2024.3386500","url":null,"abstract":"Generative adversarial networks (GANs) excel in diverse applications such as image enhancement, manipulation, and generating images and videos from text. Yet, training GANs with large datasets remains computationally intensive for standalone systems. Synchronization issues between the generator and discriminator lead to unstable training, poor convergence, vanishing, and exploding gradient challenges. In decentralized environments, standalone GANs struggle with distributed data on client machines. Researchers have turned to federated learning (FL) for distributed-GAN (D-GAN) implementations, but efforts often fall short due to training instability and poor synchronization within GAN components. In this study, we present DRL-GAN, a lightweight Wasserstein conditional distributed relativistic loss-GAN designed to overcome existing limitations. DRL-GAN ensures training stability in the face of nonconvex losses by employing a single global generator on the central server and a discriminator per client. Utilizing Wasserstein-1 for relativistic loss computation between real and fake samples, DRL-GAN effectively addresses issues, such as mode collapses, vanishing, and exploding gradients, accommodating both iid and non-iid private data in clients and fostering strong convergence. The absence of a robust conditional distributed-GAN model serves as another motivation for this work. We provide a comprehensive mathematical formulation of DRL-GAN and validate our claims empirically on CIFAR-10, MNIST, EuroSAT, and LSUN-Bedroom datasets.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Artificial Intelligence Publication Information IEEE Transactions on Artificial Intelligence 出版信息
Pub Date : 2024-04-09 DOI: 10.1109/TAI.2024.3382433
{"title":"IEEE Transactions on Artificial Intelligence Publication Information","authors":"","doi":"10.1109/TAI.2024.3382433","DOIUrl":"https://doi.org/10.1109/TAI.2024.3382433","url":null,"abstract":"","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10495742","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140540952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Applicable Shadow Resistant Neural Network Model for High-Efficiency Grid-Level Pavement Crack Detection 用于高效网格级路面裂缝检测的新型适用抗阴影神经网络模型
Pub Date : 2024-04-08 DOI: 10.1109/TAI.2024.3386149
Handuo Yang;Ju Huyan;Tao Ma;Yitao Song;Chengjia Han
To address two key challenges—limited grid-level detection capability and difficulty in detecting pavement cracks in complex environments, this study proposes a novel neural network model called CrackcellNet. This innovative model incorporates an output structure that enables end-to-end grid recognition and a module that enhances shadow image data to enhance crack detection. The model relies on the design of consecutive pooling layers to achieve adaptive target size grid output. By utilizing image fusion techniques, it enhances the quantity of shadow data in road surface detection. The results of ablation experiments indicate that the optimal configuration for CrackcellNet includes V-block and shadow augmentation operations, dilation rates of 1 or 2, and a convolutional layer in the CBA module. Through extensive experimentation, we have demonstrated that our model achieved an accuracy rate of 94.5% for grid-level crack detection and a F1 value of 0.839. Furthermore, practical engineering validation confirms the model's efficacy with an average PCIe of 0.045, providing valuable guidance for road maintenance decisions.
为了解决网格级检测能力有限和复杂环境下路面裂缝检测困难这两大难题,本研究提出了一种名为 CrackcellNet 的新型神经网络模型。这一创新模型包含一个可实现端到端网格识别的输出结构和一个可增强阴影图像数据以提高裂缝检测能力的模块。该模型依靠连续池化层的设计来实现自适应目标尺寸网格输出。通过利用图像融合技术,该模型增强了路面检测中阴影数据的数量。烧蚀实验结果表明,CrackcellNet 的最佳配置包括 V 块和阴影增强操作、1 或 2 的扩张率以及 CBA 模块中的卷积层。通过大量实验,我们证明了我们的模型在网格级裂纹检测方面达到了 94.5% 的准确率和 0.839 的 F1 值。此外,实际工程验证也证实了该模型的有效性,其平均 PCIe 为 0.045,为道路维护决策提供了宝贵的指导。
{"title":"A Novel Applicable Shadow Resistant Neural Network Model for High-Efficiency Grid-Level Pavement Crack Detection","authors":"Handuo Yang;Ju Huyan;Tao Ma;Yitao Song;Chengjia Han","doi":"10.1109/TAI.2024.3386149","DOIUrl":"https://doi.org/10.1109/TAI.2024.3386149","url":null,"abstract":"To address two key challenges—limited grid-level detection capability and difficulty in detecting pavement cracks in complex environments, this study proposes a novel neural network model called CrackcellNet. This innovative model incorporates an output structure that enables end-to-end grid recognition and a module that enhances shadow image data to enhance crack detection. The model relies on the design of consecutive pooling layers to achieve adaptive target size grid output. By utilizing image fusion techniques, it enhances the quantity of shadow data in road surface detection. The results of ablation experiments indicate that the optimal configuration for CrackcellNet includes V-block and shadow augmentation operations, dilation rates of 1 or 2, and a convolutional layer in the CBA module. Through extensive experimentation, we have demonstrated that our model achieved an accuracy rate of 94.5% for grid-level crack detection and a F1 value of 0.839. Furthermore, practical engineering validation confirms the model's efficacy with an average PCIe of 0.045, providing valuable guidance for road maintenance decisions.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prefetching-based Multiproposal Markov Chain Monte Carlo Algorithm 基于预取的多提案马尔可夫链蒙特卡洛算法
Pub Date : 2024-04-05 DOI: 10.1109/TAI.2024.3385384
Guifeng Ye;Shaowen Lu
Our proposed algorithm is a prefetching-based multiproposal Markov Chain Monte Carlo (PMP-MCMC) method that efficiently explores the target distribution by combining multiple proposals with the concept of prefetching. In our method, not all proposals are directly derived from the current state; some are derived from future states. This approach breaks through the inherent sequential characteristics of traditional MCMC algorithms. Compared with single-proposal and multiproposal methods, our approach speeds up by $K$ times and the burn-in period is reduced by a factor of $1/text{log}_{2}K$ maximally, where $K$ is the number of parallel computational units or processing cores. Compared with prefetching method, our method has increased the number of samples per iteration by a factor of $K/text{log}_{2}K$. Furthermore, the proposed method is general and can be integrated into MCMC variants such as Hamiltonian Monte Carlo (HMC). We have also applied this method to optimize the model parameters of neural networks and Bayesian inference and observed significant improvements in optimization performance.
我们提出的算法是一种基于预取的多方案马尔可夫链蒙特卡罗(PMP-MCMC)方法,通过将多个方案与预取概念相结合,有效地探索目标分布。在我们的方法中,并非所有建议都直接来自当前状态;有些建议来自未来状态。这种方法突破了传统 MCMC 算法固有的顺序特性。与单提案法和多提案法相比,我们的方法速度提高了 $K$ 倍,烧入期最大缩短了 1/text{log}_{2}K$ 倍,其中 $K$ 是并行计算单元或处理核心的数量。与预取方法相比,我们的方法将每次迭代的样本数量增加了 $K/text{log}_{2}K$。此外,我们提出的方法具有通用性,可以集成到 MCMC 变体中,如汉密尔顿蒙特卡罗(HMC)。我们还将这种方法应用于优化神经网络和贝叶斯推理的模型参数,并观察到优化性能的显著提高。
{"title":"Prefetching-based Multiproposal Markov Chain Monte Carlo Algorithm","authors":"Guifeng Ye;Shaowen Lu","doi":"10.1109/TAI.2024.3385384","DOIUrl":"https://doi.org/10.1109/TAI.2024.3385384","url":null,"abstract":"Our proposed algorithm is a prefetching-based multiproposal Markov Chain Monte Carlo (PMP-MCMC) method that efficiently explores the target distribution by combining multiple proposals with the concept of prefetching. In our method, not all proposals are directly derived from the current state; some are derived from future states. This approach breaks through the inherent sequential characteristics of traditional MCMC algorithms. Compared with single-proposal and multiproposal methods, our approach speeds up by \u0000<inline-formula><tex-math>$K$</tex-math></inline-formula>\u0000 times and the burn-in period is reduced by a factor of \u0000<inline-formula><tex-math>$1/text{log}_{2}K$</tex-math></inline-formula>\u0000 maximally, where \u0000<inline-formula><tex-math>$K$</tex-math></inline-formula>\u0000 is the number of parallel computational units or processing cores. Compared with prefetching method, our method has increased the number of samples per iteration by a factor of \u0000<inline-formula><tex-math>$K/text{log}_{2}K$</tex-math></inline-formula>\u0000. Furthermore, the proposed method is general and can be integrated into MCMC variants such as Hamiltonian Monte Carlo (HMC). We have also applied this method to optimize the model parameters of neural networks and Bayesian inference and observed significant improvements in optimization performance.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Retain and Adapt: Online Sequential EEG Classification With Subject Shift 保留和适应:带有受试者偏移的在线顺序脑电图分类
Pub Date : 2024-04-05 DOI: 10.1109/TAI.2024.3385390
Tiehang Duan;Zhenyi Wang;Li Shen;Gianfranco Doretto;Donald A. Adjeroh;Fang Li;Cui Tao
Large variance exists in Electroencephalogram (EEG) signals with its pattern differing significantly across subjects. It is a challenging problem to perform online sequential decoding of EEG signals across different subjects, where a sequence of subjects arrive in temporal order and no signal data is jointly available beforehand. The challenges include the following two aspects: 1) the knowledge learned from previous subjects does not readily fit to future subjects, and fast adaptation is needed in the process; and 2) the EEG classifier could drastically erase information of learnt subjects as learning progresses, namely catastrophic forgetting. Most existing EEG decoding explorations use sizable data for pretraining purposes, and to the best of our knowledge we are the first to tackle this challenging online sequential decoding setting. In this work, we propose a unified bi-level meta-learning framework that enables the EEG decoder to simultaneously perform fast adaptation on future subjects and retain knowledge of previous subjects. In addition, we extend to the more general subject-agnostic scenario and propose a subject shift detection algorithm for situations that subject identity and the occurrence of subject shifts are unknown. We conducted experiments on three public EEG datasets for both subject-aware and subject-agnostic scenarios. The proposed method demonstrates its effectiveness in most of the ablation settings, e.g. an improvement of 5.73% for forgetting mitigation and 3.50% for forward adaptation on SEED dataset for subject agnostic scenarios.
脑电图(EEG)信号存在很大差异,不同受试者的脑电图模式也大不相同。要对不同受试者的脑电信号进行在线顺序解码是一个极具挑战性的问题,因为受试者会按时间顺序依次到达,而事先并没有共同的信号数据。挑战包括以下两个方面:1) 从以前的受试者身上学到的知识并不容易适用于未来的受试者,因此在这一过程中需要快速适应;以及 2) 随着学习的进行,脑电图分类器可能会大幅删除所学受试者的信息,即灾难性遗忘。现有的脑电解码探索大多使用大量数据进行预训练,据我们所知,我们是第一个解决这种具有挑战性的在线顺序解码设置的人。在这项工作中,我们提出了一个统一的双层元学习框架,使脑电解码器能够同时对未来的研究对象进行快速适应,并保留以前研究对象的知识。此外,我们还将其扩展到更一般的主体不可知场景,并针对主体身份和主体偏移发生情况未知的情况提出了主体偏移检测算法。我们在三个公共脑电图数据集上进行了主体感知和主体无关场景的实验。所提出的方法在大多数消融设置中都证明了其有效性,例如,在不考虑主体的情况下,SEED 数据集的遗忘缓解率提高了 5.73%,前向适配率提高了 3.50%。
{"title":"Retain and Adapt: Online Sequential EEG Classification With Subject Shift","authors":"Tiehang Duan;Zhenyi Wang;Li Shen;Gianfranco Doretto;Donald A. Adjeroh;Fang Li;Cui Tao","doi":"10.1109/TAI.2024.3385390","DOIUrl":"https://doi.org/10.1109/TAI.2024.3385390","url":null,"abstract":"Large variance exists in Electroencephalogram (EEG) signals with its pattern differing significantly across subjects. It is a challenging problem to perform online sequential decoding of EEG signals across different subjects, where a sequence of subjects arrive in temporal order and no signal data is jointly available beforehand. The challenges include the following two aspects: 1) the knowledge learned from previous subjects does not readily fit to future subjects, and fast adaptation is needed in the process; and 2) the EEG classifier could drastically erase information of learnt subjects as learning progresses, namely catastrophic forgetting. Most existing EEG decoding explorations use sizable data for pretraining purposes, and to the best of our knowledge we are the first to tackle this challenging online sequential decoding setting. In this work, we propose a unified bi-level meta-learning framework that enables the EEG decoder to simultaneously perform fast adaptation on future subjects and retain knowledge of previous subjects. In addition, we extend to the more general subject-agnostic scenario and propose a subject shift detection algorithm for situations that subject identity and the occurrence of subject shifts are unknown. We conducted experiments on three public EEG datasets for both subject-aware and subject-agnostic scenarios. The proposed method demonstrates its effectiveness in most of the ablation settings, e.g. an improvement of 5.73% for forgetting mitigation and 3.50% for forward adaptation on SEED dataset for subject agnostic scenarios.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142169671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shuffled Grouping Cross-Channel Attention-Based Bilateral-Filter-Interpolation Deformable ConvNet With Applications to Benthonic Organism Detection 基于洗牌分组跨信道注意力的双边滤波插值变形 ConvNet 在底栖生物检测中的应用
Pub Date : 2024-04-05 DOI: 10.1109/TAI.2024.3385387
Tingkai Chen;Ning Wang
In this article, to holistically tackle underwater detection degradation due to unknown geometric variation arising from scale, pose, viewpoint, and occlusion under low-contrast and color-distortion circumstances, a shuffled grouping cross-channel attention-based bilateral-filter-interpolation deformable ConvNet (SGCA-BDC) framework is established for benthonic organism detection (BOD). Main contributions are as follows: 1) By comprehensively considering spatial and feature similarities between offset and integral coordinate positions, the BDC with modulation weight mechanism is created, such that sampling ability of convolutional kernel for BO with unknown geometric variation can be adaptively augmented from spatial perspective; 2) By utilizing 1-D convolution to recalibrate channel weight for grouped subfeature via information entropy statistic technique, an SGCA module is innovated, such that seabed background noise can be suppressed from channel aspect; 3) The proposed SGCA-BDC scheme is eventually built in an organic manner by incorporating BDC and SGCA modules. Comprehensive experiments and comparisons demonstrate that the SGCA-BDC scheme remarkably outperforms typical detection approaches including Faster RCNN, SSD, YOLOv6, YOLOv7, YOLOv8, RetinaNet, and CenterNet in terms of mean average precision by 8.54%, 4.4%, 5.18%, 3.1%, 3.01%, 12.53%, and 7.09%, respectively.
本文从整体上解决了在低对比度和色彩失真情况下,由于尺度、姿态、视角和遮挡等未知几何变化引起的水下检测退化问题,建立了一种基于洗牌分组跨信道注意力的双边滤波插值可变形 ConvNet(SGCA-BDC)框架,用于底栖生物检测(BOD)。主要贡献如下1) 通过综合考虑偏移和积分坐标位置之间的空间和特征相似性,创建了具有调制权重机制的 BDC,从而可以从空间角度自适应地增强卷积核对未知几何变化的 BO 的采样能力;2) 利用一维卷积,通过信息熵统计技术重新校准分组子特征的信道权重,创新出 SGCA 模块,从而从信道方面抑制海底背景噪声; 3) 将 BDC 和 SGCA 模块有机结合,最终构建出 SGCA-BDC 方案。综合实验和比较表明,SGCA-BDC 方案的平均精度分别为 8.54%、4.4%、5.18%、3.1%、3.01%、12.53% 和 7.09%,明显优于 Faster RCNN、SSD、YOLOv6、YOLOv7、YOLOv8、RetinaNet 和 CenterNet 等典型检测方法。
{"title":"Shuffled Grouping Cross-Channel Attention-Based Bilateral-Filter-Interpolation Deformable ConvNet With Applications to Benthonic Organism Detection","authors":"Tingkai Chen;Ning Wang","doi":"10.1109/TAI.2024.3385387","DOIUrl":"https://doi.org/10.1109/TAI.2024.3385387","url":null,"abstract":"In this article, to holistically tackle underwater detection degradation due to unknown geometric variation arising from scale, pose, viewpoint, and occlusion under low-contrast and color-distortion circumstances, a shuffled grouping cross-channel attention-based bilateral-filter-interpolation deformable ConvNet (SGCA-BDC) framework is established for benthonic organism detection (BOD). Main contributions are as follows: 1) By comprehensively considering spatial and feature similarities between offset and integral coordinate positions, the BDC with modulation weight mechanism is created, such that sampling ability of convolutional kernel for BO with unknown geometric variation can be adaptively augmented from spatial perspective; 2) By utilizing 1-D convolution to recalibrate channel weight for grouped subfeature via information entropy statistic technique, an SGCA module is innovated, such that seabed background noise can be suppressed from channel aspect; 3) The proposed SGCA-BDC scheme is eventually built in an organic manner by incorporating BDC and SGCA modules. Comprehensive experiments and comparisons demonstrate that the SGCA-BDC scheme remarkably outperforms typical detection approaches including Faster RCNN, SSD, YOLOv6, YOLOv7, YOLOv8, RetinaNet, and CenterNet in terms of mean average precision by 8.54%, 4.4%, 5.18%, 3.1%, 3.01%, 12.53%, and 7.09%, respectively.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Unbiased Fuzzy Weighted Relative Error Support Vector Machine for Reverse Prediction of Concrete Components 用于反向预测混凝土构件的无偏模糊加权相对误差支持向量机
Pub Date : 2024-04-05 DOI: 10.1109/TAI.2024.3385386
Zongwen Fan;Jin Gou;Shaoyuan Weng
Concrete is a vital component in modern construction, prized for its strength, durability, and versatility. Accurately determining the quantities of concrete components is crucial in civil engineering applications to optimize resources (e.g., manpower and financial resources). In this article, we propose an unbiased fuzzy-weighted relative error support vector machine (UFW-RE-SVM) for reverse prediction of concrete components. First, we add an unbiased term to the target function of UFW-RE-SVM for obtaining an unbiased model. Second, we design a fuzzy-weighted operation to indicate sample importance by incorporating the fuzzy membership values into the UFW-RE-SVM. The $n$th root operation is introduced to address the exponential explosion issue in the fuzzy-weighted operation. Finally, considering the UFW-RE-SVM is sensitive to its hyperparameters for multioutput prediction, the whale optimization algorithm (WOA) is utilized for hyperparameter optimization for its effectiveness in optimization tasks. We design the fitness function based on the results from multiple components to balance the performance of multioutput predictions. Experimental results show that the performance of our proposed model outperforms existing works in predicting concrete components in terms of mean absolute relative error, standard deviation, and root mean square error. Further, the statistical test shows the WOA and two other metaheuristics can significantly improve the prediction performance. This indicates that the unbiased term, fuzzy-weighted operation, and WOA are effective for improving the proposed model for reverse prediction concrete components. With these promising results, the proposed model could provide decision-makers with a valuable tool for determining concrete component quantities based on desired concrete qualities.
混凝土是现代建筑的重要组成部分,因其强度、耐久性和多功能性而备受推崇。在土木工程应用中,准确确定混凝土构件的数量对于优化资源(如人力和财力)至关重要。在本文中,我们提出了一种用于反向预测混凝土构件的无偏模糊加权相对误差支持向量机(UFW-RE-SVM)。首先,我们在 UFW-RE-SVM 的目标函数中添加了一个无偏项,以获得一个无偏模型。其次,我们设计了一种模糊加权运算,通过将模糊成员值纳入 UFW-RE-SVM 来表示样本的重要性。为了解决模糊加权运算中的指数爆炸问题,我们引入了 $n$th 根运算。最后,考虑到 UFW-RE-SVM 对多输出预测的超参数很敏感,我们利用鲸鱼优化算法(WOA)进行超参数优化,以提高其在优化任务中的有效性。我们根据多个组件的结果设计拟合函数,以平衡多输出预测的性能。实验结果表明,在平均绝对相对误差、标准偏差和均方根误差方面,我们提出的模型在预测具体组件方面的性能优于现有的工作。此外,统计测试表明,WOA 和其他两种元启发式方法可以显著提高预测性能。这表明,无偏项、模糊加权运算和 WOA 对于改进反向预测混凝土构件的拟议模型是有效的。有了这些可喜的结果,所提出的模型可以为决策者提供一个有价值的工具,帮助他们根据所需的混凝土质量确定混凝土成分的数量。
{"title":"An Unbiased Fuzzy Weighted Relative Error Support Vector Machine for Reverse Prediction of Concrete Components","authors":"Zongwen Fan;Jin Gou;Shaoyuan Weng","doi":"10.1109/TAI.2024.3385386","DOIUrl":"https://doi.org/10.1109/TAI.2024.3385386","url":null,"abstract":"Concrete is a vital component in modern construction, prized for its strength, durability, and versatility. Accurately determining the quantities of concrete components is crucial in civil engineering applications to optimize resources (e.g., manpower and financial resources). In this article, we propose an unbiased fuzzy-weighted relative error support vector machine (UFW-RE-SVM) for reverse prediction of concrete components. First, we add an unbiased term to the target function of UFW-RE-SVM for obtaining an unbiased model. Second, we design a fuzzy-weighted operation to indicate sample importance by incorporating the fuzzy membership values into the UFW-RE-SVM. The \u0000<inline-formula><tex-math>$n$</tex-math></inline-formula>\u0000th root operation is introduced to address the exponential explosion issue in the fuzzy-weighted operation. Finally, considering the UFW-RE-SVM is sensitive to its hyperparameters for multioutput prediction, the whale optimization algorithm (WOA) is utilized for hyperparameter optimization for its effectiveness in optimization tasks. We design the fitness function based on the results from multiple components to balance the performance of multioutput predictions. Experimental results show that the performance of our proposed model outperforms existing works in predicting concrete components in terms of mean absolute relative error, standard deviation, and root mean square error. Further, the statistical test shows the WOA and two other metaheuristics can significantly improve the prediction performance. This indicates that the unbiased term, fuzzy-weighted operation, and WOA are effective for improving the proposed model for reverse prediction concrete components. With these promising results, the proposed model could provide decision-makers with a valuable tool for determining concrete component quantities based on desired concrete qualities.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142165088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE transactions on artificial intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1