首页 > 最新文献

Neural Processing Letters最新文献

英文 中文
GPINN with Neural Tangent Kernel Technique for Nonlinear Two Point Boundary Value Problems 针对非线性两点边值问题的 GPINN 与神经切线核技术
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-31 DOI: 10.1007/s11063-024-11644-7
Navnit Jha, Ekansh Mallik

Neural networks as differential equation solvers are a good choice of numerical technique because of their fast solutions and their nature in tackling some classical problems which traditional numerical solvers faced. In this article, we look at the famous gradient descent optimization technique, which trains the network by updating parameters which minimizes the loss function. We look at the theoretical part of gradient descent to understand why the network works great for some terms of the loss function and not so much for other terms. The loss function considered here is built in such a way that it incorporates the differential equation as well as the derivative of the differential equation. The fully connected feed-forward network is designed in such a way that, without training at boundary points, it automatically satisfies the boundary conditions. The neural tangent kernel for gradient enhanced physics informed neural networks is examined in this work, and we demonstrate how it may be used to generate a closed-form expression for the kernel function. We also provide numerical experiments demonstrating the effectiveness of the new approach for several two point boundary value problems. Our results suggest that the neural tangent kernel based approach can significantly improve the computational accuracy of the gradient enhanced physics informed neural network while reducing the computational cost of training these models.

作为微分方程求解器的神经网络是一种很好的数值技术选择,因为其求解速度快,而且能解决传统数值求解器面临的一些经典问题。在本文中,我们将探讨著名的梯度下降优化技术,该技术通过更新参数来训练网络,从而使损失函数最小化。我们将研究梯度下降的理论部分,以了解为什么该网络对损失函数的某些项效果很好,而对其他项效果不佳。这里考虑的损失函数的构建方式包含了微分方程以及微分方程的导数。全连接前馈网络的设计方式是,无需在边界点进行训练,它就能自动满足边界条件。本研究对梯度增强物理信息神经网络的神经正切核进行了研究,并演示了如何利用它生成核函数的闭式表达式。我们还提供了数值实验,证明了新方法对若干两点边界值问题的有效性。我们的结果表明,基于神经正切核的方法可以显著提高梯度增强物理信息神经网络的计算精度,同时降低训练这些模型的计算成本。
{"title":"GPINN with Neural Tangent Kernel Technique for Nonlinear Two Point Boundary Value Problems","authors":"Navnit Jha, Ekansh Mallik","doi":"10.1007/s11063-024-11644-7","DOIUrl":"https://doi.org/10.1007/s11063-024-11644-7","url":null,"abstract":"<p>Neural networks as differential equation solvers are a good choice of numerical technique because of their fast solutions and their nature in tackling some classical problems which traditional numerical solvers faced. In this article, we look at the famous gradient descent optimization technique, which trains the network by updating parameters which minimizes the loss function. We look at the theoretical part of gradient descent to understand why the network works great for some terms of the loss function and not so much for other terms. The loss function considered here is built in such a way that it incorporates the differential equation as well as the derivative of the differential equation. The fully connected feed-forward network is designed in such a way that, without training at boundary points, it automatically satisfies the boundary conditions. The neural tangent kernel for gradient enhanced physics informed neural networks is examined in this work, and we demonstrate how it may be used to generate a closed-form expression for the kernel function. We also provide numerical experiments demonstrating the effectiveness of the new approach for several two point boundary value problems. Our results suggest that the neural tangent kernel based approach can significantly improve the computational accuracy of the gradient enhanced physics informed neural network while reducing the computational cost of training these models.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph 基于知识图谱的跨模态反馈互动的多通道多模态情感分析
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-29 DOI: 10.1007/s11063-024-11641-w
Shaohua Dong, Xiaochao Fan, Xinchun Ma

Multimodal sentiment analysis is a downstream branch task of sentiment analysis with high attention at present. Previous work in multimodal sentiment analysis have focused on the representation and fusion of modalities, capturing the underlying semantic relationships between modalities by considering contextual information. While this approach is feasible for simple contextual comments, more complex comments require the integration of external knowledge to obtain more accurate sentiment information. However, incorporating external knowledge into sentiment analysis to enhance information complementarity has not been thoroughly investigated. To address this, we propose a multichannel cross-modal feedback interaction model that incorporates the knowledge graph into multimodal sentiment analysis. Our proposed model consists of two main components: the cross-modal feedback recurrent interaction module and the external knowledge module for capturing latent information. The cross-modal interaction employs a self-feedback mechanism during network training, extracting feature representations of each modality and using these representations to mask sensory inputs, allowing the model to perform feedback-based feature masking. The external knowledge graph captures potential semantic information representations in the textual data through knowledge graph embedding. Finally, a global feature fusion module is employed for multichannel multimodal information integration. On two publicly available datasets, our method demonstrates good performance in terms of accuracy and F1 scores, compared to state-of-the-art models and several baselines.

多模态情感分析是情感分析的下游分支任务,目前备受关注。以往的多模态情感分析工作侧重于模态的表示和融合,通过考虑上下文信息来捕捉模态之间的潜在语义关系。虽然这种方法对于简单的上下文评论是可行的,但更复杂的评论则需要整合外部知识才能获得更准确的情感信息。然而,将外部知识纳入情感分析以增强信息互补性的做法尚未得到深入研究。为此,我们提出了一种多渠道跨模态反馈交互模型,将知识图谱融入多模态情感分析中。我们提出的模型由两个主要部分组成:跨模态反馈循环交互模块和用于捕捉潜在信息的外部知识模块。跨模态交互模块在网络训练过程中采用自我反馈机制,提取每种模态的特征表征,并利用这些表征来屏蔽感官输入,从而使模型能够执行基于反馈的特征屏蔽。外部知识图谱通过知识图谱嵌入捕捉文本数据中潜在的语义信息表征。最后,全局特征融合模块用于多通道多模态信息整合。在两个公开可用的数据集上,与最先进的模型和几种基线相比,我们的方法在准确率和 F1 分数方面表现出色。
{"title":"Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph","authors":"Shaohua Dong, Xiaochao Fan, Xinchun Ma","doi":"10.1007/s11063-024-11641-w","DOIUrl":"https://doi.org/10.1007/s11063-024-11641-w","url":null,"abstract":"<p>Multimodal sentiment analysis is a downstream branch task of sentiment analysis with high attention at present. Previous work in multimodal sentiment analysis have focused on the representation and fusion of modalities, capturing the underlying semantic relationships between modalities by considering contextual information. While this approach is feasible for simple contextual comments, more complex comments require the integration of external knowledge to obtain more accurate sentiment information. However, incorporating external knowledge into sentiment analysis to enhance information complementarity has not been thoroughly investigated. To address this, we propose a multichannel cross-modal feedback interaction model that incorporates the knowledge graph into multimodal sentiment analysis. Our proposed model consists of two main components: the cross-modal feedback recurrent interaction module and the external knowledge module for capturing latent information. The cross-modal interaction employs a self-feedback mechanism during network training, extracting feature representations of each modality and using these representations to mask sensory inputs, allowing the model to perform feedback-based feature masking. The external knowledge graph captures potential semantic information representations in the textual data through knowledge graph embedding. Finally, a global feature fusion module is employed for multichannel multimodal information integration. On two publicly available datasets, our method demonstrates good performance in terms of accuracy and F1 scores, compared to state-of-the-art models and several baselines.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantized Iterative Learning Bipartite Containment Tracking Control for Unknown Nonlinear Multi-agent Systems 针对未知非线性多代理系统的量化迭代学习双方包含跟踪控制
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-29 DOI: 10.1007/s11063-024-11649-2
Ruikun Zhang, Shangyu Sang, Jingyuan Zhang, Xue Lin

This paper proposes a quantized model-free adaptive iterative learning control (MFAILC) algorithm to solve the bipartite containment tracking problem of unknown nonlinear multi-agent systems, where the interactions between agents include cooperation and antagonistic interactions. To design the controller, the agent’s dynamics is transformed into the linear data model based on the dynamic linearization method, and then a quantized MFAILC algorithm is established based on the quantized values of the relative output measurements. The designed controller only depends on the input and output data of the agent. We prove that under the quantized MFAILC algorithm, the multi-agent systems can achieve the bipartite containment, that is, the output trajectories of followers converge to the convex hull formed by the leaders’ trajectories and the leaders’ symmetric trajectories. Finally, we provide simulations to illustrate the effectiveness of our theoretical results.

本文提出了一种量化的无模型自适应迭代学习控制(MFAILC)算法,以解决未知非线性多代理系统的两方包含跟踪问题,其中代理之间的相互作用包括合作和对抗性相互作用。在设计控制器时,首先根据动态线性化方法将代理的动力学特性转化为线性数据模型,然后根据相对输出测量值的量化值建立量化 MFAILC 算法。所设计的控制器只取决于代理的输入和输出数据。我们证明,在量化的 MFAILC 算法下,多代理系统可以实现两方包含,即跟随者的输出轨迹收敛于领导者轨迹和领导者对称轨迹形成的凸壳。最后,我们通过模拟来说明理论结果的有效性。
{"title":"Quantized Iterative Learning Bipartite Containment Tracking Control for Unknown Nonlinear Multi-agent Systems","authors":"Ruikun Zhang, Shangyu Sang, Jingyuan Zhang, Xue Lin","doi":"10.1007/s11063-024-11649-2","DOIUrl":"https://doi.org/10.1007/s11063-024-11649-2","url":null,"abstract":"<p>This paper proposes a quantized model-free adaptive iterative learning control (MFAILC) algorithm to solve the bipartite containment tracking problem of unknown nonlinear multi-agent systems, where the interactions between agents include cooperation and antagonistic interactions. To design the controller, the agent’s dynamics is transformed into the linear data model based on the dynamic linearization method, and then a quantized MFAILC algorithm is established based on the quantized values of the relative output measurements. The designed controller only depends on the input and output data of the agent. We prove that under the quantized MFAILC algorithm, the multi-agent systems can achieve the bipartite containment, that is, the output trajectories of followers converge to the convex hull formed by the leaders’ trajectories and the leaders’ symmetric trajectories. Finally, we provide simulations to illustrate the effectiveness of our theoretical results.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141187764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Circuit Policies for Virtual Character Control 虚拟字符控制的神经回路策略
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-28 DOI: 10.1007/s11063-024-11640-x
Waleed Razzaq, Kashif Raza

The development of high-stakes decision-making neural agents that interact with complex environments, such as video games, is an important aspect of AI research with numerous potential applications. Reinforcement learning combined with deep learning architectures (DRL) has shown remarkable success in various genres of games. The performance of DRL is heavily dependent upon the neural networks resides within them. Although these algorithms perform well in offline testing but the performance deteriorates in noisy and sub-optimal conditions, creating safety and security issues. To address these, we propose a hybrid deep learning architecture that combines a traditional convolutional neural network with worm brain-inspired neural circuit policies. This allows the agent to learn key coherent features from the environment and interpret its dynamics. The obtained DRL agent was not only able to achieve an optimal policy quickly, but it was also the most noise-resilient with the highest success rate. Our research indicates that only 20 control neurons (12 inter-neurons and 8 command neurons) are sufficient to achieve competitive results. We implemented and analyzed the agent in the popular video game Doom, demonstrating its effectiveness in practical applications.

开发与复杂环境(如视频游戏)互动的高风险决策神经代理是人工智能研究的一个重要方面,具有众多潜在应用。强化学习与深度学习架构(DRL)相结合,在各种类型的游戏中取得了显著的成功。DRL 的性能在很大程度上取决于其中的神经网络。虽然这些算法在离线测试中表现良好,但在嘈杂和次优条件下性能会下降,从而产生安全和保安问题。为了解决这些问题,我们提出了一种混合深度学习架构,将传统卷积神经网络与蠕虫大脑启发神经回路策略相结合。这样,代理就能从环境中学习关键的一致性特征,并解释其动态变化。获得的 DRL 代理不仅能快速实现最优策略,而且抗噪能力最强,成功率最高。我们的研究表明,只需 20 个控制神经元(12 个中间神经元和 8 个指令神经元)就足以实现有竞争力的结果。我们在流行的视频游戏 Doom 中实施并分析了该代理,证明了它在实际应用中的有效性。
{"title":"Neural Circuit Policies for Virtual Character Control","authors":"Waleed Razzaq, Kashif Raza","doi":"10.1007/s11063-024-11640-x","DOIUrl":"https://doi.org/10.1007/s11063-024-11640-x","url":null,"abstract":"<p>The development of high-stakes decision-making neural agents that interact with complex environments, such as video games, is an important aspect of AI research with numerous potential applications. Reinforcement learning combined with deep learning architectures (DRL) has shown remarkable success in various genres of games. The performance of DRL is heavily dependent upon the neural networks resides within them. Although these algorithms perform well in offline testing but the performance deteriorates in noisy and sub-optimal conditions, creating safety and security issues. To address these, we propose a hybrid deep learning architecture that combines a traditional convolutional neural network with worm brain-inspired neural circuit policies. This allows the agent to learn key coherent features from the environment and interpret its dynamics. The obtained DRL agent was not only able to achieve an optimal policy quickly, but it was also the most noise-resilient with the highest success rate. Our research indicates that only 20 control neurons (12 inter-neurons and 8 command neurons) are sufficient to achieve competitive results. We implemented and analyzed the agent in the popular video game Doom, demonstrating its effectiveness in practical applications.\u0000</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample-Adaptive Classification Inference Network 样本自适应分类推理网络
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-28 DOI: 10.1007/s11063-024-11629-6
Juan Yang, Guanghong Zhou, Ronggui Wang, Lixia Xue

Existing pre-trained models have yielded promising results in terms of computational time reduction. However, these models only focus on pruning simple sentences or less salient words, while neglecting the treatment of relatively complex sentences. It is frequently these sentences that cause the loss of model accuracy. This shows that the adaptation of the existing models is one-sided. To address this issue, in this paper, we propose a sample-adaptive training and inference model. Specifically, complex samples are extracted from the training datasets and a dedicated data augmentation module is trained to extract global and local semantic information of complex samples. During inference, simple samples can exit the model via the Sample Adaptive Exit Mechanism, Normal samples pass through the whole backbone model before inference, while complex samples are processed by the Characteristic Enhancement Module after passing through the backbone model. In this way, all samples are processed adaptively. Our extensive experiments on classification tasks datasets in the field of Natural Language Processing demonstrate that our method enhances model accuracy and reduces model inference time for multiple datasets. Moreover, our method is transferable and can be applied to multiple pre-trained models.

现有的预训练模型在减少计算时间方面取得了可喜的成果。然而,这些模型只注重剪切简单句或不太突出的词,而忽略了对相对复杂句子的处理。往往正是这些句子导致了模型准确性的下降。这说明现有模型的适应性是片面的。针对这一问题,本文提出了一种样本自适应训练和推理模型。具体来说,从训练数据集中提取复杂样本,并训练一个专门的数据增强模块来提取复杂样本的全局和局部语义信息。在推理过程中,简单样本可通过样本自适应退出机制退出模型,正常样本在推理前通过整个骨干模型,而复杂样本在通过骨干模型后由特征增强模块处理。这样,所有样本都能得到自适应处理。我们在自然语言处理领域的分类任务数据集上进行的大量实验表明,我们的方法提高了模型的准确性,并缩短了多个数据集的模型推理时间。此外,我们的方法具有可移植性,可应用于多个预训练模型。
{"title":"Sample-Adaptive Classification Inference Network","authors":"Juan Yang, Guanghong Zhou, Ronggui Wang, Lixia Xue","doi":"10.1007/s11063-024-11629-6","DOIUrl":"https://doi.org/10.1007/s11063-024-11629-6","url":null,"abstract":"<p>Existing pre-trained models have yielded promising results in terms of computational time reduction. However, these models only focus on pruning simple sentences or less salient words, while neglecting the treatment of relatively complex sentences. It is frequently these sentences that cause the loss of model accuracy. This shows that the adaptation of the existing models is one-sided. To address this issue, in this paper, we propose a sample-adaptive training and inference model. Specifically, complex samples are extracted from the training datasets and a dedicated data augmentation module is trained to extract global and local semantic information of complex samples. During inference, simple samples can exit the model via the Sample Adaptive Exit Mechanism, Normal samples pass through the whole backbone model before inference, while complex samples are processed by the Characteristic Enhancement Module after passing through the backbone model. In this way, all samples are processed adaptively. Our extensive experiments on classification tasks datasets in the field of Natural Language Processing demonstrate that our method enhances model accuracy and reduces model inference time for multiple datasets. Moreover, our method is transferable and can be applied to multiple pre-trained models.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Observer-Based Topology Identification and Synchronization in Finite Time for Fractional Singularly Perturbed Complex Networks via Dynamic Event-Triggered Control 基于观测器的拓扑识别和有限时间内通过动态事件触发控制实现分数奇异扰动复杂网络的同步
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-28 DOI: 10.1007/s11063-024-11648-3
Lingyan Wang, Huaiqin Wu, Jinde Cao

This paper investigates the topology identification and synchronization in finite time for fractional singularly perturbed complex networks (FSPCNs). Firstly, a convergence principle is developed for continuously differential functions. Secondly, a dynamic event-triggered mechanism (DETM) is designed to achieve the network synchronization, and a topology observer is developed to identify the network topology. Thirdly, under the designed DETM, by constructing a Lyapunov functional and applying the inequality analysis technique, the topology identification and synchronization condition in finite time is established in the forms of the matrix inequality. In addition, it is proved that the Zeno behavior can be effectively excluded. Finally, the effectiveness of the main results is verified by an application example.

本文研究了分数奇异扰动复杂网络(FSPCN)的拓扑识别和有限时间内的同步问题。首先,本文提出了连续微分函数的收敛原理。其次,设计了一个动态事件触发机制(DETM)来实现网络同步,并开发了一个拓扑观测器来识别网络拓扑。第三,在设计的 DETM 下,通过构建 Lyapunov 函数和应用不等式分析技术,以矩阵不等式的形式确定了有限时间内的拓扑识别和同步条件。此外,还证明了可以有效地排除 Zeno 行为。最后,通过一个应用实例验证了主要结果的有效性。
{"title":"An Observer-Based Topology Identification and Synchronization in Finite Time for Fractional Singularly Perturbed Complex Networks via Dynamic Event-Triggered Control","authors":"Lingyan Wang, Huaiqin Wu, Jinde Cao","doi":"10.1007/s11063-024-11648-3","DOIUrl":"https://doi.org/10.1007/s11063-024-11648-3","url":null,"abstract":"<p>This paper investigates the topology identification and synchronization in finite time for fractional singularly perturbed complex networks (FSPCNs). Firstly, a convergence principle is developed for continuously differential functions. Secondly, a dynamic event-triggered mechanism (DETM) is designed to achieve the network synchronization, and a topology observer is developed to identify the network topology. Thirdly, under the designed DETM, by constructing a Lyapunov functional and applying the inequality analysis technique, the topology identification and synchronization condition in finite time is established in the forms of the matrix inequality. In addition, it is proved that the Zeno behavior can be effectively excluded. Finally, the effectiveness of the main results is verified by an application example.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-Channel Autoencoder with Key Region Feature Enhancement for Video Anomalous Event Detection 利用关键区域特征增强的双通道自动编码器进行视频异常事件检测
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-28 DOI: 10.1007/s11063-024-11634-9
Qing Ye, Zihan Song, Yuqi Zhao, Yongmei Zhang

Video anomaly event detection is crucial for analyzing surveillance videos. Existing methods have limitations: frame-level detection fails to remove background interference, and object-level methods overlook object-environment interaction. To address these issues, this paper proposes a novel video anomaly event detection algorithm based on a dual-channel autoencoder with key region feature enhancement. The goal is to preserve valuable information in the global context while focusing on regions with a high anomaly occurrence. Firstly, a key region extraction network is proposed to perform foreground segmentation on video frames, eliminating background redundancy. Secondly, a dual-channel autoencoder is designed to enhance the features of key regions, enabling the model to extract more representative features. Finally, channel attention modules are inserted between each deconvolution layer of the decoder to enhance the model’s perception and discrimination of valuable information. Compared to existing methods, our approach accurately locates and focuses on regions with a high anomaly occurrence, improving the accuracy of anomaly event detection. Extensive experiments are conducted on the UCSD ped2, CUHK Avenue, and SHTech Campus datasets, and the results validate the effectiveness of the proposed method.

视频异常事件检测对于分析监控视频至关重要。现有方法存在局限性:帧级检测无法去除背景干扰,对象级方法忽略了对象与环境的交互作用。为了解决这些问题,本文提出了一种基于双通道自动编码器和关键区域特征增强的新型视频异常事件检测算法。该算法的目标是在关注异常事件高发区域的同时,保留有价值的全局信息。首先,提出了一个关键区域提取网络,用于对视频帧进行前景分割,消除背景冗余。其次,设计了一种双通道自动编码器来增强关键区域的特征,使模型能够提取更具代表性的特征。最后,在解码器的每个解卷积层之间插入通道注意模块,以增强模型对有价值信息的感知和辨别能力。与现有方法相比,我们的方法能准确定位并关注异常发生率较高的区域,提高了异常事件检测的准确性。我们在 UCSD ped2、CUHK Avenue 和 SHTech Campus 数据集上进行了广泛的实验,结果验证了所提方法的有效性。
{"title":"Dual-Channel Autoencoder with Key Region Feature Enhancement for Video Anomalous Event Detection","authors":"Qing Ye, Zihan Song, Yuqi Zhao, Yongmei Zhang","doi":"10.1007/s11063-024-11634-9","DOIUrl":"https://doi.org/10.1007/s11063-024-11634-9","url":null,"abstract":"<p>Video anomaly event detection is crucial for analyzing surveillance videos. Existing methods have limitations: frame-level detection fails to remove background interference, and object-level methods overlook object-environment interaction. To address these issues, this paper proposes a novel video anomaly event detection algorithm based on a dual-channel autoencoder with key region feature enhancement. The goal is to preserve valuable information in the global context while focusing on regions with a high anomaly occurrence. Firstly, a key region extraction network is proposed to perform foreground segmentation on video frames, eliminating background redundancy. Secondly, a dual-channel autoencoder is designed to enhance the features of key regions, enabling the model to extract more representative features. Finally, channel attention modules are inserted between each deconvolution layer of the decoder to enhance the model’s perception and discrimination of valuable information. Compared to existing methods, our approach accurately locates and focuses on regions with a high anomaly occurrence, improving the accuracy of anomaly event detection. Extensive experiments are conducted on the UCSD ped2, CUHK Avenue, and SHTech Campus datasets, and the results validate the effectiveness of the proposed method.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Document Information Selection Through Multi-Granularity Responses for Dialogue Generation 通过多粒度响应加强对话生成中的文件信息选择
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-28 DOI: 10.1007/s11063-024-11633-w
Meiqi Wang, Kangyu Qiao, Shuyue Xing, Caixia Yuan, Xiaojie Wang

Document information selection is an essential part of document-grounded dialogue tasks, and more accurate information selection results can provide more appropriate dialogue responses. Existing works have achieved excellent results by employing multi-granularity of dialogue history information, indicating the effectiveness of multi-level historical information. However, these works often focus on exploring the hierarchical information of dialogue history, while neglecting the multi-granularity utilization in response, important information that holds an impact on the decoding process. Therefore, this paper proposes a model for document information selection based on multi-granularity responses. By integrating the document selection results at the response word level and semantic unit level, the model enhances its capability in knowledge selection and produces better responses. For the division at the semantic unit level of the response, we propose two semantic unit division methods, static and dynamic. Experiments on two public datasets show that our models combining static or dynamic semantic unit levels significantly outperform baseline models.

文档信息选择是以文档为基础的对话任务的重要组成部分,更准确的信息选择结果可以提供更恰当的对话回应。现有研究通过采用多粒度的对话历史信息取得了很好的效果,显示了多层次历史信息的有效性。然而,这些研究往往只关注了对话历史信息的层次性,而忽视了多粒度信息在应答中的利用,而这一重要信息对解码过程具有影响。因此,本文提出了一种基于多粒度响应的文档信息选择模型。该模型通过整合响应词层面和语义单元层面的文档选择结果,增强了知识选择能力,并产生了更好的响应。对于回复语义单位层面的划分,我们提出了静态和动态两种语义单位划分方法。在两个公共数据集上进行的实验表明,我们的模型结合了静态或动态语义单元级别,其效果明显优于基线模型。
{"title":"Enhancing Document Information Selection Through Multi-Granularity Responses for Dialogue Generation","authors":"Meiqi Wang, Kangyu Qiao, Shuyue Xing, Caixia Yuan, Xiaojie Wang","doi":"10.1007/s11063-024-11633-w","DOIUrl":"https://doi.org/10.1007/s11063-024-11633-w","url":null,"abstract":"<p>Document information selection is an essential part of document-grounded dialogue tasks, and more accurate information selection results can provide more appropriate dialogue responses. Existing works have achieved excellent results by employing multi-granularity of dialogue history information, indicating the effectiveness of multi-level historical information. However, these works often focus on exploring the hierarchical information of dialogue history, while neglecting the multi-granularity utilization in response, important information that holds an impact on the decoding process. Therefore, this paper proposes a model for document information selection based on multi-granularity responses. By integrating the document selection results at the response word level and semantic unit level, the model enhances its capability in knowledge selection and produces better responses. For the division at the semantic unit level of the response, we propose two semantic unit division methods, static and dynamic. Experiments on two public datasets show that our models combining static or dynamic semantic unit levels significantly outperform baseline models.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Point-Cluster-Partition Architecture for Weighted Clustering Ensemble 加权聚类组合的点-聚类-分区架构
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-27 DOI: 10.1007/s11063-024-11618-9
Na Li, Sen Xu, Heyang Xu, Xiufang Xu, Naixuan Guo, Na Cai

Clustering ensembles can obtain more superior final results by combining multiple different clustering results. The qualities of the points, clusters, and partitions play crucial roles in the consistency of the clustering process. However, existing methods mostly focus on one or two aspects of them, without a comprehensive consideration of the three aspects. This paper proposes a three-level weighted clustering ensemble algorithm namely unified point-cluser-partition algorithm (PCPA). The first step of the PCPA is to generate the adjacency matrix by base clusterings. Then, the central step is to obtain the weighted adjacency matrix by successively weighting three layers, i.e., points, clusters, and partitions. Finally, the consensus clustering is obtained by the average link method. Three performance indexes, namely F, NMI, and ARI, are used to evaluate the accuracy of the proposed method. The experimental results show that: Firstly, as expected, the proposed three-layer weighted clustering ensemble can improve the accuracy of each evaluation index by an average value of 22.07% compared with the direct clustering ensemble without weighting; Secondly, compared with seven other methods, PCPA can achieve better clustering results and the proportion that PCPA ranks first is 28/33.

聚类集合可以通过组合多个不同的聚类结果,获得更优越的最终结果。点、簇和分区的质量对聚类过程的一致性起着至关重要的作用。然而,现有的方法大多只关注其中的一两个方面,而没有综合考虑这三个方面。本文提出了一种三级加权聚类集合算法,即统一点-排序-分区算法(PCPA)。PCPA 的第一步是通过基础聚类生成邻接矩阵。然后,中心步骤是通过对点、聚类和分区三层连续加权得到加权邻接矩阵。最后,通过平均链接法获得共识聚类。使用三个性能指标,即 F、NMI 和 ARI,来评价所提方法的准确性。实验结果表明首先,正如预期的那样,与不加权的直接聚类集合相比,所提出的三层加权聚类集合能提高各评价指标的准确度,平均值为 22.07%;其次,与其他七种方法相比,PCPA 能取得更好的聚类结果,PCPA 排名第一的比例为 28/33。
{"title":"A Point-Cluster-Partition Architecture for Weighted Clustering Ensemble","authors":"Na Li, Sen Xu, Heyang Xu, Xiufang Xu, Naixuan Guo, Na Cai","doi":"10.1007/s11063-024-11618-9","DOIUrl":"https://doi.org/10.1007/s11063-024-11618-9","url":null,"abstract":"<p>Clustering ensembles can obtain more superior final results by combining multiple different clustering results. The qualities of the points, clusters, and partitions play crucial roles in the consistency of the clustering process. However, existing methods mostly focus on one or two aspects of them, without a comprehensive consideration of the three aspects. This paper proposes a three-level weighted clustering ensemble algorithm namely unified point-cluser-partition algorithm (PCPA). The first step of the PCPA is to generate the adjacency matrix by base clusterings. Then, the central step is to obtain the weighted adjacency matrix by successively weighting three layers, i.e., points, clusters, and partitions. Finally, the consensus clustering is obtained by the average link method. Three performance indexes, namely F, NMI, and ARI, are used to evaluate the accuracy of the proposed method. The experimental results show that: Firstly, as expected, the proposed three-layer weighted clustering ensemble can improve the accuracy of each evaluation index by an average value of 22.07% compared with the direct clustering ensemble without weighting; Secondly, compared with seven other methods, PCPA can achieve better clustering results and the proportion that PCPA ranks first is 28/33.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Reinforcement Learning from Demonstration via Reachability-Based Reward Shaping 通过基于可达性的奖励塑造,从示范中进行分层强化学习
IF 3.1 4区 计算机科学 Q2 Computer Science Pub Date : 2024-05-27 DOI: 10.1007/s11063-024-11632-x
Xiaozhu Gao, Jinhui Liu, Bo Wan, Lingling An

Hierarchical reinforcement learning (HRL) has achieved remarkable success and significant progress in complex and long-term decision-making problems. However, HRL training typically entails substantial computational costs and an enormous number of samples. One effective approach to tackle this challenge is hierarchical reinforcement learning from demonstrations (HRLfD), which leverages demonstrations to expedite the training process of HRL. The effectiveness of HRLfD is contingent upon the quality of the demonstrations; hence, suboptimal demonstrations may impede efficient learning. To address this issue, this paper proposes a reachability-based reward shaping (RbRS) method to alleviate the negative interference of suboptimal demonstrations for the HRL agent. The novel HRLfD algorithm based on RbRS is named HRLfD-RbRS, which incorporates the RbRS method to enhance the learning efficiency of HRLfD. Moreover, with the help of this method, the learning agent can explore better policies under the guidance of the suboptimal demonstration. We evaluate the proposed HRLfD-RbRS algorithm on various complex robotic tasks, and the experimental results demonstrate that our method outperforms current state-of-the-art HRLfD algorithms.

分层强化学习(HRL)在复杂和长期决策问题上取得了显著成功和重大进展。然而,HRL 的训练通常需要大量的计算成本和大量的样本。应对这一挑战的一种有效方法是通过示范进行分层强化学习(HRLfD),它利用示范来加快 HRL 的训练过程。HRLfD 的有效性取决于示范的质量;因此,次优示范可能会阻碍高效学习。针对这一问题,本文提出了一种基于可达性的奖励塑造(RbRS)方法,以减轻次优示范对 HRL 代理的负面干扰。基于 RbRS 的新型 HRLfD 算法被命名为 HRLfD-RbRS,它结合了 RbRS 方法来提高 HRLfD 的学习效率。此外,在该方法的帮助下,学习代理可以在次优示范的指导下探索更好的策略。我们在各种复杂的机器人任务中评估了所提出的 HRLfD-RbRS 算法,实验结果表明我们的方法优于目前最先进的 HRLfD 算法。
{"title":"Hierarchical Reinforcement Learning from Demonstration via Reachability-Based Reward Shaping","authors":"Xiaozhu Gao, Jinhui Liu, Bo Wan, Lingling An","doi":"10.1007/s11063-024-11632-x","DOIUrl":"https://doi.org/10.1007/s11063-024-11632-x","url":null,"abstract":"<p>Hierarchical reinforcement learning (HRL) has achieved remarkable success and significant progress in complex and long-term decision-making problems. However, HRL training typically entails substantial computational costs and an enormous number of samples. One effective approach to tackle this challenge is hierarchical reinforcement learning from demonstrations (HRLfD), which leverages demonstrations to expedite the training process of HRL. The effectiveness of HRLfD is contingent upon the quality of the demonstrations; hence, suboptimal demonstrations may impede efficient learning. To address this issue, this paper proposes a reachability-based reward shaping (RbRS) method to alleviate the negative interference of suboptimal demonstrations for the HRL agent. The novel HRLfD algorithm based on RbRS is named HRLfD-RbRS, which incorporates the RbRS method to enhance the learning efficiency of HRLfD. Moreover, with the help of this method, the learning agent can explore better policies under the guidance of the suboptimal demonstration. We evaluate the proposed HRLfD-RbRS algorithm on various complex robotic tasks, and the experimental results demonstrate that our method outperforms current state-of-the-art HRLfD algorithms.</p>","PeriodicalId":51144,"journal":{"name":"Neural Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141170611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Neural Processing Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1