首页 > 最新文献

Neural Networks最新文献

英文 中文
Complete synchronization of discrete-time fractional-order BAM neural networks with leakage and discrete delays 具有泄漏和离散延迟的离散时间分数阶 BAM 神经网络的完全同步化
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-05 DOI: 10.1016/j.neunet.2024.106705

This paper concerns complete synchronization (CS) problem of discrete-time fractional-order BAM neural networks (BAMNNs) with leakage and discrete delays. Firstly, on the basis of Caputo fractional difference theory and nabla l-Laplace transform, two equations about the nabla sum are strictly proved. Secondly, two extended Halanay inequalities that are suitable for discrete-time fractional difference inequations with arbitrary initial time and multiple types of delays are introduced. In addition, through applying Caputo fractional difference theory and combining with inequalities gained from this paper, some sufficient CS criteria of discrete-time fractional-order BAMNNs with leakage and discrete delays are established under adaptive controller. Finally, one numerical simulation is utilized to certify the effectiveness of the obtained theoretical results.

本文涉及具有泄漏和离散延迟的离散时间分数阶 BAM 神经网络(BAMNN)的完全同步(CS)问题。首先,在 Caputo 分数差分理论和 nabla l-Laplace 变换的基础上,严格证明了关于 nabla 和的两个方程。其次,引入了两个适用于具有任意初始时间和多种延迟的离散时间分数差分不等式的扩展哈拉内不等式。此外,通过应用 Caputo 分数差分理论并结合本文获得的不等式,建立了自适应控制器下具有泄漏和离散延迟的离散时间分数阶 BAMNN 的一些充分 CS 准则。最后,本文利用数值模拟验证了所获理论结果的有效性。
{"title":"Complete synchronization of discrete-time fractional-order BAM neural networks with leakage and discrete delays","authors":"","doi":"10.1016/j.neunet.2024.106705","DOIUrl":"10.1016/j.neunet.2024.106705","url":null,"abstract":"<div><p>This paper concerns complete synchronization (CS) problem of discrete-time fractional-order BAM neural networks (BAMNNs) with leakage and discrete delays. Firstly, on the basis of Caputo fractional difference theory and nabla <span><math><mi>l</mi></math></span>-Laplace transform, two equations about the nabla sum are strictly proved. Secondly, two extended Halanay inequalities that are suitable for discrete-time fractional difference inequations with arbitrary initial time and multiple types of delays are introduced. In addition, through applying Caputo fractional difference theory and combining with inequalities gained from this paper, some sufficient CS criteria of discrete-time fractional-order BAMNNs with leakage and discrete delays are established under adaptive controller. Finally, one numerical simulation is utilized to certify the effectiveness of the obtained theoretical results.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142157633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GFANC-RL: Reinforcement Learning-based Generative Fixed-filter Active Noise Control GFANC-RL:基于强化学习的生成式固定滤波主动噪声控制
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-05 DOI: 10.1016/j.neunet.2024.106687
The recent Generative Fixed-filter Active Noise Control (GFANC) method achieves a good trade-off between noise reduction performance and system stability. However, labelling noise data for training the Convolutional Neural Network (CNN) in GFANC is typically resource-consuming. Even worse, labelling errors will degrade the CNN’s filter-generation accuracy. Therefore, this paper proposes a novel Reinforcement Learning-based GFANC (GFANC-RL) approach that omits the labelling process by leveraging the exploring property of Reinforcement Learning (RL). The CNN’s parameters are automatically updated through the interaction between the RL agent and the environment. Moreover, the RL algorithm solves the non-differentiability issue caused by using binary combination weights in GFANC. Simulation results demonstrate the effectiveness and transferability of the GFANC-RL method in handling real-recorded noises across different acoustic paths.2
最新的生成固定滤波主动噪声控制(GFANC)方法在降噪性能和系统稳定性之间实现了良好的权衡。然而,为训练 GFANC 中的卷积神经网络(CNN)而标注噪声数据通常非常耗费资源。更糟糕的是,标记错误会降低 CNN 的滤波器生成精度。因此,本文提出了一种新颖的基于强化学习的 GFANC(GFANC-RL)方法,利用强化学习(RL)的探索特性,省略了标记过程。CNN 的参数通过 RL 代理与环境之间的交互自动更新。此外,RL 算法还解决了 GFANC 中使用二进制组合权重所导致的不可区分性问题。仿真结果表明,GFANC-RL 方法在处理不同声学路径上的真实录音噪声时非常有效,并具有可移植性。
{"title":"GFANC-RL: Reinforcement Learning-based Generative Fixed-filter Active Noise Control","authors":"","doi":"10.1016/j.neunet.2024.106687","DOIUrl":"10.1016/j.neunet.2024.106687","url":null,"abstract":"<div><div>The recent Generative Fixed-filter Active Noise Control (GFANC) method achieves a good trade-off between noise reduction performance and system stability. However, labelling noise data for training the Convolutional Neural Network (CNN) in GFANC is typically resource-consuming. Even worse, labelling errors will degrade the CNN’s filter-generation accuracy. Therefore, this paper proposes a novel Reinforcement Learning-based GFANC (GFANC-RL) approach that omits the labelling process by leveraging the exploring property of Reinforcement Learning (RL). The CNN’s parameters are automatically updated through the interaction between the RL agent and the environment. Moreover, the RL algorithm solves the non-differentiability issue caused by using binary combination weights in GFANC. Simulation results demonstrate the effectiveness and transferability of the GFANC-RL method in handling real-recorded noises across different acoustic paths.<span><span><sup>2</sup></span></span></div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142312163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kolmogorov n-widths for multitask physics-informed machine learning (PIML) methods: Towards robust metrics 多任务物理信息机器学习(PIML)方法的柯尔莫哥洛夫 n 宽:实现稳健度量
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-04 DOI: 10.1016/j.neunet.2024.106703

Physics-informed machine learning (PIML) as a means of solving partial differential equations (PDEs) has garnered much attention in the Computational Science and Engineering (CS&E) world. This topic encompasses a broad array of methods and models aimed at solving a single or a collection of PDE problems, called multitask learning. PIML is characterized by the incorporation of physical laws into the training process of machine learning models in lieu of large data when solving PDE problems. Despite the overall success of this collection of methods, it remains incredibly difficult to analyze, benchmark, and generally compare one approach to another. Using Kolmogorov n-widths as a measure of effectiveness of approximating functions, we judiciously apply this metric in the comparison of various multitask PIML architectures. We compute lower accuracy bounds and analyze the model’s learned basis functions on various PDE problems. This is the first objective metric for comparing multitask PIML architectures and helps remove uncertainty in model validation from selective sampling and overfitting. We also identify avenues of improvement for model architectures, such as the choice of activation function, which can drastically affect model generalization to “worst-case” scenarios, which is not observed when reporting task-specific errors. We also incorporate this metric into the optimization process through regularization, which improves the models’ generalizability over the multitask PDE problem.

物理信息机器学习(PIML)作为求解偏微分方程(PDE)的一种手段,在计算科学与工程(CS&E)领域备受关注。这一主题包括一系列广泛的方法和模型,旨在解决单个或一系列 PDE 问题,即所谓的多任务学习。PIML 的特点是在解决 PDE 问题时,将物理规律纳入机器学习模型的训练过程,以代替大数据。尽管这一系列方法总体上取得了成功,但要对一种方法与另一种方法进行分析、基准测试和总体比较,仍然非常困难。我们使用 Kolmogorov n 宽作为近似函数有效性的衡量标准,在比较各种多任务 PIML 架构时明智地应用了这一指标。我们计算了精度下限,并分析了模型在各种 PDE 问题上学习到的基函数。这是首个用于比较多任务 PIML 架构的客观指标,有助于消除选择性采样和过度拟合带来的模型验证不确定性。我们还确定了模型架构的改进途径,例如激活函数的选择,这会极大地影响模型对 "最坏情况 "场景的泛化,而在报告特定任务误差时却观察不到这一点。我们还通过正则化将这一指标纳入优化过程,从而提高了模型对多任务 PDE 问题的泛化能力。
{"title":"Kolmogorov n-widths for multitask physics-informed machine learning (PIML) methods: Towards robust metrics","authors":"","doi":"10.1016/j.neunet.2024.106703","DOIUrl":"10.1016/j.neunet.2024.106703","url":null,"abstract":"<div><p>Physics-informed machine learning (PIML) as a means of solving partial differential equations (PDEs) has garnered much attention in the Computational Science and Engineering (CS&amp;E) world. This topic encompasses a broad array of methods and models aimed at solving a single or a collection of PDE problems, called multitask learning. PIML is characterized by the incorporation of physical laws into the training process of machine learning models in lieu of large data when solving PDE problems. Despite the overall success of this collection of methods, it remains incredibly difficult to analyze, benchmark, and generally compare one approach to another. Using Kolmogorov n-widths as a measure of effectiveness of approximating functions, we judiciously apply this metric in the comparison of various multitask PIML architectures. We compute lower accuracy bounds and analyze the model’s learned basis functions on various PDE problems. This is the first objective metric for comparing multitask PIML architectures and helps remove uncertainty in model validation from selective sampling and overfitting. We also identify avenues of improvement for model architectures, such as the choice of activation function, which can drastically affect model generalization to “worst-case” scenarios, which is not observed when reporting task-specific errors. We also incorporate this metric into the optimization process through regularization, which improves the models’ generalizability over the multitask PDE problem.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
State transition learning with limited data for safe control of switched nonlinear systems 利用有限数据进行状态转换学习,实现开关非线性系统的安全控制
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-03 DOI: 10.1016/j.neunet.2024.106695

Switching dynamics are prevalent in real-world systems, arising from either intrinsic changes or responses to external influences, which can be appropriately modeled by switched systems. Control synthesis for switched systems, especially integrating safety constraints, is recognized as a significant and challenging topic. This study focuses on devising a learning-based control strategy for switched nonlinear systems operating under arbitrary switching law. It aims to maintain stability and uphold safety constraints despite limited system data. To achieve these goals, we employ the control barrier function method and Lyapunov theory to synthesize a controller that delivers both safety and stability performance. To overcome the difficulties associated with constructing the specific control barrier and Lyapunov function and take advantage of switching characteristics, we create a neural control barrier function and a neural Lyapunov function separately for control policies through a state transition learning approach. These neural barrier and Lyapunov functions facilitate the design of the safe controller. The corresponding control policy is governed by learning from two components: policy loss and forward state estimation. The effectiveness of the developing scheme is verified through simulation examples.

开关动态在现实世界的系统中非常普遍,它产生于内在变化或对外部影响的反应,可以用开关系统进行适当建模。开关系统的控制合成,尤其是集成安全约束的控制合成,被认为是一个重要而具有挑战性的课题。本研究的重点是为在任意开关规律下运行的开关非线性系统设计一种基于学习的控制策略。其目的是在系统数据有限的情况下,保持稳定性并维护安全约束。为了实现这些目标,我们采用了控制障碍函数法和 Lyapunov 理论来合成一个既能保证安全性又能保证稳定性的控制器。为了克服构建特定控制障壁函数和 Lyapunov 函数的困难,并利用开关特性,我们通过状态转换学习方法为控制策略分别创建了神经控制障壁函数和神经 Lyapunov 函数。这些神经屏障和 Lyapunov 函数有助于设计安全控制器。相应的控制策略受两部分学习的支配:策略损失和前向状态估计。通过仿真实例验证了开发方案的有效性。
{"title":"State transition learning with limited data for safe control of switched nonlinear systems","authors":"","doi":"10.1016/j.neunet.2024.106695","DOIUrl":"10.1016/j.neunet.2024.106695","url":null,"abstract":"<div><p>Switching dynamics are prevalent in real-world systems, arising from either intrinsic changes or responses to external influences, which can be appropriately modeled by switched systems. Control synthesis for switched systems, especially integrating safety constraints, is recognized as a significant and challenging topic. This study focuses on devising a learning-based control strategy for switched nonlinear systems operating under arbitrary switching law. It aims to maintain stability and uphold safety constraints despite limited system data. To achieve these goals, we employ the control barrier function method and Lyapunov theory to synthesize a controller that delivers both safety and stability performance. To overcome the difficulties associated with constructing the specific control barrier and Lyapunov function and take advantage of switching characteristics, we create a neural control barrier function and a neural Lyapunov function separately for control policies through a state transition learning approach. These neural barrier and Lyapunov functions facilitate the design of the safe controller. The corresponding control policy is governed by learning from two components: policy loss and forward state estimation. The effectiveness of the developing scheme is verified through simulation examples.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0893608024006191/pdfft?md5=d2ae98134957c6fcb8db6c8185b3a468&pid=1-s2.0-S0893608024006191-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142173235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards a configurable and non-hierarchical search space for NAS 为 NAS 开发可配置的非等级搜索空间
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-03 DOI: 10.1016/j.neunet.2024.106700

Neural Architecture Search (NAS) outperforms handcrafted Neural Network (NN) design. However, current NAS methods generally use hard-coded search spaces, and predefined hierarchical architectures. As a consequence, adapting them to a new problem can be cumbersome, and it is hard to know which of the NAS algorithm or the predefined hierarchical structure impacts performance the most. To improve flexibility, and be less reliant on expert knowledge, this paper proposes a NAS methodology in which the search space is easily customizable, and allows for full network search. NAS is performed with Gaussian Process (GP)-based Bayesian Optimization (BO) in a continuous architecture embedding space. This embedding is built upon a Wasserstein Autoencoder, regularized by both a Maximum Mean Discrepancy (MMD) penalization and a Fully Input Convex Neural Network (FICNN) latent predictor, trained to infer the parameter count of architectures. This paper first assesses the embedding’s suitability for optimization by solving 2 computationally inexpensive problems: minimizing the number of parameters, and maximizing a zero-shot accuracy proxy. Then, two variants of complexity-aware NAS are performed on CIFAR-10 and STL-10, based on two different search spaces, providing competitive NN architectures with limited model sizes.

神经架构搜索(NAS)优于手工神经网络(NN)设计。然而,目前的 NAS 方法通常使用硬编码搜索空间和预定义分层架构。因此,根据新问题调整这些方法可能很麻烦,而且很难知道 NAS 算法和预定义分层结构中哪个对性能影响最大。为了提高灵活性,减少对专家知识的依赖,本文提出了一种 NAS 方法,其中的搜索空间可轻松定制,并允许进行全网搜索。在连续架构嵌入空间中,使用基于高斯过程(GP)的贝叶斯优化(BO)来执行 NAS。这种嵌入建立在瓦瑟斯坦自动编码器(Wasserstein Autoencoder)的基础上,并通过最大均值差异(MMD)惩罚和全输入凸神经网络(FICNN)潜在预测器对其进行正则化,以推断出架构的参数数量。本文首先通过解决两个计算成本低廉的问题来评估嵌入是否适合优化:最小化参数数和最大化零次精度代理。然后,基于两个不同的搜索空间,在 CIFAR-10 和 STL-10 上执行了复杂性感知 NAS 的两个变体,在有限的模型大小下提供了有竞争力的 NN 架构。
{"title":"Towards a configurable and non-hierarchical search space for NAS","authors":"","doi":"10.1016/j.neunet.2024.106700","DOIUrl":"10.1016/j.neunet.2024.106700","url":null,"abstract":"<div><p>Neural Architecture Search (NAS) outperforms handcrafted Neural Network (NN) design. However, current NAS methods generally use hard-coded search spaces, and predefined hierarchical architectures. As a consequence, adapting them to a new problem can be cumbersome, and it is hard to know which of the NAS algorithm or the predefined hierarchical structure impacts performance the most. To improve flexibility, and be less reliant on expert knowledge, this paper proposes a NAS methodology in which the search space is easily customizable, and allows for full network search. NAS is performed with Gaussian Process (GP)-based Bayesian Optimization (BO) in a continuous architecture embedding space. This embedding is built upon a Wasserstein Autoencoder, regularized by both a Maximum Mean Discrepancy (MMD) penalization and a Fully Input Convex Neural Network (FICNN) latent predictor, trained to infer the parameter count of architectures. This paper first assesses the embedding’s suitability for optimization by solving 2 computationally inexpensive problems: minimizing the number of parameters, and maximizing a zero-shot accuracy proxy. Then, two variants of complexity-aware NAS are performed on CIFAR-10 and STL-10, based on two different search spaces, providing competitive NN architectures with limited model sizes.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample selection of adversarial attacks against traffic signs 针对交通标志的对抗性攻击样本选择
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-03 DOI: 10.1016/j.neunet.2024.106698

In the real world, the correct recognition of traffic signs plays a crucial role in vehicle autonomous driving and traffic monitoring. The research on its adversarial attack can test the security of vehicle autonomous driving system and provide enlightenment for improving the recognition algorithm. However, with the development of transportation infrastructure, new traffic signs may be introduced. The adversarial attack model for traffic signs needs to adapt to the addition of new types. Based on this, class incremental learning for traffic sign adversarial attacks has become an interesting research field. We propose a class incremental learning method for adversarial attacks on traffic signs. First, this method uses Pinpoint Region Probability Estimation Network (PRPEN) to predict the probability of each pixel being attacked in old samples. It helps to identify the high attack probability regions of the samples. Subsequently, based on the size of high probability pixel concentration area, the replay sample set is constructed. Old samples with smaller concentration areas receive higher priority and are prioritized for incremental learning. The experimental results show that compared with other sample selection methods, our method selects more representative samples and can train PRPEN more effectively to generate probability maps, thereby better generating adversarial attacks on traffic signs.

在现实世界中,正确识别交通标志对车辆自动驾驶和交通监控起着至关重要的作用。对其进行对抗性攻击的研究可以检验车辆自动驾驶系统的安全性,并为改进识别算法提供启示。然而,随着交通基础设施的发展,可能会引入新的交通标志。针对交通标志的对抗攻击模型需要适应新类型的增加。基于此,针对交通标志对抗攻击的类增量学习已成为一个有趣的研究领域。我们提出了一种针对交通标志对抗攻击的类增量学习方法。首先,该方法使用针点区域概率估计网络(PRPEN)来预测旧样本中每个像素被攻击的概率。这有助于识别样本中的高攻击概率区域。随后,根据高概率像素集中区域的大小,构建重放样本集。集中区域较小的旧样本会获得更高的优先级,并优先用于增量学习。实验结果表明,与其他样本选择方法相比,我们的方法选择的样本更具代表性,能更有效地训练 PRPEN 生成概率图,从而更好地生成针对交通标志的对抗性攻击。
{"title":"Sample selection of adversarial attacks against traffic signs","authors":"","doi":"10.1016/j.neunet.2024.106698","DOIUrl":"10.1016/j.neunet.2024.106698","url":null,"abstract":"<div><p>In the real world, the correct recognition of traffic signs plays a crucial role in vehicle autonomous driving and traffic monitoring. The research on its adversarial attack can test the security of vehicle autonomous driving system and provide enlightenment for improving the recognition algorithm. However, with the development of transportation infrastructure, new traffic signs may be introduced. The adversarial attack model for traffic signs needs to adapt to the addition of new types. Based on this, class incremental learning for traffic sign adversarial attacks has become an interesting research field. We propose a class incremental learning method for adversarial attacks on traffic signs. First, this method uses Pinpoint Region Probability Estimation Network (PRPEN) to predict the probability of each pixel being attacked in old samples. It helps to identify the high attack probability regions of the samples. Subsequently, based on the size of high probability pixel concentration area, the replay sample set is constructed. Old samples with smaller concentration areas receive higher priority and are prioritized for incremental learning. The experimental results show that compared with other sample selection methods, our method selects more representative samples and can train PRPEN more effectively to generate probability maps, thereby better generating adversarial attacks on traffic signs.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142232537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local artifacts amplification for deepfakes augmentation 局部伪影放大,用于深层伪影增强。
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-03 DOI: 10.1016/j.neunet.2024.106692

With the rapid and continuous development of AIGC, It is becoming increasingly difficult to distinguish between real and forged facial images, which calls for efficient forgery detection systems. Although many detection methods have noticed the importance of local artifacts, there has been a lack of in-depth discussion regarding the selection of locations and their effective utilization. Besides, the traditional image augmentation methods that are widely used have limited improvements for forgery detection tasks and require more specialized augmentation methods specifically designed for forgery detection tasks. In this paper, this study proposes Local Artifacts Amplification for Deepfakes Augmentation, which amplifies the local artifacts on the forged faces. Furthermore, this study incorporates prior knowledge about similar facial features into the model. This means that within the facial regions defined in this work, forged features exhibit similar patterns. By aggregating the results from all facial regions, the study can enhance the overall performance of the model. The evaluation experiments conducted in this research, achieving an AUC of 93.40% and an Acc of 87.03% in the challenging WildDeepfake dataset, demonstrate a promising improvement in accuracy compared to traditional image augmentation methods and achieve superior performance on intra-dataset evaluation. The cross-dataset evaluation also showed that the method presented in this study has strong generalization abilities.

随着 AIGC 的快速和持续发展,区分真实和伪造的面部图像变得越来越困难,这就需要高效的伪造检测系统。尽管许多检测方法都注意到了局部伪影的重要性,但对于如何选择局部伪影的位置并有效利用这些伪影,一直缺乏深入的探讨。此外,目前广泛使用的传统图像增强方法对伪造检测任务的改进有限,需要专门针对伪造检测任务设计更专业的增强方法。在本文中,本研究提出了用于深度伪造增强的局部人工痕迹放大法,它可以放大伪造人脸上的局部人工痕迹。此外,本研究还将类似面部特征的先验知识纳入模型。这意味着,在本研究定义的面部区域内,伪造特征表现出相似的模式。通过汇总所有面部区域的结果,本研究可以提高模型的整体性能。与传统的图像增强方法相比,本研究进行的评估实验在具有挑战性的 WildDeepfake 数据集上取得了 93.40% 的 AUC 和 87.03% 的 Acc,表明准确率有了可喜的提高,并在数据集内评估中取得了优异的表现。跨数据集评估也表明,本研究提出的方法具有很强的泛化能力。
{"title":"Local artifacts amplification for deepfakes augmentation","authors":"","doi":"10.1016/j.neunet.2024.106692","DOIUrl":"10.1016/j.neunet.2024.106692","url":null,"abstract":"<div><p>With the rapid and continuous development of AIGC, It is becoming increasingly difficult to distinguish between real and forged facial images, which calls for efficient forgery detection systems. Although many detection methods have noticed the importance of local artifacts, there has been a lack of in-depth discussion regarding the selection of locations and their effective utilization. Besides, the traditional image augmentation methods that are widely used have limited improvements for forgery detection tasks and require more specialized augmentation methods specifically designed for forgery detection tasks. In this paper, this study proposes Local Artifacts Amplification for Deepfakes Augmentation, which amplifies the local artifacts on the forged faces. Furthermore, this study incorporates prior knowledge about similar facial features into the model. This means that within the facial regions defined in this work, forged features exhibit similar patterns. By aggregating the results from all facial regions, the study can enhance the overall performance of the model. The evaluation experiments conducted in this research, achieving an AUC of 93.40% and an Acc of 87.03% in the challenging WildDeepfake dataset, demonstrate a promising improvement in accuracy compared to traditional image augmentation methods and achieve superior performance on intra-dataset evaluation. The cross-dataset evaluation also showed that the method presented in this study has strong generalization abilities.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142146653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved region proposal network for enhanced few-shot object detection 改进的区域建议网络,用于增强少镜头物体检测。
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-03 DOI: 10.1016/j.neunet.2024.106699

Despite significant success of deep learning in object detection tasks, the standard training of deep neural networks requires access to a substantial quantity of annotated images across all classes. Data annotation is an arduous and time-consuming endeavor, particularly when dealing with infrequent objects. Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches based on deep learning. FSOD methods demonstrate remarkable performance by achieving robust object detection using a significantly smaller amount of training data. A challenge for FSOD is that instances from novel classes that do not belong to the fixed set of training classes appear in the background and the base model may pick them up as potential objects. These objects behave similarly to label noise because they are classified as one of the training dataset classes, leading to FSOD performance degradation. We develop a semi-supervised algorithm to detect and then utilize these unlabeled novel objects as positive samples during the FSOD training stage to improve FSOD performance. Specifically, we develop a hierarchical ternary classification region proposal network (HTRPN) to localize the potential unlabeled novel objects and assign them new objectness labels to distinguish these objects from the base training dataset classes. Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception ability of the object detection model for large objects. We test our approach and COCO and PASCAL VOC baselines that are commonly used in FSOD literature. Our experimental results indicate that our method is effective and outperforms the existing state-of-the-art (SOTA) FSOD methods. Our implementation is provided as a supplement to support reproducibility of the results https://github.com/zshanggu/HTRPN.1

尽管深度学习在物体检测任务中取得了巨大成功,但深度神经网络的标准训练需要获取大量标注了所有类别的图像。数据注释是一项艰巨而耗时的工作,尤其是在处理不常见的物体时。针对基于深度学习的经典物体检测方法的局限性,一种名为 "少镜头物体检测(FSOD)"的方法应运而生。FSOD 方法使用更少的训练数据量就能实现稳健的物体检测,表现出卓越的性能。FSOD 面临的一个挑战是,不属于固定训练类别集的新类别实例会出现在背景中,基础模型可能会将其作为潜在的对象。这些对象的行为类似于标签噪声,因为它们被归类为训练数据集类别之一,从而导致 FSOD 性能下降。我们开发了一种半监督算法,在 FSOD 训练阶段检测并利用这些未标记的新物体作为正样本,从而提高 FSOD 性能。具体来说,我们开发了一种分层三元分类区域建议网络(HTRPN)来定位潜在的未标记新物体,并为其分配新的对象性标签,以将这些物体与基础训练数据集类别区分开来。我们改进的区域建议网络(RPN)分层采样策略也提高了物体检测模型对大型物体的感知能力。我们测试了我们的方法以及 FSOD 文献中常用的 COCO 和 PASCAL VOC 基线。实验结果表明,我们的方法非常有效,优于现有的最先进(SOTA)FSOD 方法。我们的实现方法作为补充提供,以支持结果的可重复性 https://github.com/zshanggu/HTRPN.1。
{"title":"Improved region proposal network for enhanced few-shot object detection","authors":"","doi":"10.1016/j.neunet.2024.106699","DOIUrl":"10.1016/j.neunet.2024.106699","url":null,"abstract":"<div><p>Despite significant success of deep learning in object detection tasks, the standard training of deep neural networks requires access to a substantial quantity of annotated images across all classes. Data annotation is an arduous and time-consuming endeavor, particularly when dealing with infrequent objects. Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches based on deep learning. FSOD methods demonstrate remarkable performance by achieving robust object detection using a significantly smaller amount of training data. A challenge for FSOD is that instances from novel classes that do not belong to the fixed set of training classes appear in the background and the base model may pick them up as potential objects. These objects behave similarly to label noise because they are classified as one of the training dataset classes, leading to FSOD performance degradation. We develop a semi-supervised algorithm to detect and then utilize these unlabeled novel objects as positive samples during the FSOD training stage to improve FSOD performance. Specifically, we develop a hierarchical ternary classification region proposal network (HTRPN) to localize the potential unlabeled novel objects and assign them new objectness labels to distinguish these objects from the base training dataset classes. Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception ability of the object detection model for large objects. We test our approach and COCO and PASCAL VOC baselines that are commonly used in FSOD literature. Our experimental results indicate that our method is effective and outperforms the existing state-of-the-art (SOTA) FSOD methods. Our implementation is provided as a supplement to support reproducibility of the results <span><span>https://github.com/zshanggu/HTRPN</span><svg><path></path></svg></span>.<span><span><sup>1</sup></span></span></p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142146649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing SNN-based spatio-temporal learning: A benchmark dataset and Cross-Modality Attention model 加强基于 SNN 的时空学习:基准数据集和跨模态注意力模型
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-03 DOI: 10.1016/j.neunet.2024.106677

Spiking Neural Networks (SNNs), renowned for their low power consumption, brain-inspired architecture, and spatio-temporal representation capabilities, have garnered considerable attention in recent years. Similar to Artificial Neural Networks (ANNs), high-quality benchmark datasets are of great importance to the advances of SNNs. However, our analysis indicates that many prevalent neuromorphic datasets lack strong temporal correlation, preventing SNNs from fully exploiting their spatio-temporal representation capabilities. Meanwhile, the integration of event and frame modalities offers more comprehensive visual spatio-temporal information. Yet, the SNN-based cross-modality fusion remains underexplored.

In this work, we present a neuromorphic dataset called DVS-SLR that can better exploit the inherent spatio-temporal properties of SNNs. Compared to existing datasets, it offers advantages in terms of higher temporal correlation, larger scale, and more varied scenarios. In addition, our neuromorphic dataset contains corresponding frame data, which can be used for developing SNN-based fusion methods. By virtue of the dual-modal feature of the dataset, we propose a Cross-Modality Attention (CMA) based fusion method. The CMA model efficiently utilizes the unique advantages of each modality, allowing for SNNs to learn both temporal and spatial attention scores from the spatio-temporal features of event and frame modalities, subsequently allocating these scores across modalities to enhance their synergy. Experimental results demonstrate that our method not only improves recognition accuracy but also ensures robustness across diverse scenarios.

尖峰神经网络(SNN)以其低功耗、大脑启发式架构和时空表示能力而著称,近年来备受关注。与人工神经网络(ANN)类似,高质量的基准数据集对 SNNs 的发展也非常重要。然而,我们的分析表明,许多流行的神经形态数据集缺乏较强的时间相关性,从而阻碍了 SNN 充分发挥其时空表示能力。同时,事件模态和帧模态的整合提供了更全面的视觉时空信息。在这项工作中,我们提出了一种名为 DVS-SLR 的神经形态数据集,它能更好地利用 SNN 固有的时空特性。与现有数据集相比,它具有更高的时间相关性、更大的规模和更多的场景等优势。此外,我们的神经形态数据集包含相应的帧数据,可用于开发基于 SNN 的融合方法。凭借数据集的双模态特征,我们提出了一种基于跨模态注意(CMA)的融合方法。CMA 模型有效地利用了每种模态的独特优势,允许 SNN 从事件模态和帧模态的时空特征中学习时间和空间注意力分数,然后将这些分数分配给不同的模态,以增强它们之间的协同作用。实验结果表明,我们的方法不仅提高了识别准确率,还确保了在不同场景下的鲁棒性。
{"title":"Enhancing SNN-based spatio-temporal learning: A benchmark dataset and Cross-Modality Attention model","authors":"","doi":"10.1016/j.neunet.2024.106677","DOIUrl":"10.1016/j.neunet.2024.106677","url":null,"abstract":"<div><p>Spiking Neural Networks (SNNs), renowned for their low power consumption, brain-inspired architecture, and spatio-temporal representation capabilities, have garnered considerable attention in recent years. Similar to Artificial Neural Networks (ANNs), high-quality benchmark datasets are of great importance to the advances of SNNs. However, our analysis indicates that many prevalent neuromorphic datasets lack strong temporal correlation, preventing SNNs from fully exploiting their spatio-temporal representation capabilities. Meanwhile, the integration of event and frame modalities offers more comprehensive visual spatio-temporal information. Yet, the SNN-based cross-modality fusion remains underexplored.</p><p>In this work, we present a neuromorphic dataset called DVS-SLR that can better exploit the inherent spatio-temporal properties of SNNs. Compared to existing datasets, it offers advantages in terms of higher temporal correlation, larger scale, and more varied scenarios. In addition, our neuromorphic dataset contains corresponding frame data, which can be used for developing SNN-based fusion methods. By virtue of the dual-modal feature of the dataset, we propose a Cross-Modality Attention (CMA) based fusion method. The CMA model efficiently utilizes the unique advantages of each modality, allowing for SNNs to learn both temporal and spatial attention scores from the spatio-temporal features of event and frame modalities, subsequently allocating these scores across modalities to enhance their synergy. Experimental results demonstrate that our method not only improves recognition accuracy but also ensures robustness across diverse scenarios.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0893608024006014/pdfft?md5=98c81b95c17e5a1fd818cad27f177fa6&pid=1-s2.0-S0893608024006014-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-optimal open-loop set stabilization of Boolean control networks 布尔控制网络的时间最优开环集稳定
IF 6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-09-03 DOI: 10.1016/j.neunet.2024.106694

We show that for stabilization of Boolean control networks (BCNs) with unobservable initial states, open-loop control and close-loop control are not equivalent. An example is given to illustrate the nonequivalence. Enlightened by the nonequivalence, we explore open-loop set stabilization of BCNs with unobservable initial states. More specifically, this issue is to investigate that for a given BCN, whether there exists a unified free control sequence that is effective for all initial states of the system to stabilize the system states to a given set. The criteria for open-loop set stabilization is derived and for any open-loop set stabilizable BCN, every time-optimal open-loop set stabilizer is proposed. Besides, we obtain the least upper bounds of two integers, which are respectively related to the global stabilization and partial stabilization of BCNs in the results of two literature articles. Using the methods in the two literature articles, the least upper bounds of the two integers cannot be obtained.

我们证明,对于初始状态不可观测的布尔控制网络(BCN)的稳定问题,开环控制和闭环控制并不等同。我们举例说明了不等价性。在非等价性的启发下,我们探讨了具有不可观测初始状态的布里控制网络的开环集稳定问题。更具体地说,这个问题是要研究对于给定的 BCN,是否存在一个统一的自由控制序列,该序列对系统的所有初始状态都有效,能将系统状态稳定在给定的集合上。我们推导了开环集稳定的标准,并针对任何可开环集稳定的 BCN,提出了每一个时间最优开环集稳定器。此外,我们还得到了两个整数的最小上界,这两个整数分别与两篇文献结果中 BCN 的全局稳定和局部稳定有关。使用这两篇文献中的方法,无法得到这两个整数的最小上界。
{"title":"Time-optimal open-loop set stabilization of Boolean control networks","authors":"","doi":"10.1016/j.neunet.2024.106694","DOIUrl":"10.1016/j.neunet.2024.106694","url":null,"abstract":"<div><p>We show that for stabilization of Boolean control networks (BCNs) with unobservable initial states, open-loop control and close-loop control are not equivalent. An example is given to illustrate the nonequivalence. Enlightened by the nonequivalence, we explore open-loop set stabilization of BCNs with unobservable initial states. More specifically, this issue is to investigate that for a given BCN, whether there exists a unified free control sequence that is effective for all initial states of the system to stabilize the system states to a given set. The criteria for open-loop set stabilization is derived and for any open-loop set stabilizable BCN, every time-optimal open-loop set stabilizer is proposed. Besides, we obtain the least upper bounds of two integers, which are respectively related to the global stabilization and partial stabilization of BCNs in the results of two literature articles. Using the methods in the two literature articles, the least upper bounds of the two integers cannot be obtained.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":null,"pages":null},"PeriodicalIF":6.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142243880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1