首页 > 最新文献

Neurocomputing最新文献

英文 中文
Polycentric intuitionistic fuzzy weighted least squares twin SVMs 多中心直觉模糊加权最小二乘法孪生 SVM
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-29 DOI: 10.1016/j.neucom.2024.128475

The classification of data with outliers and noise has always been one of the principal challenges within machine learning. The previous unicentric-based fuzzy twin support vector machines (SVMs) typically allot the membership through proximity to the center of the samples, which neglects the global structural information and the local neighborhood information and potentially causes confusion between fringe support vectors and outliers. In this paper, a polycentric intuitionistic fuzzy weighted least squares twin SVMs (PIFW-LSTSVM) is presented to alleviate the above issue. Concretely, the PIFW-LSTSVM model simultaneously assigns membership and nonmembership to each sample, where the membership is determined by the sample proximity to the corresponding nearest center, and nonmembership is identified by neighborhood entropy. Benefiting from the novel polycentric weighting strategy, the PIFW-LSTSVM model mitigates the impact of outliers and noise and reduces the confusion between fringe support vectors and outliers or noise, thereby boosting the generalization ability. The experiments, conducted on both artificial and real-world benchmark datasets, comprehensively demonstrate the effectiveness and superiority of the PIFW-LSTSVM model compared to other state-of-the-art models.

对存在异常值和噪声的数据进行分类一直是机器学习领域的主要挑战之一。以往基于单中心的模糊孪生支持向量机(SVM)通常通过接近样本中心来分配成员资格,这忽略了全局结构信息和局部邻域信息,有可能造成边缘支持向量和离群值之间的混淆。本文提出了一种多中心直觉模糊加权最小二乘孪生 SVM(PIFW-LSTSVM)来缓解上述问题。具体来说,PIFW-LSTSVM 模型同时为每个样本分配成员和非成员身份,其中成员身份由样本与相应最近中心的接近程度决定,而非成员身份则由邻域熵确定。得益于新颖的多中心加权策略,PIFW-LSTSVM 模型减轻了异常值和噪声的影响,减少了边缘支持向量与异常值或噪声之间的混淆,从而提高了泛化能力。在人工数据集和真实基准数据集上进行的实验全面证明了 PIFW-LSTSVM 模型的有效性和优越性。
{"title":"Polycentric intuitionistic fuzzy weighted least squares twin SVMs","authors":"","doi":"10.1016/j.neucom.2024.128475","DOIUrl":"10.1016/j.neucom.2024.128475","url":null,"abstract":"<div><p>The classification of data with outliers and noise has always been one of the principal challenges within machine learning. The previous unicentric-based fuzzy twin support vector machines (SVMs) typically allot the membership through proximity to the center of the samples, which neglects the global structural information and the local neighborhood information and potentially causes confusion between fringe support vectors and outliers. In this paper, a polycentric intuitionistic fuzzy weighted least squares twin SVMs (PIFW-LSTSVM) is presented to alleviate the above issue. Concretely, the PIFW-LSTSVM model simultaneously assigns membership and nonmembership to each sample, where the membership is determined by the sample proximity to the corresponding nearest center, and nonmembership is identified by neighborhood entropy. Benefiting from the novel polycentric weighting strategy, the PIFW-LSTSVM model mitigates the impact of outliers and noise and reduces the confusion between fringe support vectors and outliers or noise, thereby boosting the generalization ability. The experiments, conducted on both artificial and real-world benchmark datasets, comprehensively demonstrate the effectiveness and superiority of the PIFW-LSTSVM model compared to other state-of-the-art models.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meta-Adaptable-Adapter: Efficient adaptation of self-supervised models for low-resource speech recognition 元适应适配器:为低资源语音识别高效调整自监督模型
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-29 DOI: 10.1016/j.neucom.2024.128493

Self-supervised models have demonstrated remarkable performance in speech processing by learning latent representations from large amounts of unlabeled data. Adapting these models to low-resource languages yields promising results, but the computational cost of fine-tuning all model parameters is prohibitively high. Adapters offer a solution by introducing lightweight bottleneck structures into pre-trained models for downstream tasks, enabling efficient parameter adaptation. However, randomly initialized adapters often underperform in extremely low-resource scenarios. To address this issue, we explore the Meta-Adapter for self-supervised models and analyzed some limitations of Meta-Adapter including poor learning in language-specific knowledge and meta-overfitting problems. To relieve these problems, we propose the Meta-Adaptable-Adapter (MAA), a new meta leaning algorithm that adapts to low-resource languages quickly and effectively. MAA learns task-specific adapters for feature extraction, and task-independent adapters for feature combination. The experiments on three datasets show superior performance on 31 low-resource languages across seven different language families compared to other adapters, showing better generalization and extensibility.

自监督模型通过从大量无标注数据中学习潜在表征,在语音处理中表现出了卓越的性能。将这些模型适用于低资源语言会产生很好的效果,但微调所有模型参数的计算成本过高。适配器提供了一种解决方案,即在预训练模型中引入轻量级瓶颈结构,用于下游任务,从而实现高效的参数调整。然而,随机初始化的适配器在资源极度匮乏的情况下往往表现不佳。为了解决这个问题,我们探索了用于自监督模型的元适配器,并分析了元适配器的一些局限性,包括对特定语言知识的学习能力差和元过拟合问题。为了解决这些问题,我们提出了元适应适配器(MAA),一种能快速有效地适应低资源语言的新型元倾斜算法。MAA 在特征提取时学习特定任务的适配器,在特征组合时学习与任务无关的适配器。在三个数据集上进行的实验表明,与其他适配器相比,MAA 在 7 个不同语系的 31 种低资源语言上表现出更优越的性能,显示出更好的泛化和可扩展性。
{"title":"Meta-Adaptable-Adapter: Efficient adaptation of self-supervised models for low-resource speech recognition","authors":"","doi":"10.1016/j.neucom.2024.128493","DOIUrl":"10.1016/j.neucom.2024.128493","url":null,"abstract":"<div><p>Self-supervised models have demonstrated remarkable performance in speech processing by learning latent representations from large amounts of unlabeled data. Adapting these models to low-resource languages yields promising results, but the computational cost of fine-tuning all model parameters is prohibitively high. Adapters offer a solution by introducing lightweight bottleneck structures into pre-trained models for downstream tasks, enabling efficient parameter adaptation. However, randomly initialized adapters often underperform in extremely low-resource scenarios. To address this issue, we explore the Meta-Adapter for self-supervised models and analyzed some limitations of Meta-Adapter including poor learning in language-specific knowledge and meta-overfitting problems. To relieve these problems, we propose the Meta-Adaptable-Adapter (MAA), a new meta leaning algorithm that adapts to low-resource languages quickly and effectively. MAA learns task-specific adapters for feature extraction, and task-independent adapters for feature combination. The experiments on three datasets show superior performance on 31 low-resource languages across seven different language families compared to other adapters, showing better generalization and extensibility.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parallel hyperparameter optimization of spiking neural networks 尖峰神经网络的并行超参数优化
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128483

Hyperparameter optimization of spiking neural networks (SNNs) is a difficult task which has not yet been deeply investigated in the literature. In this work, we designed a scalable constrained Bayesian based optimization algorithm that prevents sampling in non-spiking areas of an efficient high dimensional search space. These search spaces contain infeasible solutions that output no or only a few spikes during the training or testing phases, we call such a mode a “silent network”. Finding them is difficult, as many hyperparameters are highly correlated to the architecture and to the dataset. We leverage silent networks by designing a spike-based early stopping criterion to accelerate the optimization process of SNNs trained by spike timing dependent plasticity and surrogate gradient. We parallelized the optimization algorithm asynchronously, and ran large-scale experiments on heterogeneous multi-GPU Petascale architecture. Results show that by considering silent networks, we can design more flexible high-dimensional search spaces while maintaining a good efficacy. The optimization algorithm was able to focus on networks with high performances by preventing costly and worthless computation of silent networks.

尖峰神经网络(SNN)的超参数优化是一项艰巨的任务,文献中尚未对此进行深入研究。在这项工作中,我们设计了一种基于贝叶斯的可扩展约束优化算法,该算法可防止在高效高维搜索空间的非尖峰区域进行采样。这些搜索空间包含不可行的解决方案,它们在训练或测试阶段不输出或仅输出少量尖峰,我们称这种模式为 "沉默网络"。由于许多超参数与架构和数据集高度相关,因此找到它们非常困难。我们利用沉默网络,设计了基于尖峰的早期停止准则,以加速通过尖峰时序可塑性和替代梯度训练的 SNN 的优化过程。我们对优化算法进行了异步并行化,并在异构多 GPU Petascale 架构上进行了大规模实验。结果表明,通过考虑沉默网络,我们可以设计出更灵活的高维搜索空间,同时保持良好的效率。通过避免对沉默网络进行高成本和无价值的计算,优化算法能够专注于具有高性能的网络。
{"title":"Parallel hyperparameter optimization of spiking neural networks","authors":"","doi":"10.1016/j.neucom.2024.128483","DOIUrl":"10.1016/j.neucom.2024.128483","url":null,"abstract":"<div><p>Hyperparameter optimization of spiking neural networks (SNNs) is a difficult task which has not yet been deeply investigated in the literature. In this work, we designed a scalable constrained Bayesian based optimization algorithm that prevents sampling in non-spiking areas of an efficient high dimensional search space. These search spaces contain infeasible solutions that output no or only a few spikes during the training or testing phases, we call such a mode a “silent network”. Finding them is difficult, as many hyperparameters are highly correlated to the architecture and to the dataset. We leverage silent networks by designing a spike-based early stopping criterion to accelerate the optimization process of SNNs trained by spike timing dependent plasticity and surrogate gradient. We parallelized the optimization algorithm asynchronously, and ran large-scale experiments on heterogeneous multi-GPU Petascale architecture. Results show that by considering silent networks, we can design more flexible high-dimensional search spaces while maintaining a good efficacy. The optimization algorithm was able to focus on networks with high performances by preventing costly and worthless computation of silent networks.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0925231224012542/pdfft?md5=5f10e1f57ae6d5786e814d6c1fdcb989&pid=1-s2.0-S0925231224012542-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142128529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fully-inductive link prediction with path-based graph neural network: A comparative analysis 基于路径的图神经网络的全归纳链接预测:比较分析
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128484

Recently, fully-inductive link prediction in knowledge graphs (KGs) has aimed to predict missing links between unseen–unseen entities, independently completing evolving KGs. The latest literature emphasizes path-based graph neural network (GNN) methods, which combine traditional path-based methods with popular GNN methods, possessing generalization capability, interpretability, scalability, and high model capacity under the inductive setting. This paper presents the first comparative analysis of fully-inductive link prediction using path-based GNNs. First, we comprehensively review and summarize the research of six relevant models, divided into relational digraph-based models and Bellman–Ford algorithm-based models. Based on this, we conduct a comprehensive analysis of these models in terms of effectiveness and efficiency (including runtime, memory, and learning curves), and compared them with two subgraph-based models. Furthermore, we delve into the impact of factors such as message functions, aggregation functions, and negative sampling in the loss function on path-based GNNs. Finally, we provide an outlook on future research directions.

最近,知识图谱(KG)中的全归纳链接预测旨在预测未见-未见实体之间的缺失链接,独立完成不断演化的知识图谱。最新的文献强调基于路径的图神经网络(GNN)方法,它将传统的基于路径的方法与流行的 GNN 方法相结合,在归纳设置下具有泛化能力、可解释性、可扩展性和高模型容量。本文首次使用基于路径的 GNN 对全归纳链接预测进行了比较分析。首先,我们全面回顾和总结了六种相关模型的研究,分为基于关系数图的模型和基于贝尔曼-福特算法的模型。在此基础上,我们对这些模型的有效性和效率(包括运行时间、内存和学习曲线)进行了全面分析,并将它们与两个基于子图的模型进行了比较。此外,我们还深入研究了信息函数、聚合函数和损失函数中的负采样等因素对基于路径的 GNN 的影响。最后,我们对未来的研究方向进行了展望。
{"title":"Fully-inductive link prediction with path-based graph neural network: A comparative analysis","authors":"","doi":"10.1016/j.neucom.2024.128484","DOIUrl":"10.1016/j.neucom.2024.128484","url":null,"abstract":"<div><p>Recently, fully-inductive link prediction in knowledge graphs (KGs) has aimed to predict missing links between unseen–unseen entities, independently completing evolving KGs. The latest literature emphasizes path-based graph neural network (GNN) methods, which combine traditional path-based methods with popular GNN methods, possessing generalization capability, interpretability, scalability, and high model capacity under the inductive setting. This paper presents the first comparative analysis of fully-inductive link prediction using path-based GNNs. First, we comprehensively review and summarize the research of six relevant models, divided into relational digraph-based models and Bellman–Ford algorithm-based models. Based on this, we conduct a comprehensive analysis of these models in terms of effectiveness and efficiency (including runtime, memory, and learning curves), and compared them with two subgraph-based models. Furthermore, we delve into the impact of factors such as message functions, aggregation functions, and negative sampling in the loss function on path-based GNNs. Finally, we provide an outlook on future research directions.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MaskRecon: High-quality human reconstruction via masked autoencoders using a single RGB-D image MaskRecon:通过使用单一 RGB-D 图像的屏蔽自动编码器实现高质量人体重建
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128487

In this paper, we explore reconstructing high-quality clothed 3D humans from a single RGB-D image, assuming that virtual humans can be represented by front-view and back-view depths. Due to the scarcity of captured real RGB-D human images, we employ rendered images to train our method. However, rendered images lack background with significant depth variation in silhouettes, leading to shape prediction inaccuracies and noise. To mitigate this issue, we introduce a pseudo-multi-task framework, which incorporates a Conditional Generative Adversarial Network (CGAN) to infer back-view RGB-D images and a self-supervised Masked Autoencoder (MAE) to capture latent structural information of the human body. Additionally, we propose a Multi-scale Feature Fusion (MFF) module to effectively merge structural information and conditional features at various scales. Our method surpasses many existing techniques, as demonstrated through evaluations on the Thuman, RenderPeople, and BUFF datasets. Notably, our approach excels in reconstructing high-quality human models, even under challenging conditions such as complex poses and loose clothing, both on rendered and real-world images. Codes are available at https://github.com/Archaic-Atom/MaskRecon.

在本文中,我们假设虚拟人可以用前视和后视深度来表示,探索从单张 RGB-D 图像中重建高质量的穿衣三维人体。由于拍摄到的真实 RGB-D 人体图像很少,我们采用了渲染图像来训练我们的方法。然而,渲染图像缺乏背景,剪影的深度变化较大,导致形状预测不准确和噪声。为了缓解这一问题,我们引入了一个伪多任务框架,其中包含一个条件生成对抗网络(CGAN)来推断后视 RGB-D 图像,以及一个自监督掩码自动编码器(MAE)来捕捉人体的潜在结构信息。此外,我们还提出了多尺度特征融合(MFF)模块,以有效融合不同尺度的结构信息和条件特征。通过对 Thuman、RenderPeople 和 BUFF 数据集的评估,我们的方法超越了许多现有技术。值得注意的是,我们的方法在重建高质量人体模型方面表现出色,即使在复杂姿势和宽松衣物等具有挑战性的条件下,也能在渲染图像和真实世界图像上重建高质量人体模型。代码见 https://github.com/Archaic-Atom/MaskRecon。
{"title":"MaskRecon: High-quality human reconstruction via masked autoencoders using a single RGB-D image","authors":"","doi":"10.1016/j.neucom.2024.128487","DOIUrl":"10.1016/j.neucom.2024.128487","url":null,"abstract":"<div><p>In this paper, we explore reconstructing high-quality clothed 3D humans from a single RGB-D image, assuming that virtual humans can be represented by front-view and back-view depths. Due to the scarcity of captured real RGB-D human images, we employ rendered images to train our method. However, rendered images lack background with significant depth variation in silhouettes, leading to shape prediction inaccuracies and noise. To mitigate this issue, we introduce a pseudo-multi-task framework, which incorporates a Conditional Generative Adversarial Network (CGAN) to infer back-view RGB-D images and a self-supervised Masked Autoencoder (MAE) to capture latent structural information of the human body. Additionally, we propose a Multi-scale Feature Fusion (MFF) module to effectively merge structural information and conditional features at various scales. Our method surpasses many existing techniques, as demonstrated through evaluations on the Thuman, RenderPeople, and BUFF datasets. Notably, our approach excels in reconstructing high-quality human models, even under challenging conditions such as complex poses and loose clothing, both on rendered and real-world images. Codes are available at <span><span>https://github.com/Archaic-Atom/MaskRecon</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inductive relation prediction with information bottleneck 存在信息瓶颈的归纳关系预测
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128503

Inductive relation prediction is an important learning task for knowledge graph completion that aims to infer new facts from existing ones. Previous works that focus on path-based are naturally limited in expressive. The methods based on graph neural network framework consider all paths thus improving the performance. However, fusing all paths information may extract features that are spuriously correlated with the prediction. By analogy to the human reasoning process, we observe that only a small subset of the critical paths determine the prediction. In this work, we propose a novel framework that extracts such critical paths to make inductive relation prediction on Knowledge Graph with Graph Information Bottleneck (KG-GIB). KG-GIB is the first attempt to advance the Graph Information Bottleneck (GIB) for inductive relation prediction. Derived from the GIB principle, KG-GIB extracts critical paths which preserves task-relevant paths and blocks information from task-irrelevant paths. The extracted critical paths are expected to be more generalizable and interpretable. Extensive experiments on both synthetic and real-world datasets demonstrate the effectiveness of KG-GIB.

归纳关系预测是完成知识图谱的一项重要学习任务,其目的是从现有事实中推断出新事实。以往基于路径的工作在表达能力上自然受到限制。基于图神经网络框架的方法考虑了所有路径,从而提高了性能。然而,融合所有路径信息可能会提取出与预测虚假相关的特征。类比人类的推理过程,我们发现只有一小部分关键路径能决定预测结果。在这项工作中,我们提出了一个新颖的框架,它能提取这样的关键路径,从而在存在图信息瓶颈的知识图谱(KG-GIB)上进行归纳关系预测。KG-GIB 是利用图信息瓶颈(GIB)进行归纳关系预测的首次尝试。根据 GIB 原理,KG-GIB 提取关键路径,保留与任务相关的路径,屏蔽与任务无关的路径信息。提取的临界路径有望更具通用性和可解释性。在合成数据集和真实数据集上进行的大量实验证明了 KG-GIB 的有效性。
{"title":"Inductive relation prediction with information bottleneck","authors":"","doi":"10.1016/j.neucom.2024.128503","DOIUrl":"10.1016/j.neucom.2024.128503","url":null,"abstract":"<div><p>Inductive relation prediction is an important learning task for knowledge graph completion that aims to infer new facts from existing ones. Previous works that focus on path-based are naturally limited in expressive. The methods based on graph neural network framework consider all paths thus improving the performance. However, fusing all paths information may extract features that are spuriously correlated with the prediction. By analogy to the human reasoning process, we observe that only a small subset of the critical paths determine the prediction. In this work, we propose a novel framework that extracts such critical paths to make inductive relation prediction on <strong>K</strong>nowledge <strong>G</strong>raph with <strong>G</strong>raph <strong>I</strong>nformation <strong>B</strong>ottleneck (<strong>KG-GIB</strong>). KG-GIB is the first attempt to advance the Graph Information Bottleneck (GIB) for inductive relation prediction. Derived from the GIB principle, KG-GIB extracts critical paths which preserves task-relevant paths and blocks information from task-irrelevant paths. The extracted critical paths are expected to be more generalizable and interpretable. Extensive experiments on both synthetic and real-world datasets demonstrate the effectiveness of KG-GIB.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142128523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive structure generation and neuronal differentiation for memory encoding in SNNs 自适应结构生成和神经元分化促进 SNN 的记忆编码
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128470

Memory is the core of cognition. The exploration of the memory encoding mechanism or the representation mechanism of information in the Spiking Neural Network (SNN) is the basis for the in-depth study of memory. In this paper, we study the memory encoding mechanism of multilayer SNN models from a biomimetic perspective and explore a method using the high biological likelihood of SNN to enable the network to effectively simulate memory effects. We proposed a series of heuristic neuron-growing connection algorithms and supervised network weight learning algorithms, which were applied to the unsupervised and supervised training process of the presentation layer. These methods optimized the structure of the representation layer, achieved functional differentiation of neurons, and enabled the network to generate differentiated representations for different data modes. Under our algorithm, the proposed model achieves stable convergence with identical pattern inputs, demonstrating distinct representations and sensitivities to different visual modalities. To achieve stable information expression within the network, we conducted various comparative experiments to determine diverse parameters of the complex network. This paper contributes to the development of Brain-inspired Intelligence by bridging the gap between computer science and neuroscience by using simulations to validate biological hypotheses and guide machine learning.

记忆是认知的核心。探索尖峰神经网络(SNN)的记忆编码机制或信息表征机制是深入研究记忆的基础。本文从生物仿真的角度研究了多层 SNN 模型的记忆编码机制,并探索了一种利用 SNN 的高生物可能性使网络有效模拟记忆效应的方法。我们提出了一系列启发式神经元生长连接算法和监督式网络权重学习算法,并将其应用于呈现层的无监督和有监督训练过程。这些方法优化了表征层的结构,实现了神经元的功能分化,并使网络能够针对不同的数据模式生成差异化的表征。在我们的算法下,所提出的模型在相同模式输入的情况下实现了稳定的收敛,对不同的视觉模式表现出不同的表征和敏感性。为了在网络中实现稳定的信息表达,我们进行了各种对比实验,以确定复杂网络的各种参数。本文通过利用模拟验证生物学假设并指导机器学习,在计算机科学和神经科学之间架起了一座桥梁,为脑启发智能的发展做出了贡献。
{"title":"Adaptive structure generation and neuronal differentiation for memory encoding in SNNs","authors":"","doi":"10.1016/j.neucom.2024.128470","DOIUrl":"10.1016/j.neucom.2024.128470","url":null,"abstract":"<div><p>Memory is the core of cognition. The exploration of the memory encoding mechanism or the representation mechanism of information in the Spiking Neural Network (SNN) is the basis for the in-depth study of memory. In this paper, we study the memory encoding mechanism of multilayer SNN models from a biomimetic perspective and explore a method using the high biological likelihood of SNN to enable the network to effectively simulate memory effects. We proposed a series of heuristic neuron-growing connection algorithms and supervised network weight learning algorithms, which were applied to the unsupervised and supervised training process of the presentation layer. These methods optimized the structure of the representation layer, achieved functional differentiation of neurons, and enabled the network to generate differentiated representations for different data modes. Under our algorithm, the proposed model achieves stable convergence with identical pattern inputs, demonstrating distinct representations and sensitivities to different visual modalities. To achieve stable information expression within the network, we conducted various comparative experiments to determine diverse parameters of the complex network. This paper contributes to the development of Brain-inspired Intelligence by bridging the gap between computer science and neuroscience by using simulations to validate biological hypotheses and guide machine learning.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0925231224012414/pdfft?md5=3e719464f846393c8980b4a91dc9f801&pid=1-s2.0-S0925231224012414-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142149017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An open chest X-ray dataset with benchmarks for automatic radiology report generation in French 开放式胸部 X 射线数据集及法文放射学报告自动生成基准
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128478

Medical report generation (MRG), which aims to automatically generate a textual description of a specific medical image (e.g., a chest X-ray), has recently received increasing research interest. Building on the success of image captioning, MRG has become achievable. However, generating language-specific radiology reports poses a challenge for data-driven models due to their reliance on paired image-report chest X-ray datasets, which are labor-intensive, time-consuming, and costly. In this paper, we introduce a chest X-ray benchmark dataset, namely CASIA-CXR, consisting of high-resolution chest radiographs accompanied by narrative reports originally written in French. To the best of our knowledge, this is the first public chest radiograph dataset with medical reports in this particular language. Importantly, we propose a simple yet effective multimodal encoder–decoder contextually-guided framework for medical report generation in French. We validated our framework through intra-language and cross-language contextual analysis, supplemented by expert evaluation performed by radiologists. The dataset is freely available at: https://www.casia-cxr.net/.

医学报告生成(MRG)旨在自动生成特定医学影像(如胸部 X 光片)的文本描述,近来受到越来越多的研究关注。在图像字幕成功的基础上,MRG 已经可以实现。然而,生成特定语言的放射学报告对数据驱动模型构成了挑战,因为它们依赖于成对的图像-报告胸部 X 光数据集,而这些数据集耗费大量人力、时间和成本。在本文中,我们介绍了一个胸部 X 光基准数据集,即 CASIA-CXR,该数据集由高分辨率胸部 X 光片组成,并附有最初以法语撰写的叙述性报告。据我们所知,这是首个以这种特殊语言编写医疗报告的公开胸部 X 光片数据集。重要的是,我们提出了一个简单而有效的多模态编码器-解码器语境引导框架,用于生成法语医疗报告。我们通过语言内和跨语言语境分析验证了我们的框架,并由放射科专家进行了专家评估。数据集可在以下网址免费获取:https://www.casia-cxr.net/。
{"title":"An open chest X-ray dataset with benchmarks for automatic radiology report generation in French","authors":"","doi":"10.1016/j.neucom.2024.128478","DOIUrl":"10.1016/j.neucom.2024.128478","url":null,"abstract":"<div><p>Medical report generation (MRG), which aims to automatically generate a textual description of a specific medical image (e.g., a chest X-ray), has recently received increasing research interest. Building on the success of image captioning, MRG has become achievable. However, generating language-specific radiology reports poses a challenge for data-driven models due to their reliance on paired image-report chest X-ray datasets, which are labor-intensive, time-consuming, and costly. In this paper, we introduce a chest X-ray benchmark dataset, namely <span>CASIA-CXR</span>, consisting of high-resolution chest radiographs accompanied by narrative reports originally written in French. To the best of our knowledge, this is the first public chest radiograph dataset with medical reports in this particular language. Importantly, we propose a simple yet effective multimodal encoder–decoder contextually-guided framework for medical report generation in French. We validated our framework through intra-language and cross-language contextual analysis, supplemented by expert evaluation performed by radiologists. The dataset is freely available at: <span><span>https://www.casia-cxr.net/</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi-supervised accuracy predictor-based multi-objective neural architecture search 基于精度预测器的半监督多目标神经架构搜索
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128472

The rise of neural architecture search (NAS) demonstrates the deep exploration between the neural network architecture and its performance (e.g., accuracy). Many NAS methods are inefficient because they train all candidates from scratch to obtain their accuracies. Although predictor-based NAS algorithms have been vigorously developed to efficiently and accurately evaluate the performance of candidate architectures, the training of accuracy predictors still require hundreds of architectures with ground truth. To overcome this shortcoming, this paper investigates an evolutionary-based NAS method, which constructs a semi-supervised accuracy predictor to efficiently and accurately evaluate candidate architectures. A one-time extractor and strong regressors are implemented to further enhance the prediction performance of the semi-supervised accuracy predictor. Furthermore, a multi-objective approach is developed to find architectures with high ground truth in a tradeoff between high prediction accuracy and prediction confidence. Experimental results demonstrate the strong competitiveness of the proposed approach on NAS benchmarks. The code is available at https://github.com/outofstyle/SAPMNAS.

神经架构搜索(NAS)的兴起证明了神经网络架构与其性能(如准确率)之间的深入探索。许多 NAS 方法效率低下,因为它们要从头开始训练所有候选者以获得准确率。虽然基于预测器的 NAS 算法已被大力开发,以高效、准确地评估候选架构的性能,但准确率预测器的训练仍需要数百个具有地面实况的架构。为了克服这一缺陷,本文研究了一种基于进化的 NAS 方法,该方法构建了一个半监督精度预测器,以高效、准确地评估候选架构。为了进一步提高半监督准确度预测器的预测性能,本文采用了一次性提取器和强回归器。此外,还开发了一种多目标方法,以便在高预测精度和预测置信度之间权衡,找到具有高地面真实度的架构。实验结果表明,所提出的方法在 NAS 基准上具有很强的竞争力。代码见 https://github.com/outofstyle/SAPMNAS。
{"title":"Semi-supervised accuracy predictor-based multi-objective neural architecture search","authors":"","doi":"10.1016/j.neucom.2024.128472","DOIUrl":"10.1016/j.neucom.2024.128472","url":null,"abstract":"<div><p>The rise of neural architecture search (NAS) demonstrates the deep exploration between the neural network architecture and its performance (e.g., accuracy). Many NAS methods are inefficient because they train all candidates from scratch to obtain their accuracies. Although predictor-based NAS algorithms have been vigorously developed to efficiently and accurately evaluate the performance of candidate architectures, the training of accuracy predictors still require hundreds of architectures with ground truth. To overcome this shortcoming, this paper investigates an evolutionary-based NAS method, which constructs a semi-supervised accuracy predictor to efficiently and accurately evaluate candidate architectures. A one-time extractor and strong regressors are implemented to further enhance the prediction performance of the semi-supervised accuracy predictor. Furthermore, a multi-objective approach is developed to find architectures with high ground truth in a tradeoff between high prediction accuracy and prediction confidence. Experimental results demonstrate the strong competitiveness of the proposed approach on NAS benchmarks. The code is available at <span><span>https://github.com/outofstyle/SAPMNAS</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142136962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gated parametric neuron for spike-based audio recognition 用于基于尖峰的音频识别的门控参数神经元
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128477

Spiking neural networks (SNNs) aim to simulate real neural networks in the human brain with biologically plausible neurons. The leaky integrate-and-fire (LIF) neuron is one of the most widely studied SNN architectures. However, it has the vanishing gradient problem when trained with backpropagation. Additionally, its neuronal parameters are often manually specified and fixed, in contrast to the heterogeneity of real neurons in the human brain. This paper proposes a gated parametric neuron (GPN) to process spatio-temporal information effectively with the gating mechanism. Compared with the LIF neuron, the GPN has two distinguishing advantages: (1) it copes well with the vanishing gradients by improving the flow of gradient propagation; and, (2) it learns spatio-temporal heterogeneous neuronal parameters automatically. Additionally, we use the same gate structure to eliminate initial neuronal parameter selection and design a hybrid recurrent neural network-SNN structure. Experiments on two spike-based audio datasets demonstrated that the GPN network outperformed several state-of-the-art SNNs, could mitigate vanishing gradients, and had spatio-temporal heterogeneous parameters. Our work shows the ability of SNNs to handle long-term dependencies and achieve high performance simultaneously.

尖峰神经网络(SNN)旨在用生物学上可信的神经元模拟人脑中的真实神经网络。泄漏整合-发射(LIF)神经元是研究最广泛的 SNN 架构之一。然而,在使用反向传播训练时,它存在梯度消失问题。此外,它的神经元参数通常是手动指定和固定的,这与人脑中真实神经元的异质性形成了鲜明对比。本文提出了一种门控参数神经元(GPN),利用门控机制有效处理时空信息。与 LIF 神经元相比,GPN 有两个显著的优点:(1)通过改善梯度传播的流量,它能很好地应对消失的梯度;(2)它能自动学习时空异构神经元参数。此外,我们使用相同的门结构来消除初始神经元参数选择,并设计了一种混合递归神经网络-SNN 结构。在两个基于尖峰的音频数据集上进行的实验表明,GPN 网络的性能优于几种最先进的 SNN,可以缓解梯度消失,并具有时空异构参数。我们的工作表明,SNN 有能力同时处理长期依赖性和实现高性能。
{"title":"Gated parametric neuron for spike-based audio recognition","authors":"","doi":"10.1016/j.neucom.2024.128477","DOIUrl":"10.1016/j.neucom.2024.128477","url":null,"abstract":"<div><p>Spiking neural networks (SNNs) aim to simulate real neural networks in the human brain with biologically plausible neurons. The leaky integrate-and-fire (LIF) neuron is one of the most widely studied SNN architectures. However, it has the vanishing gradient problem when trained with backpropagation. Additionally, its neuronal parameters are often manually specified and fixed, in contrast to the heterogeneity of real neurons in the human brain. This paper proposes a gated parametric neuron (GPN) to process spatio-temporal information effectively with the gating mechanism. Compared with the LIF neuron, the GPN has two distinguishing advantages: (1) it copes well with the vanishing gradients by improving the flow of gradient propagation; and, (2) it learns spatio-temporal heterogeneous neuronal parameters automatically. Additionally, we use the same gate structure to eliminate initial neuronal parameter selection and design a hybrid recurrent neural network-SNN structure. Experiments on two spike-based audio datasets demonstrated that the GPN network outperformed several state-of-the-art SNNs, could mitigate vanishing gradients, and had spatio-temporal heterogeneous parameters. Our work shows the ability of SNNs to handle long-term dependencies and achieve high performance simultaneously.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Neurocomputing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1