首页 > 最新文献

Neurocomputing最新文献

英文 中文
Nearest neighbors and density-based undersampling for imbalanced data classification with class overlap 基于近邻和密度的欠采样,用于有类别重叠的不平衡数据分类
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 DOI: 10.1016/j.neucom.2024.128492

While addressing the problem of imbalanced data classification, most existing resampling methods primarily focus on balancing class distribution. However, they often overlook class overlap and fail to adequately consider the feature distributions of different classes. Consequently, when resampling is performed under such conditions, samples within areas of overlap remain susceptible to misclassification, failing to substantially improve overall performance. To address these shortcomings, we propose a novel data resampling technique, Nearest Neighbors and Density-based Undersampling (NDU). This method employs within-class k-nearest neighbors and between-class probability densities to design a weight assignment strategy. Leveraging this strategy, we establish an exclusive metric, the F_factor, to evaluate the importance of majority class samples in overlap areas. Subsequently, NDU promotes a gradient-based segmented undersampling strategy, which applies varying degrees of undersampling to majority class samples across segmented regions. Through experiments on binary imbalanced datasets with class overlap, we evaluate the efficiency of diverse resampling methods concerning classification performance. The results demonstrate that our proposed method effectively addresses class overlap challenges.

在解决不平衡数据分类问题时,大多数现有的重采样方法主要侧重于平衡类的分布。然而,这些方法往往忽略了类的重叠,未能充分考虑不同类的特征分布。因此,在这种情况下进行重采样时,重叠区域内的样本仍然容易被错误分类,无法大幅提高整体性能。为了解决这些缺陷,我们提出了一种新颖的数据重采样技术--近邻和基于密度的下采样(NDU)。这种方法利用类内 k 近邻和类间概率密度来设计权重分配策略。利用这一策略,我们建立了一个专属指标--F_因子,用于评估重叠区域中多数类样本的重要性。随后,NDU 推广了一种基于梯度的分段欠采样策略,该策略在分段区域内对多数类样本进行不同程度的欠采样。通过在具有类重叠的二元不平衡数据集上进行实验,我们评估了不同重采样方法在分类性能方面的效率。结果表明,我们提出的方法能有效解决类重叠难题。
{"title":"Nearest neighbors and density-based undersampling for imbalanced data classification with class overlap","authors":"","doi":"10.1016/j.neucom.2024.128492","DOIUrl":"10.1016/j.neucom.2024.128492","url":null,"abstract":"<div><p>While addressing the problem of imbalanced data classification, most existing resampling methods primarily focus on balancing class distribution. However, they often overlook class overlap and fail to adequately consider the feature distributions of different classes. Consequently, when resampling is performed under such conditions, samples within areas of overlap remain susceptible to misclassification, failing to substantially improve overall performance. To address these shortcomings, we propose a novel data resampling technique, Nearest Neighbors and Density-based Undersampling (NDU). This method employs within-class k-nearest neighbors and between-class probability densities to design a weight assignment strategy. Leveraging this strategy, we establish an exclusive metric, the F_factor, to evaluate the importance of majority class samples in overlap areas. Subsequently, NDU promotes a gradient-based segmented undersampling strategy, which applies varying degrees of undersampling to majority class samples across segmented regions. Through experiments on binary imbalanced datasets with class overlap, we evaluate the efficiency of diverse resampling methods concerning classification performance. The results demonstrate that our proposed method effectively addresses class overlap challenges.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed nonlinear fusion filtering for multi-sensor networked systems with random varying parameter matrix and missing measurements 针对具有随机变化参数矩阵和缺失测量的多传感器网络系统的分布式非线性融合滤波
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-27 DOI: 10.1016/j.neucom.2024.128491

This paper is concerned with the distributed fusion filtering algorithm design problem for multi-sensor nonlinear networked systems (MSNNSs) subject to multiplicative noises, random varying parameter matrix and missing measurements (MMs). In particular, we utilize the Bernoulli random variable with certain statistical features to describe and characterize the MMs phenomenon. By introduce a fictitious noise, the effects from process noise as well as random varying parameter matrix are addressed and a new nonlinear stochastic networked system is obtained. The primary purpose of this paper is to develop a novel fusion filtering scheme of the distributed way and provide the corresponding boundedness evaluation criterion. Firstly, specific upper bounds of filtering error covariance (FEC) are identified and locally minimized at each sampling instant. Subsequently, based on the obtained local filters, a distributed fusion filtering algorithm is designed via adopting the inverse covariance intersection (ICI) fusion idea. Furthermore, the analysis with respect to the upper bound of local FEC is discussed and examined by proposing a sufficient condition under certain constraints regarding the related parameters. Eventually, with the help of the simulation experiments, the usefulness of the proposed fusion filtering algorithm is illustrated.

本文关注的是多传感器非线性网络系统(MSNNS)的分布式融合滤波算法设计问题,该系统受到乘法噪声、随机变化的参数矩阵和缺失测量(MMs)的影响。特别是,我们利用具有一定统计特征的伯努利随机变量来描述和表征 MMs 现象。通过引入虚构噪声,解决了过程噪声和随机变化参数矩阵的影响,并得到了一个新的非线性随机网络系统。本文的主要目的是开发一种新颖的分布式融合滤波方案,并提供相应的有界性评估准则。首先,确定滤波误差协方差(FEC)的具体上限,并在每个采样瞬间局部最小化。随后,根据得到的局部滤波器,采用反协方差交集(ICI)融合思想设计了一种分布式融合滤波算法。此外,通过在相关参数的某些约束条件下提出一个充分条件,讨论和研究了有关局部 FEC 上限的分析。最后,在仿真实验的帮助下,说明了所提出的融合滤波算法的实用性。
{"title":"Distributed nonlinear fusion filtering for multi-sensor networked systems with random varying parameter matrix and missing measurements","authors":"","doi":"10.1016/j.neucom.2024.128491","DOIUrl":"10.1016/j.neucom.2024.128491","url":null,"abstract":"<div><p>This paper is concerned with the distributed fusion filtering algorithm design problem for multi-sensor nonlinear networked systems (MSNNSs) subject to multiplicative noises, random varying parameter matrix and missing measurements (MMs). In particular, we utilize the Bernoulli random variable with certain statistical features to describe and characterize the MMs phenomenon. By introduce a fictitious noise, the effects from process noise as well as random varying parameter matrix are addressed and a new nonlinear stochastic networked system is obtained. The primary purpose of this paper is to develop a novel fusion filtering scheme of the distributed way and provide the corresponding boundedness evaluation criterion. Firstly, specific upper bounds of filtering error covariance (FEC) are identified and locally minimized at each sampling instant. Subsequently, based on the obtained local filters, a distributed fusion filtering algorithm is designed via adopting the inverse covariance intersection (ICI) fusion idea. Furthermore, the analysis with respect to the upper bound of local FEC is discussed and examined by proposing a sufficient condition under certain constraints regarding the related parameters. Eventually, with the help of the simulation experiments, the usefulness of the proposed fusion filtering algorithm is illustrated.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142164300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general framework for multi-step ahead adaptive conformal heteroscedastic time series forecasting 多步超前自适应保形异速时间序列预测的一般框架
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-27 DOI: 10.1016/j.neucom.2024.128434

This paper introduces a novel model-agnostic algorithm called adaptive ensemble batch multi-input multi-output conformalized quantile regression (AEnbMIMOCQR) that enables forecasters to generate multi-step ahead prediction intervals for a fixed pre-specified miscoverage rate α in a distribution-free manner. Our method is grounded on conformal prediction principles, however, it does not require data splitting and provides close to exact coverage even when the data is not exchangeable. Moreover, the resulting prediction intervals, besides being empirically valid along the forecast horizon, do not neglect heteroscedasticity. AEnbMIMOCQR is designed to be robust to distribution shifts, which means that its prediction intervals remain reliable over an unlimited period of time, without entailing retraining or imposing unrealistic strict assumptions on the data-generating process. Through methodically experimentation, we demonstrate that our approach outperforms other competitive methods on both real-world and synthetic datasets. The code used in the experimental part and a tutorial on how to use AEnbMIMOCQR can be found at the following GitHub repository: https://github.com/Quilograma/AEnbMIMOCQR.

本文介绍了一种称为自适应集合批量多输入多输出保形量化回归(AEnbMIMOCQR)的新型模型无关算法,它能让预测人员以一种无分布的方式为固定的预设误覆盖率α生成多步超前预测区间。我们的方法以保形预测原理为基础,但不需要数据分割,即使在数据不可交换的情况下,也能提供接近精确的覆盖率。此外,由此得出的预测区间除了在预测范围内经验上有效外,还不会忽略异方差性。AEnbMIMOCQR 的设计对分布变化具有鲁棒性,这意味着它的预测区间在无限长的时间内都能保持可靠,而无需重新训练或对数据生成过程施加不切实际的严格假设。通过有条不紊的实验,我们证明了我们的方法在现实世界和合成数据集上都优于其他竞争方法。实验部分使用的代码以及如何使用 AEnbMIMOCQR 的教程可在以下 GitHub 代码库中找到:https://github.com/Quilograma/AEnbMIMOCQR。
{"title":"A general framework for multi-step ahead adaptive conformal heteroscedastic time series forecasting","authors":"","doi":"10.1016/j.neucom.2024.128434","DOIUrl":"10.1016/j.neucom.2024.128434","url":null,"abstract":"<div><p>This paper introduces a novel model-agnostic algorithm called adaptive ensemble batch multi-input multi-output conformalized quantile regression (<span>AEnbMIMOCQR</span>) that enables forecasters to generate multi-step ahead prediction intervals for a fixed pre-specified miscoverage rate <span><math><mi>α</mi></math></span> in a distribution-free manner. Our method is grounded on conformal prediction principles, however, it does not require data splitting and provides close to exact coverage even when the data is not exchangeable. Moreover, the resulting prediction intervals, besides being empirically valid along the forecast horizon, do not neglect heteroscedasticity. <span>AEnbMIMOCQR</span> is designed to be robust to distribution shifts, which means that its prediction intervals remain reliable over an unlimited period of time, without entailing retraining or imposing unrealistic strict assumptions on the data-generating process. Through methodically experimentation, we demonstrate that our approach outperforms other competitive methods on both real-world and synthetic datasets. The code used in the experimental part and a tutorial on how to use <span>AEnbMIMOCQR</span> can be found at the following GitHub repository: <span><span>https://github.com/Quilograma/AEnbMIMOCQR</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0925231224012050/pdfft?md5=2f884ccf3a69148127232428fbf2042a&pid=1-s2.0-S0925231224012050-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142083216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recent progress, challenges and future prospects of applied deep reinforcement learning : A practical perspective in path planning 应用深度强化学习的最新进展、挑战和未来展望:路径规划中的实用视角
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-26 DOI: 10.1016/j.neucom.2024.128423

Path planning is one of the most crucial elements in the field of robotics, such as autonomous driving, minimally invasive surgery and logistics distribution. This review begins by summarizing the limitations of conventional path planning methods and recent work on DRL-based path planning methods. Subsequently, the paper systematically reviews the construction of key elements of DRL methods in recent work, with the aim of assisting readers in comprehending the foundation of DRL research, along with the underlying logic and considerations from a practical perspective. Facing issues of sparse rewards and the exploration–exploitation balance during the practical training process, the paper reviews enhancement methods for training efficiency and optimization results in DRL path planning. In the end, the paper summarizes the current research limitations and challenges in practical path planning applications, followed by future research directions.

路径规划是自动驾驶、微创手术和物流配送等机器人技术领域最关键的要素之一。本综述首先总结了传统路径规划方法的局限性以及基于 DRL 的路径规划方法的最新研究成果。随后,本文系统回顾了近期工作中 DRL 方法关键要素的构建,旨在帮助读者理解 DRL 研究的基础,以及从实用角度出发的基本逻辑和考虑因素。面对实际训练过程中奖励稀疏和探索-开发平衡的问题,论文回顾了 DRL 路径规划中训练效率和优化结果的增强方法。最后,本文总结了当前研究的局限性和实际路径规划应用中的挑战,并提出了未来的研究方向。
{"title":"Recent progress, challenges and future prospects of applied deep reinforcement learning : A practical perspective in path planning","authors":"","doi":"10.1016/j.neucom.2024.128423","DOIUrl":"10.1016/j.neucom.2024.128423","url":null,"abstract":"<div><p>Path planning is one of the most crucial elements in the field of robotics, such as autonomous driving, minimally invasive surgery and logistics distribution. This review begins by summarizing the limitations of conventional path planning methods and recent work on DRL-based path planning methods. Subsequently, the paper systematically reviews the construction of key elements of DRL methods in recent work, with the aim of assisting readers in comprehending the foundation of DRL research, along with the underlying logic and considerations from a practical perspective. Facing issues of sparse rewards and the exploration–exploitation balance during the practical training process, the paper reviews enhancement methods for training efficiency and optimization results in DRL path planning. In the end, the paper summarizes the current research limitations and challenges in practical path planning applications, followed by future research directions.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142076672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alternating nonnegative least squares-incorporated regularized symmetric latent factor analysis for undirected weighted networks 无向加权网络的交替非负最小二乘法--纳入正则化的对称潜因子分析
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-25 DOI: 10.1016/j.neucom.2024.128440

An Undirected Weighted Network (UWN) can be precisely quantified as an adjacency matrix whose inherent characteristics are fully considered in a Symmetric Nonnegative Latent Factor (SNLF) model for its good representation accuracy. However, an SNLF model uses a sole latent factor matrix to precisely describe the topological characteristic of a UWN, i.e., symmetry, thereby impairing its representation learning ability. Aiming at addressing this issue, this paper proposes an Alternating nonnegative least squares-incorporated Regularized Symmetric Latent factor analysis (ARSL) model. First of all, equation constraints composed of multiple matrices are built in its learning objective for well describing the symmetry of a UWN. Note that it adopts an L2-norm-based regularization scheme to relax such constraints for making such a symmetry-aware learning objective solvable. Then, it designs an alternating nonnegative least squares-incorporated algorithm for optimizing its parameters efficiently. Empirical studies on four UWNs demonstrate that an ARSL model outperforms the state-of-the-art models in terms of representation accuracy, as well as achieves promising computational efficiency.

无向加权网络(UWN)可精确量化为邻接矩阵,其固有特征在对称非负潜因(SNLF)模型中得到充分考虑,从而获得良好的表示精度。然而,SNLF 模型使用唯一的潜因矩阵来精确描述 UWN 的拓扑特征,即对称性,从而削弱了其表示学习能力。针对这一问题,本文提出了一种交替非负最小二乘法(Alternating nonnegative least squares-incorporated Regularized Symmetric Latent factor analysis,ARSL)模型。首先,在其学习目标中建立了由多个矩阵组成的等式约束,以很好地描述 UWN 的对称性。需要注意的是,它采用了一种基于 L2 规范的正则化方案来放松这些约束,从而使这种对称感知学习目标变得可解。然后,它设计了一种交替非负最小二乘法算法来有效优化其参数。对四种 UWN 的实证研究表明,ARSL 模型在表示精度方面优于最先进的模型,而且计算效率也很高。
{"title":"Alternating nonnegative least squares-incorporated regularized symmetric latent factor analysis for undirected weighted networks","authors":"","doi":"10.1016/j.neucom.2024.128440","DOIUrl":"10.1016/j.neucom.2024.128440","url":null,"abstract":"<div><p>An <u>U</u>ndirected <u>W</u>eighted <u>N</u>etwork (UWN) can be precisely quantified as an adjacency matrix whose inherent characteristics are fully considered in a <u>S</u>ymmetric <u>N</u>onnegative <u>L</u>atent <u>F</u>actor (SNLF) model for its good representation accuracy. However, an SNLF model uses a sole latent factor matrix to precisely describe the topological characteristic of a UWN, i.e., symmetry, thereby impairing its representation learning ability. Aiming at addressing this issue, this paper proposes an <u>A</u>lternating nonnegative least squares-incorporated Regularized <u>S</u>ymmetric <u>L</u>atent factor analysis (ARSL) model. First of all, equation constraints composed of multiple matrices are built in its learning objective for well describing the symmetry of a UWN. Note that it adopts an <em>L</em><sub><em>2</em></sub>-norm-based regularization scheme to relax such constraints for making such a symmetry-aware learning objective solvable. Then, it designs an alternating nonnegative least squares-incorporated algorithm for optimizing its parameters efficiently. Empirical studies on four UWNs demonstrate that an ARSL model outperforms the state-of-the-art models in terms of representation accuracy, as well as achieves promising computational efficiency.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A prediction method of diabetes comorbidity based on non-negative latent features 基于非负潜特征的糖尿病合并症预测方法
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1016/j.neucom.2024.128447

In this paper, we present a novel network-based approach, namely Inherently Non-negative Latent Feature Analysis for Diabetes Mellitus Comorbidity Detection (INDM), to enhance the detection and analysis of comorbidities associated with diabetes mellitus. Different from existing methods, INDM is the first computational approach that integrates comorbidity networks of the chronic disease spectrum with patient clinical characteristics. To perform the analytical tasks, the proposed INDM adopts the following core components. First, comorbidity networks representing patients diagnosed solely with hypertension and those with hypertension and diabetes are constructed, following the case-control design that establishes a 1:1 matching in age and gender between two cohorts. Subsequently, the disease set is modeled in the comorbidity network according to the relative risk methodology. This enables nodes and edges in the comorbidity network to represent disease interactions that are derived from the patient-disease bipartite graph. Second, a nonlinear loss function with the capability of inherently non-negative latent feature analysis followed by a comorbidity classifier is adopted to uncover the patterns indicating the diabetes comorbidity in the comorbidity network. The proposed INDM has been rigorously tested on actual diabetes comorbidity datasets. The notable results demonstrate that INDM exhibits superior detection accuracy. Furthermore, the topological structure discovered by the proposed INDM can provide a profound insight into hypertension comorbidity in both the case and control groups.

本文提出了一种基于网络的新方法,即用于糖尿病合并症检测的固有非负潜特征分析法(INDM),以加强对糖尿病相关合并症的检测和分析。与现有方法不同,INDM 是第一种将慢性疾病谱的合并症网络与患者临床特征相结合的计算方法。为了完成分析任务,拟议的 INDM 采用了以下核心组件。首先,根据病例对照设计,在两个队列之间建立 1:1 的年龄和性别匹配,构建代表单纯高血压患者和高血压合并糖尿病患者的合并症网络。随后,根据相对风险方法在合并症网络中对疾病集进行建模。这样,合并症网络中的节点和边就能代表从患者-疾病双向图中得出的疾病相互作用。其次,采用一种非线性损失函数,该函数具有固有的非负潜特征分析能力,然后采用一种合并症分类器来揭示合并症网络中表明糖尿病合并症的模式。在实际的糖尿病合并症数据集上对所提出的 INDM 进行了严格测试。测试结果表明,INDM 具有极高的检测准确率。此外,所提出的 INDM 发现的拓扑结构可以为病例组和对照组的高血压合并症提供深刻的见解。
{"title":"A prediction method of diabetes comorbidity based on non-negative latent features","authors":"","doi":"10.1016/j.neucom.2024.128447","DOIUrl":"10.1016/j.neucom.2024.128447","url":null,"abstract":"<div><p>In this paper, we present a novel network-based approach, namely Inherently Non-negative Latent Feature Analysis for Diabetes Mellitus Comorbidity Detection (INDM), to enhance the detection and analysis of comorbidities associated with diabetes mellitus. Different from existing methods, INDM is the first computational approach that integrates comorbidity networks of the chronic disease spectrum with patient clinical characteristics. To perform the analytical tasks, the proposed INDM adopts the following core components. First, comorbidity networks representing patients diagnosed solely with hypertension and those with hypertension and diabetes are constructed, following the case-control design that establishes a 1:1 matching in age and gender between two cohorts. Subsequently, the disease set is modeled in the comorbidity network according to the relative risk methodology. This enables nodes and edges in the comorbidity network to represent disease interactions that are derived from the patient-disease bipartite graph. Second, a nonlinear loss function with the capability of inherently non-negative latent feature analysis followed by a comorbidity classifier is adopted to uncover the patterns indicating the diabetes comorbidity in the comorbidity network. The proposed INDM has been rigorously tested on actual diabetes comorbidity datasets. The notable results demonstrate that INDM exhibits superior detection accuracy. Furthermore, the topological structure discovered by the proposed INDM can provide a profound insight into hypertension comorbidity in both the case and control groups.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MGFNet: Cross-scene crowd counting via multistage gated fusion network MGFNet:通过多级门控融合网络进行跨场景人群计数
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1016/j.neucom.2024.128431

Existing crowd counting methods are mainly trained and tested in similar scenarios. When the testing and training scenarios of the model are different, the counting accuracy of these methods will sharply decrease, which seriously limits their practical application. To address this problem, we propose a multistage gated fusion network (MGFNet) for cross-scene crowd counting. MGFNet is primarily composed of dynamic gated convolution units (DGCU) and multilevel scale attention blocks (MSAB) modules. Specifically, DGCU uses a dynamic gating path to supplement detailed information to reduce the loss of crowd information and overestimation of background in different scenarios. MSAB calibrates crowd information at different scales and perspectives in different scenes by generating attention maps with discriminative information. In addition, we used a new global local consistency loss to optimize the model to adapt to changes in crowd density and distribution. Extensive experiments on four different types of scene counting benchmarks show that the proposed MGFNet achieves superior cross-scene counting performance.

现有的人群计数方法主要是在相似的场景下进行训练和测试。当模型的测试和训练场景不同时,这些方法的计数精度会急剧下降,严重限制了其实际应用。针对这一问题,我们提出了一种用于跨场景人群计数的多级门控融合网络(MGFNet)。MGFNet 主要由动态门控卷积单元(DGCU)和多级标度注意力模块(MSAB)组成。具体来说,DGCU 使用动态门控路径来补充详细信息,以减少不同场景中人群信息的丢失和背景信息的高估。MSAB 通过生成具有判别信息的注意力图,校准不同场景下不同尺度和视角的人群信息。此外,我们还使用了一种新的全局局部一致性损失来优化模型,以适应人群密度和分布的变化。在四种不同类型的场景计数基准上进行的广泛实验表明,所提出的 MGFNet 实现了卓越的跨场景计数性能。
{"title":"MGFNet: Cross-scene crowd counting via multistage gated fusion network","authors":"","doi":"10.1016/j.neucom.2024.128431","DOIUrl":"10.1016/j.neucom.2024.128431","url":null,"abstract":"<div><p>Existing crowd counting methods are mainly trained and tested in similar scenarios. When the testing and training scenarios of the model are different, the counting accuracy of these methods will sharply decrease, which seriously limits their practical application. To address this problem, we propose a multistage gated fusion network (MGFNet) for cross-scene crowd counting. MGFNet is primarily composed of dynamic gated convolution units (DGCU) and multilevel scale attention blocks (MSAB) modules. Specifically, DGCU uses a dynamic gating path to supplement detailed information to reduce the loss of crowd information and overestimation of background in different scenarios. MSAB calibrates crowd information at different scales and perspectives in different scenes by generating attention maps with discriminative information. In addition, we used a new global local consistency loss to optimize the model to adapt to changes in crowd density and distribution. Extensive experiments on four different types of scene counting benchmarks show that the proposed MGFNet achieves superior cross-scene counting performance.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142075895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight Transformer-based visual question answering network with Weight-Sharing Hybrid Attention 基于变压器的轻量级视觉问题解答网络与分权混合注意力
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1016/j.neucom.2024.128460

Recent advances show that Transformer-based models and object detection-based models play an indispensable role in VQA. However, object detection-based models have significant limitations due to their redundant and complex detection box generation process. In contrast, Visual and Language Pre-training (VLP) models can achieve better performance, but require high computing power. To this end, we present Weight-Sharing Hybrid Attention Network (WHAN), a lightweight Transformer-based VQA model. In WHAN, we replace the object detection network with Transformer encoder and use LoRA to solve the problem that the language model cannot adapt to interrogative sentences. We propose Weight-Sharing Hybrid Attention (WHA) module with parallel residual adapters, which can significantly reduce the trainable parameters of the model and we design DWA and BVA modules that can allow the model to perform attention operations from different scales. Experiments on VQA-v2, COCO-QA, GQA, and CLEVR datasets show that WHAN achieves competitive performance with far fewer trainable parameters.

最新进展表明,基于变换器的模型和基于物体检测的模型在 VQA 中发挥着不可或缺的作用。然而,基于物体检测的模型由于其冗余和复杂的检测框生成过程而具有很大的局限性。相比之下,视觉和语言预训练(VLP)模型可以获得更好的性能,但需要较高的计算能力。为此,我们提出了基于变压器的轻量级 VQA 模型--分权混合注意力网络(WHAN)。在 WHAN 中,我们用 Transformer 编码器取代了对象检测网络,并使用 LoRA 解决了语言模型无法适应疑问句的问题。我们提出了具有并行残差适配器的分权混合注意力(WHA)模块,它可以显著减少模型的可训练参数,我们还设计了 DWA 和 BVA 模块,可以让模型从不同尺度执行注意力操作。在 VQA-v2、COCO-QA、GQA 和 CLEVR 数据集上的实验表明,WHAN 能以更少的可训练参数获得具有竞争力的性能。
{"title":"A lightweight Transformer-based visual question answering network with Weight-Sharing Hybrid Attention","authors":"","doi":"10.1016/j.neucom.2024.128460","DOIUrl":"10.1016/j.neucom.2024.128460","url":null,"abstract":"<div><p>Recent advances show that Transformer-based models and object detection-based models play an indispensable role in VQA. However, object detection-based models have significant limitations due to their redundant and complex detection box generation process. In contrast, Visual and Language Pre-training (VLP) models can achieve better performance, but require high computing power. To this end, we present Weight-Sharing Hybrid Attention Network (WHAN), a lightweight Transformer-based VQA model. In WHAN, we replace the object detection network with Transformer encoder and use LoRA to solve the problem that the language model cannot adapt to interrogative sentences. We propose Weight-Sharing Hybrid Attention (WHA) module with parallel residual adapters, which can significantly reduce the trainable parameters of the model and we design DWA and BVA modules that can allow the model to perform attention operations from different scales. Experiments on VQA-v2, COCO-QA, GQA, and CLEVR datasets show that WHAN achieves competitive performance with far fewer trainable parameters.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142083220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Difficulty level-based knowledge distillation 基于难度水平的知识提炼
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1016/j.neucom.2024.128375

Knowledge distillation (KD) enables a simple model (student model) to perform as a complex model (teacher model) by distilling the knowledge from a pre-trained teacher model. Existing soft-label distillation methods often use a fixed temperature value in the softmax function to prevent overconfidence in the distillation process. However, this approach can lead to the suppression of important ‘dark knowledge’ for non-target classes in difficult samples, while also over-smoothing the confidence values for easier samples. To address this issue, we propose a novel approach called difficulty level-based knowledge distillation (DLKD), which considers the difficulty level of each sample to distill refined knowledge with high or low confidence, depending on the sample’s complexity. Our method calculates the difficulty level based on the Euclidean distance between the teacher model’s predictions and the pruned teacher model’s predictions. Experimental results demonstrate that our DLKD method outperforms state-of-the-art methods on challenging samples, including those with noisy labels or augmented data, achieving superior results on CIFAR-100, FGVR, and ImageNet datasets for image classification.

知识蒸馏(KD)通过从预先训练好的教师模型中蒸馏知识,使简单模型(学生模型)发挥复杂模型(教师模型)的作用。现有的软标签蒸馏方法通常在 softmax 函数中使用一个固定的温度值,以防止在蒸馏过程中过度自信。然而,这种方法会导致在困难样本中抑制非目标类别的重要 "暗知识",同时也会过度平滑较容易样本的置信度值。为了解决这个问题,我们提出了一种称为基于难度等级的知识提炼(DLKD)的新方法,它考虑了每个样本的难度等级,根据样本的复杂程度提炼出高置信度或低置信度的精炼知识。我们的方法根据教师模型预测与剪枝后教师模型预测之间的欧氏距离计算难度级别。实验结果表明,我们的 DLKD 方法在具有挑战性的样本(包括具有噪声标签或增强数据的样本)上优于最先进的方法,在 CIFAR-100、FGVR 和 ImageNet 数据集的图像分类上取得了优异的成绩。
{"title":"Difficulty level-based knowledge distillation","authors":"","doi":"10.1016/j.neucom.2024.128375","DOIUrl":"10.1016/j.neucom.2024.128375","url":null,"abstract":"<div><p>Knowledge distillation (KD) enables a simple model (student model) to perform as a complex model (teacher model) by distilling the knowledge from a pre-trained teacher model. Existing soft-label distillation methods often use a fixed temperature value in the softmax function to prevent overconfidence in the distillation process. However, this approach can lead to the suppression of important ‘dark knowledge’ for non-target classes in difficult samples, while also over-smoothing the confidence values for easier samples. To address this issue, we propose a novel approach called difficulty level-based knowledge distillation (DLKD), which considers the difficulty level of each sample to distill refined knowledge with high or low confidence, depending on the sample’s complexity. Our method calculates the difficulty level based on the Euclidean distance between the teacher model’s predictions and the pruned teacher model’s predictions. Experimental results demonstrate that our DLKD method outperforms state-of-the-art methods on challenging samples, including those with noisy labels or augmented data, achieving superior results on CIFAR-100, FGVR, and ImageNet datasets for image classification.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142049947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight multi-scale multi-angle dynamic interactive transformer-CNN fusion model for 3D medical image segmentation 用于三维医学图像分割的轻量级多尺度多角度动态交互变换器-CNN 融合模型
IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1016/j.neucom.2024.128417

Combining Convolutional Neural Network(CNN) and Transformer has become one of the mainstream methods for three-dimensional (3D) medical image segmentation. However, the complexity and diversity of target forms in 3D medical images require models to capture complex feature information for segmentation, resulting in an excessive number of parameters which are not conducive to training and deployment. Therefore, we have developed a lightweight 3D multi-target semantic segmentation model. In order to enhance contextual texture connections and reinforce the expression of detailed feature information, we designed a multi-scale and multi-angle feature interaction module to enhance feature representation by interacting multi-scale features from different perspectives. To address the issue of attention collapse in Transformers, leading to the neglect of other detailed feature learning, we utilized local features as dynamic parameters to interact with global features, dynamically grouping and learning critical features from global features, thereby enhancing the model's ability to learn detailed features. While ensuring the segmentation capability of the model, we aimed to keep the model lightweight, resulting in a total of 9.63 M parameters. Extensive experiments were conducted on public datasets ACDC and Brats2018, as well as a private dataset, Temporal Bone CT. The results indicate that our proposed model is more competitive compared to the latest techniques in 3D medical image segmentation.

卷积神经网络(CNN)与变换器的结合已成为三维(3D)医学图像分割的主流方法之一。然而,由于三维医学图像中目标形态的复杂性和多样性,需要模型捕捉复杂的特征信息进行分割,导致参数过多,不利于训练和部署。因此,我们开发了一种轻量级三维多目标语义分割模型。为了增强上下文纹理联系,强化细节特征信息的表达,我们设计了多尺度、多角度特征交互模块,通过不同视角的多尺度特征交互来增强特征表达。针对变形金刚中注意力崩溃导致忽略其他细节特征学习的问题,我们利用局部特征作为动态参数与全局特征交互,动态分组并从全局特征中学习关键特征,从而增强模型学习细节特征的能力。在确保模型细分能力的同时,我们力求保持模型的轻量级,因此模型的参数总数为 963 万个。我们在公共数据集 ACDC 和 Brats2018 以及私有数据集 Temporal Bone CT 上进行了广泛的实验。结果表明,与最新的三维医学图像分割技术相比,我们提出的模型更具竞争力。
{"title":"A lightweight multi-scale multi-angle dynamic interactive transformer-CNN fusion model for 3D medical image segmentation","authors":"","doi":"10.1016/j.neucom.2024.128417","DOIUrl":"10.1016/j.neucom.2024.128417","url":null,"abstract":"<div><p>Combining Convolutional Neural Network(CNN) and Transformer has become one of the mainstream methods for three-dimensional (3D) medical image segmentation. However, the complexity and diversity of target forms in 3D medical images require models to capture complex feature information for segmentation, resulting in an excessive number of parameters which are not conducive to training and deployment. Therefore, we have developed a lightweight 3D multi-target semantic segmentation model. In order to enhance contextual texture connections and reinforce the expression of detailed feature information, we designed a multi-scale and multi-angle feature interaction module to enhance feature representation by interacting multi-scale features from different perspectives. To address the issue of attention collapse in Transformers, leading to the neglect of other detailed feature learning, we utilized local features as dynamic parameters to interact with global features, dynamically grouping and learning critical features from global features, thereby enhancing the model's ability to learn detailed features. While ensuring the segmentation capability of the model, we aimed to keep the model lightweight, resulting in a total of 9.63 M parameters. Extensive experiments were conducted on public datasets ACDC and Brats2018, as well as a private dataset, Temporal Bone CT. The results indicate that our proposed model is more competitive compared to the latest techniques in 3D medical image segmentation.</p></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142050350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Neurocomputing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1