首页 > 最新文献

Information Sciences最新文献

英文 中文
Maximum local density-driven non-overlapping radial basis function support kernel neural network 最大局部密度驱动的非重叠径向基函数支持核神经网络
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-02 DOI: 10.1016/j.ins.2024.121421

The learning and optimization of kernels in the radial basis function neural network (RBFNN) are crucial. However, in existing methods, there are issues of overfitting when learning kernel parameters. The learned kernels are also sensitive to outliers. This paper proposes a general kernel learning strategy for RBFNN called non-overlapping maximum local density support kernel learning (MLD-SKL), which contains two modules, the non-overlapping maximum local density (MLD) kernel learning module and support kernel learning (SKL) module. In the MLD kernel learning stage, the candidate set of kernels is incrementally determined based on the local density of samples. Meanwhile, it is required that the coverage ranges of kernels from different classes do not overlap with each other. This module is effective in reducing the impact of outliers. In the SKL stage, kernel importance indicator is defined to measure the importance of kernels. The learned support kernels are utilized to construct a maximum local density-driven non-overlapping radial basis function support kernel neural network (MLD-RBFSKNN). The RBFNN constructed through MLD-SKL exhibits a more compact structure. The experiments demonstrate that the proposed MLD-RBFSKNN improves accuracy in recognition task. Furthermore, while achieving superior recognition performance, the final constructed network also has the minimum number of kernels.

径向基函数神经网络(RBFNN)内核的学习和优化至关重要。然而,现有方法在学习核参数时存在过拟合问题。学习到的内核对异常值也很敏感。本文提出了一种通用的 RBFNN 内核学习策略,称为非重叠最大局部密度支持内核学习(MLD-SKL),它包含两个模块,即非重叠最大局部密度(MLD)内核学习模块和支持内核学习(SKL)模块。在 MLD 内核学习阶段,根据样本的局部密度逐步确定候选内核集。同时,要求不同类别的内核覆盖范围不能相互重叠。该模块能有效减少异常值的影响。在 SKL 阶段,定义了内核重要性指标来衡量内核的重要性。利用学习到的支持核构建最大局部密度驱动的非重叠径向基函数支持核神经网络(MLD-RBFSKNN)。通过 MLD-SKL 构建的 RBFNN 结构更为紧凑。实验证明,所提出的 MLD-RBFSKNN 提高了识别任务的准确性。此外,在实现卓越识别性能的同时,最终构建的网络还具有最少的核数。
{"title":"Maximum local density-driven non-overlapping radial basis function support kernel neural network","authors":"","doi":"10.1016/j.ins.2024.121421","DOIUrl":"10.1016/j.ins.2024.121421","url":null,"abstract":"<div><p>The learning and optimization of kernels in the radial basis function neural network (RBFNN) are crucial. However, in existing methods, there are issues of overfitting when learning kernel parameters. The learned kernels are also sensitive to outliers. This paper proposes a general kernel learning strategy for RBFNN called non-overlapping maximum local density support kernel learning (MLD-SKL), which contains two modules, the non-overlapping maximum local density (MLD) kernel learning module and support kernel learning (SKL) module. In the MLD kernel learning stage, the candidate set of kernels is incrementally determined based on the local density of samples. Meanwhile, it is required that the coverage ranges of kernels from different classes do not overlap with each other. This module is effective in reducing the impact of outliers. In the SKL stage, kernel importance indicator is defined to measure the importance of kernels. The learned support kernels are utilized to construct a maximum local density-driven non-overlapping radial basis function support kernel neural network (MLD-RBFSKNN). The RBFNN constructed through MLD-SKL exhibits a more compact structure. The experiments demonstrate that the proposed MLD-RBFSKNN improves accuracy in recognition task. Furthermore, while achieving superior recognition performance, the final constructed network also has the minimum number of kernels.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extracting attribute implications from a formal context: Unifying the basic approaches 从形式语境中提取属性含义:统一基本方法
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-02 DOI: 10.1016/j.ins.2024.121419

There have been several pioneering approaches to the extraction of attribute implications from a formal context, dating from the 1980's: the one of Guigues and Duquenne based on so-called non-redundancy nodes, another one proposed by Ganter highlighting the concept of pseudo-closed set, and the best-known one relying on the recursive computation of so-called pseudo-intents in the book by Ganter and Wille. The Guigues and Duquenne approach has never been compared in detail in the literature with the other two, although they turn out to be equivalent. This paper tries to fill this gap, proposing a unified view, hopefully more easy to grasp.

自 20 世纪 80 年代以来,从形式语境中抽取属性含义的方法有几种开创性的方法:Guigues 和 Duquenne 提出的基于所谓非冗余节点的方法、Ganter 提出的强调伪封闭集概念的另一种方法,以及 Ganter 和 Wille 著作中最著名的依赖于所谓伪意图递归计算的方法。Guigues 和 Duquenne 方法与其他两种方法虽然等价,但从未在文献中进行过详细比较。本文试图填补这一空白,提出一种统一的观点,希望更易于理解。
{"title":"Extracting attribute implications from a formal context: Unifying the basic approaches","authors":"","doi":"10.1016/j.ins.2024.121419","DOIUrl":"10.1016/j.ins.2024.121419","url":null,"abstract":"<div><p>There have been several pioneering approaches to the extraction of attribute implications from a formal context, dating from the 1980's: the one of Guigues and Duquenne based on so-called non-redundancy nodes, another one proposed by Ganter highlighting the concept of pseudo-closed set, and the best-known one relying on the recursive computation of so-called pseudo-intents in the book by Ganter and Wille. The Guigues and Duquenne approach has never been compared in detail in the literature with the other two, although they turn out to be equivalent. This paper tries to fill this gap, proposing a unified view, hopefully more easy to grasp.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020025524013331/pdfft?md5=2ee861abae43526eb060a092a824f4dc&pid=1-s2.0-S0020025524013331-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142149162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SFP: Similarity-based filter pruning for deep neural networks SFP:基于相似性的深度神经网络滤波器剪枝
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-31 DOI: 10.1016/j.ins.2024.121418

Convolutional neural networks have exhibited exceptional performance in various artificial intelligence domains, particularly in large-scale image processing tasks. However, the proliferation of network parameters and computational requirements has emerged as a significant bottleneck for the practical deployment of CNNs. In this paper, we propose a novel similarity-based filter pruning (SFP) approach for compressing convolutional neural networks, which is different from the traditional pruning method. The existing pruning methods eliminate the unimportant parameters but ignore the duplication of the reserved convolutional kernels. In the proposed SFP, kernels are clustered first according to their similarity, then the unimportant and redundant kernels are pruned in each class, which is more efficient than traditional pruning methods only based on the importance criterion. Furthermore, this paper introduces the concept of Kernel Dispersion to evaluate sparsity across distinct network layers, and proposes Distillation Fine-Tuning with Variable Temperature Coefficient to expedite convergence and enhance accuracy. The performance of the proposed similarity-based filter pruning approach is evaluated on different datasets, including CIFAR10, CIFAR100, ImageNet, and VOC. The experimental results indicate that the proposed SFP achieves approximately 1% higher accuracy at a comparable pruning rate compared to traditional state-of-the-art pruning methods.

卷积神经网络在各种人工智能领域,尤其是大规模图像处理任务中表现出了卓越的性能。然而,网络参数和计算要求的激增已成为实际部署卷积神经网络的一个重要瓶颈。在本文中,我们提出了一种新颖的基于相似性的滤波器剪枝(SFP)方法,用于压缩卷积神经网络,它有别于传统的剪枝方法。现有的剪枝方法消除了不重要的参数,但忽略了保留卷积核的重复。在所提出的 SFP 中,首先根据内核的相似性对内核进行聚类,然后在每一类中剪枝不重要和多余的内核,这比传统的只根据重要性准则进行剪枝的方法更有效。此外,本文还引入了 "内核分散 "的概念来评估不同网络层的稀疏性,并提出了具有可变温度系数的蒸馏微调方法,以加快收敛速度并提高准确性。我们在不同的数据集(包括 CIFAR10、CIFAR100、ImageNet 和 VOC)上评估了所提出的基于相似性的滤波器剪枝方法的性能。实验结果表明,与传统的先进剪枝方法相比,所提出的 SFP 在剪枝率相当的情况下,准确率提高了约 1%。
{"title":"SFP: Similarity-based filter pruning for deep neural networks","authors":"","doi":"10.1016/j.ins.2024.121418","DOIUrl":"10.1016/j.ins.2024.121418","url":null,"abstract":"<div><p>Convolutional neural networks have exhibited exceptional performance in various artificial intelligence domains, particularly in large-scale image processing tasks. However, the proliferation of network parameters and computational requirements has emerged as a significant bottleneck for the practical deployment of CNNs. In this paper, we propose a novel similarity-based filter pruning (SFP) approach for compressing convolutional neural networks, which is different from the traditional pruning method. The existing pruning methods eliminate the unimportant parameters but ignore the duplication of the reserved convolutional kernels. In the proposed SFP, kernels are clustered first according to their similarity, then the unimportant and redundant kernels are pruned in each class, which is more efficient than traditional pruning methods only based on the importance criterion. Furthermore, this paper introduces the concept of Kernel Dispersion to evaluate sparsity across distinct network layers, and proposes Distillation Fine-Tuning with Variable Temperature Coefficient to expedite convergence and enhance accuracy. The performance of the proposed similarity-based filter pruning approach is evaluated on different datasets, including CIFAR10, CIFAR100, ImageNet, and VOC. The experimental results indicate that the proposed SFP achieves approximately 1% higher accuracy at a comparable pruning rate compared to traditional state-of-the-art pruning methods.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CATIL: Customized adversarial training based on instance loss CATIL:基于实例损失的定制对抗训练
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-30 DOI: 10.1016/j.ins.2024.121420

Adversarial training is one of the most effective adversarial defense methods currently recognized. It enhances the robustness of deep neural network (DNN) classifiers by generating adversarial samples. However, current adversarial training methods cannot effectively trade off the robust accuracy and natural accuracy when training DNN classifiers, and are prone to overfit. To solve these problems, we propose Customized Adversarial Training based on Instance Loss (CATIL), aiming to improve robust accuracy and natural accuracy while alleviating overfitting. We first comprehensively consider the influencing factors of adversarial training and propose the comprehensive customization strategy (CCS), which crafts unique attack strategies for each sample, fine-tunes the classifier's decision boundary, and boosts the robustness of the DNN classifier without compromising its natural accuracy. Second, the loss adjustment strategy (LAS) is designed to update the attack strategy according to the loss value. This increases the fitting difficulty of the DNN classifier and alleviates the overfitting problem. Finally, numerous experiments show that CATIL can effectively enhance robust accuracy and alleviate the overfitting problem without significantly compromising natural accuracy. When evaluating CIFAR-10 on Wide ResNet, CATIL achieves the best performance in both natural and robust accuracy compared to all benchmarks.

对抗训练是目前公认的最有效的对抗防御方法之一。它通过生成对抗样本来增强深度神经网络(DNN)分类器的鲁棒性。然而,目前的对抗训练方法在训练 DNN 分类器时无法有效地权衡鲁棒精度和自然精度,容易出现过拟合。为了解决这些问题,我们提出了基于实例损失的定制对抗训练(CATIL),旨在提高鲁棒精度和自然精度,同时缓解过拟合问题。首先,我们综合考虑了对抗训练的影响因素,提出了全面定制策略(CCS),为每个样本制定独特的攻击策略,微调分类器的决策边界,在不影响自然精度的前提下提高 DNN 分类器的鲁棒性。其次,损失调整策略(LAS)旨在根据损失值更新攻击策略。这增加了 DNN 分类器的拟合难度,缓解了过拟合问题。最后,大量实验表明,CATIL 可以有效提高鲁棒性精度,缓解过拟合问题,而不会明显影响自然精度。在 Wide ResNet 上评估 CIFAR-10 时,与所有基准相比,CATIL 在自然准确率和鲁棒准确率方面都取得了最佳性能。
{"title":"CATIL: Customized adversarial training based on instance loss","authors":"","doi":"10.1016/j.ins.2024.121420","DOIUrl":"10.1016/j.ins.2024.121420","url":null,"abstract":"<div><p>Adversarial training is one of the most effective adversarial defense methods currently recognized. It enhances the robustness of deep neural network (DNN) classifiers by generating adversarial samples. However, current adversarial training methods cannot effectively trade off the robust accuracy and natural accuracy when training DNN classifiers, and are prone to overfit. To solve these problems, we propose Customized Adversarial Training based on Instance Loss (CATIL), aiming to improve robust accuracy and natural accuracy while alleviating overfitting. We first comprehensively consider the influencing factors of adversarial training and propose the comprehensive customization strategy (CCS), which crafts unique attack strategies for each sample, fine-tunes the classifier's decision boundary, and boosts the robustness of the DNN classifier without compromising its natural accuracy. Second, the loss adjustment strategy (LAS) is designed to update the attack strategy according to the loss value. This increases the fitting difficulty of the DNN classifier and alleviates the overfitting problem. Finally, numerous experiments show that CATIL can effectively enhance robust accuracy and alleviate the overfitting problem without significantly compromising natural accuracy. When evaluating CIFAR-10 on Wide ResNet, CATIL achieves the best performance in both natural and robust accuracy compared to all benchmarks.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142137298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A tensor recommendation method based on HMM network and meta-path 基于 HMM 网络和元路径的张量推荐方法
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-30 DOI: 10.1016/j.ins.2024.121412

The main approach to capturing user latent preferences in recommendation systems (RS) is through high-order tensor decomposition and the deep-walk method. Several key issues, if solved, could improve the performance of RS. These include enforcing the interpretation of RS in the context of sparse data completion, cold start, and interpretability, mining user latent preferences with a tensor constructed from a user-item rating matrix (RM) and a preference match mechanism based on K-nearest neighbor (KNN) similar users. In this paper, a method that integrates a hidden Markov model, meta-path, and third-order tensor (HMM-MP-TOT) is proposed. An HMM, based on the user-item RM and latent preferences from KNN users is constructed. Subsequently, the Viterbi and deep-walk methods are used to obtain a series of user-item two-dimensional MPs. Then, truncated − singular value decomposition (t-SVD) is applied to a user-item-KNN third-order tensor to obtain a better recommendation result. On average, HMM-MP-TOT obtains 94.7% precision, 80.2% recall, and 96.4% diversity.

在推荐系统(RS)中捕捉用户潜在偏好的主要方法是高阶张量分解和深度漫步法。如果能解决几个关键问题,就能提高 RS 的性能。这些问题包括在稀疏数据完成、冷启动和可解释性的背景下强制解释 RS,使用由用户-项目评级矩阵(RM)构建的张量挖掘用户潜在偏好,以及基于 K-nearest neighbor(KNN)相似用户的偏好匹配机制。本文提出了一种集成了隐马尔可夫模型、元路径和三阶张量(HMM-MP-TOT)的方法。基于用户-项目 RM 和 KNN 用户的潜在偏好构建了一个 HMM。随后,使用 Viterbi 和深度漫步方法获得一系列用户项目二维 MP。然后,对用户-项目-KNN 三阶张量进行截断-奇异值分解(t-SVD),以获得更好的推荐结果。平均而言,HMM-MP-TOT 可获得 94.7% 的精确度、80.2% 的召回率和 96.4% 的多样性。
{"title":"A tensor recommendation method based on HMM network and meta-path","authors":"","doi":"10.1016/j.ins.2024.121412","DOIUrl":"10.1016/j.ins.2024.121412","url":null,"abstract":"<div><p>The main approach to capturing user latent preferences in recommendation systems (RS) is through high-order tensor decomposition and the deep-walk method. Several key issues, if solved, could improve the performance of RS. These include enforcing the interpretation of RS in the context of sparse data completion, cold start, and interpretability, mining user latent preferences with a tensor constructed from a user-item rating matrix (RM) and a preference match mechanism based on K-nearest neighbor (KNN) similar users. In this paper, a method that integrates a hidden Markov model, <em>meta</em>-path, and third-order tensor (HMM-MP-TOT) is proposed. An HMM, based on the user-item RM and latent preferences from KNN users is constructed. Subsequently, the Viterbi and deep-walk methods are used to obtain a series of user-item two-dimensional MPs. Then, truncated − singular value decomposition (t-SVD) is applied to a user-item-KNN third-order tensor to obtain a better recommendation result. On average, HMM-MP-TOT obtains 94.7% precision, 80.2% recall, and 96.4% diversity.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142149163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EHPR: Learning evolutionary hierarchy perception representation based on quaternion for temporal knowledge graph completion EHPR:基于四元数的学习进化层次感知表示法,用于完成时态知识图谱
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-30 DOI: 10.1016/j.ins.2024.121409

Research on temporal knowledge graphs garners attention due to the intricate connection between facts and dynamic temporal factors. However, existing research uses timestamp as auxiliary data for representation learning and directly integrate it into facts, resulting in the inability to capture the intrinsic connections between relations under time evolution. To handle these challenges, we propose the Evolutionary Hierarchy Perception Representation (EHPR), which first leverages the Hamilton product to perform rotational transformations on relation and entity over time, aiming to learn temporal relation and temporal entity with close interactions with time information. Later, EHPR is divided into two modules: (a) Rotating the head entity towards the tail entity using temporal relation through Hamilton product to model complex patterns with quaternion rotation capabilities. (b) Adopting an evolutionary hierarchical factor to capture the differences in modulus distribution between the temporal head entity and the temporal tail entity, aiming to manage the evolutionary hierarchical information between different temporal entities. In this way, EHPR not only utilizes the rich quaternion rotation capabilities to model various relation patterns but also further enables modeling of evolutionary hierarchical patterns through evolutionary hierarchy factors. Experiments show that EHPR achieves remarkable performance on six mature benchmarks compared to state-of-the-art models. Furthermore, we successfully transferred the core idea of EHPR into complex embeddings, showcasing the framework's adaptability. Compared to complex embedding models, EHPR also demonstrates stronger expressive abilities with the Hamilton operator, surpassing the performance of complex Hermitian operator.

时态知识图谱研究因事实与动态时态因素之间错综复杂的联系而备受关注。然而,现有研究将时间戳作为表征学习的辅助数据,并直接将其整合到事实中,导致无法捕捉时间演化下关系之间的内在联系。为了应对这些挑战,我们提出了进化层次感知表征(EHPR),它首先利用汉密尔顿积对关系和实体随时间进行旋转变换,旨在学习与时间信息交互密切的时间关系和时间实体。之后,EHPR 又分为两个模块:(a) 通过汉密尔顿乘积,利用时间关系将头部实体向尾部实体旋转,从而利用四元数旋转能力为复杂模式建模。(b) 采用进化层次因子捕捉时空头部实体和时空尾部实体之间模量分布的差异,旨在管理不同时空实体之间的进化层次信息。这样,EHPR 不仅能利用丰富的四元数旋转功能对各种关系模式进行建模,还能通过进化层次因子进一步对进化层次模式进行建模。实验表明,与最先进的模型相比,EHPR 在六个成熟基准上取得了显著的性能。此外,我们还成功地将 EHPR 的核心思想移植到了复杂嵌入中,从而展示了该框架的适应性。与复杂嵌入模型相比,EHPR 在汉密尔顿算子方面也表现出了更强的表达能力,超越了复杂赫尔墨斯算子。
{"title":"EHPR: Learning evolutionary hierarchy perception representation based on quaternion for temporal knowledge graph completion","authors":"","doi":"10.1016/j.ins.2024.121409","DOIUrl":"10.1016/j.ins.2024.121409","url":null,"abstract":"<div><p>Research on temporal knowledge graphs garners attention due to the intricate connection between facts and dynamic temporal factors. However, existing research uses timestamp as auxiliary data for representation learning and directly integrate it into facts, resulting in the inability to capture the intrinsic connections between relations under time evolution. To handle these challenges, we propose the <strong>E</strong>volutionary <strong>H</strong>ierarchy <strong>P</strong>erception <strong>R</strong>epresentation (EHPR), which first leverages the Hamilton product to perform rotational transformations on relation and entity over time, aiming to learn temporal relation and temporal entity with close interactions with time information. Later, EHPR is divided into two modules: (a) Rotating the head entity towards the tail entity using temporal relation through Hamilton product to model complex patterns with quaternion rotation capabilities. (b) Adopting an evolutionary hierarchical factor to capture the differences in modulus distribution between the temporal head entity and the temporal tail entity, aiming to manage the evolutionary hierarchical information between different temporal entities. In this way, EHPR not only utilizes the rich quaternion rotation capabilities to model various relation patterns but also further enables modeling of evolutionary hierarchical patterns through evolutionary hierarchy factors. Experiments show that EHPR achieves remarkable performance on six mature benchmarks compared to state-of-the-art models. Furthermore, we successfully transferred the core idea of EHPR into complex embeddings, showcasing the framework's adaptability. Compared to complex embedding models, EHPR also demonstrates stronger expressive abilities with the Hamilton operator, surpassing the performance of complex Hermitian operator.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142122592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental evaluation of the effect of community structures on link prediction 社区结构对链接预测影响的实验评估
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-30 DOI: 10.1016/j.ins.2024.121394

Link prediction involves assessing the likelihood of connections between node pairs based on various structural properties. The effectiveness of link predictors can be influenced by complex structures such as communities. Since the community structure itself has different properties that describes its characteristics, measuring the impact of these properties on the performance of link predictors presents a challenge. In this work, we aim to uncover the role of community properties and the identification of community structures on the performance of link predictors. We propose a comprehensive experimental setup to evaluate the performance of twenty-nine link predictors on real-world networks with diverse topological features, as well as on synthetic networks where we control community-dependent properties such as cohesiveness and size. We assess the performance differences between network-wide and per-community link prediction to determine whether identifying communities aids in link prediction. The results indicate that link prediction is more accurate in networks with well-defined, disjoint communities, even when these communities are not explicitly identified. Additionally, the size of the communities can influence link prediction performance if the communities are identified.

链接预测包括根据各种结构特性评估节点对之间的连接可能性。链接预测器的有效性会受到复杂结构(如群落)的影响。由于社群结构本身具有描述其特征的不同属性,因此衡量这些属性对链接预测器性能的影响是一项挑战。在这项工作中,我们旨在揭示社群属性和社群结构的识别对链接预测器性能的影响。我们提出了一个全面的实验设置,以评估 29 种链接预测器在具有不同拓扑特征的真实世界网络以及合成网络上的性能,在合成网络上,我们控制了内聚度和大小等依赖于群落的属性。我们评估了全网链接预测和每个群落链接预测的性能差异,以确定识别群落是否有助于链接预测。结果表明,在具有定义明确、互不关联的群落的网络中,链接预测更为准确,即使这些群落未被明确识别。此外,如果确定了群落,群落的大小也会影响链接预测的性能。
{"title":"Experimental evaluation of the effect of community structures on link prediction","authors":"","doi":"10.1016/j.ins.2024.121394","DOIUrl":"10.1016/j.ins.2024.121394","url":null,"abstract":"<div><p>Link prediction involves assessing the likelihood of connections between node pairs based on various structural properties. The effectiveness of link predictors can be influenced by complex structures such as communities. Since the community structure itself has different properties that describes its characteristics, measuring the impact of these properties on the performance of link predictors presents a challenge. In this work, we aim to uncover the role of community properties and the identification of community structures on the performance of link predictors. We propose a comprehensive experimental setup to evaluate the performance of twenty-nine link predictors on real-world networks with diverse topological features, as well as on synthetic networks where we control community-dependent properties such as cohesiveness and size. We assess the performance differences between network-wide and per-community link prediction to determine whether identifying communities aids in link prediction. The results indicate that link prediction is more accurate in networks with well-defined, disjoint communities, even when these communities are not explicitly identified. Additionally, the size of the communities can influence link prediction performance if the communities are identified.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142129000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymmetry index for data and its verification in dimensionality reduction and data visualization 数据不对称指数及其在降维和数据可视化中的验证
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-30 DOI: 10.1016/j.ins.2024.121405

We propose an asymmetry index as a measure of degree of asymmetry of a given dataset. It provides an additional information on a dataset allowing to guide and improve any further analysis. The index reflects the intensity of the asymmetric relationships among data resulting from hierarchical data structure. Using the information retrieved by our asymmetry index, one obtains a justification and explanation of the effectiveness of the subsequent asymmetric data analysis methods, as well as helpful preparation to asymmetrizing the tools for the further analysis. The asymmetry index is based on the k-nearest neighbors graph representing the considered data. Therefore, it uses the intrinsic geometry-based information on the data, in this way, providing an insight into the data structure. Our experiments on real data are designed to verify the usefulness of the asymmetry index and the correctness of its theoretical fundamentals. In our empirical validation, we employ the symmetric and asymmetric dimensionality reduction algorithms and evaluate their results on the basis of clustering in the 2-dimensional visualization space. We test, whether our index indeed predicts the level of superiority of the asymmetric methods over their symmetric counterparts.

我们提出了一种不对称指数,用来衡量给定数据集的不对称程度。它为数据集提供了额外的信息,可用于指导和改进任何进一步的分析。该指数反映了分层数据结构导致的数据间不对称关系的强度。利用我们的不对称指数所检索到的信息,可以为后续的不对称数据分析方法的有效性提供理由和解释,并为进一步分析工具的不对称化做好准备。不对称指数基于代表所考虑数据的 k 近邻图。因此,它使用了数据的内在几何信息,从而提供了对数据结构的洞察力。我们在真实数据上的实验旨在验证不对称指数的实用性及其理论基础的正确性。在实证验证中,我们采用了对称和非对称降维算法,并在二维可视化空间聚类的基础上对其结果进行了评估。我们检验了我们的指数是否确实预测了非对称方法优于对称方法的程度。
{"title":"Asymmetry index for data and its verification in dimensionality reduction and data visualization","authors":"","doi":"10.1016/j.ins.2024.121405","DOIUrl":"10.1016/j.ins.2024.121405","url":null,"abstract":"<div><p>We propose an asymmetry index as a measure of degree of asymmetry of a given dataset. It provides an additional information on a dataset allowing to guide and improve any further analysis. The index reflects the intensity of the asymmetric relationships among data resulting from hierarchical data structure. Using the information retrieved by our asymmetry index, one obtains a justification and explanation of the effectiveness of the subsequent asymmetric data analysis methods, as well as helpful preparation to asymmetrizing the tools for the further analysis. The asymmetry index is based on the <em>k</em>-nearest neighbors graph representing the considered data. Therefore, it uses the intrinsic geometry-based information on the data, in this way, providing an insight into the data structure. Our experiments on real data are designed to verify the usefulness of the asymmetry index and the correctness of its theoretical fundamentals. In our empirical validation, we employ the symmetric and asymmetric dimensionality reduction algorithms and evaluate their results on the basis of clustering in the 2-dimensional visualization space. We test, whether our index indeed predicts the level of superiority of the asymmetric methods over their symmetric counterparts.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142129002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online learning from incomplete data streams with partial labels for multi-classification 从带有部分标签的不完整数据流中进行在线学习,以实现多重分类
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-30 DOI: 10.1016/j.ins.2024.121411

Online learning from the data streams is a research hotspot due to adaptive responses to real-time data arrival and fleeting. Existing approaches can only handle real-world scenarios partially due to constraints such as binary classification, complete labels, and fixed feature spaces. Learning from incomplete data streams with partial labels for multi-classification is crucial but rarely investigated due to its complexity and variability. To address this issue, we propose a novel Online Learning approach from Incomplete Data Streams with Partial Labels for Multi-classification, named OLIDSPLM. OLIDSPLM includes three main ideas: a) exploiting feature similarity to re-weight the most informative features in incomplete feature space (IFS) to avoid bias caused by filling in missing features, b) using self-train to label unlabeled instances and filter outliers, and c) utilizing the difference in the distribution between instances and model generated to detect concept drifts adaptively. We experimentally evaluated OLIDSPLM and its rivals in handling the IFS, partial labels, and concept drifts to validate its effectiveness. The code is released at https://github.com/youdianlong/OLIDSPLM.

从数据流中在线学习是一个研究热点,因为它能对实时数据到达和转瞬即逝做出自适应响应。由于二元分类、完整标签和固定特征空间等限制,现有方法只能部分处理现实世界的场景。从带有部分标签的不完整数据流中学习进行多重分类至关重要,但由于其复杂性和多变性,很少有人对其进行研究。为了解决这个问题,我们提出了一种新颖的从带有部分标签的不完整数据流中进行多重分类的在线学习方法,命名为 OLIDSPLM。OLIDSPLM 包括三个主要思想:a) 利用特征相似性对不完整特征空间(IFS)中信息量最大的特征进行重新加权,以避免填补缺失特征造成的偏差;b) 利用自我训练来标记未标记的实例并过滤异常值;c) 利用实例和模型生成分布之间的差异来自适应地检测概念漂移。我们在实验中评估了 OLIDSPLM 及其对手在处理 IFS、部分标签和概念漂移方面的能力,以验证其有效性。代码发布于 https://github.com/youdianlong/OLIDSPLM。
{"title":"Online learning from incomplete data streams with partial labels for multi-classification","authors":"","doi":"10.1016/j.ins.2024.121411","DOIUrl":"10.1016/j.ins.2024.121411","url":null,"abstract":"<div><p>Online learning from the data streams is a research hotspot due to adaptive responses to real-time data arrival and fleeting. Existing approaches can only handle real-world scenarios partially due to constraints such as binary classification, complete labels, and fixed feature spaces. Learning from incomplete data streams with partial labels for multi-classification is crucial but rarely investigated due to its complexity and variability. To address this issue, we propose a novel <u><strong>O</strong></u>nline <u><strong>L</strong></u>earning approach from <u><strong>I</strong></u>ncomplete <u><strong>D</strong></u>ata <u><strong>S</strong></u>treams with <u><strong>P</strong></u>artial <u><strong>L</strong></u>abels for <u><strong>M</strong></u>ulti-classification, named OLIDS<sub>PLM</sub>. OLIDS<sub>PLM</sub> includes three main ideas: a) exploiting feature similarity to re-weight the most informative features in incomplete feature space (IFS) to avoid bias caused by filling in missing features, b) using self-train to label unlabeled instances and filter outliers, and c) utilizing the difference in the distribution between instances and model generated to detect concept drifts adaptively. We experimentally evaluated OLIDS<sub>PLM</sub> and its rivals in handling the IFS, partial labels, and concept drifts to validate its effectiveness. The code is released at <span><span>https://github.com/youdianlong/OLIDSPLM</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142162224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimisation of multiple clustering based undersampling using artificial bee colony: Application to improved detection of obfuscated patterns without adversarial training 利用人工蜂群优化基于多重聚类的欠采样:应用于改进无对抗训练的混淆模式检测
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-29 DOI: 10.1016/j.ins.2024.121407

Attack detection is one of the main features required in modern defence systems. Despite the ongoing research, it remains challenging for a typical mechanism like network-based intrusion detection system (NIDS) to catch up with evolving adversarial attacks. They specifically aim to confuse a machine-learning based predictor. Without the knowledge of adversarial patterns, the best approach is generalising signatures learned from a dataset of legitimate connections and known intrusions. This work focuses on analysing non-payload traffics so that the resulting techniques can be exploited to a range of network-based applications. It investigates a novel means to deal with the problem of imbalanced classes. An optimised undersampling method is introduced to select a subset of majority-class representatives initially created through an ensemble clustering procedure. A weighted combination of criteria representing distributions within and between classes is proposed as the objective function for a global optimisation using the artificial bee colony (ABC). This approach usually outperforms its baselines and other state-of-the-art undersampling models, with ABC being more effective using the global best strategy than a random selection of solutions or an iterative greedy search. The paper also details the parameter analysis offering a heuristic guide for potential taking up of the proposed techniques.

攻击检测是现代防御系统所需的主要功能之一。尽管研究工作一直在进行,但对于像基于网络的入侵检测系统(NIDS)这样的典型机制来说,要赶上不断发展的对抗性攻击仍是一项挑战。它们的具体目标是迷惑基于机器学习的预测器。在不了解对抗模式的情况下,最好的方法就是从合法连接和已知入侵的数据集中归纳出特征。这项工作的重点是分析非负载流量,以便将由此产生的技术用于一系列基于网络的应用。它研究了一种处理不平衡类问题的新方法。它引入了一种优化的欠采样方法,用于选择最初通过集合聚类程序创建的多数类代表子集。我们提出了代表类内和类间分布的标准加权组合,作为使用人工蜂群(ABC)进行全局优化的目标函数。这种方法通常优于其基线和其他最先进的欠采样模型,与随机选择解决方案或迭代贪婪搜索相比,人工蜂群采用全局最佳策略更为有效。论文还详细介绍了参数分析,为可能采用所建议的技术提供了启发式指导。
{"title":"Optimisation of multiple clustering based undersampling using artificial bee colony: Application to improved detection of obfuscated patterns without adversarial training","authors":"","doi":"10.1016/j.ins.2024.121407","DOIUrl":"10.1016/j.ins.2024.121407","url":null,"abstract":"<div><p>Attack detection is one of the main features required in modern defence systems. Despite the ongoing research, it remains challenging for a typical mechanism like network-based intrusion detection system (NIDS) to catch up with evolving adversarial attacks. They specifically aim to confuse a machine-learning based predictor. Without the knowledge of adversarial patterns, the best approach is generalising signatures learned from a dataset of legitimate connections and known intrusions. This work focuses on analysing non-payload traffics so that the resulting techniques can be exploited to a range of network-based applications. It investigates a novel means to deal with the problem of imbalanced classes. An optimised undersampling method is introduced to select a subset of majority-class representatives initially created through an ensemble clustering procedure. A weighted combination of criteria representing distributions within and between classes is proposed as the objective function for a global optimisation using the artificial bee colony (ABC). This approach usually outperforms its baselines and other state-of-the-art undersampling models, with ABC being more effective using the global best strategy than a random selection of solutions or an iterative greedy search. The paper also details the parameter analysis offering a heuristic guide for potential taking up of the proposed techniques.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0020025524013215/pdfft?md5=b856cbb95efe9512350b17efd109b0af&pid=1-s2.0-S0020025524013215-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1