首页 > 最新文献

Information Sciences最新文献

英文 中文
Optimization test function synthesis with generative adversarial networks and adaptive neuro-fuzzy systems 利用生成式对抗网络和自适应神经模糊系统优化测试函数合成
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121371

This paper presents an approach to synthesizing optimization test functions that couples generative adversarial networks and adaptive neuro-fuzzy systems. A generative adversarial network produces optimization landscapes from a database of known optimization test functions, and an adaptive neuro-fuzzy system performs regression on the generated landscapes to provide closed-form expressions. These expressions can be implemented as fuzzy basis function expansions. Eight databases of two-dimensional optimization landscapes reported in the literature are used to train the generative network. Exploratory landscape analysis over the generated samples reveals that the network can lead to new optimization landscapes with features of interest. In addition, fuzzy basis function expansions provide the best approximation results when compared against two symbolic regression frameworks over several selected landscapes. Examples are used to illustrate the ability of these functions to model complex surface artifacts such as plateaus. The proposed approach can be used as a mathematical collaboration tool that couples generative artificial and computational intelligence techniques to formulate high-dimensional optimization test problems from two-dimensional synthesized functions.

本文介绍了一种结合生成式对抗网络和自适应神经模糊系统的优化测试函数合成方法。生成式对抗网络从已知优化测试函数数据库中生成优化景观,而自适应神经模糊系统则对生成的景观进行回归,以提供闭式表达式。这些表达式可以作为模糊基函数展开式来实现。文献中报道的八个二维优化景观数据库用于训练生成网络。对生成样本的探索性景观分析表明,该网络可以生成具有相关特征的新优化景观。此外,在几个选定的景观中,与两个符号回归框架相比,模糊基函数展开提供了最佳的近似结果。示例说明了这些函数对高原等复杂表面特征的建模能力。所提出的方法可作为一种数学协作工具,结合生成人工智能和计算智能技术,从二维合成函数中提出高维优化测试问题。
{"title":"Optimization test function synthesis with generative adversarial networks and adaptive neuro-fuzzy systems","authors":"","doi":"10.1016/j.ins.2024.121371","DOIUrl":"10.1016/j.ins.2024.121371","url":null,"abstract":"<div><p>This paper presents an approach to synthesizing optimization test functions that couples generative adversarial networks and adaptive neuro-fuzzy systems. A generative adversarial network produces optimization landscapes from a database of known optimization test functions, and an adaptive neuro-fuzzy system performs regression on the generated landscapes to provide closed-form expressions. These expressions can be implemented as fuzzy basis function expansions. Eight databases of two-dimensional optimization landscapes reported in the literature are used to train the generative network. Exploratory landscape analysis over the generated samples reveals that the network can lead to new optimization landscapes with features of interest. In addition, fuzzy basis function expansions provide the best approximation results when compared against two symbolic regression frameworks over several selected landscapes. Examples are used to illustrate the ability of these functions to model complex surface artifacts such as plateaus. The proposed approach can be used as a mathematical collaboration tool that couples generative artificial and computational intelligence techniques to formulate high-dimensional optimization test problems from two-dimensional synthesized functions.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142095180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A low-rank support tensor machine for multi-classification 用于多重分类的低阶支持张量机
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121398

In recent decades, there has been an increasing demand for effectively handling high-dimensional multi-channel tensor data. Due to the inability to utilize internal structural information, Support Vector Machine (SVM) and its variations struggle to classify flattened tensor data, consequently resulting in the ‘curse of dimensionality’ issue. Furthermore, most of these methods can not directly apply to multiclass datasets. To overcome these challenges, we have developed a novel classification method called Multiclass Low-Rank Support Tensor Machine (MLRSTM). Our method is inspired by the well-established low-rank tensor hypothesis, which suggests a correlation between each channel of the feature tensor. Specifically, MLRSTM adopts the hinge loss function and introduces a convex approximation of tensor rank, the order-d Tensor Nuclear Norm (order-d TNN), in the regularization term. By leveraging the order-d TNN, MLRSTM effectively exploits the inherent structural information in tensor data to enhance generalization performance and avoid the curse of dimensionality. Moreover, we develop the Alternating Direction Method of Multipliers (ADMM) algorithm to optimize the convex problem inherent in training MLRSTM. Finally, comprehensive experiments validate the excellent performance of MLRSTM in tensor multi-classification tasks, showcasing its potential and efficacy in handling high-dimensional multi-channel tensor data.

近几十年来,有效处理高维多通道张量数据的需求日益增长。由于无法利用内部结构信息,支持向量机(SVM)及其变体很难对扁平化的张量数据进行分类,从而导致了 "维度诅咒 "问题。此外,这些方法大多不能直接应用于多类数据集。为了克服这些挑战,我们开发了一种新的分类方法,称为多类低张量支持张量机(MLRSTM)。我们的方法受成熟的低阶张量假说启发,该假说认为特征张量的每个通道之间存在相关性。具体来说,MLRSTM 采用了铰链损失函数,并在正则项中引入了张量秩的凸近似值--阶-d 张量核规范(阶-d TNN)。通过利用阶d TNN,MLRSTM 有效地利用了张量数据的固有结构信息,从而提高了泛化性能,避免了维度诅咒。此外,我们还开发了交替方向乘法(ADMM)算法,以优化训练 MLRSTM 所固有的凸问题。最后,综合实验验证了 MLRSTM 在张量多分类任务中的卓越性能,展示了它在处理高维多通道张量数据方面的潜力和功效。
{"title":"A low-rank support tensor machine for multi-classification","authors":"","doi":"10.1016/j.ins.2024.121398","DOIUrl":"10.1016/j.ins.2024.121398","url":null,"abstract":"<div><p>In recent decades, there has been an increasing demand for effectively handling high-dimensional multi-channel tensor data. Due to the inability to utilize internal structural information, Support Vector Machine (SVM) and its variations struggle to classify flattened tensor data, consequently resulting in the ‘curse of dimensionality’ issue. Furthermore, most of these methods can not directly apply to multiclass datasets. To overcome these challenges, we have developed a novel classification method called Multiclass Low-Rank Support Tensor Machine (MLRSTM). Our method is inspired by the well-established low-rank tensor hypothesis, which suggests a correlation between each channel of the feature tensor. Specifically, MLRSTM adopts the hinge loss function and introduces a convex approximation of tensor rank, the order-<em>d</em> Tensor Nuclear Norm (order-<em>d</em> TNN), in the regularization term. By leveraging the order-<em>d</em> TNN, MLRSTM effectively exploits the inherent structural information in tensor data to enhance generalization performance and avoid the curse of dimensionality. Moreover, we develop the Alternating Direction Method of Multipliers (ADMM) algorithm to optimize the convex problem inherent in training MLRSTM. Finally, comprehensive experiments validate the excellent performance of MLRSTM in tensor multi-classification tasks, showcasing its potential and efficacy in handling high-dimensional multi-channel tensor data.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142122696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A triple population adaptive differential evolution 三重种群适应性差异进化
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121401

The Differential Evolution (DE) algorithm is one of the most efficient algorithms for complex numerical optimization. However, the nature of differential mutation and crossover hinders the individuals from a major change and always guides them toward their superior neighbors. There's a lack of useful directional information to help the population escape from early convergence. To solve the above problem, this paper proposes a novel Triple-population-based Adaptive Differential Evolution (TPADE) to enhance the evolutionary efficiency in solving various complex numerical optimization problems. First, a population division method with symmetrical linear reduction is designed to divide the parent population of each iteration into three sub-populations of different sizes, i.e., superior sub-population, medium sub-population, and inferior sub-population. Each sub-population adopts distinct differential mutation and crossover operators to maintain balanced search directions. Second, a superior-trial-preserved selection mechanism is proposed to screen useful directional information to guide the next iteration of evolution. Third, an effective parameter adaptation strategy is designed with the linear population size reduction strategy to avoid redundant search. Experiments are then conducted to show that the TPADE exhibits well performance compared with eleven state-of-the-art DE variants, CEC winners, and their variants on the CEC'2014, CEC'2017, and CEC'2022 benchmark suites. The C++ source code of TPADE can be downloaded from https://github.com/DoubleGong/TPADE.

差分进化(DE)算法是复杂数值优化最有效的算法之一。然而,差分突变和交叉的性质阻碍了个体发生重大变化,并总是引导它们向其优越的邻居靠拢。缺乏有用的方向信息来帮助种群摆脱早期收敛。为了解决上述问题,本文提出了一种新颖的基于三种群的自适应差分进化(TPADE),以提高解决各种复杂数值优化问题的进化效率。首先,设计了一种对称线性削减的种群划分方法,将每次迭代的父种群划分为三个不同大小的子种群,即优等子种群、中等子种群和劣等子种群。每个子群采用不同的差分突变和交叉算子,以保持搜索方向的平衡。其次,提出了一种上等试验保留选择机制,以筛选有用的方向信息,指导下一次迭代进化。第三,设计了一种有效的参数适应策略和线性种群规模缩小策略,以避免冗余搜索。随后进行的实验表明,在CEC'2014、CEC'2017和CEC'2022基准套件上,TPADE与11种最先进的DE变体、CEC优胜者及其变体相比,表现出良好的性能。TPADE 的 C++ 源代码可从 https://github.com/DoubleGong/TPADE 下载。
{"title":"A triple population adaptive differential evolution","authors":"","doi":"10.1016/j.ins.2024.121401","DOIUrl":"10.1016/j.ins.2024.121401","url":null,"abstract":"<div><p>The Differential Evolution (DE) algorithm is one of the most efficient algorithms for complex numerical optimization. However, the nature of differential mutation and crossover hinders the individuals from a major change and always guides them toward their superior neighbors. There's a lack of useful directional information to help the population escape from early convergence. To solve the above problem, this paper proposes a novel Triple-population-based Adaptive Differential Evolution (TPADE) to enhance the evolutionary efficiency in solving various complex numerical optimization problems. First, a population division method with symmetrical linear reduction is designed to divide the parent population of each iteration into three sub-populations of different sizes, i.e., superior sub-population, medium sub-population, and inferior sub-population. Each sub-population adopts distinct differential mutation and crossover operators to maintain balanced search directions. Second, a superior-trial-preserved selection mechanism is proposed to screen useful directional information to guide the next iteration of evolution. Third, an effective parameter adaptation strategy is designed with the linear population size reduction strategy to avoid redundant search. Experiments are then conducted to show that the TPADE exhibits well performance compared with eleven state-of-the-art DE variants, CEC winners, and their variants on the CEC'2014, CEC'2017, and CEC'2022 benchmark suites. The C++ source code of TPADE can be downloaded from <span><span>https://github.com/DoubleGong/TPADE</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142122518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Path detectability verification for time-dependent systems with application to flexible manufacturing systems 应用于柔性制造系统的时间相关系统路径可探测性验证
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121404

This paper addresses the path detectability verification problem for time-dependent systems modeled by time labeled Petri nets (TLPNs). To capture the information precisely, it may not be sufficient to estimate the current state by resorting to the partial system observation, and it is usually crucial to decide the path of a system to reach the current state. Path detectability characterizes a time-dependent system whose current state and the corresponding path can be uniquely determined after a real-time observation (RTO). Revised state class graphs (RSCGs) are proposed to capture the time information for the evolution of the RTO in a TLPN system. We demonstrate the time information overlap problem in the RSCG, i.e., several paths are associated with the same observable events and the same time instants, which leads to such paths that cannot be distinguished. The nodes required to be computed in the proposed RSCGs are always less or equal to those of the modified state class graphs reported in the literature, since the enumeration of all the states is avoided. Based on the RSCG, an RSCG observer is formulated to address the time information overlap problem and capture the number of such paths in the TLPN system. The efficiency analysis of this verification method is provided. In this paper, the results are applied to a real production system, exposing the practical value of the reported method.

本文探讨了用时间标注 Petri 网(TLPN)建模的时变系统的路径可探测性验证问题。要精确捕捉信息,仅靠部分系统观测来估计当前状态可能是不够的,决定系统到达当前状态的路径通常至关重要。路径可探测性表征了一个随时间变化的系统,经过实时观测(RTO)后,该系统的当前状态和相应路径可以唯一确定。我们提出了修正状态类图(RSCG)来捕捉 TLPN 系统中 RTO 演化的时间信息。我们证明了 RSCG 中的时间信息重叠问题,即多条路径与相同的可观测事件和相同的时间时刻相关联,从而导致这些路径无法区分。由于避免了对所有状态的枚举,建议的 RSCG 中需要计算的节点总是少于或等于文献中报道的修正状态类图。在 RSCG 的基础上,提出了一种 RSCG 观察器来解决时间信息重叠问题,并捕捉 TLPN 系统中此类路径的数量。本文对这种验证方法进行了效率分析。本文将结果应用于实际生产系统,从而揭示了所报告方法的实用价值。
{"title":"Path detectability verification for time-dependent systems with application to flexible manufacturing systems","authors":"","doi":"10.1016/j.ins.2024.121404","DOIUrl":"10.1016/j.ins.2024.121404","url":null,"abstract":"<div><p>This paper addresses the path detectability verification problem for time-dependent systems modeled by time labeled Petri nets (TLPNs). To capture the information precisely, it may not be sufficient to estimate the current state by resorting to the partial system observation, and it is usually crucial to decide the path of a system to reach the current state. Path detectability characterizes a time-dependent system whose current state and the corresponding path can be uniquely determined after a real-time observation (RTO). Revised state class graphs (RSCGs) are proposed to capture the time information for the evolution of the RTO in a TLPN system. We demonstrate the time information overlap problem in the RSCG, i.e., several paths are associated with the same observable events and the same time instants, which leads to such paths that cannot be distinguished. The nodes required to be computed in the proposed RSCGs are always less or equal to those of the modified state class graphs reported in the literature, since the enumeration of all the states is avoided. Based on the RSCG, an RSCG observer is formulated to address the time information overlap problem and capture the number of such paths in the TLPN system. The efficiency analysis of this verification method is provided. In this paper, the results are applied to a real production system, exposing the practical value of the reported method.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142129001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consistent and specific multi-view multi-label learning with correlation information 利用相关信息进行一致而具体的多视角多标签学习
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121395

In multi-view multi-label (MVML) learning, each sample is represented by several heterogeneous distinct feature representations while associated with a set of class labels simultaneously. To achieve MVML learning, most of the existing methods contribute to the recovery of a consistent subspace, i.e., a shared feature representation, among multiple views. Nevertheless, each view has its inherent specific properties used in the discrimination process of labels. These methods lose sight in the specific information exploitation, and therefore are easily trapped in a sub-optimal result. In this study, we present an optimization framework CSVL to solve the learning problem. The main technical contribution in CSVL is a formulation for MVML learning while consistent subspace across views, specific subspace for each view, and the correlations among labels are taken into account. Specifically, consistent subspace is recovered by imposing a low-rank constraint among multiple views, and specific subspace of each view is extra generated with Frobenius norm. To further improve model generalization capability, we preserve both feature manifolds from multiple views and label correlations from multiple labels. Extensive experiments on 7 benchmark datasets show that our proposal CSVL has the advantages in MVML learning.

在多视图多标签(MVML)学习中,每个样本都由多个不同的异构特征表征来表示,同时与一组类标签相关联。为了实现 MVML 学习,大多数现有方法都有助于在多个视图之间恢复一致的子空间,即共享特征表示。然而,每个视图都有其固有的特定属性,用于标签的判别过程。这些方法忽视了对特定信息的利用,因此很容易陷入次优结果的困境。在本研究中,我们提出了一个优化框架 CSVL 来解决学习问题。CSVL 的主要技术贡献是提出了一种 MVML 学习方法,同时考虑了跨视图的一致子空间、每个视图的特定子空间以及标签之间的相关性。具体来说,一致子空间是通过在多个视图之间施加低秩约束来恢复的,而每个视图的特定子空间则是通过 Frobenius 准则额外生成的。为了进一步提高模型的泛化能力,我们同时保留了来自多个视图的特征流形和来自多个标签的标签相关性。在 7 个基准数据集上进行的广泛实验表明,我们提出的 CSVL 在 MVML 学习中具有优势。
{"title":"Consistent and specific multi-view multi-label learning with correlation information","authors":"","doi":"10.1016/j.ins.2024.121395","DOIUrl":"10.1016/j.ins.2024.121395","url":null,"abstract":"<div><p>In multi-view multi-label (MVML) learning, each sample is represented by several heterogeneous distinct feature representations while associated with a set of class labels simultaneously. To achieve MVML learning, most of the existing methods contribute to the recovery of a consistent subspace, i.e., a shared feature representation, among multiple views. Nevertheless, each view has its inherent specific properties used in the discrimination process of labels. These methods lose sight in the specific information exploitation, and therefore are easily trapped in a sub-optimal result. In this study, we present an optimization framework CSVL to solve the learning problem. The main technical contribution in CSVL is a formulation for MVML learning while consistent subspace across views, specific subspace for each view, and the correlations among labels are taken into account. Specifically, consistent subspace is recovered by imposing a low-rank constraint among multiple views, and specific subspace of each view is extra generated with <span><math><mi>F</mi><mi>r</mi><mi>o</mi><mi>b</mi><mi>e</mi><mi>n</mi><mi>i</mi><mi>u</mi><mi>s</mi></math></span> norm. To further improve model generalization capability, we preserve both feature manifolds from multiple views and label correlations from multiple labels. Extensive experiments on 7 benchmark datasets show that our proposal CSVL has the advantages in MVML learning.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tackling biased complementary label learning with large margin 利用大余量解决有偏差的互补标签学习问题
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121400

Complementary Label Learning (CLL) is a typical weakly supervised learning protocol, where each instance is associated with one complementary label to specify a class that the instance does not belong to. Current CLL approaches assume that complementary labels are uniformly sampled from all non-ground-truth labels, so as to implicitly and locally share complementary labels by solely reducing the logit of complementary label in one way or another. In this paper, we point out that, when the uniform assumption does not hold, existing CLL methods are weakened their ability to share complementary labels and fail in creating classifiers with large logit margin (LM), resulting in a significant performance drop. To address these issues, we instead present complementary logit margin (CLM) and empirically prove that increasing CLM contributes to the share of complementary labels under the biased CLL setting. Accordingly, we propose a surrogate complementary one-versus-rest loss (COVR) and demonstrate that optimization on COVR can effectively increase CLM with both theoretical and empirical evidences. Extensive experiments verify that the proposed COVR exhibits substantial improvement for both the biased CLL and even a more practical CLL setting: instance-dependent complementary label learning.

互补标签学习(CLL)是一种典型的弱监督学习协议,每个实例都与一个互补标签相关联,以指定该实例不属于的类别。目前的互补标签学习方法假定互补标签是从所有非地面真实标签中统一采样的,因此只需通过某种方式降低互补标签的对数,就能隐式地局部共享互补标签。本文指出,当均匀假设不成立时,现有的 CLL 方法共享互补标签的能力会被削弱,无法创建具有较大对数差(LM)的分类器,从而导致性能显著下降。为了解决这些问题,我们提出了互补对数边际(CLM),并通过经验证明,在有偏差的 CLL 设置下,增加 CLM 有助于提高互补标签的份额。因此,我们提出了一种替代的互补单对单损失(COVR),并通过理论和实证证明对 COVR 的优化可以有效提高 CLM。广泛的实验验证了所提出的 COVR 在有偏差的 CLL 以及更实用的 CLL 设置(依赖实例的互补标签学习)中都有显著的改进。
{"title":"Tackling biased complementary label learning with large margin","authors":"","doi":"10.1016/j.ins.2024.121400","DOIUrl":"10.1016/j.ins.2024.121400","url":null,"abstract":"<div><p>Complementary Label Learning (CLL) is a typical weakly supervised learning protocol, where each instance is associated with one complementary label to specify a class that the instance does not belong to. Current CLL approaches assume that complementary labels are uniformly sampled from all non-ground-truth labels, so as to implicitly and locally share complementary labels by solely reducing the logit of complementary label in one way or another. In this paper, we point out that, when the uniform assumption does not hold, existing CLL methods are weakened their ability to share complementary labels and fail in creating classifiers with large logit margin (LM), resulting in a significant performance drop. To address these issues, we instead present complementary logit margin (CLM) and empirically prove that increasing CLM contributes to the share of complementary labels under the biased CLL setting. Accordingly, we propose a surrogate complementary one-versus-rest loss (COVR) and demonstrate that optimization on COVR can effectively increase CLM with both theoretical and empirical evidences. Extensive experiments verify that the proposed COVR exhibits substantial improvement for both the biased CLL and even a more practical CLL setting: instance-dependent complementary label learning.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-view clustering via double spaces structure learning and adaptive multiple projection regression learning 通过双空间结构学习和自适应多重投影回归学习进行多视角聚类
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121396

Multi-view clustering aims to group objects with high similarity into one group according to the heterogeneous features of different views. The graph-based clustering methods have obtained excellent results. However, there remain a few common drawbacks. For example, some methods do not consider graphs' high-order structure information. Thus, fuller data information cannot be obtained. In addition, some methods remove noise, outliers, and redundant information in the graph learning phase, resulting in the loss of graph information. Furthermore, using predefined graphs cannot exploit complementary information between views. A triple strategy-based multi-view clustering method is presented to solve the above issues. First, Laplacian graphs are used for fusion learning, and the underlying first-order and second-order structure information among views are explored simultaneously. Then, a label fusion scheme is designed to eliminate noise, outliers, and redundant information and to mine the intrinsic characteristics of data labels. Besides, the consistent label matrix in adaptive regression learning is used to explore complementary information between views in a mutually guided learning way. Finally, the objective function is solved by using an efficient iterative method. Six types of experiments are conducted on eleven real-world multi-view datasets, and the conclusions that can be drawn are: (1) the proposed algorithm achieves the best results in terms of clustering accuracy on ten datasets with an average accuracy improvement of 5.11% compared to other algorithms. Specifically, the accuracy improved by 9.05% on dataset HW and 10.95% on dataset Reuters compared to the second results; (2) The ablation experiments confirm that the different learning strategies included in the proposed algorithm allow it to achieve better clustering performance.

多视图聚类的目的是根据不同视图的异质特征,将相似度高的物体归为一组。基于图的聚类方法取得了很好的效果。然而,这些方法仍然存在一些共同的缺点。例如,有些方法没有考虑图的高阶结构信息。因此,无法获得更全面的数据信息。此外,有些方法在图学习阶段会去除噪声、异常值和冗余信息,导致图信息丢失。此外,使用预定义图形无法利用视图之间的互补信息。本文提出了一种基于三重策略的多视图聚类方法来解决上述问题。首先,利用拉普拉斯图进行融合学习,同时探索视图之间的一阶和二阶结构信息。然后,设计一种标签融合方案来消除噪声、异常值和冗余信息,并挖掘数据标签的内在特征。此外,利用自适应回归学习中的一致标签矩阵,以相互引导的学习方式探索视图间的互补信息。最后,利用高效的迭代方法求解目标函数。在 11 个真实世界的多视图数据集上进行了六种类型的实验,得出的结论是(1)在十个数据集上,所提出的算法在聚类精度方面取得了最佳结果,与其他算法相比,平均精度提高了 5.11%。具体来说,与第二次结果相比,HW 数据集的准确率提高了 9.05%,Reuters 数据集的准确率提高了 10.95%;(2)消融实验证实,拟议算法中包含的不同学习策略使其能够实现更好的聚类性能。
{"title":"Multi-view clustering via double spaces structure learning and adaptive multiple projection regression learning","authors":"","doi":"10.1016/j.ins.2024.121396","DOIUrl":"10.1016/j.ins.2024.121396","url":null,"abstract":"<div><p>Multi-view clustering aims to group objects with high similarity into one group according to the heterogeneous features of different views. The graph-based clustering methods have obtained excellent results. However, there remain a few common drawbacks. For example, some methods do not consider graphs' high-order structure information. Thus, fuller data information cannot be obtained. In addition, some methods remove noise, outliers, and redundant information in the graph learning phase, resulting in the loss of graph information. Furthermore, using predefined graphs cannot exploit complementary information between views. A triple strategy-based multi-view clustering method is presented to solve the above issues. First, Laplacian graphs are used for fusion learning, and the underlying first-order and second-order structure information among views are explored simultaneously. Then, a label fusion scheme is designed to eliminate noise, outliers, and redundant information and to mine the intrinsic characteristics of data labels. Besides, the consistent label matrix in adaptive regression learning is used to explore complementary information between views in a mutually guided learning way. Finally, the objective function is solved by using an efficient iterative method. Six types of experiments are conducted on eleven real-world multi-view datasets, and the conclusions that can be drawn are: (1) the proposed algorithm achieves the best results in terms of clustering accuracy on ten datasets with an average accuracy improvement of 5.11% compared to other algorithms. Specifically, the accuracy improved by 9.05% on dataset HW and 10.95% on dataset Reuters compared to the second results; (2) The ablation experiments confirm that the different learning strategies included in the proposed algorithm allow it to achieve better clustering performance.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142122593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent fault diagnosis for unbalanced battery data using adversarial domain expansion and enhanced stochastic configuration networks 利用对抗域扩展和增强型随机配置网络对不平衡电池数据进行智能故障诊断
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121399

An accurate and efficient fault diagnosis method for battery systems is crucial to ensuring the safety of battery packs. Addressing the issue of insufficient actual fault data in battery operations, this paper proposes an intelligent fault diagnosis method based on feature-enhanced stochastic configuration networks and adversarial domain expansion of imbalanced battery fault data (AFDEM-FESCN). Firstly, we designed an adversarial fault domain data expansion method (AFDEM). By learning the distribution of fault data through adversarial training, the distribution of sample domains is balanced, thereby reducing model bias. Subsequently, we adjusted the distribution of SCN iterative parameters and added a linear feature layer. This enhances the feature extraction capability of the network through distribution overlay, enabling fault diagnosis. Finally, the effectiveness and feasibility of the proposed method were validated through a practical battery system fault diagnosis case, achieving a diagnostic accuracy of 92.1%. Experimental results demonstrate that the AFDEM-FESCN method exhibits good accuracy in battery system fault diagnosis, providing an effective solution to the challenge of imbalanced data in intelligent fault diagnosis.

准确高效的电池系统故障诊断方法对于确保电池组的安全性至关重要。针对电池运行中实际故障数据不足的问题,本文提出了一种基于特征增强随机配置网络和不平衡电池故障数据对抗性域扩展(AFDEM-FESCN)的智能故障诊断方法。首先,我们设计了一种对抗性故障域数据扩展方法(AFDEM)。通过对抗训练学习故障数据的分布,平衡样本域的分布,从而减少模型偏差。随后,我们调整了 SCN 迭代参数的分布,并添加了线性特征层。这就通过分布叠加增强了网络的特征提取能力,从而实现故障诊断。最后,通过一个实际的电池系统故障诊断案例验证了所提方法的有效性和可行性,诊断准确率达到 92.1%。实验结果表明,AFDEM-FESCN 方法在电池系统故障诊断中表现出良好的准确性,为智能故障诊断中的不平衡数据挑战提供了有效的解决方案。
{"title":"Intelligent fault diagnosis for unbalanced battery data using adversarial domain expansion and enhanced stochastic configuration networks","authors":"","doi":"10.1016/j.ins.2024.121399","DOIUrl":"10.1016/j.ins.2024.121399","url":null,"abstract":"<div><p>An accurate and efficient fault diagnosis method for battery systems is crucial to ensuring the safety of battery packs. Addressing the issue of insufficient actual fault data in battery operations, this paper proposes an intelligent fault diagnosis method based on feature-enhanced stochastic configuration networks and adversarial domain expansion of imbalanced battery fault data (AFDEM-FESCN). Firstly, we designed an adversarial fault domain data expansion method (AFDEM). By learning the distribution of fault data through adversarial training, the distribution of sample domains is balanced, thereby reducing model bias. Subsequently, we adjusted the distribution of SCN iterative parameters and added a linear feature layer. This enhances the feature extraction capability of the network through distribution overlay, enabling fault diagnosis. Finally, the effectiveness and feasibility of the proposed method were validated through a practical battery system fault diagnosis case, achieving a diagnostic accuracy of 92.1%. Experimental results demonstrate that the AFDEM-FESCN method exhibits good accuracy in battery system fault diagnosis, providing an effective solution to the challenge of imbalanced data in intelligent fault diagnosis.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Surrogate-Assisted Differential Evolution with multiple sampling mechanisms for high-dimensional expensive problems 针对高维昂贵问题的具有多重采样机制的代用辅助差分进化论
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121408

Recently, Surrogate-Assisted Evolutionary Algorithms (SAEAs) have been widely employed in solving Expensive Optimization Problems (EOPs) due to their efficiency in obtaining satisfactory solutions with limited resources. By leveraging historical data to construct surrogate models for approximation, SAEAs can significantly reduce the number of expensive fitness evaluations for EOPs. A hierarchical SAEA optimization framework based on evolutionary sampling methods has achieved remarkable success in High-dimensional EOPs (HEOPs), which can effectively balance the exploration and exploitation capabilities. However, the majority of existing hierarchical SAEAs focus on either switching between different sampling strategies or enhancing a single sampling strategy, potentially overlooking the potential to improve multiple sampling strategies simultaneously. In this paper, we propose a Surrogate-Assisted Differential Evolution with Multiple Sampling Mechanisms (SADE-MSM) to tackle HEOPs, incorporating three sampling strategies with different mechanisms. The contributions of SADE-MSM are summarized as follows: 1) A centroid sampling method is applied before iterative optimization to enhance the early exploration ability; 2) An improved global prescreening sampling strategy is introduced to balance the exploration and exploitation capabilities; 3) A local search sampling with the adaptive optimal region strategy is proposed, significantly improving the exploitation ability. To validate the performance of SADE-MSM, we compared it with the state-of-the-art SAEAs on benchmark problems with dimensions ranging from 30 to 500. Experimental results demonstrate that SADE-MSM has a significant performance superiority.

最近,代用辅助进化算法(SAEAs)被广泛用于解决昂贵的优化问题(EOPs),因为它们能在有限的资源下高效地获得令人满意的解决方案。通过利用历史数据来构建近似代用模型,SAEAs 可以显著减少 EOPs 中昂贵的合适度评估次数。基于进化抽样方法的分层 SAEA 优化框架在高维 EOPs(HEOPs)中取得了显著的成功,可以有效地平衡探索和利用能力。然而,现有的分层 SAEA 大多侧重于在不同采样策略之间切换或增强单一采样策略,可能忽略了同时改进多种采样策略的潜力。在本文中,我们提出了一种具有多种采样机制的代理辅助差分进化(SADE-MSM)来解决 HEOPs 问题,其中包含三种具有不同机制的采样策略。SADE-MSM 的贡献概述如下:1) 在迭代优化之前采用中心点采样方法,增强了早期探索能力;2) 引入了改进的全局预筛选采样策略,平衡了探索和利用能力;3) 提出了具有自适应最优区域策略的局部搜索采样,显著提高了利用能力。为了验证 SADE-MSM 的性能,我们在维度为 30 到 500 的基准问题上将其与最先进的 SAEA 进行了比较。实验结果表明,SADE-MSM 具有显著的性能优势。
{"title":"Surrogate-Assisted Differential Evolution with multiple sampling mechanisms for high-dimensional expensive problems","authors":"","doi":"10.1016/j.ins.2024.121408","DOIUrl":"10.1016/j.ins.2024.121408","url":null,"abstract":"<div><p>Recently, Surrogate-Assisted Evolutionary Algorithms (SAEAs) have been widely employed in solving Expensive Optimization Problems (EOPs) due to their efficiency in obtaining satisfactory solutions with limited resources. By leveraging historical data to construct surrogate models for approximation, SAEAs can significantly reduce the number of expensive fitness evaluations for EOPs. A hierarchical SAEA optimization framework based on evolutionary sampling methods has achieved remarkable success in High-dimensional EOPs (HEOPs), which can effectively balance the exploration and exploitation capabilities. However, the majority of existing hierarchical SAEAs focus on either switching between different sampling strategies or enhancing a single sampling strategy, potentially overlooking the potential to improve multiple sampling strategies simultaneously. In this paper, we propose a Surrogate-Assisted Differential Evolution with Multiple Sampling Mechanisms (SADE-MSM) to tackle HEOPs, incorporating three sampling strategies with different mechanisms. The contributions of SADE-MSM are summarized as follows: 1) A centroid sampling method is applied before iterative optimization to enhance the early exploration ability; 2) An improved global prescreening sampling strategy is introduced to balance the exploration and exploitation capabilities; 3) A local search sampling with the adaptive optimal region strategy is proposed, significantly improving the exploitation ability. To validate the performance of SADE-MSM, we compared it with the state-of-the-art SAEAs on benchmark problems with dimensions ranging from 30 to 500. Experimental results demonstrate that SADE-MSM has a significant performance superiority.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequential three-way group decision-making for double hierarchy hesitant fuzzy linguistic term set 双层次犹豫模糊语言术语集的顺序三向群体决策
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-08-28 DOI: 10.1016/j.ins.2024.121403

Group decision-making (GDM) characterized by complexity and uncertainty is an essential part of various life scenarios. Most existing researches lack tools to fuse information quickly and interpret decision results for partially formed decisions. This limitation is particularly noticeable when there is a need to improve the efficiency of GDM. To address this issue, a novel multi-level sequential three-way decision for group decision-making (S3W-GDM) method is constructed from the perspective of granular computing. This method simultaneously considers the vagueness, hesitation, and variation of GDM problems under double hierarchy hesitant fuzzy linguistic term sets (DHHFLTS) environment. First, for fusing information efficiently, a novel multi-level expert information fusion method is proposed, and the concepts of expert decision table and the extraction/aggregation of decision-leveled information based on the multi-level granularity are defined. Second, the neighborhood theory, outranking relation and regret theory (RT) are utilized to redesign the calculations of conditional probability and relative loss function. Then, the granular structure of DHHFLTS based on the sequential three-way decision (S3WD) is defined to improve the decision-making efficiency, and the decision-making strategy and interpretation of each decision-level are proposed. Furthermore, the algorithm of S3W-GDM is given. Finally, an illustrative example of diagnosis is presented, and the comparative and sensitivity analysis with other methods are performed to verify the efficiency and rationality of the proposed method.

以复杂性和不确定性为特征的群体决策(GDM)是各种生活场景的重要组成部分。大多数现有研究都缺乏快速融合信息和解释部分决策结果的工具。当需要提高 GDM 的效率时,这种局限性尤为明显。为解决这一问题,我们从粒度计算的角度出发,构建了一种新颖的多层次群体决策顺序三向决策(S3W-GDM)方法。该方法同时考虑了双层次犹豫模糊语言项集(DHHFLTS)环境下 GDM 问题的模糊性、犹豫性和变异性。首先,为了有效地融合信息,提出了一种新颖的多级专家信息融合方法,并定义了专家决策表和基于多级粒度的决策级信息提取/聚合的概念。其次,利用邻域理论、排名关系和后悔理论(RT)重新设计了条件概率和相对损失函数的计算方法。然后,定义了基于顺序三向决策(S3WD)的 DHHFLTS 的粒度结构,提高了决策效率,并提出了各决策层的决策策略和解释。此外,还给出了 S3W-GDM 算法。最后,给出了一个诊断实例,并与其他方法进行了比较和敏感性分析,以验证所提方法的效率和合理性。
{"title":"Sequential three-way group decision-making for double hierarchy hesitant fuzzy linguistic term set","authors":"","doi":"10.1016/j.ins.2024.121403","DOIUrl":"10.1016/j.ins.2024.121403","url":null,"abstract":"<div><p>Group decision-making (GDM) characterized by complexity and uncertainty is an essential part of various life scenarios. Most existing researches lack tools to fuse information quickly and interpret decision results for partially formed decisions. This limitation is particularly noticeable when there is a need to improve the efficiency of GDM. To address this issue, a novel multi-level sequential three-way decision for group decision-making (S3W-GDM) method is constructed from the perspective of granular computing. This method simultaneously considers the vagueness, hesitation, and variation of GDM problems under double hierarchy hesitant fuzzy linguistic term sets (DHHFLTS) environment. First, for fusing information efficiently, a novel multi-level expert information fusion method is proposed, and the concepts of expert decision table and the extraction/aggregation of decision-leveled information based on the multi-level granularity are defined. Second, the neighborhood theory, outranking relation and regret theory (RT) are utilized to redesign the calculations of conditional probability and relative loss function. Then, the granular structure of DHHFLTS based on the sequential three-way decision (S3WD) is defined to improve the decision-making efficiency, and the decision-making strategy and interpretation of each decision-level are proposed. Furthermore, the algorithm of S3W-GDM is given. Finally, an illustrative example of diagnosis is presented, and the comparative and sensitivity analysis with other methods are performed to verify the efficiency and rationality of the proposed method.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1