首页 > 最新文献

Information Sciences最新文献

英文 中文
Ex-RL: Experience-based reinforcement learning
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-16 DOI: 10.1016/j.ins.2024.121479

Reinforcement learning (RL) has achieved significant success across various tasks. However, generalizing RL for similar tasks remains a challenge. This study leverages expertise from related tasks to introduce a novel algorithm, Ex-RL, for executing transfer learning in tabular RL. The methodology concentrates on abstracting previous experiences into descriptive data and utilizing such data for similar tasks. The research focuses on classic RL solutions for balancing and anti-balancing, which improve the sample efficiency of the learning process. Studies indicate that weak learners, such as Q-learning, require fewer learning episodes, resulting in a 50% improvement and a higher success rate in the learning process. An online virtual lab was developed to facilitate the execution of the experiments. The code is available at Github.

{"title":"Ex-RL: Experience-based reinforcement learning","authors":"","doi":"10.1016/j.ins.2024.121479","DOIUrl":"10.1016/j.ins.2024.121479","url":null,"abstract":"<div><p>Reinforcement learning (RL) has achieved significant success across various tasks. However, generalizing RL for similar tasks remains a challenge. This study leverages expertise from related tasks to introduce a novel algorithm, Ex-RL, for executing transfer learning in tabular RL. The methodology concentrates on abstracting previous experiences into descriptive data and utilizing such data for similar tasks. The research focuses on classic RL solutions for balancing and anti-balancing, which improve the sample efficiency of the learning process. Studies indicate that weak learners, such as Q-learning, require fewer learning episodes, resulting in a 50% improvement and a higher success rate in the learning process. An online virtual lab was developed to facilitate the execution of the experiments. The code is available at <span><span>Github</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142240482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint consensus kernel learning and adaptive hypergraph regularization for graph-based clustering
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-13 DOI: 10.1016/j.ins.2024.121468

Recent advancements in multiple kernel learning-based graph clustering methods have demonstrated significant promise in effectively learning a consensus kernel matrix from various candidate kernel matrices. This approach enables the creation of a low-dimensional representation in the kernel space of high-dimensional data through self-expressiveness. However, a key challenge remains in capturing the latent geometric properties embedded within different kernel matrices to enhance data representation. In this paper, we propose a novel method called joint consensus kernel learning and adaptive hypergraph regularization for graph-based clustering (JKHR). Our approach integrates an innovative adaptive hypergraph Laplacian regularizer, which is characterized by the fusion of multiple nearest neighbor kernel graphs, into the multiple kernel learning-based graph clustering framework. JKHR jointly and adaptively optimizes both the consensus kernel matrix and the hypergraph Laplacian regularizer to achieve a low-dimensional representation that effectively preserves the intrinsic geometry of the data. Experimental results on both synthetic and real benchmark datasets demonstrate that JKHR outperforms state-of-the-art self-expressiveness-based graph clustering methods as well as traditional clustering techniques.

{"title":"Joint consensus kernel learning and adaptive hypergraph regularization for graph-based clustering","authors":"","doi":"10.1016/j.ins.2024.121468","DOIUrl":"10.1016/j.ins.2024.121468","url":null,"abstract":"<div><p>Recent advancements in multiple kernel learning-based graph clustering methods have demonstrated significant promise in effectively learning a consensus kernel matrix from various candidate kernel matrices. This approach enables the creation of a low-dimensional representation in the kernel space of high-dimensional data through self-expressiveness. However, a key challenge remains in capturing the latent geometric properties embedded within different kernel matrices to enhance data representation. In this paper, we propose a novel method called joint consensus kernel learning and adaptive hypergraph regularization for graph-based clustering (JKHR). Our approach integrates an innovative adaptive hypergraph Laplacian regularizer, which is characterized by the fusion of multiple nearest neighbor kernel graphs, into the multiple kernel learning-based graph clustering framework. JKHR jointly and adaptively optimizes both the consensus kernel matrix and the hypergraph Laplacian regularizer to achieve a low-dimensional representation that effectively preserves the intrinsic geometry of the data. Experimental results on both synthetic and real benchmark datasets demonstrate that JKHR outperforms state-of-the-art self-expressiveness-based graph clustering methods as well as traditional clustering techniques.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142240474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RT-DIFTWD: A novel data-driven intuitionistic fuzzy three-way decision model with regret theory
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-13 DOI: 10.1016/j.ins.2024.121471

With the rapid advancement of Web 3.0 and the digital transformation of the tourism industry, online reviews have emerged as a new source of information for potential tourists, making data-driven multi-attribute decision-making feasible. However, the vast number of online reviews significantly increases the complexity of tourists' decision-making. Recognizing the gaps in the current literature, particularly the lack of consideration of tourists' psychological behaviours in their decision-making processes and the inadequate handling of ambiguity and uncertainty in reviews, this study proposes a data-driven intuitionistic fuzzy regret-based three-way decision model (RT-DIFTWD). Specifically, after online reviews are crawled, a satisfaction function based on absolute and relative rationality scenarios with intuitionistic fuzzy sets is established by combining sentiment analysis and regret theory. Moreover, two attribute weight calculation methods that are based on frequency and importance conditions are proposed. A flexible three-way multi-attribute decision-making framework that is suitable for different MADM methods is subsequently proposed for deducing the prioritization and classification of alternatives. Finally, we demonstrate our proposed method through a real application of tourism selection in the Chengdu–Chongqing region. The stability, effectiveness and superiority of the presented method are validated by corresponding experimental studies and a comparative analysis.

{"title":"RT-DIFTWD: A novel data-driven intuitionistic fuzzy three-way decision model with regret theory","authors":"","doi":"10.1016/j.ins.2024.121471","DOIUrl":"10.1016/j.ins.2024.121471","url":null,"abstract":"<div><p>With the rapid advancement of Web 3.0 and the digital transformation of the tourism industry, online reviews have emerged as a new source of information for potential tourists, making data-driven multi-attribute decision-making feasible. However, the vast number of online reviews significantly increases the complexity of tourists' decision-making. Recognizing the gaps in the current literature, particularly the lack of consideration of tourists' psychological behaviours in their decision-making processes and the inadequate handling of ambiguity and uncertainty in reviews, this study proposes a data-driven intuitionistic fuzzy regret-based three-way decision model (RT-DIFTWD). Specifically, after online reviews are crawled, a satisfaction function based on absolute and relative rationality scenarios with intuitionistic fuzzy sets is established by combining sentiment analysis and regret theory. Moreover, two attribute weight calculation methods that are based on frequency and importance conditions are proposed. A flexible three-way multi-attribute decision-making framework that is suitable for different MADM methods is subsequently proposed for deducing the prioritization and classification of alternatives. Finally, we demonstrate our proposed method through a real application of tourism selection in the Chengdu–Chongqing region. The stability, effectiveness and superiority of the presented method are validated by corresponding experimental studies and a comparative analysis.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142240476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Granular correlation-based label-specific feature augmentation for multi-label classification
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-13 DOI: 10.1016/j.ins.2024.121473

Multi-label classification is an extension of single-label classification with generations of multi-output for unseen instances. Label correlation is an essential component in constructing multi-label classifiers. How to optimize the representation of label correlation while preserving the semantics of label-specific remains an uncertain issue. Instead of estimating label correlation by a holistic feature representation, we present an augmented label correlation model by generating multi-granularity label-specific features. Firstly, we devise a mixture distance measure to characterize the closeness of an instance by weighing the Pearson correlation coefficient with cosine similarity. Secondly, we explore the local label-specific relative discrimination by leveraging from both the instance-level and class-level correlation distribution within k nearest neighborhood. Finally, we conduct an information fusion strategy to comprehensively integrate the positive and the negative tendencies at the neighborhood level. Instances with salient positive tendency and compact neighborhood structure receive larger values while receiving smaller values with salient negative tendency and sparse neighborhood structure. With the concatenation of original features and augmented features, we examine the classification performance of the proposed granule correlation-based feature augmentation (GOFA) on well-established second-order multi-label classification methods. Extensive comparisons on thirteen benchmarks demonstrate the statistical superiority of GOFA over state-of-the-art multi-label classifications.

{"title":"Granular correlation-based label-specific feature augmentation for multi-label classification","authors":"","doi":"10.1016/j.ins.2024.121473","DOIUrl":"10.1016/j.ins.2024.121473","url":null,"abstract":"<div><p>Multi-label classification is an extension of single-label classification with generations of multi-output for unseen instances. Label correlation is an essential component in constructing multi-label classifiers. How to optimize the representation of label correlation while preserving the semantics of label-specific remains an uncertain issue. Instead of estimating label correlation by a holistic feature representation, we present an augmented label correlation model by generating multi-granularity label-specific features. Firstly, we devise a mixture distance measure to characterize the closeness of an instance by weighing the Pearson correlation coefficient with cosine similarity. Secondly, we explore the local label-specific relative discrimination by leveraging from both the instance-level and class-level correlation distribution within <em>k</em> nearest neighborhood. Finally, we conduct an information fusion strategy to comprehensively integrate the positive and the negative tendencies at the neighborhood level. Instances with salient positive tendency and compact neighborhood structure receive larger values while receiving smaller values with salient negative tendency and sparse neighborhood structure. With the concatenation of original features and augmented features, we examine the classification performance of the proposed granule correlation-based feature augmentation (GOFA) on well-established second-order multi-label classification methods. Extensive comparisons on thirteen benchmarks demonstrate the statistical superiority of GOFA over state-of-the-art multi-label classifications.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142240483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A mean shift algorithm incorporating reachable distance for spatial clustering
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-12 DOI: 10.1016/j.ins.2024.121456

Spatial clustering is a widely used technique in spatial analysis that groups similar objects together based on their proximity in space. However, traditional clustering algorithms often fail to ensure the accessibility of cluster centers, which limits their validity in practical applications such as facility location problems. To address this issue, this article introduces a novel Mean Shift algorithm that incorporates reachable distance and an iterative mechanism to accurately locate cluster centers. The proposed algorithm initially labels clustering elements with road network coordinates to facilitate the calculation of reachable distance and the cluster center iterative mechanism. Subsequently, the mean shift vector function is modified to employ reachable distance as the measure of geographic reachable similarity. Unlike existing algorithms, our approach allows for cluster centers to be positioned independently of the clustering elements, guaranteeing geographical accessibility. Through simulation experiments, we demonstrate that our proposed algorithm not only outperforms existing methods in terms of solution quality, but also effectively addresses the limitations of disregarding geographical obstacles and unreachable cluster centers.

{"title":"A mean shift algorithm incorporating reachable distance for spatial clustering","authors":"","doi":"10.1016/j.ins.2024.121456","DOIUrl":"10.1016/j.ins.2024.121456","url":null,"abstract":"<div><p>Spatial clustering is a widely used technique in spatial analysis that groups similar objects together based on their proximity in space. However, traditional clustering algorithms often fail to ensure the accessibility of cluster centers, which limits their validity in practical applications such as facility location problems. To address this issue, this article introduces a novel Mean Shift algorithm that incorporates reachable distance and an iterative mechanism to accurately locate cluster centers. The proposed algorithm initially labels clustering elements with road network coordinates to facilitate the calculation of reachable distance and the cluster center iterative mechanism. Subsequently, the mean shift vector function is modified to employ reachable distance as the measure of geographic reachable similarity. Unlike existing algorithms, our approach allows for cluster centers to be positioned independently of the clustering elements, guaranteeing geographical accessibility. Through simulation experiments, we demonstrate that our proposed algorithm not only outperforms existing methods in terms of solution quality, but also effectively addresses the limitations of disregarding geographical obstacles and unreachable cluster centers.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142240481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KPI-oriented process monitoring based on causal-weighted partial least squares 基于因果加权偏最小二乘法的以 KPI 为导向的流程监控
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-12 DOI: 10.1016/j.ins.2024.121470

With the increasing demand for product quality, Key performance indicator (KPI)-oriented process monitoring plays an important role in modern industrial. Partial least squares (PLS)-based methods are widely adopted for KPI-oriented process monitoring, in which the process variable space is decomposed into a principal component subspace that has a strong correlation with KPIs and a residual subspace unrelated to KPIs, and the two spaces are monitored separately. However, if a KPI-unrelated fault occurs in a variable that has no causal relation with any KPI, but has a spurious correlation with some KPIs because of the existence of confounders, PLS based methods may mis-regard the KPI-unrelated fault as a KPI-related fault. This occurs because as a correlation analysis-based method, PLS cannot discriminate whether a variable is a real cause of KPIs or has a spurious correlation with KPIs. Motivated by this, this paper proposes two causal-weighted PLS methods for KPI-oriented process monitoring by combining the LiNGAM-based causal discovery with PLS, which first calculate causal weights based on the modified LiNGAM algorithm and bootstrap strategy, and then employ the causal weights to reweight the weight vectors of PLS to enhance the influence of causal variables in the KPI-related principal subspace and reduce the influence of spurious correlations. Case studies based on a simulated dataset, Tennessee Eastman Process dataset and field data from finishing rolling mill process show that the proposed methods can significantly reduce the false alarm rate for KPI-unrelated faults (i.e., the probability that a KPI-unrelated fault sample is mis-regarded as a KPI-related fault sample) caused by spurious correlation without significantly compromising the fault detection rate of KPI-related faults.

随着产品质量要求的不断提高,以关键绩效指标(KPI)为导向的过程监控在现代工业中发挥着重要作用。基于偏最小二乘法(PLS)的方法被广泛应用于以 KPI 为导向的过程监控,即把过程变量空间分解为与 KPI 高度相关的主成分子空间和与 KPI 无关的残差子空间,并分别对这两个空间进行监控。然而,如果一个与任何 KPI 都没有因果关系的变量出现了与 KPI 无关的故障,但由于混杂因素的存在,该变量与某些 KPI 存在虚假相关性,则基于 PLS 的方法可能会将与 KPI 无关的故障误认为是与 KPI 相关的故障。出现这种情况的原因是,作为一种基于相关性分析的方法,PLS 无法区分变量是 KPI 的真正原因还是与 KPI 存在虚假相关性。受此启发,本文通过将基于 LiNGAM 的因果发现与 PLS 结合,提出了两种面向 KPI 过程监控的因果加权 PLS 方法,它们首先基于改进的 LiNGAM 算法和引导策略计算因果权重,然后利用因果权重对 PLS 的权重向量进行重新加权,以增强因果变量在 KPI 相关主子空间中的影响力并降低虚假相关性的影响。基于模拟数据集、田纳西州伊士曼工艺数据集和精轧机工艺现场数据的案例研究表明,所提出的方法可以显著降低由虚假相关性引起的 KPI 不相关故障的误报率(即 KPI 不相关故障样本被误认为 KPI 相关故障样本的概率),而不会明显影响 KPI 相关故障的检测率。
{"title":"KPI-oriented process monitoring based on causal-weighted partial least squares","authors":"","doi":"10.1016/j.ins.2024.121470","DOIUrl":"10.1016/j.ins.2024.121470","url":null,"abstract":"<div><p>With the increasing demand for product quality, Key performance indicator (KPI)-oriented process monitoring plays an important role in modern industrial. Partial least squares (PLS)-based methods are widely adopted for KPI-oriented process monitoring, in which the process variable space is decomposed into a principal component subspace that has a strong correlation with KPIs and a residual subspace unrelated to KPIs, and the two spaces are monitored separately. However, if a KPI-unrelated fault occurs in a variable that has no causal relation with any KPI, but has a spurious correlation with some KPIs because of the existence of confounders, PLS based methods may mis-regard the KPI-unrelated fault as a KPI-related fault. This occurs because as a correlation analysis-based method, PLS cannot discriminate whether a variable is a real cause of KPIs or has a spurious correlation with KPIs. Motivated by this, this paper proposes two causal-weighted PLS methods for KPI-oriented process monitoring by combining the LiNGAM-based causal discovery with PLS, which first calculate causal weights based on the modified LiNGAM algorithm and bootstrap strategy, and then employ the causal weights to reweight the weight vectors of PLS to enhance the influence of causal variables in the KPI-related principal subspace and reduce the influence of spurious correlations. Case studies based on a simulated dataset, Tennessee Eastman Process dataset and field data from finishing rolling mill process show that the proposed methods can significantly reduce the false alarm rate for KPI-unrelated faults (i.e., the probability that a KPI-unrelated fault sample is mis-regarded as a KPI-related fault sample) caused by spurious correlation without significantly compromising the fault detection rate of KPI-related faults.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142229872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incremental attribute reduction for dynamic fuzzy decision information systems based on fuzzy knowledge granularity 基于模糊知识粒度的动态模糊决策信息系统的增量属性缩减
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-12 DOI: 10.1016/j.ins.2024.121467

Rough set-based attribute reduction is a powerful technique for data preprocessing in data mining. Knowledge granularity, as a reliable measure for assessing uncertainty in decision information systems (DIS), finds applicability in attribute reduction within such systems. Nevertheless, the limitation arises from the fact that static attribute reduction methods fail to effectively utilize the information contained in acquired data and promptly update knowledge due to the continuous evolution of data. In addition, existing incremental methods based on knowledge granularity are designed exclusively for symbolic data and lack the capability to handle real-valued data. Inspired by this, our study focuses on the attribute reduction approach for fuzzy decision information systems (FDIS) that encompass object variations by utilizing fuzzy knowledge granularity. Firstly, fuzzy knowledge granulation is defined to quantify uncertainty within FDIS, and utilized to determine the importance of attributes for attribute reduction. Additionally, the incremental mechanisms and attribute reduction algorithms are investigated for adding an object and an object set to FDIS, respectively. Moreover, an explication of how the incremental mechanism for increasing an object set can be viewed as a generalization of the mechanism used for a single object is provided. Finally, comparative experiments on various datasets are conducted to validate the effectiveness and efficiency of the proposed incremental algorithms. The results demonstrate that our algorithms achieve superior classification accuracy and while requiring minimal computing time when compared to the comparative algorithms.

基于粗糙集的属性还原是数据挖掘中一种强大的数据预处理技术。知识粒度作为评估决策信息系统(DIS)中不确定性的一种可靠措施,适用于此类系统中的属性还原。然而,静态属性还原方法无法有效利用所获数据中包含的信息,也无法因数据的不断变化而及时更新知识,因此存在局限性。此外,现有的基于知识粒度的增量方法专门针对符号数据而设计,缺乏处理实值数据的能力。受此启发,我们的研究重点是利用模糊知识粒度,为包含对象变化的模糊决策信息系统(FDIS)提供属性缩减方法。首先,我们定义了模糊知识粒度来量化 FDIS 中的不确定性,并利用它来确定属性的重要性,从而减少属性。此外,还分别研究了在 FDIS 中添加对象和对象集的增量机制和属性缩减算法。此外,还阐述了如何将增加对象集的增量机制视为用于单个对象的机制的一般化。最后,我们在各种数据集上进行了对比实验,以验证所提出的增量算法的有效性和效率。实验结果表明,与其他算法相比,我们的算法具有更高的分类准确性,同时所需的计算时间也最少。
{"title":"Incremental attribute reduction for dynamic fuzzy decision information systems based on fuzzy knowledge granularity","authors":"","doi":"10.1016/j.ins.2024.121467","DOIUrl":"10.1016/j.ins.2024.121467","url":null,"abstract":"<div><p>Rough set-based attribute reduction is a powerful technique for data preprocessing in data mining. Knowledge granularity, as a reliable measure for assessing uncertainty in decision information systems (DIS), finds applicability in attribute reduction within such systems. Nevertheless, the limitation arises from the fact that static attribute reduction methods fail to effectively utilize the information contained in acquired data and promptly update knowledge due to the continuous evolution of data. In addition, existing incremental methods based on knowledge granularity are designed exclusively for symbolic data and lack the capability to handle real-valued data. Inspired by this, our study focuses on the attribute reduction approach for fuzzy decision information systems (FDIS) that encompass object variations by utilizing fuzzy knowledge granularity. Firstly, fuzzy knowledge granulation is defined to quantify uncertainty within FDIS, and utilized to determine the importance of attributes for attribute reduction. Additionally, the incremental mechanisms and attribute reduction algorithms are investigated for adding an object and an object set to FDIS, respectively. Moreover, an explication of how the incremental mechanism for increasing an object set can be viewed as a generalization of the mechanism used for a single object is provided. Finally, comparative experiments on various datasets are conducted to validate the effectiveness and efficiency of the proposed incremental algorithms. The results demonstrate that our algorithms achieve superior classification accuracy and while requiring minimal computing time when compared to the comparative algorithms.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142229870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Order-sensitive competitive revenue maximization for viral marketing in social networks
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-12 DOI: 10.1016/j.ins.2024.121474

The Competitive Influence Maximization (CIM) problem is a critical issue in viral marketing, focusing on selecting a set of influential individuals, known as seed users, for competitors to maximize their revenue. These seed users have significant sway in social networks and serve as valuable marketing resources provided by the platform. They are often displayed in a certain order launched by the platform and the potential information hidden in the order can profoundly affect the final marketing outcomes. However, current CIM research predominantly emphasizes designing effective algorithms for seed selection while ignoring the impact of the seed order launched by the platform. Therefore, this paper focuses on identifying the optimal seed order to maximize platform revenue in a competitive market environment. Specifically, we introduce a new problem called Order-Sensitive Competitive Revenue Maximization (OSCRM) to investigate the CIM problem from a new practical perspective. We prove the problem to be NP-hard and present a simple greedy algorithm with a 1/3-approximate ratio. To address it more efficiently, we further propose an enhanced greedy algorithm called GMST. This algorithm leverages the maximum spanning tree (MST) and achieves a 1/2-approximate ratio. Extensive experiments on four real-world datasets demonstrate the effectiveness of our proposed GMST algorithm.

{"title":"Order-sensitive competitive revenue maximization for viral marketing in social networks","authors":"","doi":"10.1016/j.ins.2024.121474","DOIUrl":"10.1016/j.ins.2024.121474","url":null,"abstract":"<div><p>The Competitive Influence Maximization (CIM) problem is a critical issue in viral marketing, focusing on selecting a set of influential individuals, known as seed users, for competitors to maximize their revenue. These seed users have significant sway in social networks and serve as valuable marketing resources provided by the platform. They are often displayed in a certain order launched by the platform and the potential information hidden in the order can profoundly affect the final marketing outcomes. However, current CIM research predominantly emphasizes designing effective algorithms for seed selection while ignoring the impact of the seed order launched by the platform. Therefore, this paper focuses on identifying the optimal seed order to maximize platform revenue in a competitive market environment. Specifically, we introduce a new problem called Order-Sensitive Competitive Revenue Maximization (OSCRM) to investigate the CIM problem from a new practical perspective. We prove the problem to be NP-hard and present a simple greedy algorithm with a 1/3-approximate ratio. To address it more efficiently, we further propose an enhanced greedy algorithm called GMST. This algorithm leverages the maximum spanning tree (MST) and achieves a 1/2-approximate ratio. Extensive experiments on four real-world datasets demonstrate the effectiveness of our proposed GMST algorithm.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142240475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interpretable sequence clustering 可解释序列聚类
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-12 DOI: 10.1016/j.ins.2024.121453

Categorical sequence clustering is vital across various domains; however, the interpretability of cluster assignments presents considerable challenges. Sequences inherently lack explicit features, and existing sequence clustering algorithms heavily rely on complex representations, which complicates the explanation of their outcomes. To address this issue, we propose a method called Interpretable Sequence Clustering Tree (ISCT), which combines sequential patterns with a concise and interpretable tree structure. ISCT leverages k1 patterns to generate k leaf nodes, corresponding to k clusters, which provides an intuitive explanation on how each cluster is formed. More precisely, ISCT first projects sequences into random subspaces and then utilizes the k-means algorithm to obtain high-quality initial cluster assignments. Subsequently, it constructs a pattern-based decision tree using a boosting strategy in which sequences are re-projected and re-clustered at each node before mining the top-1 discriminative splitting pattern. Experimental results on 14 real-world data sets demonstrate that our proposed method provides an interpretable tree structure while delivering fast and accurate cluster assignments.

分类序列聚类在各个领域都至关重要;然而,聚类分配的可解释性带来了相当大的挑战。序列本身缺乏明确的特征,而现有的序列聚类算法严重依赖于复杂的表示方法,这使得解释其结果变得更加复杂。为了解决这个问题,我们提出了一种名为可解释序列聚类树(ISCT)的方法,它将序列模式与简洁、可解释的树形结构相结合。ISCT 利用 k-1 个模式生成 k 个叶节点,对应 k 个聚类,从而直观地解释每个聚类是如何形成的。更确切地说,ISCT 首先将序列投影到随机子空间中,然后利用 k-means 算法获得高质量的初始聚类分配。随后,ISCT 利用提升策略构建基于模式的决策树,在每个节点上对序列进行重新投影和重新聚类,然后再挖掘前 1 位的判别分裂模式。在 14 个真实世界数据集上的实验结果表明,我们提出的方法能提供可解释的树形结构,同时提供快速准确的聚类分配。
{"title":"Interpretable sequence clustering","authors":"","doi":"10.1016/j.ins.2024.121453","DOIUrl":"10.1016/j.ins.2024.121453","url":null,"abstract":"<div><p>Categorical sequence clustering is vital across various domains; however, the interpretability of cluster assignments presents considerable challenges. Sequences inherently lack explicit features, and existing sequence clustering algorithms heavily rely on complex representations, which complicates the explanation of their outcomes. To address this issue, we propose a method called Interpretable Sequence Clustering Tree (ISCT), which combines sequential patterns with a concise and interpretable tree structure. ISCT leverages <span><math><mi>k</mi><mo>−</mo><mn>1</mn></math></span> patterns to generate <em>k</em> leaf nodes, corresponding to <em>k</em> clusters, which provides an intuitive explanation on how each cluster is formed. More precisely, ISCT first projects sequences into random subspaces and then utilizes the <em>k</em>-means algorithm to obtain high-quality initial cluster assignments. Subsequently, it constructs a pattern-based decision tree using a boosting strategy in which sequences are re-projected and re-clustered at each node before mining the top-1 discriminative splitting pattern. Experimental results on 14 real-world data sets demonstrate that our proposed method provides an interpretable tree structure while delivering fast and accurate cluster assignments.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142172778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reliability-driven large group consensus decision-making method with hesitant fuzzy linguistic information for the selection of hydrogen storage technology
IF 8.1 1区 计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-12 DOI: 10.1016/j.ins.2024.121457

The use of hydrogen storage technology (HST) as a bridge for producing and utilizing hydrogen energy in the hydrogen industry chain is significant, and its evaluation has attracted the interest of researchers. Since there are different types of HSTs, selecting the most appropriate one requires the participation of plenty of experts with different professional backgrounds, which makes this be modeled as a large group decision-making problem. This paper develops a reliability-driven large group consensus decision-making (LGCDM) method for HST selection using the hesitant fuzzy linguistic terms set (HFLTS) as the evaluation representation format. Specifically, the expertise level of individuals and the reliability of group opinions are measured based on the set variables, and then the dimensionality of large groups is reduced based on the reliability of subgroup opinions. Furthermore, an opinion reliability rating mechanism is designed and, when consensus is not satisfactory, a feedback recommendation mechanism and consensus optimization mechanism are developed for implementation. Finally, the proposed reliability-driven LGCDM approach is applied to the HST selection for THVOW Company, and the comparison with existent related approaches indicates that it not only is practical and reasonable, but also provides a technical path for relevant departments to make decisions on practical issues.

{"title":"Reliability-driven large group consensus decision-making method with hesitant fuzzy linguistic information for the selection of hydrogen storage technology","authors":"","doi":"10.1016/j.ins.2024.121457","DOIUrl":"10.1016/j.ins.2024.121457","url":null,"abstract":"<div><p>The use of hydrogen storage technology (HST) as a bridge for producing and utilizing hydrogen energy in the hydrogen industry chain is significant, and its evaluation has attracted the interest of researchers. Since there are different types of HSTs, selecting the most appropriate one requires the participation of plenty of experts with different professional backgrounds, which makes this be modeled as a large group decision-making problem. This paper develops a reliability-driven large group consensus decision-making (LGCDM) method for HST selection using the hesitant fuzzy linguistic terms set (HFLTS) as the evaluation representation format. Specifically, the expertise level of individuals and the reliability of group opinions are measured based on the set variables, and then the dimensionality of large groups is reduced based on the reliability of subgroup opinions. Furthermore, an opinion reliability rating mechanism is designed and, when consensus is not satisfactory, a feedback recommendation mechanism and consensus optimization mechanism are developed for implementation. Finally, the proposed reliability-driven LGCDM approach is applied to the HST selection for THVOW Company, and the comparison with existent related approaches indicates that it not only is practical and reasonable, but also provides a technical path for relevant departments to make decisions on practical issues.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142240484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1