{"title":"Multi-view clustering via double spaces structure learning and adaptive multiple projection regression learning","authors":"","doi":"10.1016/j.ins.2024.121396","DOIUrl":null,"url":null,"abstract":"<div><p>Multi-view clustering aims to group objects with high similarity into one group according to the heterogeneous features of different views. The graph-based clustering methods have obtained excellent results. However, there remain a few common drawbacks. For example, some methods do not consider graphs' high-order structure information. Thus, fuller data information cannot be obtained. In addition, some methods remove noise, outliers, and redundant information in the graph learning phase, resulting in the loss of graph information. Furthermore, using predefined graphs cannot exploit complementary information between views. A triple strategy-based multi-view clustering method is presented to solve the above issues. First, Laplacian graphs are used for fusion learning, and the underlying first-order and second-order structure information among views are explored simultaneously. Then, a label fusion scheme is designed to eliminate noise, outliers, and redundant information and to mine the intrinsic characteristics of data labels. Besides, the consistent label matrix in adaptive regression learning is used to explore complementary information between views in a mutually guided learning way. Finally, the objective function is solved by using an efficient iterative method. Six types of experiments are conducted on eleven real-world multi-view datasets, and the conclusions that can be drawn are: (1) the proposed algorithm achieves the best results in terms of clustering accuracy on ten datasets with an average accuracy improvement of 5.11% compared to other algorithms. Specifically, the accuracy improved by 9.05% on dataset HW and 10.95% on dataset Reuters compared to the second results; (2) The ablation experiments confirm that the different learning strategies included in the proposed algorithm allow it to achieve better clustering performance.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524013100","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-view clustering aims to group objects with high similarity into one group according to the heterogeneous features of different views. The graph-based clustering methods have obtained excellent results. However, there remain a few common drawbacks. For example, some methods do not consider graphs' high-order structure information. Thus, fuller data information cannot be obtained. In addition, some methods remove noise, outliers, and redundant information in the graph learning phase, resulting in the loss of graph information. Furthermore, using predefined graphs cannot exploit complementary information between views. A triple strategy-based multi-view clustering method is presented to solve the above issues. First, Laplacian graphs are used for fusion learning, and the underlying first-order and second-order structure information among views are explored simultaneously. Then, a label fusion scheme is designed to eliminate noise, outliers, and redundant information and to mine the intrinsic characteristics of data labels. Besides, the consistent label matrix in adaptive regression learning is used to explore complementary information between views in a mutually guided learning way. Finally, the objective function is solved by using an efficient iterative method. Six types of experiments are conducted on eleven real-world multi-view datasets, and the conclusions that can be drawn are: (1) the proposed algorithm achieves the best results in terms of clustering accuracy on ten datasets with an average accuracy improvement of 5.11% compared to other algorithms. Specifically, the accuracy improved by 9.05% on dataset HW and 10.95% on dataset Reuters compared to the second results; (2) The ablation experiments confirm that the different learning strategies included in the proposed algorithm allow it to achieve better clustering performance.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.