Zhiming Liu , Jinhai Li , Xiao Zhang , Xizhao Wang
{"title":"Multi-level information fusion for missing multi-label learning based on stochastic concept clustering","authors":"Zhiming Liu , Jinhai Li , Xiao Zhang , Xizhao Wang","doi":"10.1016/j.inffus.2024.102775","DOIUrl":null,"url":null,"abstract":"<div><div>Missing multi-label learning is to address the problem of missing labels in multi-label datasets for multi-label classification tasks. Notably, the complex dependencies that typically exist between labels make accurate classification particularly challenging in the presence of missing labels. Some existing missing multi-label classification models often utilize feature selection to effectively recognize the dependencies between labels and features. However, they are ineffective at capturing hierarchical relationships of feature information, probably leading to a decline in prediction performance. To address this problem, this paper proposes a missing multi-label classification model based on multi-level stochastic concept clustering (MML-MSCC) to make dependencies between features and labels recognized more accurately and prediction performance better. In our model, optimal granularity selection is achieved through the global mutual information between features and labels, which makes the study of stochastic granule concept across multiple granularities. Furthermore, we utilize a stochastic concept clustering method to combine similar feature information for the purpose of making the missing label completion more reasonable. Note that stochastic granule concept clustering is performed with cross-granularity, thereby effectively capturing hierarchical relationships among feature information. Finally, to evaluate the performance of our model, we compare the MML-MSCC model with 9 existing missing multi-label classification models on 12 open datasets in terms of six evaluation metrics.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"115 ","pages":"Article 102775"},"PeriodicalIF":14.7000,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253524005530","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Missing multi-label learning is to address the problem of missing labels in multi-label datasets for multi-label classification tasks. Notably, the complex dependencies that typically exist between labels make accurate classification particularly challenging in the presence of missing labels. Some existing missing multi-label classification models often utilize feature selection to effectively recognize the dependencies between labels and features. However, they are ineffective at capturing hierarchical relationships of feature information, probably leading to a decline in prediction performance. To address this problem, this paper proposes a missing multi-label classification model based on multi-level stochastic concept clustering (MML-MSCC) to make dependencies between features and labels recognized more accurately and prediction performance better. In our model, optimal granularity selection is achieved through the global mutual information between features and labels, which makes the study of stochastic granule concept across multiple granularities. Furthermore, we utilize a stochastic concept clustering method to combine similar feature information for the purpose of making the missing label completion more reasonable. Note that stochastic granule concept clustering is performed with cross-granularity, thereby effectively capturing hierarchical relationships among feature information. Finally, to evaluate the performance of our model, we compare the MML-MSCC model with 9 existing missing multi-label classification models on 12 open datasets in terms of six evaluation metrics.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.