Yaojin Lin , Yulin Li , Shidong Lin , Lei Guo , Yu Mao
{"title":"Partial multi-label feature selection based on label distribution learning","authors":"Yaojin Lin , Yulin Li , Shidong Lin , Lei Guo , Yu Mao","doi":"10.1016/j.patcog.2025.111523","DOIUrl":null,"url":null,"abstract":"<div><div>Partial Multi-label Learning (PML) induces a multi-classifier in an imprecise supervised environment, where the candidate labels associated with each training sample are partially valid. The high-dimensional feature space, presented in PML data accompanied by ambiguous labeling information, is a significant challenge for learning. In this paper, we propose a PML feature selection method based on Label Distribution Learning (LDL), which handles the above challenges by correcting misleading and then selecting common and label-specific features. In the first procedure, the error distribution hypothesis is constructed, which divides the structure of ambiguous label information into minority and majority error distribution according to the error amount that may appear in the data annotation process. Under the analysis of the hypothesis, the label credibility distribution data (LCDD) was generated by identifying and correcting errors, where the fractional category of each label associated with each training sample describes the probability that the label belongs to that sample. In the second procedure, a discriminative feature subset is selected for PML based on LCDD by common and label-specific feature constraints. Experiments on three synthetic and five real PML datasets demonstrate the effectiveness of the proposed method.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"164 ","pages":"Article 111523"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325001839","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Partial Multi-label Learning (PML) induces a multi-classifier in an imprecise supervised environment, where the candidate labels associated with each training sample are partially valid. The high-dimensional feature space, presented in PML data accompanied by ambiguous labeling information, is a significant challenge for learning. In this paper, we propose a PML feature selection method based on Label Distribution Learning (LDL), which handles the above challenges by correcting misleading and then selecting common and label-specific features. In the first procedure, the error distribution hypothesis is constructed, which divides the structure of ambiguous label information into minority and majority error distribution according to the error amount that may appear in the data annotation process. Under the analysis of the hypothesis, the label credibility distribution data (LCDD) was generated by identifying and correcting errors, where the fractional category of each label associated with each training sample describes the probability that the label belongs to that sample. In the second procedure, a discriminative feature subset is selected for PML based on LCDD by common and label-specific feature constraints. Experiments on three synthetic and five real PML datasets demonstrate the effectiveness of the proposed method.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.