{"title":"Granular correlation-based label-specific feature augmentation for multi-label classification","authors":"","doi":"10.1016/j.ins.2024.121473","DOIUrl":null,"url":null,"abstract":"<div><p>Multi-label classification is an extension of single-label classification with generations of multi-output for unseen instances. Label correlation is an essential component in constructing multi-label classifiers. How to optimize the representation of label correlation while preserving the semantics of label-specific remains an uncertain issue. Instead of estimating label correlation by a holistic feature representation, we present an augmented label correlation model by generating multi-granularity label-specific features. Firstly, we devise a mixture distance measure to characterize the closeness of an instance by weighing the Pearson correlation coefficient with cosine similarity. Secondly, we explore the local label-specific relative discrimination by leveraging from both the instance-level and class-level correlation distribution within <em>k</em> nearest neighborhood. Finally, we conduct an information fusion strategy to comprehensively integrate the positive and the negative tendencies at the neighborhood level. Instances with salient positive tendency and compact neighborhood structure receive larger values while receiving smaller values with salient negative tendency and sparse neighborhood structure. With the concatenation of original features and augmented features, we examine the classification performance of the proposed granule correlation-based feature augmentation (GOFA) on well-established second-order multi-label classification methods. Extensive comparisons on thirteen benchmarks demonstrate the statistical superiority of GOFA over state-of-the-art multi-label classifications.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524013872","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-label classification is an extension of single-label classification with generations of multi-output for unseen instances. Label correlation is an essential component in constructing multi-label classifiers. How to optimize the representation of label correlation while preserving the semantics of label-specific remains an uncertain issue. Instead of estimating label correlation by a holistic feature representation, we present an augmented label correlation model by generating multi-granularity label-specific features. Firstly, we devise a mixture distance measure to characterize the closeness of an instance by weighing the Pearson correlation coefficient with cosine similarity. Secondly, we explore the local label-specific relative discrimination by leveraging from both the instance-level and class-level correlation distribution within k nearest neighborhood. Finally, we conduct an information fusion strategy to comprehensively integrate the positive and the negative tendencies at the neighborhood level. Instances with salient positive tendency and compact neighborhood structure receive larger values while receiving smaller values with salient negative tendency and sparse neighborhood structure. With the concatenation of original features and augmented features, we examine the classification performance of the proposed granule correlation-based feature augmentation (GOFA) on well-established second-order multi-label classification methods. Extensive comparisons on thirteen benchmarks demonstrate the statistical superiority of GOFA over state-of-the-art multi-label classifications.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.