{"title":"On learning discriminative embeddings for optimized top-k matching","authors":"Soumyadeep Ghosh , Mayank Vatsa , Richa Singh , Nalini Ratha","doi":"10.1016/j.patcog.2025.111341","DOIUrl":null,"url":null,"abstract":"<div><div>Optimizing overall classification accuracy in neural networks does not always yield the best top-<span><math><mi>k</mi></math></span> accuracy, a critical metric in many real-world applications. This discrepancy is particularly evident in scenarios where multiple classes exhibit high similarity and overlap in the embedding space, leading to class ambiguity during retrieval. Addressing this challenge, the paper proposes a novel method to enhance top-<span><math><mi>k</mi></math></span> matching performance by leveraging class relationships in the embedding space. The proposed approach first employs a clustering algorithm to group similar classes into superclusters, capturing their inherent similarity. Next, the compactness of these superclusters is optimized while preserving the discriminative properties of individual classes. This dual optimization improves the separability of classes within superclusters and enhances retrieval accuracy in ambiguous scenarios. Experimental results on diverse datasets, including STL-10, CIFAR-10, CIFAR-100, Stanford Online Products, CARS196, and SCface, demonstrate significant improvements in top-<span><math><mi>k</mi></math></span> accuracy, validating the effectiveness and generalizability of the proposed method.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"162 ","pages":"Article 111341"},"PeriodicalIF":7.5000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325000019","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Optimizing overall classification accuracy in neural networks does not always yield the best top- accuracy, a critical metric in many real-world applications. This discrepancy is particularly evident in scenarios where multiple classes exhibit high similarity and overlap in the embedding space, leading to class ambiguity during retrieval. Addressing this challenge, the paper proposes a novel method to enhance top- matching performance by leveraging class relationships in the embedding space. The proposed approach first employs a clustering algorithm to group similar classes into superclusters, capturing their inherent similarity. Next, the compactness of these superclusters is optimized while preserving the discriminative properties of individual classes. This dual optimization improves the separability of classes within superclusters and enhances retrieval accuracy in ambiguous scenarios. Experimental results on diverse datasets, including STL-10, CIFAR-10, CIFAR-100, Stanford Online Products, CARS196, and SCface, demonstrate significant improvements in top- accuracy, validating the effectiveness and generalizability of the proposed method.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.