{"title":"基于聚类引导的截断哈希增强近似近邻搜索","authors":"Mingyang Liu;Zuyuan Yang;Wei Han;Shengli Xie","doi":"10.1109/LSP.2024.3509333","DOIUrl":null,"url":null,"abstract":"Hashing is essential for approximate nearest neighbor search by mapping high-dimensional data to compact binary codes. The balance between similarity preservation and code diversity is a key challenge. Existing projection-based methods often struggle with fitting binary codes to continuous space due to space heterogeneity. To address this, we propose a novel Cluster Guided Truncated Hashing (CGTH) method that uses latent cluster information to guide the binary learning process. By leveraging data clusters as anchor points and applying a truncated coding strategy, our method effectively maintains local similarity and code diversity. Experiments on benchmark datasets demonstrate that CGTH outperforms existing methods, achieving superior search performance.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"181-185"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cluster Guided Truncated Hashing for Enhanced Approximate Nearest Neighbor Search\",\"authors\":\"Mingyang Liu;Zuyuan Yang;Wei Han;Shengli Xie\",\"doi\":\"10.1109/LSP.2024.3509333\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hashing is essential for approximate nearest neighbor search by mapping high-dimensional data to compact binary codes. The balance between similarity preservation and code diversity is a key challenge. Existing projection-based methods often struggle with fitting binary codes to continuous space due to space heterogeneity. To address this, we propose a novel Cluster Guided Truncated Hashing (CGTH) method that uses latent cluster information to guide the binary learning process. By leveraging data clusters as anchor points and applying a truncated coding strategy, our method effectively maintains local similarity and code diversity. Experiments on benchmark datasets demonstrate that CGTH outperforms existing methods, achieving superior search performance.\",\"PeriodicalId\":13154,\"journal\":{\"name\":\"IEEE Signal Processing Letters\",\"volume\":\"32 \",\"pages\":\"181-185\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Signal Processing Letters\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10771637/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10771637/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Cluster Guided Truncated Hashing for Enhanced Approximate Nearest Neighbor Search
Hashing is essential for approximate nearest neighbor search by mapping high-dimensional data to compact binary codes. The balance between similarity preservation and code diversity is a key challenge. Existing projection-based methods often struggle with fitting binary codes to continuous space due to space heterogeneity. To address this, we propose a novel Cluster Guided Truncated Hashing (CGTH) method that uses latent cluster information to guide the binary learning process. By leveraging data clusters as anchor points and applying a truncated coding strategy, our method effectively maintains local similarity and code diversity. Experiments on benchmark datasets demonstrate that CGTH outperforms existing methods, achieving superior search performance.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.