Consistent Spectral Clustering in Hyperbolic Spaces

Sagar Ghosh, Swagatam Das
{"title":"Consistent Spectral Clustering in Hyperbolic Spaces","authors":"Sagar Ghosh, Swagatam Das","doi":"arxiv-2409.09304","DOIUrl":null,"url":null,"abstract":"Clustering, as an unsupervised technique, plays a pivotal role in various\ndata analysis applications. Among clustering algorithms, Spectral Clustering on\nEuclidean Spaces has been extensively studied. However, with the rapid\nevolution of data complexity, Euclidean Space is proving to be inefficient for\nrepresenting and learning algorithms. Although Deep Neural Networks on\nhyperbolic spaces have gained recent traction, clustering algorithms or\nnon-deep machine learning models on non-Euclidean Spaces remain underexplored.\nIn this paper, we propose a spectral clustering algorithm on Hyperbolic Spaces\nto address this gap. Hyperbolic Spaces offer advantages in representing complex\ndata structures like hierarchical and tree-like structures, which cannot be\nembedded efficiently in Euclidean Spaces. Our proposed algorithm replaces the\nEuclidean Similarity Matrix with an appropriate Hyperbolic Similarity Matrix,\ndemonstrating improved efficiency compared to clustering in Euclidean Spaces.\nOur contributions include the development of the spectral clustering algorithm\non Hyperbolic Spaces and the proof of its weak consistency. We show that our\nalgorithm converges at least as fast as Spectral Clustering on Euclidean\nSpaces. To illustrate the efficacy of our approach, we present experimental\nresults on the Wisconsin Breast Cancer Dataset, highlighting the superior\nperformance of Hyperbolic Spectral Clustering over its Euclidean counterpart.\nThis work opens up avenues for utilizing non-Euclidean Spaces in clustering\nalgorithms, offering new perspectives for handling complex data structures and\nimproving clustering efficiency.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09304","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Clustering, as an unsupervised technique, plays a pivotal role in various data analysis applications. Among clustering algorithms, Spectral Clustering on Euclidean Spaces has been extensively studied. However, with the rapid evolution of data complexity, Euclidean Space is proving to be inefficient for representing and learning algorithms. Although Deep Neural Networks on hyperbolic spaces have gained recent traction, clustering algorithms or non-deep machine learning models on non-Euclidean Spaces remain underexplored. In this paper, we propose a spectral clustering algorithm on Hyperbolic Spaces to address this gap. Hyperbolic Spaces offer advantages in representing complex data structures like hierarchical and tree-like structures, which cannot be embedded efficiently in Euclidean Spaces. Our proposed algorithm replaces the Euclidean Similarity Matrix with an appropriate Hyperbolic Similarity Matrix, demonstrating improved efficiency compared to clustering in Euclidean Spaces. Our contributions include the development of the spectral clustering algorithm on Hyperbolic Spaces and the proof of its weak consistency. We show that our algorithm converges at least as fast as Spectral Clustering on Euclidean Spaces. To illustrate the efficacy of our approach, we present experimental results on the Wisconsin Breast Cancer Dataset, highlighting the superior performance of Hyperbolic Spectral Clustering over its Euclidean counterpart. This work opens up avenues for utilizing non-Euclidean Spaces in clustering algorithms, offering new perspectives for handling complex data structures and improving clustering efficiency.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
双曲空间中的一致谱聚类
聚类作为一种无监督技术,在各种数据分析应用中发挥着举足轻重的作用。在聚类算法中,欧几里得空间谱聚类算法已被广泛研究。然而,随着数据复杂性的快速发展,欧几里得空间在表示和学习算法方面已被证明是低效的。尽管双曲空间上的深度神经网络最近获得了广泛关注,但非欧几里得空间上的聚类算法或非深度机器学习模型仍未得到充分探索。双曲空间在表示复杂数据结构(如层次结构和树状结构)方面具有优势,而这些结构无法有效嵌入欧几里得空间。我们提出的算法用适当的双曲相似矩阵取代了欧几里得相似矩阵,与在欧几里得空间中进行聚类相比,效率有所提高。我们证明,我们的算法收敛速度至少与欧几里得空间的光谱聚类算法一样快。为了说明我们方法的有效性,我们展示了在威斯康星乳腺癌数据集上的实验结果,凸显了超曲谱聚类比其欧几里得对应算法更优越的性能。这项工作开辟了在聚类算法中利用非欧几里得空间的途径,为处理复杂数据结构和提高聚类效率提供了新的视角。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Fitting Multilevel Factor Models Cartan moving frames and the data manifolds Symmetry-Based Structured Matrices for Efficient Approximately Equivariant Networks Recurrent Interpolants for Probabilistic Time Series Prediction PieClam: A Universal Graph Autoencoder Based on Overlapping Inclusive and Exclusive Communities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1