Linear-time approximation scheme for k-means clustering of axis-parallel affine subspaces

IF 0.4 4区 计算机科学 Q4 MATHEMATICS Computational Geometry-Theory and Applications Pub Date : 2023-06-01 DOI:10.1016/j.comgeo.2023.101981
Kyungjin Cho, Eunjin Oh
{"title":"Linear-time approximation scheme for k-means clustering of axis-parallel affine subspaces","authors":"Kyungjin Cho,&nbsp;Eunjin Oh","doi":"10.1016/j.comgeo.2023.101981","DOIUrl":null,"url":null,"abstract":"<div><p>In this paper, we present a linear-time approximation scheme for <em>k</em>-means clustering of <em>incomplete</em> data points in <em>d</em>-dimensional Euclidean space. An <em>incomplete</em> data point with <span><math><mi>Δ</mi><mo>&gt;</mo><mn>0</mn></math></span><span><span> unspecified entries is represented as an axis-parallel affine subspace of dimension Δ. The distance between two incomplete data points is defined as the </span>Euclidean distance between two closest points in the axis-parallel affine subspaces corresponding to the data points. We present an algorithm for </span><em>k</em>-means clustering of <em>n</em> axis-parallel affine subspaces of dimension Δ that yields an <span><math><mo>(</mo><mn>1</mn><mo>+</mo><mi>ϵ</mi><mo>)</mo></math></span>-approximate solution in <span><math><mi>O</mi><mo>(</mo><mi>n</mi><mi>d</mi><mo>)</mo></math></span> time. The constants hidden behind <span><math><mi>O</mi><mo>(</mo><mo>⋅</mo><mo>)</mo></math></span> depend only on <span><math><mi>Δ</mi><mo>,</mo><mi>ϵ</mi></math></span> and <em>k</em>. This improves the <span><math><mi>O</mi><mo>(</mo><msup><mrow><mi>n</mi></mrow><mrow><mn>2</mn></mrow></msup><mi>d</mi><mo>)</mo></math></span>-time algorithm by Eiben et al. (2021) <span>[7]</span> by a factor of <em>n</em>.</p></div>","PeriodicalId":51001,"journal":{"name":"Computational Geometry-Theory and Applications","volume":null,"pages":null},"PeriodicalIF":0.4000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Geometry-Theory and Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925772123000019","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we present a linear-time approximation scheme for k-means clustering of incomplete data points in d-dimensional Euclidean space. An incomplete data point with Δ>0 unspecified entries is represented as an axis-parallel affine subspace of dimension Δ. The distance between two incomplete data points is defined as the Euclidean distance between two closest points in the axis-parallel affine subspaces corresponding to the data points. We present an algorithm for k-means clustering of n axis-parallel affine subspaces of dimension Δ that yields an (1+ϵ)-approximate solution in O(nd) time. The constants hidden behind O() depend only on Δ,ϵ and k. This improves the O(n2d)-time algorithm by Eiben et al. (2021) [7] by a factor of n.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
轴平行仿射子空间k均值聚类的线性时间近似格式
本文给出了d维欧氏空间中不完全数据点的k均值聚类的线性时间近似方案。Δ>;0个未指定条目表示为维度为Δ的轴平行仿射子空间。两个不完全数据点之间的距离被定义为与数据点相对应的轴平行仿射子空间中的两个最近点之间的欧几里得距离。我们提出了一种对维度为Δ的n轴平行仿射子空间进行k均值聚类的算法,该算法在O(nd)时间内产生(1+)-近似解。隐藏在O(‧)后面的常数仅取决于Δ、Ş和k。这改进了Eiben等人的O(n2d)-时间算法。(2021)[7]的因子为n。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.60
自引率
16.70%
发文量
43
审稿时长
>12 weeks
期刊介绍: Computational Geometry is a forum for research in theoretical and applied aspects of computational geometry. The journal publishes fundamental research in all areas of the subject, as well as disseminating information on the applications, techniques, and use of computational geometry. Computational Geometry publishes articles on the design and analysis of geometric algorithms. All aspects of computational geometry are covered, including the numerical, graph theoretical and combinatorial aspects. Also welcomed are computational geometry solutions to fundamental problems arising in computer graphics, pattern recognition, robotics, image processing, CAD-CAM, VLSI design and geographical information systems. Computational Geometry features a special section containing open problems and concise reports on implementations of computational geometry tools.
期刊最新文献
Editorial Board Largest unit rectangles inscribed in a convex polygon Packing unequal disks in the Euclidean plane Editorial Board Improved approximation for two-dimensional vector multiple knapsack
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1