{"title":"Sublinear Time Approximation of the Cost of a Metric [math]-Nearest Neighbor Graph","authors":"Artur Czumaj, Christian Sohler","doi":"10.1137/22m1544105","DOIUrl":null,"url":null,"abstract":"SIAM Journal on Computing, Volume 53, Issue 2, Page 524-571, April 2024. <br/> Abstract. Let [math] be an [math]-point metric space. We assume that [math] is given in the distance oracle model, that is, [math] and for every pair of points [math] from [math] we can query their distance [math] in constant time. A [math]-nearest neighbor ([math]-NN) graph for [math] is a directed graph [math] that has an edge to each of [math]’s [math] nearest neighbors. We use [math] to denote the sum of edge weights of [math]. In this paper, we study the problem of approximating [math] in sublinear time when we are given oracle access to the metric space [math] that defines [math]. Our goal is to develop an algorithm that solves this problem faster than the time required to compute [math]. We first present an algorithm that in [math] time with probability at least [math] approximates [math] to within a factor of [math]. Next, we present a more elaborate sublinear algorithm that in time [math] computes an estimate [math] of [math] that satisfies with probability at least [math] [math], where [math] denotes the cost of the minimum spanning tree of [math]. Further, we complement these results with near matching lower bounds. We show that any algorithm that for a given metric space [math] of size [math], with probability at least [math], estimates [math] to within a [math] factor requires [math] time. Similarly, any algorithm that with probability at least [math] estimates [math] to within an additive error term [math] requires [math] time.","PeriodicalId":49532,"journal":{"name":"SIAM Journal on Computing","volume":"1 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIAM Journal on Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1137/22m1544105","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
SIAM Journal on Computing, Volume 53, Issue 2, Page 524-571, April 2024. Abstract. Let [math] be an [math]-point metric space. We assume that [math] is given in the distance oracle model, that is, [math] and for every pair of points [math] from [math] we can query their distance [math] in constant time. A [math]-nearest neighbor ([math]-NN) graph for [math] is a directed graph [math] that has an edge to each of [math]’s [math] nearest neighbors. We use [math] to denote the sum of edge weights of [math]. In this paper, we study the problem of approximating [math] in sublinear time when we are given oracle access to the metric space [math] that defines [math]. Our goal is to develop an algorithm that solves this problem faster than the time required to compute [math]. We first present an algorithm that in [math] time with probability at least [math] approximates [math] to within a factor of [math]. Next, we present a more elaborate sublinear algorithm that in time [math] computes an estimate [math] of [math] that satisfies with probability at least [math] [math], where [math] denotes the cost of the minimum spanning tree of [math]. Further, we complement these results with near matching lower bounds. We show that any algorithm that for a given metric space [math] of size [math], with probability at least [math], estimates [math] to within a [math] factor requires [math] time. Similarly, any algorithm that with probability at least [math] estimates [math] to within an additive error term [math] requires [math] time.
期刊介绍:
The SIAM Journal on Computing aims to provide coverage of the most significant work going on in the mathematical and formal aspects of computer science and nonnumerical computing. Submissions must be clearly written and make a significant technical contribution. Topics include but are not limited to analysis and design of algorithms, algorithmic game theory, data structures, computational complexity, computational algebra, computational aspects of combinatorics and graph theory, computational biology, computational geometry, computational robotics, the mathematical aspects of programming languages, artificial intelligence, computational learning, databases, information retrieval, cryptography, networks, distributed computing, parallel algorithms, and computer architecture.