Victor Junqiu Wei, Raymond Chi-Wing Wong, Cheng Long, David Mount, Hanan Samet
{"title":"地形表面邻近查询","authors":"Victor Junqiu Wei, Raymond Chi-Wing Wong, Cheng Long, David Mount, Hanan Samet","doi":"https://dl.acm.org/doi/10.1145/3563773","DOIUrl":null,"url":null,"abstract":"<p>Due to the advance of the geo-spatial positioning and the computer graphics technology, digital terrain data has become increasingly popular nowadays. Query processing on terrain data has attracted considerable attention from both the academic and the industry communities.</p><p>Proximity queries such as the shortest path/distance query, <i>k</i> nearest/farthest neighbor query, and top-<i>k</i> closest/farthest pairs query are fundamental and important queries in the context of the terrain surfaces, and they have a lot of applications in Geographical Information System, 3D object feature vector construction, and 3D object data mining. In this article, we first study the most fundamental type of query, namely, shortest distance and path query, which is to find the shortest distance and path between two points of interest on the surface of the terrain. As observed by existing studies, computing the exact shortest distance/path is very expensive. Some existing studies proposed <i>ϵ</i>-approximate distance and path oracles, where <i>ϵ</i> is a non-negative real-valued error parameter. However, the best-known algorithm has a large oracle construction time, a large oracle size, and a large query time. Motivated by this, we propose a novel <i>ϵ</i>-approximate distance and path oracle called the <i><underline>S</underline>pace <underline>E</underline>fficient distance and path oracle (SE),</i> which has a small oracle construction time, a small oracle size, and a small distance and path query time, thanks to its compactness of storing concise information about pairwise distances between any two points-of-interest. Then, we propose several algorithms for the <i>k</i> nearest/farthest neighbor and top-<i>k</i> closest/farthest pairs queries with the assistance of our distance and path oracle <i>SE</i>. </p><p>Our experimental results show that the oracle construction time, the oracle size, and the distance and path query time of <i>SE</i> are up to two, three, and five orders of magnitude faster than the best-known algorithm, respectively. Besides, our algorithms for other proximity queries including <i>k</i> nearest/farthest neighbor queries and top-<i>k</i> closest/farthest pairs queries significantly outperform the state-of-the-art algorithms by up to two orders of magnitude.</p>","PeriodicalId":50915,"journal":{"name":"ACM Transactions on Database Systems","volume":"13 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Proximity Queries on Terrain Surface\",\"authors\":\"Victor Junqiu Wei, Raymond Chi-Wing Wong, Cheng Long, David Mount, Hanan Samet\",\"doi\":\"https://dl.acm.org/doi/10.1145/3563773\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Due to the advance of the geo-spatial positioning and the computer graphics technology, digital terrain data has become increasingly popular nowadays. Query processing on terrain data has attracted considerable attention from both the academic and the industry communities.</p><p>Proximity queries such as the shortest path/distance query, <i>k</i> nearest/farthest neighbor query, and top-<i>k</i> closest/farthest pairs query are fundamental and important queries in the context of the terrain surfaces, and they have a lot of applications in Geographical Information System, 3D object feature vector construction, and 3D object data mining. In this article, we first study the most fundamental type of query, namely, shortest distance and path query, which is to find the shortest distance and path between two points of interest on the surface of the terrain. As observed by existing studies, computing the exact shortest distance/path is very expensive. Some existing studies proposed <i>ϵ</i>-approximate distance and path oracles, where <i>ϵ</i> is a non-negative real-valued error parameter. However, the best-known algorithm has a large oracle construction time, a large oracle size, and a large query time. Motivated by this, we propose a novel <i>ϵ</i>-approximate distance and path oracle called the <i><underline>S</underline>pace <underline>E</underline>fficient distance and path oracle (SE),</i> which has a small oracle construction time, a small oracle size, and a small distance and path query time, thanks to its compactness of storing concise information about pairwise distances between any two points-of-interest. Then, we propose several algorithms for the <i>k</i> nearest/farthest neighbor and top-<i>k</i> closest/farthest pairs queries with the assistance of our distance and path oracle <i>SE</i>. </p><p>Our experimental results show that the oracle construction time, the oracle size, and the distance and path query time of <i>SE</i> are up to two, three, and five orders of magnitude faster than the best-known algorithm, respectively. Besides, our algorithms for other proximity queries including <i>k</i> nearest/farthest neighbor queries and top-<i>k</i> closest/farthest pairs queries significantly outperform the state-of-the-art algorithms by up to two orders of magnitude.</p>\",\"PeriodicalId\":50915,\"journal\":{\"name\":\"ACM Transactions on Database Systems\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2022-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Transactions on Database Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/https://dl.acm.org/doi/10.1145/3563773\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Database Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/10.1145/3563773","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
随着地理空间定位技术和计算机图形技术的发展,数字地形数据日益普及。地形数据的查询处理已经引起了学术界和工业界的广泛关注。最近路径/距离查询、k个最近/最远邻居查询和top-k个最近/最远对查询等接近性查询是地形表面环境中最基本和重要的查询,在地理信息系统、三维物体特征向量构建和三维物体数据挖掘等方面有着广泛的应用。在本文中,我们首先研究了最基本的查询类型,即最短距离和路径查询,即在地形表面上查找两个兴趣点之间的最短距离和路径。根据现有的研究,计算精确的最短距离/路径是非常昂贵的。一些现有的研究提出了ϵ-approximate距离和路径预言器,其中的λ是一个非负的实值误差参数。然而,最著名的算法具有较大的oracle构建时间、较大的oracle大小和较大的查询时间。基于此,我们提出了一种新颖的ϵ-approximate距离路径oracle,称为空间高效距离路径oracle (Space Efficient distance and path oracle, SE),它具有较小的oracle构建时间,较小的oracle大小,以及较小的距离和路径查询时间,这得益于它的紧凑性,可以存储任何两个兴趣点之间的两两距离的简明信息。然后,在距离和路径oracle SE的帮助下,我们提出了k个最近/最远邻居和top-k个最近/最远对查询的几种算法。我们的实验结果表明,SE的oracle构建时间、oracle大小、距离和路径查询时间分别比最知名的算法快2、3和5个数量级。此外,我们的算法用于其他邻近查询,包括k最近/最远邻居查询和top-k最近/最远对查询,其性能明显优于最先进的算法,最高可达两个数量级。
Due to the advance of the geo-spatial positioning and the computer graphics technology, digital terrain data has become increasingly popular nowadays. Query processing on terrain data has attracted considerable attention from both the academic and the industry communities.
Proximity queries such as the shortest path/distance query, k nearest/farthest neighbor query, and top-k closest/farthest pairs query are fundamental and important queries in the context of the terrain surfaces, and they have a lot of applications in Geographical Information System, 3D object feature vector construction, and 3D object data mining. In this article, we first study the most fundamental type of query, namely, shortest distance and path query, which is to find the shortest distance and path between two points of interest on the surface of the terrain. As observed by existing studies, computing the exact shortest distance/path is very expensive. Some existing studies proposed ϵ-approximate distance and path oracles, where ϵ is a non-negative real-valued error parameter. However, the best-known algorithm has a large oracle construction time, a large oracle size, and a large query time. Motivated by this, we propose a novel ϵ-approximate distance and path oracle called the Space Efficient distance and path oracle (SE), which has a small oracle construction time, a small oracle size, and a small distance and path query time, thanks to its compactness of storing concise information about pairwise distances between any two points-of-interest. Then, we propose several algorithms for the k nearest/farthest neighbor and top-k closest/farthest pairs queries with the assistance of our distance and path oracle SE.
Our experimental results show that the oracle construction time, the oracle size, and the distance and path query time of SE are up to two, three, and five orders of magnitude faster than the best-known algorithm, respectively. Besides, our algorithms for other proximity queries including k nearest/farthest neighbor queries and top-k closest/farthest pairs queries significantly outperform the state-of-the-art algorithms by up to two orders of magnitude.
期刊介绍:
Heavily used in both academic and corporate R&D settings, ACM Transactions on Database Systems (TODS) is a key publication for computer scientists working in data abstraction, data modeling, and designing data management systems. Topics include storage and retrieval, transaction management, distributed and federated databases, semantics of data, intelligent databases, and operations and algorithms relating to these areas. In this rapidly changing field, TODS provides insights into the thoughts of the best minds in database R&D.