{"title":"A KD-Tree-Based Nearest Neighbor Search for Large Quantities of Data","authors":"Shwu-Huey Yen, Ya-ju Hsieh","doi":"10.3837/tiis.2013.03.003","DOIUrl":null,"url":null,"abstract":"The discovery of nearest neighbors, without training in advance, has many applications, such as the formation of mosaic images, image matching, image retrieval and image stitching. When the quantity of data is huge and the number of dimensions is high, the efficient identification of a nearest neighbor (NN) is very important. This study proposes a variation of the KD-tree - the arbitrary KD-tree (KDA) - which is constructed without the need to evaluate variances. Multiple KDAs can be constructed efficiently and possess independent tree structures, when the amount of data is large. Upon testing, using extended synthetic databases and real-world SIFT data, this study concludes that the KDA method increases computational efficiency and produces satisfactory accuracy, when solving NN problems.","PeriodicalId":49932,"journal":{"name":"KSII Transactions on Internet and Information Systems","volume":"14 1","pages":"459-470"},"PeriodicalIF":0.9000,"publicationDate":"2013-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"KSII Transactions on Internet and Information Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3837/tiis.2013.03.003","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 2
Abstract
The discovery of nearest neighbors, without training in advance, has many applications, such as the formation of mosaic images, image matching, image retrieval and image stitching. When the quantity of data is huge and the number of dimensions is high, the efficient identification of a nearest neighbor (NN) is very important. This study proposes a variation of the KD-tree - the arbitrary KD-tree (KDA) - which is constructed without the need to evaluate variances. Multiple KDAs can be constructed efficiently and possess independent tree structures, when the amount of data is large. Upon testing, using extended synthetic databases and real-world SIFT data, this study concludes that the KDA method increases computational efficiency and produces satisfactory accuracy, when solving NN problems.
期刊介绍:
The KSII Transactions on Internet and Information Systems (TIIS) is online scholarly journal indexed in SCIE (Thomson Reuters) and SCOPUS (Elsevier) and published by KSII and supported by KETI. The Transactions is published every other month. The Transactions is designed to allow readers to obtain the most state of the art in a number of focusing areas related to wired & wireless internet and information systems. The technologies and applications of IT are very rapidly changing and updating. Thus quick publication and distribution to researchers, developers, deployment engineers, technical managers, and educators are crucial. Our most important aim is to publish the accepted papers quickly after receiving the manuscript.