Wenfeng Hou, Daiwei Li, Chao Xu, Haiqing Zhang, Tianrui Li
{"title":"An Advanced k Nearest Neighbor Classification Algorithm Based on KD-tree","authors":"Wenfeng Hou, Daiwei Li, Chao Xu, Haiqing Zhang, Tianrui Li","doi":"10.1109/IICSPI.2018.8690508","DOIUrl":null,"url":null,"abstract":"KNN (K Nearest-neighbor Classification) is a lazy learning classification algorithm, where it only memorizes the training dataset instead of providing a defined discriminative function. KNN tends to search the nearest neighbor(s) for a target in the entire training set, hence, the prediction step of KNN is quite time consuming. KD-tree (K Dimensional-tree) is a multi-dimensional binary tree, which is a specific storage structure for efficiently representing training data. Therefore, the paper takes the advantages of KNN and KD-tree and then proposes a new classification algorithm called KNN-KD-tree. Eleven datasets have been adopted to conduct experiments. The experiments have shown that the proposed KNN-KD-tree algorithm can efficiently reduce time complexity and significantly improve search performance.","PeriodicalId":6673,"journal":{"name":"2018 IEEE International Conference of Safety Produce Informatization (IICSPI)","volume":"3 1","pages":"902-905"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference of Safety Produce Informatization (IICSPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICSPI.2018.8690508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31
Abstract
KNN (K Nearest-neighbor Classification) is a lazy learning classification algorithm, where it only memorizes the training dataset instead of providing a defined discriminative function. KNN tends to search the nearest neighbor(s) for a target in the entire training set, hence, the prediction step of KNN is quite time consuming. KD-tree (K Dimensional-tree) is a multi-dimensional binary tree, which is a specific storage structure for efficiently representing training data. Therefore, the paper takes the advantages of KNN and KD-tree and then proposes a new classification algorithm called KNN-KD-tree. Eleven datasets have been adopted to conduct experiments. The experiments have shown that the proposed KNN-KD-tree algorithm can efficiently reduce time complexity and significantly improve search performance.