{"title":"Improving the Accuracy of Features Weighted k-Nearest Neighbor using Distance Weight","authors":"K. U. Syaliman, Ause Labellapansa, Ana Yulianti","doi":"10.5220/0009390903260330","DOIUrl":null,"url":null,"abstract":": FWk-NN is an improvement of k-NN, where FWk-NN gives weight to each data feature thereby reducing the influence of features that are less relevant to the target. Feature weighting is proven to be able to improve the accuracy of k-NN. However, the FWK-NN still uses the majority vote system for class determination to new data. Whereby the majority vote system is considered to have several weaknesses, it ignores the similarity between data and the possibility of a double majority class. To overcome the issue of vote majority at FWk-NN, the research will change the voting majority by using distance weight. This study uses a dataset obtained from the UCI repository and a water quality data set. The data used from the UCI repository are iris, ionosphere, hayes-Roth, and glass. Based on the tests carried out using UCI repository dataset it is proven that FWk-NN using distance weight has averaged an increase about2%, with the highest increase of accuracy of 4.23% in the glass dataset. In water quality data, FWk-NN using distance weight can achieve an accuracy of 92.58% or has increased 2% from FWk-NN. From all the data tested, it is proven that the distance weight is able to increase the accuracy of the FWk-NN with an average increase about 1.9%.","PeriodicalId":382428,"journal":{"name":"Proceedings of the Second International Conference on Science, Engineering and Technology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Second International Conference on Science, Engineering and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0009390903260330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
: FWk-NN is an improvement of k-NN, where FWk-NN gives weight to each data feature thereby reducing the influence of features that are less relevant to the target. Feature weighting is proven to be able to improve the accuracy of k-NN. However, the FWK-NN still uses the majority vote system for class determination to new data. Whereby the majority vote system is considered to have several weaknesses, it ignores the similarity between data and the possibility of a double majority class. To overcome the issue of vote majority at FWk-NN, the research will change the voting majority by using distance weight. This study uses a dataset obtained from the UCI repository and a water quality data set. The data used from the UCI repository are iris, ionosphere, hayes-Roth, and glass. Based on the tests carried out using UCI repository dataset it is proven that FWk-NN using distance weight has averaged an increase about2%, with the highest increase of accuracy of 4.23% in the glass dataset. In water quality data, FWk-NN using distance weight can achieve an accuracy of 92.58% or has increased 2% from FWk-NN. From all the data tested, it is proven that the distance weight is able to increase the accuracy of the FWk-NN with an average increase about 1.9%.