{"title":"Semi-supervised text classification using enhanced KNN algorithm","authors":"M. A. Wajeed, T. Adilakshmi","doi":"10.1109/WICT.2011.6141232","DOIUrl":null,"url":null,"abstract":"Due to the growth of information which has a great value, classifying the available information becomes inevitable so that navigation could be made easy. Many techniques of supervised learning and unsupervised learning do exist in the literature for data classification. Semi-supervised learning is halfway between the supervised and unsupervised learning. In addition to unlabeled data, the algorithm is provided with some supervision information but not necessarily for all example data. The paper explores the semi-supervised text classification which is applied to different types of vectors that are generated from the text documents. Enhancements in KNN algorithm are made to increase the accuracy performance of the classifier in the process of semi-supervised text classification, and results obtained are encouraging.","PeriodicalId":178645,"journal":{"name":"2011 World Congress on Information and Communication Technologies","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 World Congress on Information and Communication Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WICT.2011.6141232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
Due to the growth of information which has a great value, classifying the available information becomes inevitable so that navigation could be made easy. Many techniques of supervised learning and unsupervised learning do exist in the literature for data classification. Semi-supervised learning is halfway between the supervised and unsupervised learning. In addition to unlabeled data, the algorithm is provided with some supervision information but not necessarily for all example data. The paper explores the semi-supervised text classification which is applied to different types of vectors that are generated from the text documents. Enhancements in KNN algorithm are made to increase the accuracy performance of the classifier in the process of semi-supervised text classification, and results obtained are encouraging.