大规模层次分类

Journal of Independent Studies and Research Computing Pub Date : 1900-01-01 DOI:10.31645/2013.11.2.3

Adarsh Khalique, Rahim Hasnani

{"title":"大规模层次分类","authors":"Adarsh Khalique, Rahim Hasnani","doi":"10.31645/2013.11.2.3","DOIUrl":null,"url":null,"abstract":"This study elucidates various algorithms used for document or text classification challenge. A sample data is used in this study on which various algorithms like Support Vector Machines (SVM), Naïve Bayes, Neural Networks and K-Nearest Neighbor are used in order to analyze their performances and accuracies. This study tries to identify the limitations and strength of these algorithms on the given sample data that how optimally they can perform classification. Different validations are used in this study to examine the accuracies regarding the classification can be identified. Validations include Split-Validation, X-Validation and Bootstrapping. Different ways and methods are discussed through which classification is made possible in large hierarchy. Finally this study concludes on the basis of results obtained that which machine learning technique or classifier performed excellent on the provided sample data set and achieved higher accuracy as compared to others.","PeriodicalId":412730,"journal":{"name":"Journal of Independent Studies and Research Computing","volume":"121 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Large Scale Hierarchical Classification\",\"authors\":\"Adarsh Khalique, Rahim Hasnani\",\"doi\":\"10.31645/2013.11.2.3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study elucidates various algorithms used for document or text classification challenge. A sample data is used in this study on which various algorithms like Support Vector Machines (SVM), Naïve Bayes, Neural Networks and K-Nearest Neighbor are used in order to analyze their performances and accuracies. This study tries to identify the limitations and strength of these algorithms on the given sample data that how optimally they can perform classification. Different validations are used in this study to examine the accuracies regarding the classification can be identified. Validations include Split-Validation, X-Validation and Bootstrapping. Different ways and methods are discussed through which classification is made possible in large hierarchy. Finally this study concludes on the basis of results obtained that which machine learning technique or classifier performed excellent on the provided sample data set and achieved higher accuracy as compared to others.\",\"PeriodicalId\":412730,\"journal\":{\"name\":\"Journal of Independent Studies and Research Computing\",\"volume\":\"121 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Independent Studies and Research Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31645/2013.11.2.3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Independent Studies and Research Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31645/2013.11.2.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本研究阐明了用于文档或文本分类挑战的各种算法。本研究使用样本数据，使用支持向量机(SVM)、Naïve贝叶斯(Bayes)、神经网络(Neural Networks)和k -最近邻(K-Nearest Neighbor)等各种算法，分析其性能和精度。本研究试图确定这些算法在给定样本数据上的局限性和强度，以及它们如何最优地执行分类。在本研究中使用不同的验证来检验关于分类可以识别的准确性。验证包括分割验证、x验证和引导。讨论了在大层次结构中实现分类的不同途径和方法。最后，本研究根据所获得的结果得出结论，哪种机器学习技术或分类器在提供的样本数据集上表现出色，并且相对于其他机器学习技术或分类器实现了更高的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Large Scale Hierarchical Classification

This study elucidates various algorithms used for document or text classification challenge. A sample data is used in this study on which various algorithms like Support Vector Machines (SVM), Naïve Bayes, Neural Networks and K-Nearest Neighbor are used in order to analyze their performances and accuracies. This study tries to identify the limitations and strength of these algorithms on the given sample data that how optimally they can perform classification. Different validations are used in this study to examine the accuracies regarding the classification can be identified. Validations include Split-Validation, X-Validation and Bootstrapping. Different ways and methods are discussed through which classification is made possible in large hierarchy. Finally this study concludes on the basis of results obtained that which machine learning technique or classifier performed excellent on the provided sample data set and achieved higher accuracy as compared to others.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Independent Studies and Research Computing

自引率

0.00%

发文量