Fusion Approach for Document Classification using Random Forest and SVM

Sheelesh Kumar Sharma, Navel Sharma, Prem Prakash Potter
{"title":"Fusion Approach for Document Classification using Random Forest and SVM","authors":"Sheelesh Kumar Sharma, Navel Sharma, Prem Prakash Potter","doi":"10.1109/SMART50582.2020.9337131","DOIUrl":null,"url":null,"abstract":"Document classification is an important task due to its many potential applications. With the ever-increasing number of digital documents, it has become imperative to design efficient and accurate methods for document classification. When the categories of documents are already known, the problem can be solved using supervised learning approach. System can learn the traits of a document category from the labeled data and later on can be used as a predictor for the unseen data. Here, we discuss a fusion supervised learning approach for document classification. The power of Random forests and Support Vector Machine is harnessed for making a hybrid approach for the task of document categorization. The proposed approach categories documents into their respective categories and it works well various benchmarked datasets such as 20 Newsgroups, CMU and Classic Data Sets.","PeriodicalId":129946,"journal":{"name":"2020 9th International Conference System Modeling and Advancement in Research Trends (SMART)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 9th International Conference System Modeling and Advancement in Research Trends (SMART)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMART50582.2020.9337131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Document classification is an important task due to its many potential applications. With the ever-increasing number of digital documents, it has become imperative to design efficient and accurate methods for document classification. When the categories of documents are already known, the problem can be solved using supervised learning approach. System can learn the traits of a document category from the labeled data and later on can be used as a predictor for the unseen data. Here, we discuss a fusion supervised learning approach for document classification. The power of Random forests and Support Vector Machine is harnessed for making a hybrid approach for the task of document categorization. The proposed approach categories documents into their respective categories and it works well various benchmarked datasets such as 20 Newsgroups, CMU and Classic Data Sets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于随机森林和支持向量机的文档分类融合方法
文档分类是一项重要的任务,因为它有许多潜在的应用。随着数字文档数量的不断增加,设计高效、准确的文档分类方法已成为当务之急。当已知文档的类别时,可以使用监督学习方法来解决问题。系统可以从标记的数据中学习文档类别的特征,然后可以用作未知数据的预测器。在这里,我们讨论一种用于文档分类的融合监督学习方法。利用随机森林和支持向量机的力量,为文档分类任务提供了一种混合方法。所提出的方法将文档分类到各自的类别中,并且它可以很好地工作于各种基准数据集,如20新闻组,CMU和经典数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Oral Disease Detection using Neural Network A Review on Effectiveness of Artificial Intelligence Techniques in the Detection of COVID-19 Accident Avoidance Simulation using SUMO Gesture-Based Model of Mixed Reality Human-Computer Interface The Survey of Digital Image Analysis with Artificial Intelligence- DCNN Technique
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1