Micro-blog category based on feature-words category dispersion

Yingyou Chen, Qing Wu
{"title":"Micro-blog category based on feature-words category dispersion","authors":"Yingyou Chen, Qing Wu","doi":"10.1109/CCIS.2012.6664377","DOIUrl":null,"url":null,"abstract":"The micro-blog information classification is an important pretreatment in micro-blog data processing work. Due to the unique properties of the micro-blog text, there are some limitations when use traditional classification to deal with it. Consider to a single microblog text brief which contains less effective feature-words, and the content compare spoken of the features, this paper proposed to use similar words and collocations to extend the text feature-words, reducing the possibility of feature loss. For the feature of information selection and weight calculation, proposed one kind text classification methods which based on the feature-words category dispersion and dispersion degree. The experiments show that the propose classification method achieves good effects in the classification of micro-blog text, and has better applicability in microblog text classification scene.","PeriodicalId":392558,"journal":{"name":"2012 IEEE 2nd International Conference on Cloud Computing and Intelligence Systems","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 2nd International Conference on Cloud Computing and Intelligence Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS.2012.6664377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The micro-blog information classification is an important pretreatment in micro-blog data processing work. Due to the unique properties of the micro-blog text, there are some limitations when use traditional classification to deal with it. Consider to a single microblog text brief which contains less effective feature-words, and the content compare spoken of the features, this paper proposed to use similar words and collocations to extend the text feature-words, reducing the possibility of feature loss. For the feature of information selection and weight calculation, proposed one kind text classification methods which based on the feature-words category dispersion and dispersion degree. The experiments show that the propose classification method achieves good effects in the classification of micro-blog text, and has better applicability in microblog text classification scene.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
微博分类基于特征词分类分散
微博信息分类是微博数据处理工作中重要的预处理工作。由于微博文本的独特属性,使用传统的分类方法对微博文本进行分类时存在一定的局限性。针对单个微博文本摘要中有效特征词较少,且内容比较讲特征的情况,本文提出使用相似词和搭配来扩展文本特征词,减少特征丢失的可能性。针对信息选择和权重计算的特点,提出了一种基于特征词类别离散度和离散度的文本分类方法。实验表明,本文提出的分类方法在微博文本分类中取得了较好的分类效果,在微博文本分类场景中具有较好的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design of household appliance control system based on Zigbee Secure cloud authentication using eIDs The research on the control algorithm of IOT based bicycle parking system Blind extraction algorithm of the harmonic signal based on the steady-state point capture in lorenz energy accumulation area Study on the modeling and analyzing of the role-based threats in the cyberspace
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1