支持向量机在推特上的泗水事件分类

Drajad Bima Ajipangestu, R. Sarno
{"title":"支持向量机在推特上的泗水事件分类","authors":"Drajad Bima Ajipangestu, R. Sarno","doi":"10.1109/iSemantic50169.2020.9234205","DOIUrl":null,"url":null,"abstract":"Twitter is a social media that is often used by many people in the world. The information is spread and obtained through social media. For example, there is a company that is organizing a new event that many people need to know. This allows the creation of a system that supports the presentation of user information by detecting certain events from Twitter's social media data. In this study, tweet data will be retrieved using Twitter API and stored in JSON format. Furthermore, there will be a pre-processing which includes the deletion of characters, number, URL, stemming, and lower case. Furthermore, feature extraction is performed using Global Vector for Word Representation. we will classify into four classes, which are Competitions, Seminars, Festivals, and Other events. The classification is using SVM to predict the type of event. There are three experimental methods used, there is SVM C, SVM linear, and SVM Nu. SVM Nu was conducted with changes in the SVC parameters in the form of kernel and Nu to produce the best accuracy. Based on the experiments we have done, the best results are obtained with an accuracy of 85.2% by classification using the NuSVC method with an RBF kernel and nu parameter of 0.2.","PeriodicalId":345558,"journal":{"name":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Event Classification in Surabaya on Twitter with Support Vector Machine\",\"authors\":\"Drajad Bima Ajipangestu, R. Sarno\",\"doi\":\"10.1109/iSemantic50169.2020.9234205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Twitter is a social media that is often used by many people in the world. The information is spread and obtained through social media. For example, there is a company that is organizing a new event that many people need to know. This allows the creation of a system that supports the presentation of user information by detecting certain events from Twitter's social media data. In this study, tweet data will be retrieved using Twitter API and stored in JSON format. Furthermore, there will be a pre-processing which includes the deletion of characters, number, URL, stemming, and lower case. Furthermore, feature extraction is performed using Global Vector for Word Representation. we will classify into four classes, which are Competitions, Seminars, Festivals, and Other events. The classification is using SVM to predict the type of event. There are three experimental methods used, there is SVM C, SVM linear, and SVM Nu. SVM Nu was conducted with changes in the SVC parameters in the form of kernel and Nu to produce the best accuracy. Based on the experiments we have done, the best results are obtained with an accuracy of 85.2% by classification using the NuSVC method with an RBF kernel and nu parameter of 0.2.\",\"PeriodicalId\":345558,\"journal\":{\"name\":\"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSemantic50169.2020.9234205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Seminar on Application for Technology of Information and Communication (iSemantic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSemantic50169.2020.9234205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

推特是世界上许多人经常使用的社交媒体。信息通过社交媒体传播和获取。例如,有一家公司正在组织一个很多人都需要知道的新活动。这允许创建一个系统,该系统通过检测Twitter社交媒体数据中的某些事件来支持用户信息的表示。在本研究中,Twitter数据将使用Twitter API进行检索,并以JSON格式存储。此外,还会有一个预处理,包括删除字符、数字、URL、词干和小写字母。此外,使用全局向量进行特征提取。我们将分为四类,分别是竞赛、研讨会、节日和其他活动。分类是利用支持向量机来预测事件的类型。采用了三种实验方法,分别是SVM C、SVM线性和SVM Nu。以核和Nu的形式改变SVC参数进行SVM Nu,以获得最佳精度。根据实验结果,采用RBF核和nu参数为0.2的NuSVC方法进行分类,准确率达到85.2%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Event Classification in Surabaya on Twitter with Support Vector Machine
Twitter is a social media that is often used by many people in the world. The information is spread and obtained through social media. For example, there is a company that is organizing a new event that many people need to know. This allows the creation of a system that supports the presentation of user information by detecting certain events from Twitter's social media data. In this study, tweet data will be retrieved using Twitter API and stored in JSON format. Furthermore, there will be a pre-processing which includes the deletion of characters, number, URL, stemming, and lower case. Furthermore, feature extraction is performed using Global Vector for Word Representation. we will classify into four classes, which are Competitions, Seminars, Festivals, and Other events. The classification is using SVM to predict the type of event. There are three experimental methods used, there is SVM C, SVM linear, and SVM Nu. SVM Nu was conducted with changes in the SVC parameters in the form of kernel and Nu to produce the best accuracy. Based on the experiments we have done, the best results are obtained with an accuracy of 85.2% by classification using the NuSVC method with an RBF kernel and nu parameter of 0.2.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Independent Public Video Conference Network Sentiment Analysis of Yelp Review Extrovert and Introvert Classification based on Myers-Briggs Type Indicator(MBTI) using Support Vector Machine (SVM) Brain Segmentation using Adaptive Thresholding, K-Means Clustering and Mathematical Morphology in MRI Data Handwriting Recognition of Hiragana Characters using Convolutional Neural Network
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1