基于情感分析的文本数据年龄组预测

Divakar Yadav, Aarushi Gupta, Saumya Asati, Nikhil Choudhary, A. K. Yadav
{"title":"基于情感分析的文本数据年龄组预测","authors":"Divakar Yadav, Aarushi Gupta, Saumya Asati, Nikhil Choudhary, A. K. Yadav","doi":"10.1145/3439231.3439262","DOIUrl":null,"url":null,"abstract":"Social media platforms provide a large amount of textual data covering various topics to explore opinions and emotions, hidden in the content using sentiment analysis. The consumer perspective on the quality and popularity of a product can be deduced from the product reviews, available at social media platforms by performing sentiment analysis. Sentiment analysis tells about the polarity of a sentence whether positive, negative or neutral. It can be used to predict personality, age and gender, based on writing style using feature extraction on the labeled training data sets. Understanding human emotions and opinions from text is a difficult task and to make it easier, sentiment analyzers are used. This paper proposes a method for prediction of age groups namely teenagers, adults and senior citizens from textual data collected from twitter and compares performance of different classifiers such as K-Nearest Neighbor (KNN), Multi-layer Perceptron (MLP), Decision tree, Random forest and Support Vector Machine (SVM), based on certain performance metrics like f-score, precision, recall and accuracy. One of the basic applications of this work can be for web readability analysis of resources, available on Internet.","PeriodicalId":210400,"journal":{"name":"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Age Group Prediction on Textual Data using Sentiment Analysis\",\"authors\":\"Divakar Yadav, Aarushi Gupta, Saumya Asati, Nikhil Choudhary, A. K. Yadav\",\"doi\":\"10.1145/3439231.3439262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media platforms provide a large amount of textual data covering various topics to explore opinions and emotions, hidden in the content using sentiment analysis. The consumer perspective on the quality and popularity of a product can be deduced from the product reviews, available at social media platforms by performing sentiment analysis. Sentiment analysis tells about the polarity of a sentence whether positive, negative or neutral. It can be used to predict personality, age and gender, based on writing style using feature extraction on the labeled training data sets. Understanding human emotions and opinions from text is a difficult task and to make it easier, sentiment analyzers are used. This paper proposes a method for prediction of age groups namely teenagers, adults and senior citizens from textual data collected from twitter and compares performance of different classifiers such as K-Nearest Neighbor (KNN), Multi-layer Perceptron (MLP), Decision tree, Random forest and Support Vector Machine (SVM), based on certain performance metrics like f-score, precision, recall and accuracy. One of the basic applications of this work can be for web readability analysis of resources, available on Internet.\",\"PeriodicalId\":210400,\"journal\":{\"name\":\"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3439231.3439262\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3439231.3439262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

社交媒体平台提供了大量涵盖各种主题的文本数据,用于探索观点和情绪,并使用情感分析隐藏在内容中。消费者对产品质量和受欢迎程度的看法可以从社交媒体平台上的产品评论中推断出来,通过进行情感分析。情感分析告诉我们一个句子的极性是积极的、消极的还是中性的。它可以用来预测个性,年龄和性别,基于写作风格,使用标记训练数据集上的特征提取。从文本中理解人类的情感和观点是一项艰巨的任务,为了使其更容易,使用了情感分析工具。本文提出了一种从twitter收集的文本数据中预测青少年、成年人和老年人年龄组的方法,并比较了k -最近邻(KNN)、多层感知器(MLP)、决策树、随机森林和支持向量机(SVM)等不同分类器的性能,基于某些性能指标,如f分数、精度、召回率和准确性。本工作的一个基本应用是对Internet上的资源进行网页可读性分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Age Group Prediction on Textual Data using Sentiment Analysis
Social media platforms provide a large amount of textual data covering various topics to explore opinions and emotions, hidden in the content using sentiment analysis. The consumer perspective on the quality and popularity of a product can be deduced from the product reviews, available at social media platforms by performing sentiment analysis. Sentiment analysis tells about the polarity of a sentence whether positive, negative or neutral. It can be used to predict personality, age and gender, based on writing style using feature extraction on the labeled training data sets. Understanding human emotions and opinions from text is a difficult task and to make it easier, sentiment analyzers are used. This paper proposes a method for prediction of age groups namely teenagers, adults and senior citizens from textual data collected from twitter and compares performance of different classifiers such as K-Nearest Neighbor (KNN), Multi-layer Perceptron (MLP), Decision tree, Random forest and Support Vector Machine (SVM), based on certain performance metrics like f-score, precision, recall and accuracy. One of the basic applications of this work can be for web readability analysis of resources, available on Internet.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Can children of typical development benefit from inclusion intervention with Daisy Robot - a socially assistive robot? Pedagogical Triangulations: from the online forum to the e-magazine: a praxiological experience about school and its actor during COVID19 confinement CovidSense: A Smartphone-based Initiative for Fighting COVID-19 Spreading Apple Siri (input) + Voice Over (output) = a de facto marriage: An exploratory case study with blind people Is Simple English Wikipedia As Simple And Easy-to-Understand As We Expect It To Be?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1