基于情感分析的文本数据年龄组预测

Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion Pub Date : 2020-12-02 DOI:10.1145/3439231.3439262

Divakar Yadav, Aarushi Gupta, Saumya Asati, Nikhil Choudhary, A. K. Yadav

{"title":"基于情感分析的文本数据年龄组预测","authors":"Divakar Yadav, Aarushi Gupta, Saumya Asati, Nikhil Choudhary, A. K. Yadav","doi":"10.1145/3439231.3439262","DOIUrl":null,"url":null,"abstract":"Social media platforms provide a large amount of textual data covering various topics to explore opinions and emotions, hidden in the content using sentiment analysis. The consumer perspective on the quality and popularity of a product can be deduced from the product reviews, available at social media platforms by performing sentiment analysis. Sentiment analysis tells about the polarity of a sentence whether positive, negative or neutral. It can be used to predict personality, age and gender, based on writing style using feature extraction on the labeled training data sets. Understanding human emotions and opinions from text is a difficult task and to make it easier, sentiment analyzers are used. This paper proposes a method for prediction of age groups namely teenagers, adults and senior citizens from textual data collected from twitter and compares performance of different classifiers such as K-Nearest Neighbor (KNN), Multi-layer Perceptron (MLP), Decision tree, Random forest and Support Vector Machine (SVM), based on certain performance metrics like f-score, precision, recall and accuracy. One of the basic applications of this work can be for web readability analysis of resources, available on Internet.","PeriodicalId":210400,"journal":{"name":"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Age Group Prediction on Textual Data using Sentiment Analysis\",\"authors\":\"Divakar Yadav, Aarushi Gupta, Saumya Asati, Nikhil Choudhary, A. K. Yadav\",\"doi\":\"10.1145/3439231.3439262\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media platforms provide a large amount of textual data covering various topics to explore opinions and emotions, hidden in the content using sentiment analysis. The consumer perspective on the quality and popularity of a product can be deduced from the product reviews, available at social media platforms by performing sentiment analysis. Sentiment analysis tells about the polarity of a sentence whether positive, negative or neutral. It can be used to predict personality, age and gender, based on writing style using feature extraction on the labeled training data sets. Understanding human emotions and opinions from text is a difficult task and to make it easier, sentiment analyzers are used. This paper proposes a method for prediction of age groups namely teenagers, adults and senior citizens from textual data collected from twitter and compares performance of different classifiers such as K-Nearest Neighbor (KNN), Multi-layer Perceptron (MLP), Decision tree, Random forest and Support Vector Machine (SVM), based on certain performance metrics like f-score, precision, recall and accuracy. One of the basic applications of this work can be for web readability analysis of resources, available on Internet.\",\"PeriodicalId\":210400,\"journal\":{\"name\":\"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3439231.3439262\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3439231.3439262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

社交媒体平台提供了大量涵盖各种主题的文本数据，用于探索观点和情绪，并使用情感分析隐藏在内容中。消费者对产品质量和受欢迎程度的看法可以从社交媒体平台上的产品评论中推断出来，通过进行情感分析。情感分析告诉我们一个句子的极性是积极的、消极的还是中性的。它可以用来预测个性，年龄和性别，基于写作风格，使用标记训练数据集上的特征提取。从文本中理解人类的情感和观点是一项艰巨的任务，为了使其更容易，使用了情感分析工具。本文提出了一种从twitter收集的文本数据中预测青少年、成年人和老年人年龄组的方法，并比较了k -最近邻(KNN)、多层感知器(MLP)、决策树、随机森林和支持向量机(SVM)等不同分类器的性能，基于某些性能指标，如f分数、精度、召回率和准确性。本工作的一个基本应用是对Internet上的资源进行网页可读性分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Age Group Prediction on Textual Data using Sentiment Analysis

Social media platforms provide a large amount of textual data covering various topics to explore opinions and emotions, hidden in the content using sentiment analysis. The consumer perspective on the quality and popularity of a product can be deduced from the product reviews, available at social media platforms by performing sentiment analysis. Sentiment analysis tells about the polarity of a sentence whether positive, negative or neutral. It can be used to predict personality, age and gender, based on writing style using feature extraction on the labeled training data sets. Understanding human emotions and opinions from text is a difficult task and to make it easier, sentiment analyzers are used. This paper proposes a method for prediction of age groups namely teenagers, adults and senior citizens from textual data collected from twitter and compares performance of different classifiers such as K-Nearest Neighbor (KNN), Multi-layer Perceptron (MLP), Decision tree, Random forest and Support Vector Machine (SVM), based on certain performance metrics like f-score, precision, recall and accuracy. One of the basic applications of this work can be for web readability analysis of resources, available on Internet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 9th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion

自引率

0.00%

发文量