Sentiment analysis of passenger feedback on U.S. airlines using machine learning classification methods

Md Nurul Raihen, Sultana Akter
{"title":"Sentiment analysis of passenger feedback on U.S. airlines using machine learning classification methods","authors":"Md Nurul Raihen, Sultana Akter","doi":"10.30574/wjarr.2024.23.1.2183","DOIUrl":null,"url":null,"abstract":"Twitter, a platform for micro-blogging, has contained as a novel information architecture. Everyday People worldwide publish about 200 million status messages, known as tweets. Twitter users express their opinions by posting concise text messages. Twitter data is useful for sentiment analysis and consumer feedback tweets. This study employed multi-class sentiment analysis to analyze tweets from 6 major US airlines (American, United, US Airways, Southwest, Delta and Virgin America). Airlines are essential for travel, and this study has helped people choose the best ones. Classification model with the lowest error rate could help airline companies improve their business by figuring out why information is being misclassified. This analysis of airline evaluations can help us identify good airlines and apply this model to our own journeys. This helps the airline identify its weaknesses so they can improve them. A technique of natural language processing (NLP) known as sentiment analysis (or opinion mining) classifies the tone of data as positive, negative, or neutral. The analysis was conducted with seven distinct classification strategies: Linear Discriminant Analysis, Quadratic Discriminant Analysis, Decision Tree, Random Forest, K-Nearest Neighbors, Gradient Boosting, and AdaBoost to utilize the split validation (80% as train data set, 20% as test data set) and 10 folds cross validation process. The suggested model demonstrates superior accuracy and efficiency compared to all others, achieving an accuracy score of 90.13% for the Random Forest with 10 folds cross validation approach. The project aims to utilize machine learning techniques to estimate the reasons for misclassified information since the lowest error rate means the airline sentiment provides less wrong prediction.","PeriodicalId":23739,"journal":{"name":"World Journal of Advanced Research and Reviews","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Advanced Research and Reviews","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30574/wjarr.2024.23.1.2183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Twitter, a platform for micro-blogging, has contained as a novel information architecture. Everyday People worldwide publish about 200 million status messages, known as tweets. Twitter users express their opinions by posting concise text messages. Twitter data is useful for sentiment analysis and consumer feedback tweets. This study employed multi-class sentiment analysis to analyze tweets from 6 major US airlines (American, United, US Airways, Southwest, Delta and Virgin America). Airlines are essential for travel, and this study has helped people choose the best ones. Classification model with the lowest error rate could help airline companies improve their business by figuring out why information is being misclassified. This analysis of airline evaluations can help us identify good airlines and apply this model to our own journeys. This helps the airline identify its weaknesses so they can improve them. A technique of natural language processing (NLP) known as sentiment analysis (or opinion mining) classifies the tone of data as positive, negative, or neutral. The analysis was conducted with seven distinct classification strategies: Linear Discriminant Analysis, Quadratic Discriminant Analysis, Decision Tree, Random Forest, K-Nearest Neighbors, Gradient Boosting, and AdaBoost to utilize the split validation (80% as train data set, 20% as test data set) and 10 folds cross validation process. The suggested model demonstrates superior accuracy and efficiency compared to all others, achieving an accuracy score of 90.13% for the Random Forest with 10 folds cross validation approach. The project aims to utilize machine learning techniques to estimate the reasons for misclassified information since the lowest error rate means the airline sentiment provides less wrong prediction.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用机器学习分类方法对乘客对美国航空公司的反馈意见进行情感分析
推特(Twitter)是一个微型博客平台,包含了一种新颖的信息架构。每天,世界各地的人们都会发布约 2 亿条状态信息,即所谓的 "推文"。Twitter 用户通过发布简洁的文本信息来表达自己的观点。Twitter 数据可用于情感分析和消费者反馈推文。本研究采用多类情感分析法对美国 6 家主要航空公司(美国航空、美联航、全美航空、西南航空、达美航空和维珍美国航空)的推文进行了分析。航空公司是旅行的必备品,这项研究有助于人们选择最佳的航空公司。错误率最低的分类模型可以帮助航空公司找出信息被错误分类的原因,从而改善业务。对航空公司评价的分析可以帮助我们找出好的航空公司,并将这一模型应用到自己的旅行中。这有助于航空公司找出自己的弱点,从而加以改进。自然语言处理 (NLP) 的一种技术被称为情感分析(或意见挖掘),它将数据的语气分为正面、负面和中性。分析采用了七种不同的分类策略:线性判别分析、四元判别分析、决策树、随机森林、K-近邻、梯度提升和 AdaBoost,并利用拆分验证(80% 作为训练数据集,20% 作为测试数据集)和 10 次交叉验证过程。与所有其他方法相比,建议的模型具有更高的准确性和效率,随机森林和 10 次交叉验证方法的准确性达到了 90.13%。该项目旨在利用机器学习技术来估算信息分类错误的原因,因为最低的错误率意味着航空情感提供的错误预测较少。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The relationship between income from fish farming and pupil’s retention in primary schools in Amolator district, Uganda A GIS-based land suitability assessment of wheat production in Kano using selected physical parameters Nanotechnology applications in breast implant manufacturing for improved durability and functionality Challenges and perspectives of total hip arthroplasty in young adults: A study of 74 patients The influence of the workplace environment on the behavior of health workers who breastfeed in Kupang Regency
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1