Leveraging sentiment analysis of food delivery services reviews using deep learning and word embedding.

IF 2.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE PeerJ Computer Science Pub Date : 2025-02-19 eCollection Date: 2025-01-01 DOI:10.7717/peerj-cs.2669

Dheya Mustafa, Safaa M Khabour, Mousa Al-Kfairy, Ahmed Shatnawi

{"title":"Leveraging sentiment analysis of food delivery services reviews using deep learning and word embedding.","authors":"Dheya Mustafa, Safaa M Khabour, Mousa Al-Kfairy, Ahmed Shatnawi","doi":"10.7717/peerj-cs.2669","DOIUrl":null,"url":null,"abstract":"Companies that deliver food (food delivery services, or FDS) try to use customer feedback to identify aspects where the customer experience could be improved. Consumer feedback on purchasing and receiving goods via online platforms is a crucial tool for learning about a company's performance. Many English-language studies have been conducted on sentiment analysis (SA). Arabic is becoming one of the most extensively written languages on the World Wide Web, but because of its morphological and grammatical difficulty as well as the lack of openly accessible resources for Arabic SA, like as dictionaries and datasets, there has not been much research done on the language. Using a manually annotated FDS dataset, the current study conducts extensive sentiment analysis using reviews related to FDS that include Modern Standard Arabic and dialectal Arabic. It does this by utilizing word embedding models, deep learning techniques, and natural language processing to extract subjective opinions, determine polarity, and recognize customers' feelings in the FDS domain. Convolutional neural network (CNN), bidirectional long short-term memory recurrent neural network (BiLSTM), and an LSTM-CNN hybrid model were among the deep learning approaches to classification that we evaluated. In addition, the article investigated different effective approaches for word embedding and stemming techniques. Using a dataset of Modern Standard Arabic and dialectal Arabic corpus gathered from Talabat.com, we trained and evaluated our suggested models. Our best accuracy was approximately 84% for multiclass classification and 92.5% for binary classification on the FDS. To verify that the proposed approach is suitable for analyzing human perceptions in diversified domains, we designed and carried out excessive experiments on other existing Arabic datasets. The highest obtained multi-classification accuracy is 88.9% on the Hotels Arabic-Reviews Dataset (HARD) dataset, and the highest obtained binary classification accuracy is 97.2% on the same dataset.","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2669"},"PeriodicalIF":2.5000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11888876/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.7717/peerj-cs.2669","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Companies that deliver food (food delivery services, or FDS) try to use customer feedback to identify aspects where the customer experience could be improved. Consumer feedback on purchasing and receiving goods via online platforms is a crucial tool for learning about a company's performance. Many English-language studies have been conducted on sentiment analysis (SA). Arabic is becoming one of the most extensively written languages on the World Wide Web, but because of its morphological and grammatical difficulty as well as the lack of openly accessible resources for Arabic SA, like as dictionaries and datasets, there has not been much research done on the language. Using a manually annotated FDS dataset, the current study conducts extensive sentiment analysis using reviews related to FDS that include Modern Standard Arabic and dialectal Arabic. It does this by utilizing word embedding models, deep learning techniques, and natural language processing to extract subjective opinions, determine polarity, and recognize customers' feelings in the FDS domain. Convolutional neural network (CNN), bidirectional long short-term memory recurrent neural network (BiLSTM), and an LSTM-CNN hybrid model were among the deep learning approaches to classification that we evaluated. In addition, the article investigated different effective approaches for word embedding and stemming techniques. Using a dataset of Modern Standard Arabic and dialectal Arabic corpus gathered from Talabat.com, we trained and evaluated our suggested models. Our best accuracy was approximately 84% for multiclass classification and 92.5% for binary classification on the FDS. To verify that the proposed approach is suitable for analyzing human perceptions in diversified domains, we designed and carried out excessive experiments on other existing Arabic datasets. The highest obtained multi-classification accuracy is 88.9% on the Hotels Arabic-Reviews Dataset (HARD) dataset, and the highest obtained binary classification accuracy is 97.2% on the same dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用深度学习和词嵌入对送餐服务评论进行情感分析。

提供食品的公司（食品配送服务或FDS）尝试使用客户反馈来确定客户体验可以改进的方面。消费者通过在线平台购买和接收商品的反馈是了解公司业绩的重要工具。许多英语研究都是关于情感分析的。阿拉伯语正在成为万维网上最广泛的书面语言之一，但由于其形态学和语法上的困难，以及阿拉伯语SA缺乏公开可访问的资源，如字典和数据集，对该语言的研究还不多。目前的研究使用手动注释的FDS数据集，使用与FDS相关的评论（包括现代标准阿拉伯语和方言阿拉伯语）进行了广泛的情感分析。它通过使用词嵌入模型、深度学习技术和自然语言处理来提取主观意见，确定极性，并识别客户在FDS领域的感受。卷积神经网络（CNN）、双向长短期记忆递归神经网络（BiLSTM）和LSTM-CNN混合模型是我们评估的深度学习分类方法之一。此外，本文还研究了不同的有效的词嵌入和词干提取方法。使用从Talabat.com收集的现代标准阿拉伯语和方言阿拉伯语语料库数据集，我们训练并评估了我们建议的模型。我们在FDS上的多类分类的最佳准确率约为84%，二元分类的最佳准确率约为92.5%。为了验证所提出的方法适用于分析不同领域的人类感知，我们在其他现有的阿拉伯语数据集上设计并进行了大量实验。在酒店阿拉伯评论数据集（HARD）数据集上获得的最高多重分类准确率为88.9%，在同一数据集上获得的最高二元分类准确率为97.2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

PeerJ Computer Science Computer Science-General Computer Science

CiteScore

6.10

自引率

5.30%

发文量

332

审稿时长

10 weeks

期刊介绍： PeerJ Computer Science is the new open access journal covering all subject areas in computer science, with the backing of a prestigious advisory board and more than 300 academic editors.