{"title":"Discovering a tourism destination with social media data: BERT-based sentiment analysis","authors":"M. Viñán-Ludeña, Luis M. de Campos","doi":"10.1108/jhtt-09-2021-0259","DOIUrl":null,"url":null,"abstract":"\nPurpose\nThe main purpose of this paper is to analyze a tourist destination using sentiment analysis techniques with data from Twitter and Instagram to find the most representative entities (or places) and perceptions (or aspects) of the users.\n\n\nDesign/methodology/approach\nThe authors used 90,725 Instagram posts and 235,755 Twitter tweets to analyze tourism in Granada (Spain) to identify the important places and perceptions mentioned by travelers on both social media sites. The authors used several approaches for sentiment classification for English and Spanish texts, including deep learning models.\n\n\nFindings\nThe best results in a test set were obtained using a bidirectional encoder representations from transformers (BERT) model for Spanish texts and Tweeteval for English texts, and these were subsequently used to analyze the data sets. It was then possible to identify the most important entities and aspects, and this, in turn, provided interesting insights for researchers, practitioners, travelers and tourism managers so that services could be improved and better marketing strategies formulated.\n\n\nResearch limitations/implications\nThe authors propose a Spanish-Tourism-BERT model for performing sentiment classification together with a process to find places through hashtags and to reveal the important negative aspects of each place.\n\n\nPractical implications\nThe study enables managers and practitioners to implement the Spanish-BERT model with our Spanish Tourism data set that the authors released for adoption in applications to find both positive and negative perceptions.\n\n\nOriginality/value\nThis study presents a novel approach on how to apply sentiment analysis in the tourism domain. First, the way to evaluate the different existing models and tools is presented; second, a model is trained using BERT (deep learning model); third, an approach of how to identify the acceptance of the places of a destination through hashtags is presented and, finally, the evaluation of why the users express positivity (negativity) through the identification of entities and aspects.\n","PeriodicalId":51611,"journal":{"name":"Journal of Hospitality and Tourism Technology","volume":" ","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hospitality and Tourism Technology","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/jhtt-09-2021-0259","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HOSPITALITY, LEISURE, SPORT & TOURISM","Score":null,"Total":0}
引用次数: 6
Abstract
Purpose
The main purpose of this paper is to analyze a tourist destination using sentiment analysis techniques with data from Twitter and Instagram to find the most representative entities (or places) and perceptions (or aspects) of the users.
Design/methodology/approach
The authors used 90,725 Instagram posts and 235,755 Twitter tweets to analyze tourism in Granada (Spain) to identify the important places and perceptions mentioned by travelers on both social media sites. The authors used several approaches for sentiment classification for English and Spanish texts, including deep learning models.
Findings
The best results in a test set were obtained using a bidirectional encoder representations from transformers (BERT) model for Spanish texts and Tweeteval for English texts, and these were subsequently used to analyze the data sets. It was then possible to identify the most important entities and aspects, and this, in turn, provided interesting insights for researchers, practitioners, travelers and tourism managers so that services could be improved and better marketing strategies formulated.
Research limitations/implications
The authors propose a Spanish-Tourism-BERT model for performing sentiment classification together with a process to find places through hashtags and to reveal the important negative aspects of each place.
Practical implications
The study enables managers and practitioners to implement the Spanish-BERT model with our Spanish Tourism data set that the authors released for adoption in applications to find both positive and negative perceptions.
Originality/value
This study presents a novel approach on how to apply sentiment analysis in the tourism domain. First, the way to evaluate the different existing models and tools is presented; second, a model is trained using BERT (deep learning model); third, an approach of how to identify the acceptance of the places of a destination through hashtags is presented and, finally, the evaluation of why the users express positivity (negativity) through the identification of entities and aspects.
期刊介绍:
The Journal of Hospitality and Tourism Technology is the only journal dedicated solely for research in technology and e-business in tourism and hospitality. It is a bridge between academia and industry through the intellectual exchange of ideas, trends and paradigmatic changes in the fields of hospitality, IT and e-business. It covers: -E-Marketplaces, electronic distribution channels, or e-Intermediaries -Internet or e-commerce business models -Self service technologies -E-Procurement -Social dynamics of e-communication -Relationship Development and Retention -E-governance -Security of transactions -Mobile/Wireless technologies in commerce -IT control and preparation for disaster -Virtual reality applications -Word of Mouth. -Cross-Cultural differences in IT use -GPS and Location-based services -Biometric applications -Business intelligence visualization -Radio Frequency Identification applications -Service-Oriented Architecture of business systems -Technology in New Product Development