{"title":"Bangkok Tours and Activities Data Analysis via User-Generated Content","authors":"Naina Chugh, N. Phumchusri","doi":"10.1109/iCCECE49321.2020.9231211","DOIUrl":null,"url":null,"abstract":"The overarching goal of this paper is to gain visibility on tourist preferences and whether or not the needs of tourists are being met. With the Travel and Tourism (T&T) sector being the backbone to the global economy and the sector becoming more saturated and competitive, insights on T&T are vital now, more than ever. The rise of social media and user-generated content has effectuated the opportunity for a systematic analysis of tourist preferences via user-generated content. This paper is focused on gaining insights of tourism in Bangkok, Thailand through user-generated content scraped from TripAdvisor's online reviews of tours and activities. In order to develop insights on tourist preferences and tourism trends in Bangkok, various analyses were implemented, including sentiment analysis to gather tourist point-of-view, association rules mining to find patterns of preferences, and natural language processing along with text frequency analysis to understand what features tourists are most frequently talking about. This paper also developed prediction models using logistic regression to forecast 5-start ratings and 1-star ratings of reviews - with the purpose of identifying factors that significantly affect position and negative sentiments on tours/activities.","PeriodicalId":413847,"journal":{"name":"2020 International Conference on Computing, Electronics & Communications Engineering (iCCECE)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computing, Electronics & Communications Engineering (iCCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iCCECE49321.2020.9231211","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The overarching goal of this paper is to gain visibility on tourist preferences and whether or not the needs of tourists are being met. With the Travel and Tourism (T&T) sector being the backbone to the global economy and the sector becoming more saturated and competitive, insights on T&T are vital now, more than ever. The rise of social media and user-generated content has effectuated the opportunity for a systematic analysis of tourist preferences via user-generated content. This paper is focused on gaining insights of tourism in Bangkok, Thailand through user-generated content scraped from TripAdvisor's online reviews of tours and activities. In order to develop insights on tourist preferences and tourism trends in Bangkok, various analyses were implemented, including sentiment analysis to gather tourist point-of-view, association rules mining to find patterns of preferences, and natural language processing along with text frequency analysis to understand what features tourists are most frequently talking about. This paper also developed prediction models using logistic regression to forecast 5-start ratings and 1-star ratings of reviews - with the purpose of identifying factors that significantly affect position and negative sentiments on tours/activities.