{"title":"Predicting restaurant survival using nationwide Google Maps data","authors":"Tomasz Starakiewicz, Piotr Wójcik","doi":"10.1016/j.knosys.2025.113198","DOIUrl":null,"url":null,"abstract":"<div><div>The restaurant sector is pivotal to firm exit research, which influences economic policy and managerial strategy recommendations. Recent studies using online data are based on geographically limited datasets and have largely omitted temporal dynamics in user interactions. Additionally, these studies rely on manual labeling for text analysis, a resource-intensive approach. Built upon the case of Poland, our study introduces the first comprehensive, nationwide analysis of restaurant survival using Google Maps data. We enhance predictive model performance by incorporating time-sensitive user interactions. Our model controls for established determinants of business exit and proves robust regarding data quality issues associated with user-provided business directories. We apply an efficient, label-free method for extracting semantic content from reviews, thereby creating useful features for firm exit prediction. Furthermore, we present an efficient feature selection strategy using hierarchical agglomerative clustering that retains predictive power while reducing the model complexity. Our model has broad applications ranging from credit scoring to early-warning systems for business closures, while our data collection method opens doors to large-scale firm exit studies in regions where official records are lacking and online sources used in previous studies are less prevalent.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"314 ","pages":"Article 113198"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095070512500245X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The restaurant sector is pivotal to firm exit research, which influences economic policy and managerial strategy recommendations. Recent studies using online data are based on geographically limited datasets and have largely omitted temporal dynamics in user interactions. Additionally, these studies rely on manual labeling for text analysis, a resource-intensive approach. Built upon the case of Poland, our study introduces the first comprehensive, nationwide analysis of restaurant survival using Google Maps data. We enhance predictive model performance by incorporating time-sensitive user interactions. Our model controls for established determinants of business exit and proves robust regarding data quality issues associated with user-provided business directories. We apply an efficient, label-free method for extracting semantic content from reviews, thereby creating useful features for firm exit prediction. Furthermore, we present an efficient feature selection strategy using hierarchical agglomerative clustering that retains predictive power while reducing the model complexity. Our model has broad applications ranging from credit scoring to early-warning systems for business closures, while our data collection method opens doors to large-scale firm exit studies in regions where official records are lacking and online sources used in previous studies are less prevalent.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.