{"title":"Complementarities in learning from data: Insights from general search","authors":"Maximilian Schaefer , Geza Sapi","doi":"10.1016/j.infoecopol.2023.101063","DOIUrl":null,"url":null,"abstract":"<div><p>The ability to make accurate predictions relating to consumer preferences is a key factor of a digital firm's success. Examples include targeted advertisements and, more broadly, business models relying on capturing consumers' attention. The prediction technologies used to learn consumer preferences rely on consumer generated data. Despite the importance of data-driven technologies, there is a lack of knowledge about the precise role that data-scale plays for prediction accuracy. From a policy perspective, a better understanding about the role of data is needed to assess the risks that “big data” might pose for competition. This article highlights potential complementarities between different data dimensions in algorithmic learning. We analyze our hypothesis using search engine data from Yahoo! and provide evidence that more data in the <em>within-user</em> dimension enhances the efficiency of algorithmic learning in the <em>across-user</em> dimension. Our findings suggest that ignoring these complementarities might lead to underestimating scale advantages from data.</p></div>","PeriodicalId":47029,"journal":{"name":"Information Economics and Policy","volume":"65 ","pages":"Article 101063"},"PeriodicalIF":4.5000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Economics and Policy","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167624523000483","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 2
Abstract
The ability to make accurate predictions relating to consumer preferences is a key factor of a digital firm's success. Examples include targeted advertisements and, more broadly, business models relying on capturing consumers' attention. The prediction technologies used to learn consumer preferences rely on consumer generated data. Despite the importance of data-driven technologies, there is a lack of knowledge about the precise role that data-scale plays for prediction accuracy. From a policy perspective, a better understanding about the role of data is needed to assess the risks that “big data” might pose for competition. This article highlights potential complementarities between different data dimensions in algorithmic learning. We analyze our hypothesis using search engine data from Yahoo! and provide evidence that more data in the within-user dimension enhances the efficiency of algorithmic learning in the across-user dimension. Our findings suggest that ignoring these complementarities might lead to underestimating scale advantages from data.
期刊介绍:
IEP is an international journal that aims to publish peer-reviewed policy-oriented research about the production, distribution and use of information, including these subjects: the economics of the telecommunications, mass media, and other information industries, the economics of innovation and intellectual property, the role of information in economic development, and the role of information and information technology in the functioning of markets. The purpose of the journal is to provide an interdisciplinary and international forum for theoretical and empirical research that addresses the needs of other researchers, government, and professionals who are involved in the policy-making process. IEP publishes research papers, short contributions, and surveys.