{"title":"Correcting Sample Selection Bias of Historical Digital Trace Data: Inverse Probability Weighting (IPW) and Type II Tobit Model","authors":"Chankyung Pak, Kelley Cotter, Kjerstin Thorson","doi":"10.1080/19312458.2022.2037537","DOIUrl":null,"url":null,"abstract":"ABSTRACT Digital trace data have become one of the central pillars of media research methods. Despite the opportunities for better understanding individual users’ true behaviors in the personalized media environment, many scholars have pointed out the potential for bias in trace data collections, questioning the generalizability of findings based on them. In this study, we propose two statistical bias correction methods–Inverse Probability Weighting (IPW) and Type II Tobit, which are designed to remedy selection bias of inference from digital trace data donated by research participants. Applying these methods to Facebook take-out data, we demonstrate how the correction methods can change estimated effect sizes, which is important for the translation of academic findings into real-world impacts. We conduct two simulation studies, one under fully synthetic and another under partially simulated conditions, and find that Type II Tobit generally provides a more robust and cost-efficient correction method for digital trace data.","PeriodicalId":47552,"journal":{"name":"Communication Methods and Measures","volume":"16 1","pages":"134 - 155"},"PeriodicalIF":6.3000,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communication Methods and Measures","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1080/19312458.2022.2037537","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 0
Abstract
ABSTRACT Digital trace data have become one of the central pillars of media research methods. Despite the opportunities for better understanding individual users’ true behaviors in the personalized media environment, many scholars have pointed out the potential for bias in trace data collections, questioning the generalizability of findings based on them. In this study, we propose two statistical bias correction methods–Inverse Probability Weighting (IPW) and Type II Tobit, which are designed to remedy selection bias of inference from digital trace data donated by research participants. Applying these methods to Facebook take-out data, we demonstrate how the correction methods can change estimated effect sizes, which is important for the translation of academic findings into real-world impacts. We conduct two simulation studies, one under fully synthetic and another under partially simulated conditions, and find that Type II Tobit generally provides a more robust and cost-efficient correction method for digital trace data.
期刊介绍:
Communication Methods and Measures aims to achieve several goals in the field of communication research. Firstly, it aims to bring attention to and showcase developments in both qualitative and quantitative research methodologies to communication scholars. This journal serves as a platform for researchers across the field to discuss and disseminate methodological tools and approaches.
Additionally, Communication Methods and Measures seeks to improve research design and analysis practices by offering suggestions for improvement. It aims to introduce new methods of measurement that are valuable to communication scientists or enhance existing methods. The journal encourages submissions that focus on methods for enhancing research design and theory testing, employing both quantitative and qualitative approaches.
Furthermore, the journal is open to articles devoted to exploring the epistemological aspects relevant to communication research methodologies. It welcomes well-written manuscripts that demonstrate the use of methods and articles that highlight the advantages of lesser-known or newer methods over those traditionally used in communication.
In summary, Communication Methods and Measures strives to advance the field of communication research by showcasing and discussing innovative methodologies, improving research practices, and introducing new measurement methods.