{"title":"(Mis)Measuring People's Attitudes from Social Media","authors":"Indira Sen","doi":"10.1145/3406865.3418363","DOIUrl":null,"url":null,"abstract":"Activities of people, recorded via digital devices or online environments, offer increasingly comprehensive pictures of both individual and group-level behavior, potentially allowing inferences within and outside the platforms. These digital traces are often in the form of textual units such as tweets or Reddit posts or comments. Compared to solicited survey responses, social media posts are the organic, unsolicited thoughts of people on a variety of topics, and the language in these posts are a key to their attitudes, beliefs and values. Notwithstanding the many promises of digital traces, recent studies have begun to discuss the errors that can occur when digital traces are used to learn about social phenomena. In this thesis, I propose to first, diagnose and characterize issues in the measurement of people's attitudes at scale, and second, mitigate these errors through theory-driven solutions. To critically study and record errors and biases in using digital traces for measuring human behavior, we propose a systematic framework, named 'Total Error Framework for Digital Traces' (TED). TED is inspired by and adapted from the Total Survey Error Framework, developed and employed in survey methodology to assess the validity and reliability of survey-based studies. To mitigate errors unearthed by examining Computational Social Science through TED, we apply several domain specific solutions, such as using linguistic theories to understand people's attitudes. This thesis contributes in improving the reliability and validity of attitude measurement from digital traces.","PeriodicalId":93424,"journal":{"name":"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...","volume":"130 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3406865.3418363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Activities of people, recorded via digital devices or online environments, offer increasingly comprehensive pictures of both individual and group-level behavior, potentially allowing inferences within and outside the platforms. These digital traces are often in the form of textual units such as tweets or Reddit posts or comments. Compared to solicited survey responses, social media posts are the organic, unsolicited thoughts of people on a variety of topics, and the language in these posts are a key to their attitudes, beliefs and values. Notwithstanding the many promises of digital traces, recent studies have begun to discuss the errors that can occur when digital traces are used to learn about social phenomena. In this thesis, I propose to first, diagnose and characterize issues in the measurement of people's attitudes at scale, and second, mitigate these errors through theory-driven solutions. To critically study and record errors and biases in using digital traces for measuring human behavior, we propose a systematic framework, named 'Total Error Framework for Digital Traces' (TED). TED is inspired by and adapted from the Total Survey Error Framework, developed and employed in survey methodology to assess the validity and reliability of survey-based studies. To mitigate errors unearthed by examining Computational Social Science through TED, we apply several domain specific solutions, such as using linguistic theories to understand people's attitudes. This thesis contributes in improving the reliability and validity of attitude measurement from digital traces.