Shu-Mei Lai , Tso-Jung Yen , Ming-Yi Chang , Yang-chih Fu , Wei-Chung Liu
{"title":"Predicting network members from partial contact records on social media: A machine learning approach","authors":"Shu-Mei Lai , Tso-Jung Yen , Ming-Yi Chang , Yang-chih Fu , Wei-Chung Liu","doi":"10.1016/j.socnet.2024.08.004","DOIUrl":null,"url":null,"abstract":"<div><p>Surveys conducted on social groups often generate incomplete information due to imperfect response rates. Drawing on Facebook data from a nationally representative sample of graduating college students in Taiwan, we examined the extent to which partial contact records predict which Facebook users belong to a specific class. We first used data from classes with low to middle response rates to train a model for classmate prediction. Based on data from classes with high or perfect response rates, we simulated data by using four different sampling methods with various response rates, and applied the trained model on simulated data to classmate prediction. With a minimal response rate of 40 percent, we achieved an accuracy rate of 90 percent and a true positive rate of 86 percent. Chronological order sampling had the best prediction performance, followed closely by popularity sampling, then by random sampling, and lastly by unpopularity sampling.</p></div>","PeriodicalId":48353,"journal":{"name":"Social Networks","volume":"80 ","pages":"Pages 10-24"},"PeriodicalIF":2.9000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0378873324000455/pdfft?md5=cc4c9fafb61af2fdbf03825448a7f086&pid=1-s2.0-S0378873324000455-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Networks","FirstCategoryId":"90","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378873324000455","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Surveys conducted on social groups often generate incomplete information due to imperfect response rates. Drawing on Facebook data from a nationally representative sample of graduating college students in Taiwan, we examined the extent to which partial contact records predict which Facebook users belong to a specific class. We first used data from classes with low to middle response rates to train a model for classmate prediction. Based on data from classes with high or perfect response rates, we simulated data by using four different sampling methods with various response rates, and applied the trained model on simulated data to classmate prediction. With a minimal response rate of 40 percent, we achieved an accuracy rate of 90 percent and a true positive rate of 86 percent. Chronological order sampling had the best prediction performance, followed closely by popularity sampling, then by random sampling, and lastly by unpopularity sampling.
期刊介绍:
Social Networks is an interdisciplinary and international quarterly. It provides a common forum for representatives of anthropology, sociology, history, social psychology, political science, human geography, biology, economics, communications science and other disciplines who share an interest in the study of the empirical structure of social relations and associations that may be expressed in network form. It publishes both theoretical and substantive papers. Critical reviews of major theoretical or methodological approaches using the notion of networks in the analysis of social behaviour are also included, as are reviews of recent books dealing with social networks and social structure.