Harnessing Mobile Phone Social Network Topology to Infer Users Demographic Attributes

J. Brea, Javier Burroni, Martin Minnoni, Carlos Sarraute
{"title":"Harnessing Mobile Phone Social Network Topology to Infer Users Demographic Attributes","authors":"J. Brea, Javier Burroni, Martin Minnoni, Carlos Sarraute","doi":"10.1145/2659480.2659492","DOIUrl":null,"url":null,"abstract":"We study the structure of the social graph of mobile phone users in the country of Mexico, with a focus on demographic attributes of the users (more specifically the users' age). We examine assortativity patterns in the graph, and observe a strong age homophily in the communications preferences. We propose a graph based algorithm for the prediction of the age of mobile phone users. The algorithm exploits the topology of the mobile phone network, together with a subset of known users ages (seeds), to infer the age of remaining users. We provide the details of the methodology, and show experimental results on a network GT with more than 70 million users. By carefully examining the topological relations of the seeds to the rest of the nodes in GT, we find topological metrics which have a direct influence on the performance of the algorithm. In particular we characterize subsets of users for which the accuracy of the algorithm is 62% when predicting between 4 age categories (whereas a pure random guess would yield an accuracy of 25%). We also show that we can use the probabilistic information computed by the algorithm to further increase its inference power to 72% on a significant subset of users.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"7 1","pages":"1:1-1:9"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2659480.2659492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

We study the structure of the social graph of mobile phone users in the country of Mexico, with a focus on demographic attributes of the users (more specifically the users' age). We examine assortativity patterns in the graph, and observe a strong age homophily in the communications preferences. We propose a graph based algorithm for the prediction of the age of mobile phone users. The algorithm exploits the topology of the mobile phone network, together with a subset of known users ages (seeds), to infer the age of remaining users. We provide the details of the methodology, and show experimental results on a network GT with more than 70 million users. By carefully examining the topological relations of the seeds to the rest of the nodes in GT, we find topological metrics which have a direct influence on the performance of the algorithm. In particular we characterize subsets of users for which the accuracy of the algorithm is 62% when predicting between 4 age categories (whereas a pure random guess would yield an accuracy of 25%). We also show that we can use the probabilistic information computed by the algorithm to further increase its inference power to 72% on a significant subset of users.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用手机社交网络拓扑推断用户人口统计属性
我们研究了墨西哥手机用户的社交图谱结构,重点关注用户的人口统计属性(更具体地说,是用户的年龄)。我们研究了图中的分类模式,并观察到通信偏好中强烈的年龄同质性。我们提出了一种基于图的算法来预测手机用户的年龄。该算法利用移动电话网络的拓扑结构,以及已知用户年龄的子集(种子)来推断剩余用户的年龄。我们提供了方法的细节,并展示了在拥有超过7000万用户的网络GT上的实验结果。通过仔细检查种子到GT中其余节点的拓扑关系,我们发现拓扑指标对算法的性能有直接影响。特别是,我们描述了用户子集,当预测4个年龄类别时,算法的准确率为62%(而纯随机猜测的准确率为25%)。我们还表明,我们可以使用算法计算的概率信息进一步将其推断能力提高到72%,在一个重要的用户子集上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An ensemble transformer-based model for Arabic sentiment analysis Homophily and polarization on political twitter during the 2017 Norwegian election Perceptible sentiment analysis of students' WhatsApp group chats in valence, arousal, and dominance space A performant deep learning model for sentiment analysis of climate change DEES: a real-time system for event extraction from disaster-related web text
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1