Reproducibility of Survey Results: A New Method to Quantify Similarity of Human Subject Pools

A. R. Khamesi, Riccardo Musmeci, S. Silvestri, Denise A. Baker
{"title":"Reproducibility of Survey Results: A New Method to Quantify Similarity of Human Subject Pools","authors":"A. R. Khamesi, Riccardo Musmeci, S. Silvestri, Denise A. Baker","doi":"10.1109/GLOBECOM42002.2020.9348076","DOIUrl":null,"url":null,"abstract":"Smart Connected Communities (SCCs) is a novel paradigm that brings together multiple disciplines, including social-sciences, computer science, and engineering. Large-scale surveys are a fundamental tool to understand the needs and impact of new technologies to human populations, necessary to realize the SCC paradigm. However, there is a growing debate regarding the reproducibility of survey results. As an example, it has been shown that surveys may easily provide contradictory results, even if the subject populations are statistically equivalent from a demographic perspective. In this paper, we take the initial steps towards addressing the problem of reproducibility of survey results by providing formal methods to quantitatively justify apparently inconsistent results. Specifically, we define a new dissimilarity metric between two populations based on the users answers to non-demographic questions. To this purpose, we propose two algorithms based on submodular optimization and information theory, respectively, to select the most representative questions in a survey. Results show that our method effectively identifies and quantifies differences that are not evident from a purely demographic point of view.","PeriodicalId":12759,"journal":{"name":"GLOBECOM 2020 - 2020 IEEE Global Communications Conference","volume":"171 1","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GLOBECOM 2020 - 2020 IEEE Global Communications Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOBECOM42002.2020.9348076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Smart Connected Communities (SCCs) is a novel paradigm that brings together multiple disciplines, including social-sciences, computer science, and engineering. Large-scale surveys are a fundamental tool to understand the needs and impact of new technologies to human populations, necessary to realize the SCC paradigm. However, there is a growing debate regarding the reproducibility of survey results. As an example, it has been shown that surveys may easily provide contradictory results, even if the subject populations are statistically equivalent from a demographic perspective. In this paper, we take the initial steps towards addressing the problem of reproducibility of survey results by providing formal methods to quantitatively justify apparently inconsistent results. Specifically, we define a new dissimilarity metric between two populations based on the users answers to non-demographic questions. To this purpose, we propose two algorithms based on submodular optimization and information theory, respectively, to select the most representative questions in a survey. Results show that our method effectively identifies and quantifies differences that are not evident from a purely demographic point of view.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
调查结果的再现性:一种量化人类受试者池相似性的新方法
智能互联社区(SCCs)是一种将社会科学、计算机科学和工程学等多学科结合在一起的新范式。大规模调查是了解新技术对人口的需求和影响的基本工具,是实现SCC范式所必需的。然而,关于调查结果的可重复性的争论越来越多。举个例子,调查很容易得出相互矛盾的结果,即使从人口统计学的角度来看,调查对象在统计上是相等的。在本文中,我们采取初步步骤,通过提供正式的方法来定量地证明明显不一致的结果,以解决调查结果的可重复性问题。具体来说,我们根据用户对非人口统计问题的回答定义了两个人群之间的新的不相似度度量。为此,我们分别提出了基于子模块优化和信息论的两种算法来选择调查中最具代表性的问题。结果表明,我们的方法有效地识别和量化了从纯粹人口统计学的角度来看并不明显的差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
AirID: Injecting a Custom RF Fingerprint for Enhanced UAV Identification using Deep Learning Oversampling Algorithm based on Reinforcement Learning in Imbalanced Problems FAST-RAM: A Fast AI-assistant Solution for Task Offloading and Resource Allocation in MEC Achieving Privacy-Preserving Vehicle Selection for Effective Content Dissemination in Smart Cities Age-optimal Transmission Policy for Markov Source with Differential Encoding
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1