Taking a Closer Look at the Bayesian Truth Serum.

IF 1.3 4区心理学 Q4 PSYCHOLOGY, EXPERIMENTAL Experimental psychology Pub Date : 2022-07-01 DOI:10.1027/1618-3169/a000558

Philipp Schoenegger, Steven Verheyen

{"title":"Taking a Closer Look at the Bayesian Truth Serum.","authors":"Philipp Schoenegger, Steven Verheyen","doi":"10.1027/1618-3169/a000558","DOIUrl":null,"url":null,"abstract":" Over the past few decades, psychology and its cognate disciplines have undergone substantial scientific reform, ranging from advances in statistical methodology to significant changes in academic norms. One aspect of experimental design that has received comparatively little attention is incentivization, i.e., the way that participants are rewarded and incentivized monetarily for their participation in experiments and surveys. While incentive-compatible designs are the norm in disciplines like economics, the majority of studies in psychology and experimental philosophy are constructed such that individuals' incentives to maximize their payoffs in many cases stand opposed to their incentives to state their true preferences honestly. This is in part because the subject matter is often self-report data about subjective topics, and the sample is drawn from online platforms like Prolific or MTurk where many participants are out to make a quick buck. One mechanism that allows for the introduction of an incentive-compatible design in such circumstances is the Bayesian Truth Serum (BTS; Prelec, 2004), which rewards participants based on how surprisingly common their answers are. Recently, Schoenegger (2021) applied this mechanism in the context of Likert-scale self-reports, finding that the introduction of this mechanism significantly altered response behavior. In this registered report, we further investigate this mechanism by (1) attempting to directly replicate the previous result and (2) analyzing if the Bayesian Truth Serum's effect is distinct from the effects of its constituent parts (increase in expected earnings and addition of prediction tasks). We fail to find significant differences in response behavior between participants who were simply paid for completing the study and participants who were incentivized with the BTS. Per our pre-registration, we regard this as evidence in favor of a null effect of up to V = .1 and a failure to replicate but reserve judgment as to whether the BTS mechanism should be adopted in social science fields that rely heavily on Likert-scale items reporting subjective data, seeing that smaller effect sizes might still be of practical interest and results may differ for items different from the ones we studied. Further, we provide weak evidence that the prediction task itself influences response distributions and that this task's effect is distinct from an increase in expected earnings, suggesting a complex interaction between the BTS' constituent parts and its truth-telling instructions.","PeriodicalId":12173,"journal":{"name":"Experimental psychology","volume":"69 4","pages":"226-239"},"PeriodicalIF":1.3000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Experimental psychology","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1027/1618-3169/a000558","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Over the past few decades, psychology and its cognate disciplines have undergone substantial scientific reform, ranging from advances in statistical methodology to significant changes in academic norms. One aspect of experimental design that has received comparatively little attention is incentivization, i.e., the way that participants are rewarded and incentivized monetarily for their participation in experiments and surveys. While incentive-compatible designs are the norm in disciplines like economics, the majority of studies in psychology and experimental philosophy are constructed such that individuals' incentives to maximize their payoffs in many cases stand opposed to their incentives to state their true preferences honestly. This is in part because the subject matter is often self-report data about subjective topics, and the sample is drawn from online platforms like Prolific or MTurk where many participants are out to make a quick buck. One mechanism that allows for the introduction of an incentive-compatible design in such circumstances is the Bayesian Truth Serum (BTS; Prelec, 2004), which rewards participants based on how surprisingly common their answers are. Recently, Schoenegger (2021) applied this mechanism in the context of Likert-scale self-reports, finding that the introduction of this mechanism significantly altered response behavior. In this registered report, we further investigate this mechanism by (1) attempting to directly replicate the previous result and (2) analyzing if the Bayesian Truth Serum's effect is distinct from the effects of its constituent parts (increase in expected earnings and addition of prediction tasks). We fail to find significant differences in response behavior between participants who were simply paid for completing the study and participants who were incentivized with the BTS. Per our pre-registration, we regard this as evidence in favor of a null effect of up to V = .1 and a failure to replicate but reserve judgment as to whether the BTS mechanism should be adopted in social science fields that rely heavily on Likert-scale items reporting subjective data, seeing that smaller effect sizes might still be of practical interest and results may differ for items different from the ones we studied. Further, we provide weak evidence that the prediction task itself influences response distributions and that this task's effect is distinct from an increase in expected earnings, suggesting a complex interaction between the BTS' constituent parts and its truth-telling instructions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

近距离观察贝叶斯吐真剂。

在过去的几十年里，心理学及其相关学科经历了重大的科学改革，从统计方法的进步到学术规范的重大变化。实验设计中相对较少受到关注的一个方面是激励，即参与者因参与实验和调查而获得奖励和金钱激励的方式。虽然激励相容的设计是经济学等学科的规范，但心理学和实验哲学的大多数研究都是这样构建的:在许多情况下，个人最大化其收益的动机与他们诚实地陈述其真实偏好的动机是对立的。这在一定程度上是因为研究的主题通常是关于主观话题的自我报告数据，样本来自多产或MTurk等在线平台，在这些平台上，许多参与者都想赚快钱。一种允许在这种情况下引入激励相容设计的机制是贝叶斯真值血清(BTS;Prelec, 2004)，该计划根据参与者的答案出奇地普遍来奖励他们。最近，Schoenegger(2021)将该机制应用于李克特自我报告中，发现该机制的引入显著改变了反应行为。在本注册报告中，我们通过(1)试图直接复制之前的结果和(2)分析贝叶斯真值血清的效果是否与其组成部分(预期收益的增加和预测任务的增加)的效果不同，进一步研究了这一机制。我们并没有发现仅仅因为完成研究而获得报酬的参与者和受到防弹少年团激励的参与者之间的反应行为有显著差异。根据我们的预注册，我们认为这是支持零效应高达V = 0.1的证据，并且无法复制，但对于是否应该在严重依赖李克特量表项目报告主观数据的社会科学领域采用BTS机制保留判断，因为较小的效应大小可能仍然具有实际意义，并且结果可能与我们研究的项目不同。此外，我们提供了微弱的证据，证明预测任务本身会影响反应分布，并且该任务的效果与预期收益的增加不同，这表明BTS的组成部分与其说实话指令之间存在复杂的相互作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Experimental psychology PSYCHOLOGY, EXPERIMENTAL-

CiteScore

2.00

自引率

7.70%

发文量

期刊介绍： As its name implies, Experimental Psychology (ISSN 1618-3169) publishes innovative, original, high-quality experimental research in psychology — quickly! It aims to provide a particularly fast outlet for such research, relying heavily on electronic exchange of information which begins with the electronic submission of manuscripts, and continues throughout the entire review and production process. The scope of the journal is defined by the experimental method, and so papers based on experiments from all areas of psychology are published. In addition to research articles, Experimental Psychology includes occasional theoretical and review articles.