Early Experimental Psychology: How did Replication Work Before P-Hacking?

IF 3.6 2区心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Review of General Psychology Pub Date : 2022-02-04 DOI:10.1177/10892680211066468

Annette Mülberger

{"title":"Early Experimental Psychology: How did Replication Work Before P-Hacking?","authors":"Annette Mülberger","doi":"10.1177/10892680211066468","DOIUrl":null,"url":null,"abstract":"For many researchers, replication is still the “gold standard” that is crucial for verifying scientific findings (see, for example, Frank & Saxe, 2012; Iso-Ahola, 2020; Witte & Zenker, 2017). Indeed, Crandall and Sherman (2016) declared that: “[t]here is no controversy over the need for replication; virtually all scientists and philosophers of science endorse the notion that replication of one sort or another is absolutely essential” (p. 94). In recent decades, this has led to widespread concern because few experimental findings are actually being confirmed in this way (see, for example, Pashler & Wagenmakers, 2012; Reproducibility Project: Psychology; Wiggins & Chrisopherson, 2019). Before it is possible to plan how to remedy this situation, the reasons for the lack of replications must be identified. “Questionable research practices” such as p-hacking or post-hoc hypothesizing, the “file-drawer problem,” are often cited as contributing to the problem (Romero, 2019; Wiggins & Chrisopherson, 2019). These research practices are firmly embeddedwithin a scientific culture that is characterized by a highly competitive academic environment and a reward system that dissuades rather than encouraging replication (Crandall & Sherman, 2016; Romero, 2019). This setting fosters personal ambition, urging researchers to come up with innovative and ambitious projects continually and to publish as many papers as possible. Meanwhile, most journals only publish reports of original research offering statistically significant results, which has led to a “publication bias” (Romero, 2017). Replicability problems, as Pashler and Wagenmakers (2012) stated, “reflect deep-seated human biases and well entrenched incentives that shape the behavior of individuals and institutions” (p. 529). Fraud cases, such as that involving Diederik Stapel, show just how far a person might be willing to go when succumbing to such pressures (Stroebe, Postmes, & Spears, 2012; Derksen, 2021). Whether replication is really necessary and whether the problematic research practices mentioned above are due more to the present reward system, general human biases or an incorrect statistical or philosophical understanding are still open questions (Feest, 2019; Flis, 2019; Morawski, 2019). Given such uncertainties, it seems interesting to explore how research was undertaken in the past, when the current institutional conditions did not pertain—or at least, not yet fully. Stated differently: If the current replicability problem is related to recent research practices that have appeared as part of academic life in times of neoliberal capitalism and “big science,” then we might assume that replication worked differently in the past. Thus, in the present paper, I adopt a historical stance to reveal characteristics of nineteenth-century psychology experimental research practices and to describe the way research was replicated. The original experiments I present in this paper are well known, dating from 1860 to 1900, a period characterized by important changes in Europe, such as industrialization, workers movements, and the constitution of modern nation states. In this period, the Prussianmodel at German universities (Charle, 2004), offered a broad humanistic as well as a thorough experimental training in fields such as chemistry, physics, physiology, philosophy, and psychology, which was just emerging as an experimental sciencewith its own scientific community (Ash, 1980; Bringmann & Tweney, 1980; Rieber & Robinson, 2001). When the first replications were performed, neither young researchers nor senior professors suffered the pressure to “publish or perish,” and journals were not yet acting as gatekeepers taking decisions based on p-values. Often, researchers carried out years of patient experimenting before they published their results. Thus, what role did replication play within psychological research?","PeriodicalId":48306,"journal":{"name":"Review of General Psychology","volume":"26 1","pages":"131 - 145"},"PeriodicalIF":3.6000,"publicationDate":"2022-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Review of General Psychology","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1177/10892680211066468","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 2

Abstract

For many researchers, replication is still the “gold standard” that is crucial for verifying scientific findings (see, for example, Frank & Saxe, 2012; Iso-Ahola, 2020; Witte & Zenker, 2017). Indeed, Crandall and Sherman (2016) declared that: “[t]here is no controversy over the need for replication; virtually all scientists and philosophers of science endorse the notion that replication of one sort or another is absolutely essential” (p. 94). In recent decades, this has led to widespread concern because few experimental findings are actually being confirmed in this way (see, for example, Pashler & Wagenmakers, 2012; Reproducibility Project: Psychology; Wiggins & Chrisopherson, 2019). Before it is possible to plan how to remedy this situation, the reasons for the lack of replications must be identified. “Questionable research practices” such as p-hacking or post-hoc hypothesizing, the “file-drawer problem,” are often cited as contributing to the problem (Romero, 2019; Wiggins & Chrisopherson, 2019). These research practices are firmly embeddedwithin a scientific culture that is characterized by a highly competitive academic environment and a reward system that dissuades rather than encouraging replication (Crandall & Sherman, 2016; Romero, 2019). This setting fosters personal ambition, urging researchers to come up with innovative and ambitious projects continually and to publish as many papers as possible. Meanwhile, most journals only publish reports of original research offering statistically significant results, which has led to a “publication bias” (Romero, 2017). Replicability problems, as Pashler and Wagenmakers (2012) stated, “reflect deep-seated human biases and well entrenched incentives that shape the behavior of individuals and institutions” (p. 529). Fraud cases, such as that involving Diederik Stapel, show just how far a person might be willing to go when succumbing to such pressures (Stroebe, Postmes, & Spears, 2012; Derksen, 2021). Whether replication is really necessary and whether the problematic research practices mentioned above are due more to the present reward system, general human biases or an incorrect statistical or philosophical understanding are still open questions (Feest, 2019; Flis, 2019; Morawski, 2019). Given such uncertainties, it seems interesting to explore how research was undertaken in the past, when the current institutional conditions did not pertain—or at least, not yet fully. Stated differently: If the current replicability problem is related to recent research practices that have appeared as part of academic life in times of neoliberal capitalism and “big science,” then we might assume that replication worked differently in the past. Thus, in the present paper, I adopt a historical stance to reveal characteristics of nineteenth-century psychology experimental research practices and to describe the way research was replicated. The original experiments I present in this paper are well known, dating from 1860 to 1900, a period characterized by important changes in Europe, such as industrialization, workers movements, and the constitution of modern nation states. In this period, the Prussianmodel at German universities (Charle, 2004), offered a broad humanistic as well as a thorough experimental training in fields such as chemistry, physics, physiology, philosophy, and psychology, which was just emerging as an experimental sciencewith its own scientific community (Ash, 1980; Bringmann & Tweney, 1980; Rieber & Robinson, 2001). When the first replications were performed, neither young researchers nor senior professors suffered the pressure to “publish or perish,” and journals were not yet acting as gatekeepers taking decisions based on p-values. Often, researchers carried out years of patient experimenting before they published their results. Thus, what role did replication play within psychological research?

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

早期实验心理学：在P破解之前复制是如何工作的？

对许多研究人员来说，复制仍然是验证科学发现至关重要的“金标准”（例如，参见Frank和Saxe，2012；Iso Ahola，2020；Witte和Zenker，2017）。事实上，Crandall和Sherman（2016）宣称：“复制的必要性没有争议；几乎所有的科学家和科学哲学家都赞同这样或那样的复制是绝对必要的”（第94页）。近几十年来，这引起了人们的广泛关注，因为很少有实验发现能以这种方式得到证实（例如，参见Pashler&Wagenmakers，2012；再现性项目：心理学；威金斯和克里索弗森，2019）。在计划如何补救这种情况之前，必须确定缺少复制的原因。“有问题的研究实践”，如p-黑客攻击或事后假设，即“文件抽屉问题”，经常被认为是导致该问题的原因（Romero，2019；Wiggins&Chrispherson，2019）。这些研究实践牢牢植根于一种科学文化中，这种文化的特点是竞争激烈的学术环境和鼓励而非鼓励复制的奖励制度（Crandall&Sherman，2016；罗梅罗，2019）。这种环境培养了个人的雄心，促使研究人员不断提出创新和雄心勃勃的项目，并发表尽可能多的论文。与此同时，大多数期刊只发表具有统计意义的原始研究报告，这导致了“发表偏见”（Romero，2017）。正如Pashler和Wagenmakers（2012）所指出的，可复制性问题“反映了根深蒂固的人类偏见和根深蒂固的激励因素，这些因素塑造了个人和机构的行为”（第529页）。欺诈案件，如涉及Diederik Stapel的案件，表明了一个人在屈服于这种压力时可能愿意走多远（Stroebe，Postmes，&Spears，2012；Derksen，2021）。复制是否真的有必要，以及上述有问题的研究实践是否更多地是由于目前的奖励制度、普遍的人类偏见或不正确的统计或哲学理解，仍然是悬而未决的问题（Feest，2019；Flis，2019；Morawski，2019）。考虑到这些不确定性，探索过去的研究是如何进行的似乎很有趣，当时的制度条件并不适用——或者至少还没有完全适用。换句话说：如果当前的可复制性问题与最近的研究实践有关，这些研究实践在新自由主义资本主义和“大科学”时代作为学术生活的一部分出现，那么我们可能会认为复制在过去的效果不同。因此，在本文中，我采用历史的立场来揭示19世纪心理学实验研究实践的特点，并描述研究的复制方式。我在本文中提出的最初实验是众所周知的，可以追溯到1860年至1900年，这一时期欧洲发生了重要变化，如工业化、工人运动和现代民族国家的宪法。在这一时期，德国大学的普鲁士模式（Charle，2004）在化学、物理、生理学、哲学和心理学等领域提供了广泛的人文主义和彻底的实验培训，这是一门刚刚兴起的实验科学，拥有自己的科学界（Ash，1980；Bringmann和Tweney，1980；Rieber和Robinson，2001）。当进行第一次复制时，无论是年轻的研究人员还是资深教授，都没有承受“出版或灭亡”的压力，期刊也还没有充当基于p值做出决定的看门人。通常情况下，研究人员在发表研究结果之前进行了多年的耐心实验。因此，复制在心理学研究中扮演了什么角色？

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Review of General Psychology PSYCHOLOGY, MULTIDISCIPLINARY-

CiteScore

7.60

自引率

4.80%

发文量

期刊介绍： Review of General Psychology seeks to publish innovative theoretical, conceptual, or methodological articles that cross-cut the traditional subdisciplines of psychology. The journal contains articles that advance theory, evaluate and integrate research literatures, provide a new historical analysis, or discuss new methodological developments in psychology as a whole. Review of General Psychology is especially interested in articles that bridge gaps between subdisciplines in psychology as well as related fields or that focus on topics that transcend traditional subdisciplinary boundaries.

期刊最新文献

Relational Ontology in the Mapuche Thinking: Possibilities for Indigenous Well-Being Amidst Colonial Settings Education and Training: Professional The 4D Model of American Political Conservatism: Disgust, Disorder Aversion, Deontology, and (Social) Dominance The Kokoro in Japanese Spiritual Care Antiracist Psychology to Advance Equitable Public Policy