Geoff Tomaino, Asaf Mazar, Ziv Carmon, Klaus Wertenbroch
{"title":"A simple method for improving generalizability in behavioral science: Scope Testing with AI-Generated Stimuli (STAGS)","authors":"Geoff Tomaino, Asaf Mazar, Ziv Carmon, Klaus Wertenbroch","doi":"10.1002/arcp.1101","DOIUrl":null,"url":null,"abstract":"<p>Behavioral research typically tests hypotheses in a limited set of researcher-selected contexts. This approach can reveal whether an effect <i>can</i> occur (possibility) but does not indicate whether it holds in other contexts (generalizability). We present Scope Testing with AI-Generated Stimuli (STAGS), a simple approach that uses Generative AI (GenAI) to test predictions across a range, or scope, of stimuli. By assessing whether a prediction holds across this range, STAGS sheds light on the generalizability of the effect. In addition, outsourcing stimulus generation to GenAI makes transparent the otherwise opaque process of stimulus selection, requiring researchers to articulate the scope of stimuli to which their hypothesis applies. We illustrate STAGS in an experiment, showing that specifying the population from which stimuli are sampled can help researchers understand the scope of the effect they are studying. We discuss the benefits and limitations of this approach and propose directions for future exploration.</p>","PeriodicalId":100328,"journal":{"name":"Consumer Psychology Review","volume":"8 1","pages":"87-97"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Consumer Psychology Review","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/arcp.1101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Behavioral research typically tests hypotheses in a limited set of researcher-selected contexts. This approach can reveal whether an effect can occur (possibility) but does not indicate whether it holds in other contexts (generalizability). We present Scope Testing with AI-Generated Stimuli (STAGS), a simple approach that uses Generative AI (GenAI) to test predictions across a range, or scope, of stimuli. By assessing whether a prediction holds across this range, STAGS sheds light on the generalizability of the effect. In addition, outsourcing stimulus generation to GenAI makes transparent the otherwise opaque process of stimulus selection, requiring researchers to articulate the scope of stimuli to which their hypothesis applies. We illustrate STAGS in an experiment, showing that specifying the population from which stimuli are sampled can help researchers understand the scope of the effect they are studying. We discuss the benefits and limitations of this approach and propose directions for future exploration.